Term frequency-inverse document frequency (TF-IDF) is a numerical statistic that reflects the importance of a word in a document relative to a collection of documents or corpus. It is widely used in information retrieval and text mining to evaluate how relevant a word is to a specific document in the context of the entire corpus.