Structured collection of texts / From Wikipedia, the free encyclopedia
Dear Wikiwand AI, let's keep it short by simply answering these key questions:
Can you list the top facts and stats about Text corpus?
Summarize this article for a 10 years old
SHOW ALL QUESTIONS
In linguistics, a corpus (plural corpora) or text corpus is a language resource consisting of a large and structured set of texts (nowadays usually electronically stored and processed). In corpus linguistics, they are used to do statistical analysis and hypothesis testing, checking occurrences or validating linguistic rules within a specific language territory.
Structured collection of texts
In search technology, a corpus is the collection of documents which is being searched.