Corpus Search

CQL Query: query builder | visualize | options

Type in a search query in the CQL (Corpus WorkBench Query Language) format in the text box above to search in the corpus. The CQL syntax uses an intuitive system of defining properties of words you are looking for, as in for instance:

[upos="NUM.*"] [lemma="otázka"]

will search for any form of the word otázka preceded by a numeral. More information about the CQL language can be found here.

By default, TEITOK searches the entire corpus, which may contain multiple transcripts for a single recording. If you want to search only in the part of the corpus where each recording has only a single associated transcript, you must restrict the search to so-called canonical transcripts. For example:

[lemma = "situace"] :: match.text_canonical = "1"

searches for the lemma situace only in canonical transcripts.

To facilitate searching, the interface provides a query builder which provides an easy way to define simple queries in CQL. Just click on the query builder icon to open the query builder, define your query, and click on the button to insert that query in the CQL query box, after which you can modify it by hand if needed, or simply hit search.

You can use to Query Builder to just search for documents – you do this by not providing any token restrictions, which will make the system interpret the query as a search for document.


List all documents