This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

C.R.I.M.E.: The Corpus of Recorded Investigative, Media, and Evidence-based Proceedings

Please use the following text to cite this item or export to a predefined format:
Coats, Steven and Roemling, Dana, 2025, C.R.I.M.E.: The Corpus of Recorded Investigative, Media, and Evidence-based Proceedings, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11372/LRT-5992.
Date issued
2025
Size
133,310,369 tokens,
19,376.84 hours
Language(s)
Description
CRIME is the Corpus of Recorded Investigative, Media, and Evidence-based Proceedings, a structured, searchable resource comprising ASR-generated transcripts from investigative interviews, courtroom interactions, and other criminal-justice-related media content. This version contains, for each transcript, a link to the corresponding audio.
Publisher
Acknowledgement
 Files in this item
Name
forensic_transcript_df.csv
Size
2.25 GB
Format
text/csv
Description
CSV
MD5
112c63a9e11fddfef4a047e6a6c1fadf
Preview
  File Preview