Ancillary Monitor Corpus: Common Crawl - german web (YEAR 2015 – VERSION 1)
Please use the following text to cite this item or export to a predefined format:
Rüdiger, Jan Oliver, 2024,
Ancillary Monitor Corpus: Common Crawl - german web (YEAR 2015 – VERSION 1), LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11372/LRT-5789.
Authors
Item identifier
Date issued
2024-11-12
Size
5044582 articles,
3197261048 tokens
Language(s)
Description
*** german version see below ***
The ‘Ancillary Monitor Corpus: Common Crawl - german web’ was designed with the aim of enabling a broad-based linguistic analysis of the German-language (visible) internet over time - with the aim of achieving comparability with the DeReKo (‘German Reference Corpus’ of the Leibniz Institute for the German Language - DeReKo volume 57 billion tokens - status: DeReKo Release 2024-I). The corpus is separated by year (here year 2015) and versioned (here version 1). Version 1 comprises (all years 2013-2024) 97.45 billion tokens.
The corpus is based on the data dumps from CommonCrawl (https://commoncrawl.org/). CommonCrawl is a non-profit organisation that provides copies of the visible Internet free of charge for research purposes.
The CommonCrawl WET raw data was first filtered by TLD (top-level domain). Only pages ending in the following TLDs were taken into account: ‘.at; .bayern; .berlin; .ch; .cologne; .de; .gmbh; .hamburg; .koeln; .nrw; .ruhr; .saarland; .swiss; .tirol; .wien; .zuerich’. These are the exclusive German-language TLDs according to ICANN (https://data.iana.org/TLD/tlds-alpha-by-domain.txt) as of 1 June 2024 - TLDs with a purely corporate reference (e.g. ‘.edeka; .bmw; .ford’) were excluded. The language of the individual documents (URLs) was then estimated with the help of NTextCat (https://github.com/ivanakcheurov/ntextcat) (via the CORE14 profile of NTextCat) - only those documents/URLs for which German was the most likely language were processed further (e.g. to exclude foreign-language material such as individual subpages). The third step involved filtering for manual selectors and filtering for 1:1 duplicates (within one year).
The filtering and subsequent processing was carried out using CorpusExplorer (http://hdl.handle.net/11234/1-2634) and our own (supplementary) scripts, and the TreeTagger (http://hdl.handle.net/11372/LRT-323) was used for automatic annotation. The corpus was processed on the HELIX HPC cluster. The author would like to take this opportunity to thank the state of Baden-Württemberg and the German Research Foundation (DFG) for the possibility to use the bwHPC/HELIX HPC cluster - funding code HPC cluster: INST 35/1597-1 FUGG.
Data content:
- Tokens and record boundaries
- Automatic lemma and POS annotation (using TreeTagger)
- Metadata:
- GUID - Unique identifier of the document
- YEAR - Year of capture (please use this information for data slices)
- Url - Full URL
- Tld - Top-Level Domain
- Domain - Domain without TLD (but with sub-domains if applicable)
- DomainFull - Complete domain (incl. TLD)
- DomainFull - Complete domain (incl. TLD)
- Datum - (System Information): Date of the CorpusExplorer (date of capture by CommonCrawl - not date of creation/modification of the document).
- Hash - (System Information): SHA1 hash of the CommonCrawl
- Pfad - (System Information): Path of the cluster (raw data) - is supplied by the system.
Please note that the files are saved as *.cec6.gz. These are binary files of the CorpusExplorer (see above). These files ensure efficient archiving. You can use both CorpusExplorer and the ‘CEC6-Converter’ (available for Linux, MacOS and Windows - see: https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-5705) to convert the data. The data can be exported in the following formats:
- CATMA v6
- CoNLL
- CSV
- CSV (only meta-data)
- DTA TCF-XML
- DWDS TEI-XML
- HTML
- IDS I5-XML
- IDS KorAP XML
- IMS Open Corpus Workbench
- JSON
- OPUS Corpus Collection XCES
- Plaintext
- SaltXML
- SlashA XML
- SketchEngine VERT
- SPEEDy/CODEX (JSON)
- TLV-XML
- TreeTagger
- TXM
- WebLicht
- XML
Please note that an export increases the storage space requirement extensively. The ‘CorpusExplorerConsole’ (https://github.com/notesjor/CorpusExplorer.Terminal.Console - available for Linux, MacOS and Windows) also offers a simple solution for editing and analysing. If you have any questions, please contact the author.
Legal information
The data was downloaded on 01.11.2024. The use, processing and distribution is subject to §60d UrhG (german copyright law), which authorises the use for non-commercial purposes in research and teaching. LINDAT/CLARIN is responsible for long-term archiving in accordance with §69d para. 5 and ensures that only authorised persons can access the data. The data has been checked to the best of our knowledge and belief (on a random basis) - should you nevertheless find legal violations (e.g. right to be forgotten, personal rights, etc.), please write an e-mail to the author (amc_report@jan-oliver-ruediger.de) with the following information: 1) why this content is undesirable (please outline only briefly) and 2) how the content can be identified - e.g. file name, URL or domain, etc. The author will endeavour to identify the content. The author will endeavour to remove the content and re-upload the data (modified) within two weeks (new version). If you have any further questions, please contact CLARIN.
*** english version see above ***
Das ‚Ancillary Monitor Corpus: Common Crawl - german web‘ wurde mit dem Ziel konzipiert - eine breit angelegte und zeitlich begleitende linguistische Analyse des deutschsprachigen (sichtbaren) Internets zu ermöglichen - wobei eine Vergleichbarkeit mit dem DeReKo (‚Deutsches Referenz Korpus‘ des Leibniz-Instituts für Deutsche Sprache - DeReKo Umfang 57 Mrd. Token - Stand: DeReKo Release 2024-I) angestrebt wird. Das Korpus ist nach Jahren getrennt (hier Jahr 2015) und versioniert (hier Version 1). Die Version 1 umfasst (alle Jahre 2013-2024) 97,45 Mrd. Token.
Das Korpus basiert auf den Daten-Dumps von CommonCrawl (https://commoncrawl.org/). CommonCrawl ist eine Non-Profit-Organisation, die Kopien des sichtbaren Internets kostenlos für die Forschung zur Verfügung stellt.
Die CommonCrawl WET Rohdaten wurden zunächst nach TLD (Top-Level Domain) gefiltert. Es wurden nur Seiten berücksichtigt, die auf folgende TLDs enden: „.at; .bayern; .berlin; .ch; .cologne; .de; .gmbh; .hamburg; .koeln; .nrw; .ruhr; .saarland; .swiss; .tirol; .wien; .zuerich“. Dies sind die exklusiven deutschsprachigen TLDs laut ICANN (https://data.iana.org/TLD/tlds-alpha-by-domain.txt) zum Stand 01.06.2024 - ausgeschlossen wurden TLDs mit reinem Firmenbezug (z.B. „.edeka; .bmw; .ford“). Für die einzelnen Dokumente (URLs) wurde dann mit Hilfe von NTextCat (https://github.com/ivanakcheurov/ntextcat) die Sprache geschätzt (über das CORE14-Profil von NTextCat) - es wurden nur solche Dokumente/URLs weiterverarbeitet, bei denen Deutsch die wahrscheinlichste Sprache war (z.B. um möglichst auszuschließen, dass fremdsprachiges Material wie einzelne Unterseitenbereiche enthalten sind). Als dritter Schritt erfolgte eine Filterung nach manuellen Selektoren und eine Filterung nach 1:1-Dubletten (innerhalb eines Jahres).
Die Filterung und anschließende Aufbereitung erfolgte mit dem CorpusExplorer (http://hdl.handle.net/11234/1-2634) und eigenen (ergänzenden) Skripten, wobei für die automatische Annotation der TreeTagger (http://hdl.handle.net/11372/LRT-323) verwendet wurde. Die Aufbereitung des Korpus erfolgte auf dem HELIX-HPC-Cluster. Der Autor dankt an dieser Stelle dem Land Baden-Württemberg und der Deutschen Forschungsgemeinschaft (DFG) für die Möglichkeit das bwHPC/HELIX HPC-Cluster nutzen zu können – Förderkennzeichen HPC-Cluster: INST 35/1597-1 FUGG.
Dateninhalt:
- Token und Satzgrenzen
- Automatische Lemma- und POS-Annotation (mittels TreeTagger)
- Metadaten:
- GUID - Eindeutiger Identifikator des Dokuments
- YEAR - Jahr der Erfassung (bitte verwenden Sie diese Angabe für Datenschnitte)
- Url - Vollständige URL
- Tld – Top-Level Domain
- Domain – Domain ohne TLD (aber ggf. mit Sub-Domains)
- DomainFull – Vollständige Domain (inkl. TLD)
- DomainFull - Komplette Domain (inkl. TLD)
- Datum - (System Information): Datum des CorpusExplorers (Tag der Erfassung durch CommonCrawl - nicht Tag der Erstellung/Änderung des Dokuments).
- Hash - (System Information): SHA1-Hash des CommonCrawl
- Pfad - (System Information): Pfad des Clusters (Rohdaten) - wird systembedingt geliefert.
Bitte beachten Sie, dass die Dateien als *.cec6.gz gespeichert sind. Dies sind Binärdateien des CorpusExplorers (siehe oben). Diese Dateien gewährleisten eine effiziente Archivierung. Sie können sowohl den CorpusExplorer als auch den ‚CEC6-Converter‘ (verfügbar für Linux, MacOS und Windows - siehe: https://lindat.mff.cuni.cz/repository/xmlui/handle/11372/LRT-5705) zur Konvertierung der Daten verwenden. Die Daten können in folgende Formate exportiert werden:
- CATMA v6
- CoNLL
- CSV
- CSV (only meta-data)
- DTA TCF-XML
- DWDS TEI-XML
- HTML
- IDS I5-XML
- IDS KorAP XML
- IMS Open Corpus Workbench
- JSON
- OPUS Corpus Collection XCES
- Plaintext
- SaltXML
- SlashA XML
- SketchEngine VERT
- SPEEDy/CODEX (JSON)
- TLV-XML
- TreeTagger
- TXM
- WebLicht
- XML
Bitte beachten Sie, dass ein Export den Speicherplatzbedarf erheblich erhöht. Eine einfache Lösung zur Bearbeitung und Analyse bietet auch die „CorpusExplorerConsole“ (https://github.com/notesjor/CorpusExplorer.Terminal.Console - verfügbar für Linux, MacOS und Windows). Bei Fragen wenden Sie sich bitte an den Autor.
Rechtliche Hinweise
Die Daten wurden am 01.11.2024 heruntergeladen. Die Nutzung, Verarbeitung und Verbreitung unterliegt §60d UrhG, der die Nutzung für nicht kommerzielle Zwecke in Forschung und Lehre erlaubt. LINDAT/CLARIN übernimmt die Langzeitarchivierung nach §69d Abs. 5 und stellt sicher, dass nur berechtigte Personen auf die Daten zugreifen können. Die Daten wurden nach bestem Wissen und Gewissen (stichprobenartig) überprüft - sollten Sie dennoch Rechtsverletzungen (z.B. Recht auf Vergessenwerden, Persönlichkeitsrechte etc.) finden, schreiben Sie bitte eine E-Mail an den Autor (amc_report@jan-oliver-ruediger.de) mit folgenden Informationen: 1) warum dieser Inhalt unerwünscht ist (bitte nur kurz skizzieren) und 2) wie der Inhalt identifiziert werden kann - z.B. Dateiname, URL oder Domain etc. Der Autor wird sich bemühen, den Inhalt zu entfernen und die Daten innerhalb von zwei Wochen (verändert) wieder hochzuladen (neue Version). Bei weiteren Fragen wenden Sie sich bitte an CLARIN.
Publisher
Subject(s)
Collections
This item isPublicly Available
and licensed under:
Files in this item
- Name
- 2015_0084.cec6.gz
- Size
- 191.88 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- d71b4cf491d43e1c56775304035f9b1a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0055.cec6.gz
- Size
- 191.85 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 9ae0e3f41681feace0580907f40353dd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0059.cec6.gz
- Size
- 191.82 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 44397fda91e2901170bb24781a5d92ba

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0057.cec6.gz
- Size
- 191.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6d7eed963a6f353d8758903b6fb9ccb5

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0045.cec6.gz
- Size
- 191.61 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b3e4d02019dc440c82edb784ba2b3b1b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0001.cec6.gz
- Size
- 191.48 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4124ac60677e3676ecf0fafc3b71368a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0031.cec6.gz
- Size
- 192 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 1b8680f85e6f33f79659527d5e5fd2ca

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0078.cec6.gz
- Size
- 192.09 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 0c6752422c9f9ef7d38f4a4ecb8da4f7

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0082.cec6.gz
- Size
- 192.22 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 246acd8a7b7cb6ec0ec9508dba06cd42

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0039.cec6.gz
- Size
- 192.23 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 62941a46b00ca9412ae098ed11c2d7fd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0065.cec6.gz
- Size
- 192.11 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 1cea04c3cd0d8dd68524496c5941b283

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0070.cec6.gz
- Size
- 192.24 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 1805be3dcbf01df72481635f393b8c82

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0025.cec6.gz
- Size
- 192.27 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- f1278ae020f021b7358389bb45b84200

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0020.cec6.gz
- Size
- 192.75 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 9d469a79be6a54e6989cbe8e9b0ce112

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0002.cec6.gz
- Size
- 192.57 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 7622772c981927994867ec05f66fd0e0

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0004.cec6.gz
- Size
- 192.69 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 26dc3296a0e49952b9d811d4608202f4

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0033.cec6.gz
- Size
- 192.62 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a8919d009717ec12142d4cb5dedf1d47

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0022.cec6.gz
- Size
- 192.6 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- de2da59f2004a736f5450f51da80310e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0067.cec6.gz
- Size
- 192.79 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 2dd043971946ec60d6509af059aa83a5

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0038.cec6.gz
- Size
- 192.89 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a1b4d7bd9e91f74461fd5a09a0d10fa2

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0005.cec6.gz
- Size
- 193.05 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- e677fc86b4db50371f9971733d4cb351

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0075.cec6.gz
- Size
- 192.86 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 494272e95c5ab4df5454d7020b0788a0

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0060.cec6.gz
- Size
- 192.96 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6ea0a5e31ae53f6c906c0ffc6e00a0c4

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0068.cec6.gz
- Size
- 192.97 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 3469cd694fcffa1b6f42f87c53f55f49

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0072.cec6.gz
- Size
- 193.04 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6d3e0265f21c38f8a446e4fad4e2879a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0042.cec6.gz
- Size
- 193.24 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a6c7adf7961d8b83f9583deafc3d6fcf

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0081.cec6.gz
- Size
- 193.16 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 1b3e9815145c634bd8688af0a75f9391

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0006.cec6.gz
- Size
- 193.07 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 97a250df26c64e63d95fb5359cd2ff65

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0043.cec6.gz
- Size
- 193.18 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 0c38616f89295830cbe35a73a1685ae7

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0023.cec6.gz
- Size
- 193.23 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- c112ef4ecf35835039846fa66d1cf032

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0035.cec6.gz
- Size
- 193.33 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 9486bdf8dd0156bdae1f2815387a922d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0010.cec6.gz
- Size
- 193.36 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- fe66fbc6809d534d52dfb75b9a41aa7c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0009.cec6.gz
- Size
- 193.41 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a85f81949ba53f2edacb9dc11c5b7e5b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0041.cec6.gz
- Size
- 193.43 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6acdae4de434b495bae6e95a4c31aaae

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0050.cec6.gz
- Size
- 193.51 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 99a0b55130d24cf155e9e85734454428

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0056.cec6.gz
- Size
- 193.45 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 30205d31d37171a0641fa2f077a5c15b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0040.cec6.gz
- Size
- 193.55 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 0d5ce4c296aec4a0ad4e94af3cfe592a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0051.cec6.gz
- Size
- 192.99 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 8ae798477b20d39218b62ebf3fdff26a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0061.cec6.gz
- Size
- 193.24 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 7da47132ffc3c4fb4b9201d8541c340c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0029.cec6.gz
- Size
- 193.59 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 4c829ea8ec15635a3658d161959efa4b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0016.cec6.gz
- Size
- 193.56 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 1f2ef7234ad70ee377cc6321653d44da

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0028.cec6.gz
- Size
- 193.58 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 12534bb74cb03c6fe7da163feb7936d7

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0011.cec6.gz
- Size
- 193.58 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 76bed77866960d48533d3c9444abe154

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0003.cec6.gz
- Size
- 193.61 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 63fbb0af83387410b1aff80a0c7fdad3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0076.cec6.gz
- Size
- 193.59 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- d6a914dcf35c676d9005ca91b76e2409

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0054.cec6.gz
- Size
- 193.61 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 77ffa5b1998fe0108a44578635130e2c

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0026.cec6.gz
- Size
- 193.63 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- fd9709469c90789f421703daaa9cae9d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0083.cec6.gz
- Size
- 193.72 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- f8c91b1b9383b2738ec4300d74c826e3

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0037.cec6.gz
- Size
- 194 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 87ad9eb3586570d53a3a0d75ce2aa95e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0019.cec6.gz
- Size
- 193.76 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- d49f47b218f74054a7e85174f812bf8d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0048.cec6.gz
- Size
- 193.9 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- fd39e04593afbb6e90275a2ab0f1f1e2

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0014.cec6.gz
- Size
- 194.01 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 19c29ee134af03567f18d5ad504c99fd

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0064.cec6.gz
- Size
- 194.16 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 566c9ea53efb3a15b33dd439b764b14a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0063.cec6.gz
- Size
- 194.32 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 060a9b8bf1c32d9d2949dd541c110cee

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0027.cec6.gz
- Size
- 194.34 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- b43a9f5084a5468a9dab2f611f9d485a

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0018.cec6.gz
- Size
- 194.52 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 5b9c9fc1713b2e8b41ad03b63a387319

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0062.cec6.gz
- Size
- 194.65 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 30bf4bbcab73d3d42412a7d597bc1164

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0030.cec6.gz
- Size
- 194.59 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- fe0cb0575c1ca2f36eeced287069337f

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0017.cec6.gz
- Size
- 194.63 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 5329273c02c0af0cd8c384951d1204eb

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0077.cec6.gz
- Size
- 195.18 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 792626998f4fc7ce916535e93e5a7ca2

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0007.cec6.gz
- Size
- 194.77 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- d00441dca466994684ce416e1d6ae646

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0044.cec6.gz
- Size
- 195.49 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- dee9a33089ad53bc6b80fcd24ac7d747

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0066.cec6.gz
- Size
- 195.27 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- f0c5faca579264010646ae4a9dc96f17

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0058.cec6.gz
- Size
- 194.01 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 5cb77d19c639ee90f7189ad1cce7abd8

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0013.cec6.gz
- Size
- 194.08 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 64cbf09477ff840065d97d5362627270

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0032.cec6.gz
- Size
- 194.26 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- e993ca04a5b6e7af9a527ad64f96520d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0071.cec6.gz
- Size
- 194.31 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 982ed3629f07f4b0958c9511f60587ea

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0085.cec6.gz
- Size
- 22.01 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 80876686f1ce3d5b7a30b7ba402f63d6

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0052.cec6.gz
- Size
- 195.75 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 139ef48b3423c342858de209c5d9ae93

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0047.cec6.gz
- Size
- 195.25 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 665d9eb7b682439fd9d26a6914fc4ceb

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0034.cec6.gz
- Size
- 187.96 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- ca28e23297824b6c31addfc25247de6b

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0012.cec6.gz
- Size
- 195.88 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- a0d0a761ad55aaf55c96f81cd0ad5b38

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0036.cec6.gz
- Size
- 190.67 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 7ac5710ca4ad7d4ad61b87b33360c520

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0024.cec6.gz
- Size
- 190.44 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- cb71f5635c6289a566b4357cba192449

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0015.cec6.gz
- Size
- 190.12 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 825ebc278e131d5bdaa95f5d43d3745d

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0079.cec6.gz
- Size
- 190.71 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 51085e2612c4141452f3830bfd8b20d1

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0080.cec6.gz
- Size
- 192.34 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 3841b7af49ce92a60c3ab10f62c9b361

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0021.cec6.gz
- Size
- 190.85 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 6af13509f0288603b2155053cb5b4f57

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0074.cec6.gz
- Size
- 191.16 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- c7ec739ba5111a27ed72a166fa8895cb

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0008.cec6.gz
- Size
- 191.42 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 2b7cd253383182898e52b991f3d75774

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0053.cec6.gz
- Size
- 192.43 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- c43fc198ec809c4f91845eb2328c824e

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0073.cec6.gz
- Size
- 192.29 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 61e873b64f5448dc1ff3bf2b899d52e6

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0049.cec6.gz
- Size
- 192.49 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 2bcf897d42d17bc28e2226545bc44f97

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0069.cec6.gz
- Size
- 192.28 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- cf578d0c15af900b348c5d63986ffdd8

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz
- Name
- 2015_0046.cec6.gz
- Size
- 192.39 MB
- Format
- application/x-gzip
- Description
- gzip Archive
- MD5
- 9a65293e9d8bb97d39a2a5bd0e80deb9

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

