Catalog
Repository
Education
Projects
Tools
Services
About
Partners
Mission Statement
CLARIN
DARIAH
Service integrations
Project partnerships
Login
LINDAT/CLARIAH-CZ Repository Home
View Item
Show/Hide Menu
Browse
All of the Repository
Issue Date
Authors
Titles
Subjects
Publisher
Language
Type
Rights Label
My Account
Login
Statistics
Statistics
BETA
General Information
Deposit
Cite
Submission Lifecycle
FAQ
About
Help Desk
enTenTen
LINDAT / CLARIAH-CZ
Authors
(:unav) Unknown author
Item identifier
http://hdl.handle.net/11858/00-097C-0000-0001-CCDF-8
Date issued
2011-12-16
Type
corpus
,
text
Size
3268798627 tokens
Language(s)
English
Description
Very large English web corpus enTenTEn, comprising 3,268,798,627 tokens.
Publisher
Masaryk University, NLP Centre
Acknowledgement
Lexical Computing Ltd.
Subject(s)
English large corpus
Collection(s)
LINDAT / CLARIAH-CZ Data & Tools
Show full item record
Files in this item
This item is
Academic Use
and licensed under:
NLP Centre Web Corpus License
Name
ententen08.vert.gz
Size
6.95 GB
Format
application/x-gzip
MD5
9bf7179d3643f0f42798ef9d75e25ba3
Download file