DZ Interset
Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel, 2006,
DZ Interset, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11858/00-097C-0000-0007-70FD-E.
Authors
Item identifier
Date issued
2006-06
Type
Size
1 mb
Description
DZ Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. DZ Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another.
New tag sets are attached by writing a driver for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets.
Acknowledgement
Ministerstvo školství, mládeže a tělovýchovy České republiky
Project code:MSM 0021620838
Project name:Moderní metody, struktury a systémy informatiky
Subject(s)
Collections
Files in this item
- Name
- interset-v1.2.zip
- Size
- 2.1 MB
- Format
- application/zip
- Description
- Zip
- MD5
- 6aefe932b29341941ffacd61797febe3

- interset
- bin
- print_trie.pl2 kB
- csts-zh-conll-cs-pdt.pl1 kB
- csts-bg-conll-cs-pdt.pl746 B
- collect_tags_from_pmk.pl978 B
- print_permitted_fs.pl447 B
- csts-en-conll-cs-pdt.pl747 B
- conll-da-conll-en-penn.pl854 B
- driver-test.pl16 kB
- conll-sv-conll-cs-pdt.pl820 B
- list_cs_conll_tags.pl1 kB
- csts-cs-pdt-en-penn.pl559 B
- index_examples.pl12 kB
- csts-ar-conll-cs-pdt.pl746 B
- collect_tags_from_conll.pl851 B
- conll-da-conll-cs-pdt.pl848 B
- csts_convert_tags.pl1 kB
- doc
- COPYING.txt34 kB
- wiki
- doku
- versions.txt4 kB
- common-problems.txt9 kB
- drivers.txt19 kB
- license.txt745 B
- features.txt24 kB
- to-do.txt8 kB
- pronouns.txt8 kB
- download.txt2 kB
- tagsets.txt377 B
- tagsets
- conll-2006-bg.txt197 B
- conll-2006-sl.txt838 B
- conll-2006-cs.txt160 kB
- urdu.txt2 kB
- verb-forms.txt25 kB
- brainstorming.txt30 kB
- references.txt2 kB
- how-to-use.txt5 kB
- how-to-write-a-driver.txt18 kB
- doku
- papers
- 2010-icgl-hongkong
- submitted1.pdf215 kB
- acl-ijcnlp2009.sty14 kB
- paper.tex25 kB
- submitted2-camera-ready.pdf177 kB
- paper.bib10 kB
- submitted0.pdf193 kB
- paper1-8pages.tex39 kB
- 2009-ufal-horni-misecky
- Interset.ppt543 kB
- Interset.pdf237 kB
- Interset.odp51 kB
- 2008-lrec-marrakech
- tagdrivers-marrakech-poster.odp183 kB
- tagdrivers-marrakech-styl-lrec.pdf109 kB
- tagdrivers-marrakech-styl-lrec.rtf314 kB
- tagdrivers-marrakech-poster.pdf299 kB
- 2010-icgl-hongkong
- lib
- tagset
- zh
- conll.pm10 kB
- en
- conll2009.pm3 kB
- conll.pm3 kB
- penn.pm16 kB
- ar
- conll.pm19 kB
- conll2007.pm22 kB
- pl
- ipipan.pm65 kB
- bg
- conll.pm59 kB
- common.pm67 kB
- cs
- conll2009.pm3 kB
- multext.pm56 kB
- conll.pm165 kB
- pmkdl.pm1 MB
- pmkkr.pm21 kB
- pdt.pm104 kB
- pmk.pm141 kB
- de
- conll2009.pm24 kB
- conll.pm2 kB
- stts.pm18 kB
- README.txt598 B
- sv
- svdahybrid.pm12 kB
- mamba.pm10 kB
- conll.pm1 kB
- hajic.pm17 kB
- da
- conll.pm31 kB
- pt
- conll.pm40 kB
- zh
- tagset
- bin

