Show simple item record

 
dc.contributor.author Zeman, Daniel
dc.date.accessioned 2012-10-25T12:42:49Z
dc.date.available 2012-10-25T12:42:49Z
dc.date.issued 2006-06
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0007-70FD-E
dc.description DZ Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. DZ Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another. New tag sets are attached by writing a driver for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets.
dc.description.sponsorship grant MSM 0021620838 of the Ministry of Education of the Czech Republic
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights GNU General Public License, version 2
dc.rights.uri http://www.gnu.org/licenses/gpl-2.0.html
dc.source.uri https://wiki.ufal.ms.mff.cuni.cz/user:zeman:interset
dc.subject morphology
dc.subject NLP
dc.subject Perl
dc.title DZ Interset
dc.type toolService
metashare.ResourceInfo#ContactInfo#PersonInfo.surname Zeman
metashare.ResourceInfo#ContactInfo#PersonInfo.givenName Daniel
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationName Charles University in Prague, UFAL
metashare.ResourceInfo#DistributionInfo.availability restrictedUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse academic-nonCommercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUse attribution
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMedium downloadable
metashare.ResourceInfo#ValidationInfo.validated True
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.projectName #1-Výzkumný záměr
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType #1-nationalFunds
metashare.ResourceInfo#TextInfo#SizeInfo.size 1
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnit mb
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.email zeman@ufal.mff.cuni.cz
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent false
metashare.ResourceInfo#ContentInfo.detailedType tool
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
demo.uri http://quest.ms.mff.cuni.cz/cgi-bin/interset/index.pl
sponsor Ministerstvo školství, mládeže a tělovýchovy České republiky MSM 0021620838 Moderní metody, struktury a systémy informatiky nationalFunds
size.info 1 mb
files.size 2203707
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
GNU General Public License, version 2
GNU General Public License, version 2.0
Icon
Name
interset-v1.2.zip
Size
2.1 MB
Format
application/zip
Description
DZ Interset software package
MD5
6aefe932b29341941ffacd61797febe3
 Download file  Preview
 File Preview  
  • interset
    • bin
      • csts-zh-conll-cs-pdt.pl1 kB
      • print_trie.pl2 kB
      • csts-bg-conll-cs-pdt.pl746 B
      • collect_tags_from_pmk.pl978 B
      • print_permitted_fs.pl447 B
      • csts-en-conll-cs-pdt.pl747 B
      • conll-da-conll-en-penn.pl854 B
      • driver-test.pl16 kB
      • conll-sv-conll-cs-pdt.pl820 B
      • csts-cs-pdt-en-penn.pl559 B
      • list_cs_conll_tags.pl1 kB
      • csts-ar-conll-cs-pdt.pl746 B
      • index_examples.pl12 kB
      • collect_tags_from_conll.pl851 B
      • conll-da-conll-cs-pdt.pl848 B
      • csts_convert_tags.pl1 kB
    • doc
      • COPYING.txt34 kB
      • wiki
        • doku
          • versions.txt4 kB
          • common-problems.txt9 kB
          • drivers.txt19 kB
          • features.txt24 kB
          • to-do.txt8 kB
          • license.txt745 B
          • pronouns.txt8 kB
          • download.txt2 kB
          • tagsets.txt377 B
          • tagsets
            • conll-2006-bg.txt197 B
            • conll-2006-sl.txt838 B
            • conll-2006-cs.txt160 kB
            • urdu.txt2 kB
          • verb-forms.txt25 kB
          • how-to-use.txt5 kB
          • references.txt2 kB
          • brainstorming.txt30 kB
          • how-to-write-a-driver.txt18 kB
      • papers
        • 2010-icgl-hongkong
          • submitted1.pdf215 kB
          • acl-ijcnlp2009.sty14 kB
          • paper.tex25 kB
          • submitted2-camera-ready.pdf177 kB
          • paper.bib10 kB
          • submitted0.pdf193 kB
          • paper1-8pages.tex39 kB
        • 2009-ufal-horni-misecky
          • Interset.ppt543 kB
          • Interset.pdf237 kB
          • Interset.odp51 kB
        • 2008-lrec-marrakech
          • tagdrivers-marrakech-poster.odp183 kB
          • tagdrivers-marrakech-styl-lrec.pdf109 kB
          • tagdrivers-marrakech-styl-lrec.rtf314 kB
          • tagdrivers-marrakech-poster.pdf299 kB
    • lib
      • tagset
        • zh
          • conll.pm10 kB
        • en
          • conll2009.pm3 kB
          • penn.pm16 kB
          • conll.pm3 kB
        • ar
          • conll.pm19 kB
          • conll2007.pm22 kB
        • pl
          • ipipan.pm65 kB
        • bg
          • conll.pm59 kB
        • common.pm67 kB
        • cs
          • conll2009.pm3 kB
          • multext.pm56 kB
          • pmkdl.pm1 MB
          • conll.pm165 kB
          • pmkkr.pm21 kB
          • pdt.pm104 kB
          • pmk.pm141 kB
        • README.txt598 B
        • de
          • conll2009.pm24 kB
          • conll.pm2 kB
          • stts.pm18 kB
        • sv
          • svdahybrid.pm12 kB
          • mamba.pm10 kB
          • conll.pm1 kB
          • hajic.pm17 kB
        • da
          • conll.pm31 kB
        • pt
          • conll.pm40 kB

Show simple item record