This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 
Please use the following text to cite this item or export to a predefined format:
Zeman, Daniel, 2006, DZ Interset, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11858/00-097C-0000-0007-70FD-E.
dc.contributor.authorZeman, Daniel
dc.date.accessioned2012-10-25T12:42:49Z
dc.date.available2012-10-25T12:42:49Z
dc.date.issued2006-06
dc.descriptionDZ Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. DZ Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another. New tag sets are attached by writing a driver for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets.
dc.description.sponsorshipgrant MSM 0021620838 of the Ministry of Education of the Czech Republic
dc.identifier.urihttp://hdl.handle.net/11858/00-097C-0000-0007-70FD-E
dc.publisherCharles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rightsGNU General Public License, version 2
dc.rights.labelPUB
dc.rights.urihttp://www.gnu.org/licenses/gpl-2.0.html
dc.source.urihttps://wiki.ufal.ms.mff.cuni.cz/user:zeman:interset
dc.subjectmorphology
dc.subjectNLP
dc.subjectPerl
dc.titleDZ Interset
dc.typetoolService
local.brandingLINDAT / CLARIAH-CZ
local.demo.urihttp://quest.ms.mff.cuni.cz/cgi-bin/interset/index.pl
local.files.count1
local.files.size2203707
local.has.filesyes
local.size.info1 mb
local.sponsornationalFunds MSM 0021620838 Ministerstvo školství, mládeže a tělovýchovy České republiky Moderní metody, struktury a systémy informatiky
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo#CommunicationInfo.emailzeman@ufal.mff.cuni.cz
metashare.ResourceInfo#ContactInfo#PersonInfo#OrganizationInfo.organizationNameCharles University in Prague, UFAL
metashare.ResourceInfo#ContactInfo#PersonInfo.givenNameDaniel
metashare.ResourceInfo#ContactInfo#PersonInfo.surnameZeman
metashare.ResourceInfo#ContentInfo.descriptionDZ Interset is a means of converting among various tag sets in natural language processing. The core idea is similar to interlingua-based machine translation. DZ Interset defines a set of features that are encoded by the various tag sets. The set of features should be as universal as possible. It does not need to encode everything that is encoded by any tag set but it should encode all information that people may want to access and/or port from one tag set to another. New tag sets are attached by writing a driver for them. Once the driver is ready, you can easily convert tags between the new set and any other set for which you also have a driver. This reusability is an obvious advantage over writing a targeted conversion procedure each time you need to convert between a particular pair of tag sets.
metashare.ResourceInfo#ContentInfo.detailedTypetool
metashare.ResourceInfo#ContentInfo.resourceTypetoolService
metashare.ResourceInfo#DistributionInfo#LicenseInfo.distributionAccessMediumdownloadable
metashare.ResourceInfo#DistributionInfo#LicenseInfo.licenseAttribution-NonCommercial 3.0 Unported (CC BY-NC 3.0)
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUseacademic-nonCommercialUse
metashare.ResourceInfo#DistributionInfo#LicenseInfo.restrictionsOfUseattribution
metashare.ResourceInfo#DistributionInfo.availabilityrestrictedUse
metashare.ResourceInfo#IdentificationInfo.resourceNameDZ Interset
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependentfalse
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.fundingType#1-nationalFunds
metashare.ResourceInfo#ResourceCreationInfo#FundingInfo#ProjectInfo.projectName#1-Výzkumný záměr
metashare.ResourceInfo#TextInfo#SizeInfo.size1
metashare.ResourceInfo#TextInfo#SizeInfo.sizeUnitmb
metashare.ResourceInfo#ValidationInfo.validatedTrue
This item isPublicly Available
and licensed under:
 Files in this item
Name
interset-v1.2.zip
Size
2.1 MB
Format
application/zip
Description
Zip
MD5
6aefe932b29341941ffacd61797febe3
Preview
  File Preview
  • interset
    • bin
      • print_trie.pl2 kB
      • csts-zh-conll-cs-pdt.pl1 kB
      • csts-bg-conll-cs-pdt.pl746 B
      • collect_tags_from_pmk.pl978 B
      • print_permitted_fs.pl447 B
      • csts-en-conll-cs-pdt.pl747 B
      • conll-da-conll-en-penn.pl854 B
      • driver-test.pl16 kB
      • conll-sv-conll-cs-pdt.pl820 B
      • list_cs_conll_tags.pl1 kB
      • csts-cs-pdt-en-penn.pl559 B
      • index_examples.pl12 kB
      • csts-ar-conll-cs-pdt.pl746 B
      • collect_tags_from_conll.pl851 B
      • conll-da-conll-cs-pdt.pl848 B
      • csts_convert_tags.pl1 kB
    • doc
      • COPYING.txt34 kB
      • wiki
        • doku
          • versions.txt4 kB
          • common-problems.txt9 kB
          • drivers.txt19 kB
          • license.txt745 B
          • features.txt24 kB
          • to-do.txt8 kB
          • pronouns.txt8 kB
          • download.txt2 kB
          • tagsets.txt377 B
          • tagsets
            • conll-2006-bg.txt197 B
            • conll-2006-sl.txt838 B
            • conll-2006-cs.txt160 kB
            • urdu.txt2 kB
          • verb-forms.txt25 kB
          • brainstorming.txt30 kB
          • references.txt2 kB
          • how-to-use.txt5 kB
          • how-to-write-a-driver.txt18 kB
      • papers
        • 2010-icgl-hongkong
          • submitted1.pdf215 kB
          • acl-ijcnlp2009.sty14 kB
          • paper.tex25 kB
          • submitted2-camera-ready.pdf177 kB
          • paper.bib10 kB
          • submitted0.pdf193 kB
          • paper1-8pages.tex39 kB
        • 2009-ufal-horni-misecky
          • Interset.ppt543 kB
          • Interset.pdf237 kB
          • Interset.odp51 kB
        • 2008-lrec-marrakech
          • tagdrivers-marrakech-poster.odp183 kB
          • tagdrivers-marrakech-styl-lrec.pdf109 kB
          • tagdrivers-marrakech-styl-lrec.rtf314 kB
          • tagdrivers-marrakech-poster.pdf299 kB
    • lib
      • tagset
        • zh
          • conll.pm10 kB
        • en
          • conll2009.pm3 kB
          • conll.pm3 kB
          • penn.pm16 kB
        • ar
          • conll.pm19 kB
          • conll2007.pm22 kB
        • pl
          • ipipan.pm65 kB
        • bg
          • conll.pm59 kB
        • common.pm67 kB
        • cs
          • conll2009.pm3 kB
          • multext.pm56 kB
          • conll.pm165 kB
          • pmkdl.pm1 MB
          • pmkkr.pm21 kB
          • pdt.pm104 kB
          • pmk.pm141 kB
        • de
          • conll2009.pm24 kB
          • conll.pm2 kB
          • stts.pm18 kB
        • README.txt598 B
        • sv
          • svdahybrid.pm12 kB
          • mamba.pm10 kB
          • conll.pm1 kB
          • hajic.pm17 kB
        • da
          • conll.pm31 kB
        • pt
          • conll.pm40 kB