This is a new version of the repository. Do let us know (lindat-help at ufal.mff.cuni.cz) if you encounter any issues.
 

Information extraction from EIA documents

Please use the following text to cite this item or export to a predefined format:
Lukšová, Ivana and Hladká, Barbora, 2015, Information extraction from EIA documents, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL), http://hdl.handle.net/11234/1-1515.
Date issued
2015-10-13
Language(s)
Description
Environmental impact assessment (EIA) is the formal process used to predict the environmental consequences of a plan. We present a rule-based extraction system to mine Czech EIA documents. The extraction rules work with a set of documents enriched with morphological information and manually created vocabularies of terms supposed to be extracted from the documents, e.g. basic information about the project (address, ID company, ...), data on the impacts and outcomes (waste substances, endangered species, ...), a final opinion. The documents Notice of Intent contains the section BI2 with the information on the scope (capacity) of the plan.
Acknowledgement
This item isPublicly Available
and licensed under:
 Files in this item
Name
intlib_eia_app.zip
Size
1.72 MB
Format
application/zip
Description
Zip
MD5
3bdefa5bb3cfa815886bba571095dcd6
Preview
  File Preview
  • intlib_eia_app
    • src
    • stanoviskoNatura.cmd224 B
    • configuration
      • eia_config.xml480 B
      • czsem_config.xml1 kB
    • resources
      • eia
        • feature_aggregator
          • subsections_aggr.txt150 B
          • main_sections_aggr.txt449 B
        • gazetteer
          • Oznameni_dictionary_cs.def404 B
          • Cislovky_slovne2.lst2 kB
          • Kategorizace.lst34 B
          • Prirodni_oblasti.lst517 kB
          • Odpady_kody.lst85 kB
          • Rozhodnuti.lst21 kB
          • Stavy.lst18 kB
          • Odpady.lst807 kB
          • typy_posNATURA.def57 B
          • odpad_nazvy.lst167 kB
          • typy_stanoviska.lst3 kB
          • Oznameni_headers.def47 B
          • BI2_entities_cs.def175 B
          • Skodlive_latky_cs.lst449 B
          • Terminy.lst14 kB
          • metrika_before.lst602 B
          • Casy.lst3 kB
          • Oznameni_headers_cs.def50 B
          • Kraje_CR.lst12 kB
          • Kraj_obec.lst1 MB
          • Skodlive_latky_zjednodusene.lst119 kB
          • typy_posNATURA.lst3 kB
          • entities.def49 B
          • Metrika.lst51 kB
          • Sousedni_zeme.lst4 kB
          • Kategorizace_zakonna_omezeni_varianty_kat.lst6 kB
          • Oznameni_headers.lst9 kB
          • Oznameni_dictionary.def385 B
          • BI2_entities.def292 B
          • Veliciny2.lst32 kB
          • Metrika2.lst278 B
          • Oznameni_headers_cs.lst371 B
          • Odpady_typy2.lst64 B
          • Pojmy.lst689 kB
          • Ohr+Chr_druhy.lst534 kB
          • Pojmy2.lst114 kB
          • typy_stanoviska.def52 B
          • entities.lst78 kB
          • Veliciny.lst83 kB
          • Chranene_druhy_synonyma.lst108 kB
          • Ohrozene_druhy.lst484 kB
          • Prirodni_oblasti_typy_cs.lst325 B
          • Skodlive_latky.lst95 kB
          • Prirodni_oblasti_typy.lst808 B
          • Pojmy_cs.lst153 B
          • Odpady_typy.lst160 B
        • linking
          • 140717
            • Pojem_velicina.csv84 kB
          • Velicina_metrika.csv37 kB
          • Pojem_velicina.csv141 kB
        • regexp
          • sentence.txt95 B
          • common_regexp.txt145 B
          • oznameni_regexp.txt252 B
          • number_regexp.txt1 kB
        • jape
          • attribute_entity.jape945 B
          • attribute_entity2.jape955 B
          • stav1.jape2 kB
          • number_unit.jape5 kB
          • attribute_number.jape1 kB
          • dict_linking.jape6 kB
          • entity_count.jape2 kB
          • stav2.jape1 kB
          • readme.txt441 B
          • add_id.jape743 B
          • add_id2.jape1 kB
          • entity_unit.jape2 kB
          • termin_attribute.jape655 B
    • stanoviskoNatura.sh304 B
    • oznameni.cmd260 B
    • generator
      • generator.pl9 kB
    • EIAdokumentace.pdf743 kB
    • lib
      • readme.txt68 B
    • stanovisko.sh286 B
    • EIAdokumentace.odt327 kB
    • oznameni.sh433 B
    • stanovisko.cmd205 B