Information extraction from EIA documents
Please use the following text to cite this item or export to a predefined format:
Lukšová, Ivana and Hladká, Barbora, 2015,
Information extraction from EIA documents, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11234/1-1515.
Authors
Item identifier
Date issued
2015-10-13
Type
Language(s)
Description
Environmental impact assessment (EIA) is the formal process used to predict the environmental consequences of a plan. We present a rule-based extraction system to mine Czech EIA documents. The extraction rules work with a set of documents enriched with morphological information and manually created vocabularies of terms supposed to be extracted from the documents, e.g. basic information about the project (address, ID company, ...), data on the impacts and outcomes (waste substances, endangered species, ...), a final opinion. The documents Notice of Intent contains the section BI2 with the information on the scope (capacity) of the plan.
Acknowledgement
Technologická agentura České republiky
Project code:TA02010182
Project name:Inteligentní knihovna - INTLIB
Subject(s)
Collections
Files in this item
- Name
- intlib_eia_app.zip
- Size
- 1.72 MB
- Format
- application/zip
- Description
- Zip
- MD5
- 3bdefa5bb3cfa815886bba571095dcd6

- intlib_eia_app
- src
- EIA_analysis
- pom.xml2 kB
- .project772 B
- .settings
- org.eclipse.jdt.core.prefs243 B
- org.eclipse.core.resources.prefs191 B
- .classpath1 kB
- src
- main
- java
- czsem
- gate
- cz
- intlib
- eia
- util
- EIAUtil.java703 B
- processing
- LinkEIAAnnotationPR.java5 kB
- EIANazevExtraction.java3 kB
- EIADictionariesFilter.java11 kB
- EIAHeadingsFeatureExtractor.java2 kB
- EIAHeadersFilter.java4 kB
- EIASectionBuilder.java3 kB
- EIAAnalysisConfig.java1 kB
- analysis
- SectionsDetection.java6 kB
- TreexAnalysis.java1 kB
- EntityDetection.java6 kB
- endtoend
- EIAMainClass.java937 B
- OznameniAnalysis.java5 kB
- StanoviskoNaturaAnalysis.java2 kB
- StanoviskoAnalysis.java2 kB
- io
- DataStoreImporter.java1 kB
- EIAXMLExporter.java34 kB
- util
- eia
- intlib
- resources
- eia_request_v1.3_template.xml1 kB
- eia_statement_v1.0_template.xml624 B
- java
- test
- main
- EIA_analysis
- stanoviskoNatura.cmd224 B
- configuration
- eia_config.xml480 B
- czsem_config.xml1 kB
- resources
- eia
- feature_aggregator
- subsections_aggr.txt150 B
- main_sections_aggr.txt449 B
- gazetteer
- Oznameni_dictionary_cs.def404 B
- Cislovky_slovne2.lst2 kB
- Kategorizace.lst34 B
- Prirodni_oblasti.lst517 kB
- Odpady_kody.lst85 kB
- Rozhodnuti.lst21 kB
- Stavy.lst18 kB
- Odpady.lst807 kB
- typy_posNATURA.def57 B
- odpad_nazvy.lst167 kB
- typy_stanoviska.lst3 kB
- Oznameni_headers.def47 B
- BI2_entities_cs.def175 B
- Skodlive_latky_cs.lst449 B
- Terminy.lst14 kB
- metrika_before.lst602 B
- Casy.lst3 kB
- Oznameni_headers_cs.def50 B
- Kraje_CR.lst12 kB
- Kraj_obec.lst1 MB
- Skodlive_latky_zjednodusene.lst119 kB
- typy_posNATURA.lst3 kB
- entities.def49 B
- Metrika.lst51 kB
- Sousedni_zeme.lst4 kB
- Kategorizace_zakonna_omezeni_varianty_kat.lst6 kB
- Oznameni_headers.lst9 kB
- Oznameni_dictionary.def385 B
- BI2_entities.def292 B
- Veliciny2.lst32 kB
- Metrika2.lst278 B
- Oznameni_headers_cs.lst371 B
- Odpady_typy2.lst64 B
- Pojmy.lst689 kB
- Ohr+Chr_druhy.lst534 kB
- Pojmy2.lst114 kB
- typy_stanoviska.def52 B
- entities.lst78 kB
- Veliciny.lst83 kB
- Chranene_druhy_synonyma.lst108 kB
- Ohrozene_druhy.lst484 kB
- Prirodni_oblasti_typy_cs.lst325 B
- Skodlive_latky.lst95 kB
- Prirodni_oblasti_typy.lst808 B
- Pojmy_cs.lst153 B
- Odpady_typy.lst160 B
- linking
- 140717
- Pojem_velicina.csv84 kB
- Velicina_metrika.csv37 kB
- Pojem_velicina.csv141 kB
- 140717
- regexp
- sentence.txt95 B
- common_regexp.txt145 B
- oznameni_regexp.txt252 B
- number_regexp.txt1 kB
- jape
- attribute_entity.jape945 B
- attribute_entity2.jape955 B
- stav1.jape2 kB
- number_unit.jape5 kB
- attribute_number.jape1 kB
- dict_linking.jape6 kB
- entity_count.jape2 kB
- stav2.jape1 kB
- readme.txt441 B
- add_id.jape743 B
- add_id2.jape1 kB
- entity_unit.jape2 kB
- termin_attribute.jape655 B
- feature_aggregator
- eia
- stanoviskoNatura.sh304 B
- oznameni.cmd260 B
- generator
- generator.pl9 kB
- EIAdokumentace.pdf743 kB
- lib
- readme.txt68 B
- stanovisko.sh286 B
- EIAdokumentace.odt327 kB
- oznameni.sh433 B
- stanovisko.cmd205 B
- src

