dc.contributor.author | Straka, Milan |
dc.contributor.author | Richter, Michal |
dc.date.accessioned | 2015-03-13T14:55:19Z |
dc.date.available | 2015-03-13T14:55:19Z |
dc.date.issued | 2015-03-13 |
dc.identifier.uri | http://hdl.handle.net/11234/1-1469 |
dc.description | Korektor is a statistical spell-checker and (occasionally) grammar-checker. It is released under 2-Clause BSD license http://opensource.org/licenses/BSD-2-Clause. Korektor started with Michal Richter's diploma thesis Advanced Czech Spellchecker https://redmine.ms.mff.cuni.cz/documents/1, but it is being developed further. There are two versions: a command line utility (tested on Linux, Windows and OS X) and a REST service with publicly available API http://lindat.mff.cuni.cz/services/korektor/api-reference.php and HTML front end https://lindat.mff.cuni.cz/services/korektor/. |
dc.language.iso | eng |
dc.publisher | Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.relation.replaces | http://hdl.handle.net/11858/00-097C-0000-000D-F67C-5 |
dc.rights | BSD 2-Clause "Simplified" or "FreeBSD" license |
dc.rights.uri | http://opensource.org/licenses/BSD-2-Clause |
dc.source.uri | http://ufal.mff.cuni.cz/korektor |
dc.subject | Korektor |
dc.subject | spellchecker |
dc.subject | spellchecking |
dc.subject | grammar checker |
dc.subject | diacritical marks generation |
dc.title | Korektor 2 |
dc.type | toolService |
metashare.ResourceInfo#ResourceComponentType#ToolServiceInfo.languageDependent | true |
metashare.ResourceInfo#ContentInfo.detailedType | tool |
dc.rights.label | PUB |
has.files | yes |
branding | LINDAT / CLARIAH-CZ |
demo.uri | http://lindat.mff.cuni.cz/services/korektor/ |
contact.person | Milan Straka straka@ufal.mff.cuni.cz Charles University in Prague, UFAL |
sponsor | Ministerstvo školství, mládeže a tělovýchovy České republiky LM2010013 LINDAT/CLARIN: Institut pro analýzu, zpracování a distribuci lingvistických dat nationalFunds |
files.size | 5307632 |
files.count | 2 |
Files in this item
Download all files in item (5.06 MB)- Name
- korektor-2.0.0-bin.zip
- Size
- 5.03 MB
- Format
- application/zip
- Description
- Command-line binaries for Linux, Windows and OS X with documentation.
- MD5
- ce6d955b66cb5255a598b97b5331512e
- korektor-2.0.0-bin
- src
- tokenizer.cpp2 kB
- persistent_structures
- bit_array.cpp2 kB
- packed_array.h2 kB
- comp_increasing_array.h2 kB
- increasing_array.h1 kB
- increasing_array.cpp2 kB
- string_array.cpp1 kB
- packed_array.cpp5 kB
- value_mapping.cpp6 kB
- mapped_double_array.cpp1 kB
- string_array.h2 kB
- bit_array.h3 kB
- value_mapping.h1 kB
- comp_increasing_array.cpp2 kB
- mapped_double_array.h1013 B
- version
- version.h803 B
- version.cpp1 kB
- .clang_complete15 B
- common.h833 B
- unilib
- utf8.cpp2 kB
- utf16.cpp1 kB
- unicode.cpp177 kB
- CHANGES898 B
- README816 B
- version.h793 B
- AUTHORS82 B
- unistrip.h1 kB
- version.cpp706 B
- utf16.h6 kB
- uninorms.h1 kB
- uninorms.cpp224 kB
- Makefile.include534 B
- utf8.h8 kB
- unicode.h3 kB
- LICENSE16 kB
- unistrip.cpp35 kB
- decoder
- viterbi_state.h2 kB
- viterbi_state.cpp1 kB
- stage_possibility.cpp1 kB
- decoder_multi_factor.cpp5 kB
- stage_possibility.h1 kB
- decoder_multi_factor.h1 kB
- decoder_base.h2 kB
- decoder_base.cpp7 kB
- token
- tokenizer.cpp2 kB
- token.cpp1 kB
- output_format.h1 kB
- input_format.h1 kB
- tokenizer.h1 kB
- token.h946 B
- output_format.cpp6 kB
- input_format.cpp5 kB
- spellchecker
- constants.h688 B
- spellchecker_correction.h713 B
- spellchecker.cpp8 kB
- configuration.cpp7 kB
- constants.cpp679 B
- spellchecker.h1 kB
- configuration.h2 kB
- rest_server
- microrestd
- libmicrohttpd
- platform_interface.h10 kB
- response.cpp14 kB
- connection.h3 kB
- microhttpd.h84 kB
- memorypool.h3 kB
- MHD_config.h3 kB
- response.h1 kB
- platform.h5 kB
- README429 B
- tsearch.h1 kB
- postprocessor.cpp35 kB
- w32functions.cpp19 kB
- autoinit_funcs.h1 kB
- w32functions.h5 kB
- reason_phrase.h1 kB
- internal.h33 kB
- COPYING26 kB
- connection.cpp97 kB
- memorypool.cpp7 kB
- daemon.cpp132 kB
- reason_phrase.cpp3 kB
- internal.cpp5 kB
- AUTHORS2 kB
- CHANGES82 B
- README827 B
- AUTHORS39 B
- rest_server
- xml_response_generator.cpp794 B
- xml_builder.h3 kB
- string_piece.h1 kB
- response_generator.h752 B
- rest_server.h2 kB
- version.h710 B
- version.cpp623 B
- json_builder.h3 kB
- json_response_generator.h809 B
- rest_service.h664 B
- rest_request.h1 kB
- rest_server.cpp23 kB
- json_builder.cpp2 kB
- xml_response_generator.h805 B
- xml_builder.cpp1 kB
- json_response_generator.cpp801 B
- pugixml.h473 B
- Makefile.include957 B
- pugixml
- pugiconfig.h2 kB
- LICENSE1 kB
- pugixml.cpp158 kB
- pugixml.h35 kB
- README368 B
- AUTHORS624 B
- LICENSE16 kB
- microrestd.h796 B
- libmicrohttpd
- rest_server.cpp3 kB
- korektor_service.h3 kB
- korektor_service.cpp14 kB
- microrestd
- create
- create_error_model
- create_error_hierarchy.h10 kB
- error_hierarchy.h5 kB
- get_error_signature.h3 kB
- estimate_error_model.h3 kB
- create_error_model.cpp6 kB
- create_lm_binary.cpp2 kB
- create_morphology.cpp21 kB
- create_error_model
- Makefile2 kB
- .editorconfig125 B
- korektor.cpp5 kB
- lexicon
- sim_words_finder.h1 kB
- sim_words_finder.cpp5 kB
- lexicon.cpp18 kB
- similar_words_map.h602 B
- lexicon.h3 kB
- error_model
- error_model_basic.h5 kB
- error_model.h2 kB
- error_model.cpp662 B
- error_model_basic.cpp14 kB
- language_model
- zip_lm_creation.cpp9 kB
- lm_wrapper.h1 kB
- zip_lm.h3 kB
- ngram.cpp1 kB
- zip_lm.cpp4 kB
- lm_wrapper.cpp1 kB
- ngram.h1 kB
- Makefile.builtem13 kB
- utils
- options.h1 kB
- bits.h866 B
- parse.cpp1 kB
- utf.h1 kB
- io.h1 kB
- io.cpp1 kB
- hash.h641 B
- utf.cpp3 kB
- options.cpp2 kB
- parse.h653 B
- morphology
- morphology.h3 kB
- morphology.cpp12 kB
- factor_list.h691 B
- bin-win32
- create_error_model.exe321 kB
- create_lm_binary.exe302 kB
- create_morphology.exe354 kB
- korektor.exe451 kB
- tokenizer.exe321 kB
- CHANGES654 B
- MANUAL.pdf158 kB
- README1 kB
- MANUAL25 kB
- AUTHORS196 B
- bin-linux64
- create_error_model518 kB
- korektor671 kB
- create_lm_binary496 kB
- create_morphology552 kB
- tokenizer545 kB
- INSTALL2 kB
- LICENSE1 kB
- bin-osx
- create_error_model302 kB
- korektor596 kB
- create_lm_binary226 kB
- create_morphology320 kB
- tokenizer280 kB
- bin-linux32
- create_error_model526 kB
- korektor677 kB
- create_lm_binary505 kB
- create_morphology561 kB
- tokenizer553 kB
- MANUAL.html37 kB
- bin-win64
- create_error_model.exe390 kB
- create_lm_binary.exe368 kB
- create_morphology.exe440 kB
- korektor.exe539 kB
- tokenizer.exe378 kB
- src
- Name
- MANUAL.html
- Size
- 37.38 KB
- Format
- HTML
- Description
- User manual
- MD5
- ce116a99e5753b169c2337a2ac502d26