dc.contributor.author |
Marek, Michal |
dc.date.accessioned |
2011-06-28T09:40:25Z |
dc.date.available |
2009-11-02T09:48:39Z |
dc.date.issued |
2009-11-02T09:48:39Z |
dc.identifier.uri |
http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B |
dc.description |
Victor is a web page cleaning tool. It is aimed at removing menu, ads, footers, headers, etc. from HTML web pages, so that only main web page content remains. Victor is based on a conditional random fields algorithm. |
dc.publisher |
Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL) |
dc.rights |
GNU General Public License, version 2 |
dc.rights.uri |
http://www.gnu.org/licenses/gpl-2.0.html |
dc.source.uri |
http://ufal.mff.cuni.cz/victor/ |
dc.subject |
html cleaning |
dc.title |
Victor |
dc.type |
toolService |
dc.rights.label |
PUB |
has.files |
yes |
branding |
LINDAT / CLARIAH-CZ |
files.size |
1877749 |
files.count |
1 |