Show simple item record

 
dc.contributor.author Marek, Michal
dc.date.accessioned 2011-06-28T09:40:25Z
dc.date.available 2009-11-02T09:48:39Z
dc.date.issued 2009-11-02T09:48:39Z
dc.identifier.uri http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B
dc.description Victor is a web page cleaning tool. It is aimed at removing menu, ads, footers, headers, etc. from HTML web pages, so that only main web page content remains. Victor is based on a conditional random fields algorithm.
dc.publisher Charles University, Faculty of Mathematics and Physics, Institute of Formal and Applied Linguistics (UFAL)
dc.rights GNU General Public License, version 2
dc.rights.uri http://www.gnu.org/licenses/gpl-2.0.html
dc.source.uri http://ufal.mff.cuni.cz/victor/
dc.subject html cleaning
dc.title Victor
dc.type toolService
dc.rights.label PUB
has.files yes
branding LINDAT / CLARIAH-CZ
files.size 1877749
files.count 1


 Files in this item

This item is
Publicly Available
and licensed under:
GNU General Public License, version 2
GNU General Public License, version 2.0
Icon
Name
victor-1.0-beta.tar.bz2
Size
1.79 MB
Format
application/x-bzip2
Description
Installation file (Linux, 32bits)
MD5
3cbeda259d5eefee2d5bd8fed1a531ee
 Download file

Show simple item record