Victor
Please use the following text to cite this item or export to a predefined format:
Marek, Michal, 2009,
Victor, LINDAT/CLARIAH-CZ digital library at the Institute of Formal and Applied Linguistics (ÚFAL),
http://hdl.handle.net/11858/00-097C-0000-0001-48FD-B.
Authors
Item identifier
Project URL
Date issued
2009-11-02T09:48:39Z
Type
Description
Victor is a web page cleaning tool. It is aimed at removing menu, ads, footers, headers, etc. from HTML web pages, so that only main web page content remains. Victor is based on a conditional random fields algorithm.
Subject(s)
Collections
Files in this item
- Name
- victor-1.0-beta.tar.bz2
- Size
- 1.79 MB
- Format
- application/x-bzip2
- Description
- bzip2 Archive
- MD5
- 3cbeda259d5eefee2d5bd8fed1a531ee

The file preview has not been generated yet. Please try again later or contact the system administrator lindat-help@ufal.mff.cuni.cz

