Files in this item
This item is
Creative Commons - Attribution 4.0 International (CC BY 4.0)
Publicly Available
and licensed under:Creative Commons - Attribution 4.0 International (CC BY 4.0)



- Name
- README
- Size
- 2.24 KB
- Format
- Unknown
- Description
- description
- MD5
- 438062a5dbd764121d6787637ad0c0f6

- Name
- 01 DH around us.pptx
- Size
- 1.4 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- c6e6ef7a2c53b03a5d5d13b0ba7c3f5a

- Name
- DH 1 - Text Analysis.ipynb
- Size
- 56.63 KB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- af17717a34fd852ed992fe4b34a0b3da

- Name
- 01 intro.pptx
- Size
- 15.46 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- c6017e3639c524f5dba19c5beaffc5ed
- Name
- 01 intro.mp4
- Size
- 219.65 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 66c465299ea1a6aa105634b049d50b8d

- Name
- DH 2 - Cloud Based OCR.ipynb
- Size
- 2.37 MB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- 47e5cd6e8f643bd41feba5a67c9cac9d

- Name
- 02 - digitization.pptx
- Size
- 3.56 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- 39de288efcdaa77038f869b8a3eae012
- Name
- 02 - digitization.mp4
- Size
- 185.39 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 1fae3f636b93828433b39dd7e160cf9a

- Name
- 03 - using APIs.pptx
- Size
- 1.7 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- 23b1031e5c8e62eb3fdddcf3bec37830

- Name
- DH 3 - Text Analysis Using APIs.ipynb
- Size
- 3.09 MB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- 323a62421d8eb3140204f7f1ef393dd8
- Name
- 03 - using APIs.mp4
- Size
- 167.46 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 1d50e62a2a57a9055b59776b06395727

- Name
- DH 4 - tasks.txt
- Size
- 330 bytes
- Format
- Text file
- Description
- practical: install software and try it
- MD5
- 2ef05c6ced82bd06edb9dd92fff23821
Install Apache Tika App from https://tika.apache.org/download.html You must have Java installed. Run the application: either by double clicking on it or from the command line: java -jar tika-app-version.jar Process a PDF file, a DOC(X) file, an Excel file. See the metadata, observe the results. Try files in different languages. . . .

- Name
- 04 - encoding.pptx
- Size
- 1.41 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- 406552e0853602af835c81b6b1379430

- Name
- 04 - preprocessing.pptx
- Size
- 2.19 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- 2b3fdba0fef5301bf558f24e75f3bc1d
- Name
- 04 - encoding.mp4
- Size
- 213.4 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 8f19850fb3b860c7ae33fcabb95b0710
- Name
- 04 - preprocessing.mp4
- Size
- 360.61 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 3622a7433effc87691a0c9569172dab1

- Name
- 05 - visualization.pptx
- Size
- 1.83 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- bcd99c33f447d294b5f9c4e6e52e0cf8

- Name
- DH 5 - Text Analysis and Visualization.ipynb
- Size
- 4.14 MB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- f276c44b98fa341a24d9ca31886acc69
- Name
- 05 - visualization.mp4
- Size
- 213.42 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 0c376b3c593259e5194a85036f4ee78d

- Name
- DH 6 - tasks.txt
- Size
- 217 bytes
- Format
- Text file
- Description
- practical: install software and try it
- MD5
- 25142ba80b916e58971539e72696b06a

- Name
- 06 - image processing.pptx
- Size
- 35.43 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- 0137f77f4238f1260af5e61450638c3f
- Name
- 06 - image processing.mp4
- Size
- 317.11 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- d55b26cfbe9bfba9735b866d35b25d36

- Name
- 07 - word embeddings.pptx
- Size
- 1.77 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- dae092598b01dc7b625b741529ffe0b0

- Name
- DH 7 - Text Analysis using Word Embeddings.ipynb
- Size
- 3.79 MB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- 356407cea813f1091ab3e8248dd162c3
- Name
- 07 - word embeddings.mp4
- Size
- 408.83 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 0aee43f06d816db7277a4727aeb75714

- Name
- 08 - assessment of DH.pptx
- Size
- 1.41 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- ca683fd7f400bb5f9b8aca9b83edd1b7

- Name
- 08 - evaluation.pptx
- Size
- 1.74 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- 903a6df99dc5689f2d802fa42e7f4c53

- Name
- DH 8 Evaluation.ipynb
- Size
- 125.93 KB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- d5dd363d32fca65df2d9f68c712b7799
- Name
- 08 - assessment of DH.mp4
- Size
- 167.9 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- 5d090befcdd4587faba6b94fd050d3d2
- Name
- 08 - evaluation.mp4
- Size
- 310.86 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- db78a5f1fbdbd846ef21e25545dc96d7

- Name
- DH 9 - tasks.txt
- Size
- 206 bytes
- Format
- Text file
- Description
- practical: read specification, try software
- MD5
- 035f6c659e05898ce4a14e8215d1ebf3

- Name
- 09 - metadata.pptx
- Size
- 1.52 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- a4ea7d32d598135f14a0936d65f46de2
- Name
- 09 - metadata.mp4
- Size
- 166.72 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- eb6eb300d2bb96317d521e9dcd294cd4

- Name
- 10 - infrastructure.pptx
- Size
- 1.6 MB
- Format
- Microsoft PowerPoint 2007
- Description
- slides
- MD5
- aae995facc889d6b3fd4f1884de7bf63

- Name
- DH 10 Complete Analysis.ipynb
- Size
- 1.08 MB
- Format
- Unknown
- Description
- practical: Jupyter Notebook
- MD5
- a44daee12a35f371d1520df4600c3488
- Name
- 10 - infrastructure.mp4
- Size
- 144.52 MB
- Format
- MPEG-4 video
- Description
- video
- MD5
- a9287093b10d9b9be300a21e171f6305

- Name
- key.json
- Size
- 2.25 KB
- Format
- Unknown
- Description
- example Google API key (not functional)
- MD5
- 6879dfd7056f793bc0d57468e6886f1e

- Name
- maj.txt
- Size
- 30.04 KB
- Format
- Text file
- Description
- example text for processing
- MD5
- 416fcf0323c8fd68a93771c2870db8ab
1 Byl pozdní večer – první máj – večerní máj – byl lásky čas. Hrdliččin zval ku lásce hlas, kde borový zaváněl háj. O lásce šeptal tichý mech; květoucí strom lhal lásky žel, svou lásku slavík růži pěl, růžinu jevil vonný vzdech. Jezero hladké v křovích stinných zvučelo temně tajný bol, břeh je objímal kol a kol; a slunce jasná světů jiných bloudila blankytnými pásky, planoucí tam co slzy lásky. I světy jich v oblohu skvoucí co ve chrám věčné lásky vzešly; až se – milostí k sobě vroucí změnivše se v jiskry hasnoucí – bloudící co milenci sešly. Ouplné lůny krásná tvář – tak bledě jasná, jasně bledá, jak milence milenka hledá – ve růžovou vzplanula zář; na vodách obrazy své zřela a sama k sobě láskou mřela. Dál blyštil bledý dvorů stín, jenž k sobě šly vzdy blíž a blíž, jak v objetí by níž a níž se vinuly v soumraku klín, až posléze šerem v jedno splynou. S nimi se stromy k stromům vinou. – Nejzáze stíní šero hor, tam bříza k boru, k bříze bor se klon . . .