Install Apache Tika App from https://tika.apache.org/download.html You must have Java installed. Run the application: either by double clicking on it or from the command line: java -jar tika-app-version.jar Process a PDF file, a DOC(X) file, an Excel file. See the metadata, observe the results. Try files in different languages.