Looking for a Tika-as-a-service?
If you are interested in extracting text and metadata from documents, you have probably heard about Apache Tika. It is a great tool that can extract a bunch of information from a wide range of file formats, including PDF, Word, Excel, PowerPoint, HTML, XML, and many more. The problem is that it is a Java …