How to build next-gen search engine using Apache Tika (with python code)

Maciej Zalwert
8 min readJun 14, 2024

--

In this article, you will learn how to set up & run Apache Tika and use it in Python for semantic search.

The goal is to find relevant documents and specific parts of those documents regardless of the file format.

In the second article of this series, I will demonstrate how to utilize evidences with LLM model to…

--

--

Maciej Zalwert

Experienced in building data-intensive solutions for diverse industries