Blog
Tim Spann: Why I Joined Zilliz

Tim Spann: Why I Joined Zilliz

May 29, 20243 min read

Introduction

My name is Tim Spann and I work at Zilliz on developer advocacy for the amazing Open Source project, Milvus. Open Source and helping developers, engineers and cool projects has been my passion for a number of years covering things like Hadoop, Spark, Kafka, NiFi, Flink, Iceberg, Kudu, HBase, Hive and Spring.

My Medium posts: https://medium.com/@tspann

My YouTube Channel: https://www.youtube.com/@FLaNK-Stack

New Challenges

The last two years I have been working on the intersection of streaming and AI and this is where I first saw the importance of a database for AI that could store and query any type of data in any mode that is needed.

I have been working with generative AI, but I needed to be where the future is going and where the new data processing is happening. Unstructured data processing is needed now and I need to spread the word. This is the place. With Milvus, Towhee, Attu and integrations with Kafka and all the cool LlamaX frameworks, this is how to get it done. We need to build up a global group of unstructured data engineers and data superstars. I am so excited to continue this accelerated journey. I have been interested in machine learning, natural language processing and edge AI for nearly a decade.

More Than Vector Databases

Milvus alone is a powerful datastore and reason to want to work for Zilliz. This is just the start of a new paradigm shift for the next Generative AI-powered Data Revolution. The need for powerful, fast ways to do unstructured data processing and Vector ETL is already evident and growing. In the next few years, we will see a rise in unstructured data engineering and processing like we did with Spark, Flink and Kafka for structured and semistructured data.

The need to load logs, email, documents, slack messages, photos, images, videos, audio files and even more binary formats will transform industries. When I started with Big Data, we had to move a lot of JSON, CSV, XML, Relational Tables and structured data. We still have those files and we have them streaming, but we need our data available for similarity search and to be vectorized for fast access.

We will be building as many prompts as we build SQL statements. Many of these data formats will need to be used for the same applications. We can add JSON metadata along with our vectors for additional types of searching, while the lines between unstructured data and structured data becomes blurred as models and prompts require a federated view of data especially for live use cases.

I have already seen this for mass transit applications and this will move into all enterprise applications including IoT and fraud analytics.

The future has a lot more data, a huge need for unstructured data processing, a scalable open-source AI database that can handle the new data and an ever-increasing variety of AI Models.

It Takes a Team

I am very fortunate to have collaborated with a number of my coworkers before and was eager to work with them. I was also incredibly impressed with the everyone I spoke with before joining. This is an incredibly skilled, intelligent team with a deep background in what it takes to bring innovative technology to the mainstream. The future starts now, let’s dive in.

Community

Join me in the New York City area for meetups and other events.

I am also assisting many of the AI events in Princeton and work with StartupGrind Princeton and Trenton, Applied Generative AI and the NJ GAI Meetup.

Updated on Aug 03, 2026

Tim Spann
Tim Spann is a Principal Developer Advocate at Zilliz.

Keep Reading

Zilliz Skills Breakdown: How AI Agents Master Vector Databases

Zilliz's Milvus Skill (pymilvus, 7 files) and Zilliz Cloud Skill (zilliz-cli, 14 modules) bring vector-DB dev and ops into one Claude Code session.

The Great AI Agent Protocol Race: Function Calling vs. MCP vs. A2A

Compare Function Calling, MCP, and A2A protocols for AI agents. Learn which standard best fits your development needs and future-proof your applications.

What is the K-Nearest Neighbors (KNN) Algorithm in Machine Learning?

KNN is a supervised machine learning technique and algorithm for classification and regression. This post is the ultimate guide to KNN.