Amin Ahmad - CTO, Vectara - Algolia / Elasticsearch-like search product on neural search principles

Amin Ahmad - CTO, Vectara - Algolia / Elasticsearch-like search product on neural search principles

Vector Podcast · 2022-02-16

Update: ZIR.AI has relaunched as Vectara: https://vectara.com/

Topics:

00:00 Intro

00:54 Amin’s background at Google Research and affinity to NLP and vector search field

05:28 Main focus areas of ZIR.AI in neural search

07:26 Does the company offer neural network training to clients? Other support provided with ranking and document format conversions

08:51 Usage of open source vs developing own tech

10:17 The core of ZIR.AI product

14:36 API support, communication protocols and P95/P99 SLAs, dedicated pools of encoders

17:13 Speeding up single node / single customer throughput and challenge of productionizing off the shelf models, like BERT

23:01 Distilling transformer models and why it can be out of reach of smaller companies

25:07 Techniques for data augmentation from Amin’s and Dmitry’s practice (key search team: margin loss)

30:03 Vector search algorithms used in ZIR.AI and the need for boolean logic in company’s client base

33:51 Dynamics of open source in vector search space and cloud players: Google, Amazon, Microsoft

36:03 Implementing a multilingual search with BM25 vs neural search and impact on business

38:56 Is vector search a hype similar to big data few years ago? Prediction for vector search algorithms influence relations databases

43:09 Is there a need to combine BM25 with neural search? Ideas from Amin and features offered in ZIR.AI product

51:31 Increasing the robustness of search — or simply making it to work

55:10 How will Search Engineer profession change with neural search in the game?

Get a $100 discount (first month free) for a 50mb plan, using the code VectorPodcast (no lock-in, you can cancel any time): https://zir-ai.com/signup/user

Vector Podcast

Vector Podcast is here to bring you the depth and breadth of Search Engine Technology, Product, Marketing, Business. In the podcast we talk with engineers, entrepreneurs, thinkers and tinkerers, who put their soul into search. Depending on your interest, you should find a matching topic for you -- whether it is deep algorithmic aspect of search engines and information retrieval field, or examples of products offering deep tech to its users. "Vector" -- because it aims to cover an emerging field of vector similarity search, giving you the ability to search content beyond text: audio, video, images and more. "Vector" also because it is all about vector in your profession, product, marketing and business.

Podcast website: https://www.vectorpodcast.com/

Dmitry is blogging on https://dmitry-kan.medium.com/

Waar kun je luisteren?

Apple Podcasts Logo Spotify Logo Podtail Logo Google Podcasts Logo RSS

Afleveringen