Query Page

←

About

Zai is the Zenodeo ai providing a Q&A access to the ~1M treatments extracted by Plazi and made available on TreatmentBank and BLR on Zenodo.

Zai responds to simple and focused questions with easy-to-understand summaries or pointed answers. It cannot (yet) respond to questions that span many documents, and is best at extracting answers from a single treatment that can be pin-pointed with a full-text search.

Zai is also the Chinese verb 在 (zài) for describing existence in a location, similar to how we say in English, "to be at" or "to be in."

Check out more info on how this works.

←

Semantic Cache

We use the Supabase/gte-small model for generating vector embeddings of queries.

                                   ┌───────────┐ 
                                   │what is the│ 
   ┌──────────────────────────────▶│meaning of │──┐
   │                               │life?      │  │
 asks                              └───────────┘  │
   │                                     │        │
   │        ┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─│─ ─ ─ ─ ┼ 
   │           semantic cache            │        ││
┌────┐      │       ┌─────────┐   ┌──────▼────┐   │ 
│\O/ │              │search   │   │convert    │   ││
│ |  │  42  │       │for exact│   │query to   │   │ 
│/ \ │◀───────found─┤key in   ◀───│unique     │   ││
│user│      │       │cache    │   │cache key  │   │ 
└────┘              └────┬────┘   └───────────┘   ││
   ▲        │            │                        │ 
   │                  not found                   ││
   │        │            │                        │ 
   │                ┌────▼────┐                   ││
   │        │       │convert  │                   │ 
   │                │to       │                   ││
   │        │       │embedding│◀──────────────────┘ 
   │                │vector   │                    │
   │        │       └─────────┘                     
   │                                               │
   │        │       ┌────────────────────┐          
   │                │search for query    │         │
   │        │       │with highest cosine │          
   └──────────found─│similarity above a  │         │
            │       │preset threshold    │          
                    └────────────────────┘         │
            │                  │                    
             ─ ─ ─ ─ ─ ─ ─ ─ ─ ┼ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┘
                              no   
                               │   
                               ▼

←

How this works

                 ┌───────────┐                        
                 │what is the│                        
   ┌────────────▶│meaning of │                        
   │             │life?      │                        
 asks            └───────────┘                        
   │                   │                              
   │        ┌ ─ ─ ─ ─ ─│─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─ 
   │                   │          semantic cache^    │
   │        │    ┌─────▼─────┐                        
┌────┐       yes │is the     │           ┌──────┐    │
│user│◀─42──┼────│answer in  │──────────▶│cache │     
└────┘           │cache?     │           └───▲──┘    │
            │    └───────────┘               │        
                      no                     │       │
            └ ─ ─ ─ ─ ─│─ ─ ─ ─ ─ ─ ─ ─ ─ ─ ─│─ ─ ─ ─ 
   ┌ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┼ ─ ─ ─ ─ ─           │        
    vector db ┌────────▼───────┐  │          │        
   │          │conduct a vector│       ┌─────┴────┐   
              │search of the   │  │    │ response │   
   │          │question to     │       └─────▲────┘   
              │identify related│  │          │        
   │          │treatments      │             │        
              └────────┬───────┘  │    ┌─────┴────┐   
   │                   │               │   LLM    │   
               ┌───────▼────────┐ │    └─────▲────┘   
   │           │create a context│            │        
               │from the        │ │          │        
   │           │fulltext of the │            │        
               └───────┬────────┘ │          │        
   └ ─ ─ ─ ─ ─ ─ ─ ─ ─ ┼ ─ ─ ─ ─ ─           │        
                       │                     │        
               ┌───────▼────────┐            │        
               │question: "what │            │        
               │is the meaning  │            │        
               │of life?"       │            │        
               │                ├────────────┘        
               │context: top    │                     
               │ranked paper    │                     
               └────────────────┘

A vector search using usearch allows searching for documents relevant to the question. The question is converted to a vector embedding and similar embeddings (using cosine similarity) are located in a db of ~3.5M embeddings generated from the 1M+ treatments. The fulltext of the topranked treatments is used to build a context for the question.

The LLM uses the context to generate the answer. The generated answers are stored in a semantic cache for fast retrieval for subsequent queries.

We are testing various LLMs including Alibaba's Qwen 3:0.6b, Meta's Llama 3.2:1b and 3b, and Google's Gemma 3 to generate the answers.