# Home

Latest works

					Cover image	Cover image (dark)	Cover image (dark)
Tarka Embedding 30M V1	Release \| 7-1-2026	We achieved 20× compression and recovered ~86% MTEB performance. The model supports elastic upscaling at inference, avoiding full-weight loading and reducing both memory footprint and compute cost.	`Open - Source`	/pages/ABnCtumAP0cLOkzUkBNZ	/files/5JJNF8Sim67m21lsW3SZ
Reduce and Refine	Release \| 1-12-2025	This work explores model compression by progressively reducing a 28-layer model to a lean 6-layer model without major performance loss. Reduce and Refine demonstrates a practical path to faster, lighter, more efficient LLMs.	`Open - Source`	/pages/gDCUWXEBdH0thvnOVOOP	/files/x3SCsA1vqQegreFi3ffw
Tarka Embedding V1	Release \| 9-11-2025	The Tarka Embedding V1 series is a compact and efficient text embedding model family developed to explore the capabilities of knowledge distillation, coreset selection, and model compression techniques	`Open - Source`	/pages/VFUK5S6OyQxbSUyuLKdH	/files/3jaBPbkGB7OK7rkvTMPA	/files/IFJo7Xc1q4CSdhasY8NL	/files/ylqIqcnwvETTtkTyLQ8x

--- # Agent Instructions: Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://tarka-air.gitbook.io/home/home.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.

Tarka Embedding 30M V1

Reduce and Refine

Tarka Embedding V1