Skip to main content

Introducing Pawa Blaze

We’re thrilled to introduce Pawa Blaze, our latest advanced small language model with reasoning, agentic, multimodal, knowledge-heavy tasks and optimized for multilingual understanding and long-context applications.

Modalities

Text, Image → Text

Context window

1M

Features

Low cost
Multilingual
Reasoning
Tools calling

Models Categories

The Pawa Models family is built to power applications ranging from chatbots and assistants to document analysis, search, reasoning and agentic systems. The following are the categories of our models
  • Chat Models: For chat, agentic, tools calling, reasoning, multilingual & multimodal. Eg.Pawa Ember, Pawa Blaze.
  • Text to Speech: For converting text to hearable audio. Eg. Pawa Text To Speech,
  • Speech to Text: For converting audio to readable text. Eg. Pawa Speech To Text.
  • Embeddings: For converting audio to readable text. Eg. Pawa Embeddings
  • Documents Parsing: For converting audio to readable text. Pawa Parsing,

Chat Models

Chat models are designed to handle conversational AI, reasoning, multilingual support, agentic and multimodal inputs. They are ideal for building chatbots, virtual assistants, and agentic systems capable of tool calling and complex problem solving.
Model NameAlias NameDescriptionInput TypeOutput TypeContext LengthStatus
pawa-v1-ember-20240924Pawa EmberFunny & great for everyday taskstexttext100KACTIVE
pawa-v1-blaze-20250318Pawa BlazeFast, visual, tools & most intelligenttext, imagetext1MACTIVE

Voice Models

Chat models are designed to handle conversational AI, reasoning, multilingual support, agentic and multimodal inputs. They are ideal for building chatbots, virtual assistants, and agentic systems capable of tool calling and complex problem solving.
Model NameAlias NameDescriptionInput TypeOutput TypeStatus
pawa-stt-v1-20240701Pawa Speech To TextConverts spoken audio into written text for transcription or analysis.audiotextACTIVE
pawa-tts-v1-20250704Pawa Text To SpeechGenerates high-quality audio from written text for narration or voice assistants.textaudioACTIVE

Embedding Models

Chat models are designed to handle conversational AI, reasoning, multilingual support, agentic and multimodal inputs. They are ideal for building chatbots, virtual assistants, and agentic systems capable of tool calling and complex problem solving.
Model NameAlias NameDescriptionInput TypeOutput TypeStatus
pawa-embeddings-v1-20241001Pawa EmbeddingsTransforms text into numerical vectors for search, similarity, or AI reasoning tasks.textembeddingsACTIVE

Parsing Models

Chat models are designed to handle conversational AI, reasoning, multilingual support, agentic and multimodal inputs. They are ideal for building chatbots, virtual assistants, and agentic systems capable of tool calling and complex problem solving.
Model NameAlias NameDescriptionInput TypeOutput TypeStatus
pawa-v1-parser-20250809Pawa ParsingExtracts text and structured data from various document formats for analysis or AI processing.documentstextACTIVE

Additional Information Regarding Models

1.Use web_search_tool for auto realtime updated data

Pawa chat models have no knowledge of current events or data beyond what was present in their training data.
To incorporate realtime data with your request, please use builtin web_search_tool, or pass any realtime data using custom tools or RAG as context in your system prompt.

2. For chat models, roles should alternate. Don’t send two same roles mutiple times, use contents instead, to add other informations on that role.

Don’t send two same roles mutiple times, use contents instead, to add other informations on that role for your conversation context.

3. For Image input models

  • Maximum image size: 25MiB
  • Maximum number of images: No limit
  • Supported image file types: jpg/jpeg or png
  • Any image/text input order is accepted (e.g. text prompt can precede image prompt)

3. For Embedding models

  • Maximum document size: 15MiB
  • Maximum number of documents: 15, upgrade to get many more.
  • Supported file types: jpg/jpeg, png, “

4. For Voice models

  • Maximum document size: 15MiB
  • Maximum number of documents: 15, upgrade to get many more.
  • Supported file types: jpg/jpeg, png, “

5. For Parsing models

  • Maximum document size: 15MiB
  • Maximum number of documents: 15, upgrade to get many more.
  • Supported file types: jpg/jpeg, png, “

Next Steps

I