AQ-5B
A foundation model by Zyora Labs
AQ-5B is Zyora Labs' own foundation model — a decoder-only Transformer trained from scratch in pure PyTorch to understand every educational stream, from Physics and Maths to Botany, Tamil and Engineering. It is the base that will power the next generation of ZyoraByte and Zyora AI.
- Type
- Foundation model — decoder-only, pre-trained from scratch
- Parameters
- 5 billion
- Architecture
- Decoder-only Transformer — RoPE + RMSNorm + SwiGLU + GQA
- Implementation
- Pure PyTorch (hand-coded, not HuggingFace transformers)
- Tokenizer
- Custom BPE · 32,000 vocabulary
- Languages
- English & Tamil
- Domain
- All educational streams (base → advanced concepts)
- Developer
- Zyora Labs
What is AQ-5B?
AQ-5B is Zyora Labs' own foundation model — built from scratch, not fine-tuned from someone else's weights. It is a decoder-only Transformer designed to understand every educational stream, from Physics and Maths to Botany, Zoology, Tamil, Engineering and Arts & Science.
Rather than memorising a specific syllabus, AQ-5B is trained on broad conceptual knowledge — base to advanced, so it forms a durable foundation for the next generation of ZyoraByte and Zyora AI products.
A modern decoder, hand-built
AQ-5B is a decoder-only Transformer — RoPE + RMSNorm + SwiGLU + GQA — implemented in pure PyTorch, hand-coded in torch.nn rather than pulled from HuggingFace transformers. The config is kept HF-compatible so the model can be loaded for inference and sharing later.
- Rotary position embeddings (RoPE) for long-range context.
- RMSNorm for stable, efficient normalisation.
- SwiGLU feed-forward blocks.
- Grouped-query attention (GQA) for efficient inference.
- Custom BPE tokenizer with a 32,000-token vocabulary.
Bilingual, every stream
AQ-5B is bilingual in English and Tamil and treats all educational streams as first-class — the sciences, mathematics, languages, engineering and the arts. It is trained on conceptual knowledge from foundational to advanced, giving downstream tutors a broad, reliable base to reason from.
Scope
AQ-5B is a foundation model — a base intended for further training and alignment rather than a finished consumer product. It is built to give downstream tutors like ZyoraByte a broad, reliable base to reason from across every educational stream.
Frequently asked questions
What is AQ-5B?
AQ-5B is Zyora Labs' own foundation model — a decoder-only Transformer trained from scratch to understand every educational stream. It is the base model for the next generation of ZyoraByte and Zyora AI products.
What architecture does AQ-5B use?
AQ-5B is a decoder-only Transformer using RoPE, RMSNorm, SwiGLU and grouped-query attention (GQA), hand-coded in pure PyTorch rather than HuggingFace transformers.
How large is AQ-5B?
AQ-5B is a 5-billion-parameter foundation model built from scratch by Zyora Labs.
Which languages and subjects does AQ-5B cover?
AQ-5B is bilingual in English and Tamil and is trained on broad conceptual knowledge across all educational streams — Physics, Chemistry, Botany, Zoology, Maths, English, Tamil, Engineering, Arts and Science.
Who is building AQ-5B?
AQ-5B is built by Zyora Labs, led by Vasanth (Chief Researcher) with Adithi S (Junior AI Engineer).
Want to build with AQ-5B?