๐Ÿค– 11+ Models Available

All models,
one interface

Powered by NVIDIA NIM infrastructure. Switch models mid-conversation with a single click.

Flagship Live
GLM-5.1
GLM ยท v5.1-0520 ยท 130B params
Best-in-class reasoning with native chain-of-thought. Excels at math, coding, analysis, and complex problem-solving. Flagship model for Axecodi.
Context
128K
Speed
~60 tok/s
Thinking
โœ“ CoT
reasoningcodemathmultilingual
Use GLM-5.1 โ†’
DeepSeek Live
DeepSeek R1
DeepSeek ยท R1-671B ยท MoE
Advanced reasoning model with reinforcement learning. Matches o1-level performance on math and science tasks. Full chain-of-thought output.
Context
64K
Speed
~45 tok/s
Thinking
โœ“ CoT
reasoningmathscienceopen-source
Use DeepSeek R1 โ†’
Meta Live
Llama 3.3
Meta ยท Llama-3.3-70B-Instruct
Meta's most powerful open-source LLM. Exceptional instruction following, coding, and multilingual support. Great for general tasks.
Context
128K
Speed
~90 tok/s
Params
70B
generalcodemultilingualopen-source
Use Llama 3.3 โ†’
Mistral Live
Mixtral 8x7B
Mistral ยท Mixtral-8x7B-Instruct-v0.1
Mixture-of-Experts architecture with 8 expert networks. High throughput and efficiency. Excellent for rapid iteration and real-time applications.
Context
32K
Speed
~130 tok/s
Type
MoE
fastMoEopen-source
Use Mixtral 8x7B โ†’
Google Live
Gemma 2
Google ยท Gemma-2-9B-IT
Google's lightweight but capable open model. Punches above its weight class for summarization, Q&A, and chat. Ideal for lightweight tasks.
Context
8K
Speed
~160 tok/s
Params
9B
lightweightfastopen-source
Use Gemma 2 โ†’
Alibaba Live
Qwen 2.5
Alibaba ยท Qwen2.5-72B-Instruct
Alibaba's flagship open model with strong multilingual capabilities. Excellent for Chinese/English tasks, coding, and complex instruction following.
Context
128K
Speed
~75 tok/s
Params
72B
multilingualcodeopen-source
Use Qwen 2.5 โ†’
Meta Live
CodeLlama 70B
Meta ยท CodeLlama-70B-Instruct
Specialized code generation model. Trained on 500B tokens of code. Supports 80+ programming languages with fill-in-the-middle capability.
Context
100K
Speed
~80 tok/s
Focus
Code
codefill-in-middle80+ langs
Use CodeLlama 70B โ†’
Mistral Live
Mistral 7B
Mistral ยท Mistral-7B-Instruct-v0.3
Ultra-fast, lightweight model for quick queries and prototyping. Surprisingly capable at general tasks despite its compact size.
Context
32K
Speed
~200 tok/s
Params
7B
ultra-fastlightweightopen-source
Use Mistral 7B โ†’

Model comparison

Model Context Window Speed Reasoning Code Open Source
GLM-5.1 128K Fast โœ“ CoT โœ“ โœ—
DeepSeek R1 64K Moderate โœ“ CoT โœ“ โœ“
Llama 3.3 70B 128K Fast โœ“ โœ“ โœ“
Mixtral 8x7B 32K Very Fast Partial โœ“ โœ“
Gemma 2 9B 8K Ultra Fast โœ— Partial โœ“
Qwen 2.5 72B 128K Fast โœ“ โœ“ โœ“