API هوش مصنوعی Kaya AI چیست؟

Kaya AI یک درگاه API هوش مصنوعی یکپارچه است که به توسعهدهندگان ایرانی امکان دسترسی به مدلهای برتر مثل GPT-4o، Claude 3.5 Sonnet، Gemini Pro و دهها مدل دیگر را از طریق یک API واحد و سازگار با فرمت OpenAI میدهد.

آیا API شما با کتابخانه OpenAI سازگار است؟

بله، API ما کاملاً با فرمت OpenAI سازگار است. کافیست base URL را به آدرس Kaya AI تغییر دهید.

نحوه پرداخت و قیمتگذاری چگونه است؟

Kaya AI از سیستم اعتباری استفاده میکند. شما اعتبار میخرید با پرداخت ریالی از طریق درگاه زیبال و فقط به ازای توکنهای مصرفشده هزینه پرداخت میکنید.

آیا به VPN نیاز دارم؟

خیر. Kaya AI مستقیماً از ایران قابل دسترسی است. سرورهای ما درخواستها را به مدلهای هوش مصنوعی ارسال میکنند و نیازی به VPN نیست.

کاتالوگ مدل‌ها

تمام مدل‌های فعال Kaya AI — GPT-4o، Claude، Gemini و صدها مدل دیگر. فیلتر بر اساس ارائه‌دهنده، نوع استفاده و قیمت.

340 مدل فعال|دریافت کلید API ←

340 از 340 مدل

نوع استفاده

مرتب‌سازی

AI21: Jamba Large 1.7

ai21/jamba-large-1.7

AI21

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, instruction-following, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 256K context...

گفتگو256K context

ورودی / ۱M توکن

۳۴۰٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۳۶۰٬۰۰۰ تومان

AionLabs: Aion-1.0

aion-labs/aion-1.0

Aion-labs

Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree...

گفتگو131K context

ورودی / ۱M توکن

۶۸۰٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۳۶۰٬۰۰۰ تومان

AionLabs: Aion-1.0-Mini

aion-labs/aion-1.0-mini

Aion-labs

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, coding, and logic. It is a modified variant...

گفتگو131K context

ورودی / ۱M توکن

۱۱۹٬۰۰۰ تومان

خروجی / ۱M توکن

۲۳۸٬۰۰۰ تومان

AionLabs: Aion-2.0

aion-labs/aion-2.0

Aion-labs

Aion-2.0 is a variant of DeepSeek V3.2 optimized for immersive roleplaying and storytelling. It is particularly strong at introducing tension, crises, and conflict into stories, making narratives feel more engaging....

گفتگو131K context

ورودی / ۱M توکن

۱۳۶٬۰۰۰ تومان

خروجی / ۱M توکن

۲۷۲٬۰۰۰ تومان

AionLabs: Aion-RP 1.0 (8B)

aion-labs/aion-rp-llama-3.1-8b

Aion-labs

Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto benchmark, a roleplaying-specific variant of Arena-Hard-Auto, where LLMs evaluate each other’s responses. It is a fine-tuned base model...

گفتگو33K context

ورودی / ۱M توکن

۱۳۶٬۰۰۰ تومان

خروجی / ۱M توکن

۲۷۲٬۰۰۰ تومان

AllenAI: Olmo 3 32B Think

allenai/olmo-3-32b-think

Allenai

Olmo 3 32B Think is a large-scale, 32-billion-parameter model purpose-built for deep reasoning, complex logic chains and advanced instruction-following scenarios. Its capacity enables strong performance on demanding evaluation tasks and...

گفتگو66K context

ورودی / ۱M توکن

۲۵٬۵۰۰ تومان

خروجی / ۱M توکن

۸۵٬۰۰۰ تومان

Amazon: Nova 2 Lite

amazon/nova-2-lite-v1

Amazon

Nova 2 Lite is a fast, cost-effective reasoning model for everyday workloads that can process text, images, and videos to generate text. Nova 2 Lite demonstrates standout capabilities in processing...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵۱٬۰۰۰ تومان

خروجی / ۱M توکن

۴۲۵٬۰۰۰ تومان

Amazon: Nova Lite 1.0

amazon/nova-lite-v1

Amazon

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

گفتگوبینایی300K context

ورودی / ۱M توکن

۱۰٬۲۰۰ تومان

خروجی / ۱M توکن

۴۰٬۸۰۰ تومان

Amazon: Nova Micro 1.0

amazon/nova-micro-v1

Amazon

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

گفتگو128K context

ورودی / ۱M توکن

۵٬۹۵۰ تومان

خروجی / ۱M توکن

۲۳٬۸۰۰ تومان

Amazon: Nova Premier 1.0

amazon/nova-premier-v1

Amazon

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۱۲۵٬۰۰۰ تومان

Amazon: Nova Pro 1.0

amazon/nova-pro-v1

Amazon

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

گفتگوبینایی300K context

ورودی / ۱M توکن

۱۳۶٬۰۰۰ تومان

خروجی / ۱M توکن

۵۴۴٬۰۰۰ تومان

Anthropic Claude Haiku Latest

~anthropic/claude-haiku-latest

~anthropic

This model always redirects to the latest model in the Anthropic Claude Haiku family.

گفتگوبینایی200K context

ورودی / ۱M توکن

۱۷۰٬۰۰۰ تومان

خروجی / ۱M توکن

۸۵۰٬۰۰۰ تومان

Anthropic Claude Sonnet Latest

~anthropic/claude-sonnet-latest

~anthropic

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵۱۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

Anthropic: Claude 3 Haiku

anthropic/claude-3-haiku

Anthropic

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

گفتگوبینایی200K context

ورودی / ۱M توکن

۴۲٬۵۰۰ تومان

خروجی / ۱M توکن

۲۱۲٬۵۰۰ تومان

Anthropic: Claude Fable 5

anthropic/claude-fable-5

Anthropic

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

خروجی / ۱M توکن

۸٬۵۰۰٬۰۰۰ تومان

Anthropic: Claude Fable Latest

~anthropic/claude-fable-latest

~anthropic

This model always redirects to the latest model in the Claude Fable family.

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

خروجی / ۱M توکن

۸٬۵۰۰٬۰۰۰ تومان

Anthropic: Claude Haiku 4.5

anthropic/claude-haiku-4.5

Anthropic

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...

گفتگوبینایی200K context

ورودی / ۱M توکن

۱۷۰٬۰۰۰ تومان

خروجی / ۱M توکن

۸۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4

anthropic/claude-opus-4

Anthropic

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

گفتگوبینایی200K context

ورودی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۱۲٬۷۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.1

anthropic/claude-opus-4.1

Anthropic

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, reasoning, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

گفتگوبینایی200K context

ورودی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۱۲٬۷۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.5

anthropic/claude-opus-4.5

Anthropic

Claude Opus 4.5 is Anthropic’s frontier reasoning model optimized for complex software engineering, agentic workflows, and long-horizon computer use. It offers strong multimodal capabilities, competitive performance across real-world coding and...

گفتگوبینایی200K context

ورودی / ۱M توکن

۸۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۴٬۲۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.6

anthropic/claude-opus-4.6

Anthropic

Opus 4.6 is Anthropic’s strongest model for coding and long-running professional tasks. It is built for agents that operate across entire workflows rather than single prompts, making it especially effective...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۸۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۴٬۲۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.6 (Fast)

anthropic/claude-opus-4.6-fast

Anthropic

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵٬۱۰۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲۵٬۵۰۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.7

anthropic/claude-opus-4.7

Anthropic

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۸۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۴٬۲۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.7 (Fast)

anthropic/claude-opus-4.7-fast

Anthropic

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵٬۱۰۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲۵٬۵۰۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.8

anthropic/claude-opus-4.8

Anthropic

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۸۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۴٬۲۵۰٬۰۰۰ تومان

Anthropic: Claude Opus 4.8 (Fast)

anthropic/claude-opus-4.8-fast

Anthropic

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

خروجی / ۱M توکن

۸٬۵۰۰٬۰۰۰ تومان

Anthropic: Claude Opus Latest

~anthropic/claude-opus-latest

~anthropic

This model always redirects to the latest model in the Claude Opus family.

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۸۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۴٬۲۵۰٬۰۰۰ تومان

Anthropic: Claude Sonnet 4

anthropic/claude-sonnet-4

Anthropic

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵۱۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

Anthropic: Claude Sonnet 4.5

anthropic/claude-sonnet-4.5

Anthropic

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵۱۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

Anthropic: Claude Sonnet 4.6

anthropic/claude-sonnet-4.6

Anthropic

Sonnet 4.6 is Anthropic's most capable Sonnet-class model yet, with frontier performance across coding, agents, and professional work. It excels at iterative development, complex codebase navigation, end-to-end project management with...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵۱۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

Arcee AI: Coder Large

arcee-ai/coder-large

Arcee-ai

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...

گفتگو33K context

ورودی / ۱M توکن

۸۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱۳۶٬۰۰۰ تومان

Arcee AI: Trinity Large Thinking

arcee-ai/trinity-large-thinking

Arcee-ai

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. Launch video: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...

گفتگو262K context

ورودی / ۱M توکن

۴۲٬۵۰۰ تومان

خروجی / ۱M توکن

۱۳۶٬۰۰۰ تومان

Arcee AI: Trinity Mini

arcee-ai/trinity-mini

Arcee-ai

Trinity Mini is a 26B-parameter (3B active) sparse mixture-of-experts language model featuring 128 experts with 8 active per token. Engineered for efficient reasoning over long contexts (131k) with robust function...

گفتگو131K context

ورودی / ۱M توکن

۷٬۶۵۰ تومان

خروجی / ۱M توکن

۲۵٬۵۰۰ تومان

Arcee AI: Virtuoso Large

arcee-ai/virtuoso-large

Arcee-ai

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

گفتگو131K context

ورودی / ۱M توکن

۱۲۷٬۵۰۰ تومان

خروجی / ۱M توکن

۲۰۴٬۰۰۰ تومان

Auto Router

openrouter/auto

Openrouter

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

رایگانگفتگوبیناییتولید تصویر2.0M context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Baidu: ERNIE 4.5 VL 424B A47B

baidu/ernie-4.5-vl-424b-a47b

Baidu

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...

گفتگوبینایی131K context

ورودی / ۱M توکن

۷۱٬۴۰۰ تومان

خروجی / ۱M توکن

۲۱۲٬۵۰۰ تومان

Body Builder (beta)

openrouter/bodybuilder

Openrouter

Transform your natural language requests into structured OpenRouter API request objects. Describe what you want to accomplish with AI models, and Body Builder will construct the appropriate API calls. Example:...

رایگانگفتگو128K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

ByteDance Seed: Seed 1.6

bytedance-seed/seed-1.6

Bytedance-seed

Seed 1.6 is a general-purpose model released by the ByteDance Seed team. It incorporates multimodal capabilities and adaptive deep thinking with a 256K context window.

گفتگوبینایی262K context

ورودی / ۱M توکن

۴۲٬۵۰۰ تومان

خروجی / ۱M توکن

۳۴۰٬۰۰۰ تومان

ByteDance Seed: Seed 1.6 Flash

bytedance-seed/seed-1.6-flash

Bytedance-seed

Seed 1.6 Flash is an ultra-fast multimodal deep thinking model by ByteDance Seed, supporting both text and visual understanding. It features a 256k context window and can generate outputs of...

گفتگوبینایی262K context

ورودی / ۱M توکن

۱۲٬۷۵۰ تومان

خروجی / ۱M توکن

۵۱٬۰۰۰ تومان

ByteDance Seed: Seed-2.0-Lite

bytedance-seed/seed-2.0-lite

Bytedance-seed

Seed-2.0-Lite is a versatile, cost‑efficient enterprise workhorse that delivers strong multimodal and agent capabilities while offering noticeably lower latency, making it a practical default choice for most production workloads across...

گفتگوبینایی262K context

ورودی / ۱M توکن

۴۲٬۵۰۰ تومان

خروجی / ۱M توکن

۳۴۰٬۰۰۰ تومان

ByteDance Seed: Seed-2.0-Mini

bytedance-seed/seed-2.0-mini

Bytedance-seed

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, supports 256k context, four reasoning effort modes (minimal/low/medium/high), multimodal understanding,...

گفتگوبینایی262K context

ورودی / ۱M توکن

۱۷٬۰۰۰ تومان

خروجی / ۱M توکن

۶۸٬۰۰۰ تومان

ByteDance: UI-TARS 7B

bytedance/ui-tars-1.5-7b

Bytedance

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

گفتگوبینایی128K context

ورودی / ۱M توکن

۱۷٬۰۰۰ تومان

خروجی / ۱M توکن

۳۴٬۰۰۰ تومان

Cohere: Command A

cohere/command-a

Cohere

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

گفتگو256K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

Cohere: Command R (08-2024)

cohere/command-r-08-2024

Cohere

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

گفتگو128K context

ورودی / ۱M توکن

۲۵٬۵۰۰ تومان

خروجی / ۱M توکن

۱۰۲٬۰۰۰ تومان

Cohere: Command R+ (08-2024)

cohere/command-r-plus-08-2024

Cohere

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...

گفتگو128K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

Cohere: Command R7B (12-2024)

cohere/command-r7b-12-2024

Cohere

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, tool use, agents, and similar tasks requiring complex reasoning...

گفتگو128K context

ورودی / ۱M توکن

۶٬۳۷۵ تومان

خروجی / ۱M توکن

۲۵٬۵۰۰ تومان

Cohere: North Mini Code (free)

cohere/north-mini-code:free

Cohere

North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...

رایگانگفتگو256K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Deep Cogito: Cogito v2.1 671B

deepcogito/cogito-v2.1-671b

Deepcogito

Cogito v2.1 671B MoE represents one of the strongest open models globally, matching performance of frontier closed and open models. This model is trained using self play with reinforcement learning...

گفتگو128K context

ورودی / ۱M توکن

۲۱۲٬۵۰۰ تومان

خروجی / ۱M توکن

۲۱۲٬۵۰۰ تومان

DeepSeek: DeepSeek V3

deepseek/deepseek-chat

DeepSeek

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

گفتگو131K context

ورودی / ۱M توکن

۳۴٬۰۳۴ تومان

خروجی / ۱M توکن

۱۳۶٬۰۱۷ تومان

DeepSeek: DeepSeek V3 0324

deepseek/deepseek-chat-v3-0324

DeepSeek

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

گفتگو164K context

ورودی / ۱M توکن

۳۴٬۰۰۰ تومان

خروجی / ۱M توکن

۱۳۰٬۹۰۰ تومان

DeepSeek: DeepSeek V3.1

deepseek/deepseek-chat-v3.1

DeepSeek

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

گفتگو164K context

ورودی / ۱M توکن

۳۵٬۷۰۰ تومان

خروجی / ۱M توکن

۱۳۴٬۳۰۰ تومان

DeepSeek: DeepSeek V3.1 Terminus

deepseek/deepseek-v3.1-terminus

DeepSeek

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

گفتگو164K context

ورودی / ۱M توکن

۴۵٬۹۰۰ تومان

خروجی / ۱M توکن

۱۶۱٬۵۰۰ تومان

DeepSeek: DeepSeek V3.2

deepseek/deepseek-v3.2

DeepSeek

DeepSeek-V3.2 is a large language model designed to harmonize high computational efficiency with strong reasoning and agentic tool-use performance. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

گفتگو131K context

ورودی / ۱M توکن

۳۸٬۸۹۶ تومان

خروجی / ۱M توکن

۵۸٬۳۴۴ تومان

DeepSeek: DeepSeek V3.2 Exp

deepseek/deepseek-v3.2-exp

DeepSeek

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

گفتگو164K context

ورودی / ۱M توکن

۴۵٬۹۰۰ تومان

خروجی / ۱M توکن

۶۹٬۷۰۰ تومان

DeepSeek: DeepSeek V4 Flash

deepseek/deepseek-v4-flash

DeepSeek

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

گفتگو1.0M context

ورودی / ۱M توکن

۱۵٬۳۰۰ تومان

خروجی / ۱M توکن

۳۰٬۶۰۰ تومان

DeepSeek: DeepSeek V4 Pro

deepseek/deepseek-v4-pro

DeepSeek

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

گفتگو1.0M context

ورودی / ۱M توکن

۷۳٬۹۵۰ تومان

خروجی / ۱M توکن

۱۴۷٬۹۰۰ تومان

DeepSeek: R1

deepseek/deepseek-r1

DeepSeek

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

گفتگو164K context

ورودی / ۱M توکن

۱۱۹٬۰۰۰ تومان

خروجی / ۱M توکن

۴۲۵٬۰۰۰ تومان

DeepSeek: R1 0528

deepseek/deepseek-r1-0528

DeepSeek

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

گفتگو164K context

ورودی / ۱M توکن

۸۵٬۰۰۰ تومان

خروجی / ۱M توکن

۳۶۵٬۵۰۰ تومان

DeepSeek: R1 Distill Llama 70B

deepseek/deepseek-r1-distill-llama-70b

DeepSeek

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

گفتگو128K context

ورودی / ۱M توکن

۱۳۶٬۰۰۰ تومان

خروجی / ۱M توکن

۱۳۶٬۰۰۰ تومان

EssentialAI: Rnj 1 Instruct

essentialai/rnj-1-instruct

Essentialai

Rnj-1 is an 8B-parameter, dense, open-weight model family developed by Essential AI and trained from scratch with a focus on programming, math, and scientific reasoning. The model demonstrates strong performance...

گفتگو33K context

ورودی / ۱M توکن

۲۵٬۵۰۰ تومان

خروجی / ۱M توکن

۲۵٬۵۰۰ تومان

Free Models Router

openrouter/free

Openrouter

The simplest way to get free inference. openrouter/free is a router that selects free models at random from the models available on OpenRouter. The router smartly filters for models that...

رایگانگفتگوبینایی200K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Google Gemini Flash Latest

~google/gemini-flash-latest

~google

This model always redirects to the latest model in the Google Gemini Flash family.

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۲۵۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۵۳۰٬۰۰۰ تومان

Google Gemini Pro Latest

~google/gemini-pro-latest

~google

This model always redirects to the latest model in the Google Gemini Pro family.

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۳۴۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۰۴۰٬۰۰۰ تومان

Google: Gemini 2.5 Flash

google/gemini-2.5-flash

Google

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, coding, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۵۱٬۰۰۰ تومان

خروجی / ۱M توکن

۴۲۵٬۰۰۰ تومان

Google: Gemini 2.5 Flash Lite

google/gemini-2.5-flash-lite

Google

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۱۷٬۰۰۰ تومان

خروجی / ۱M توکن

۶۸٬۰۰۰ تومان

Google: Gemini 2.5 Flash Lite Preview 09-2025

google/gemini-2.5-flash-lite-preview-09-2025

Google

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۱۷٬۰۰۰ تومان

خروجی / ۱M توکن

۶۸٬۰۰۰ تومان

Google: Gemini 2.5 Pro

google/gemini-2.5-pro

Google

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, coding, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۲۱۲٬۵۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

Google: Gemini 2.5 Pro Preview 05-06

google/gemini-2.5-pro-preview-05-06

Google

گفتگوبینایی1.0M context

ورودی / ۱M توکن

۲۱۲٬۵۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

google/gemma-3-27b-it

Google

گفتگوبینایی131K context

ورودی / ۱M توکن

۱۳٬۶۰۰ تومان

خروجی / ۱M توکن

۲۷٬۲۰۰ تومان

google/gemma-4-26b-a4b-it:free

Google

رایگانگفتگوبینایی262K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Google: Gemma 4 31B

google/gemma-4-31b-it

Google

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

گفتگوبینایی262K context

ورودی / ۱M توکن

۲۰٬۴۰۰ تومان

خروجی / ۱M توکن

۵۹٬۵۰۰ تومان

google/gemini-3-pro-image

Google

گفتگوبیناییتولید تصویر66K context

ورودی / ۱M توکن

۳۴۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۰۴۰٬۰۰۰ تومان

IBM: Granite 4.0 Micro

ibm-granite/granite-4.0-h-micro

Ibm-granite

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...

گفتگو131K context

ورودی / ۱M توکن

۲٬۸۹۰ تومان

خروجی / ۱M توکن

۱۹٬۰۴۰ تومان

IBM: Granite 4.1 8B

ibm-granite/granite-4.1-8b

Ibm-granite

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...

گفتگو131K context

ورودی / ۱M توکن

۸٬۵۰۰ تومان

خروجی / ۱M توکن

۱۷٬۰۰۰ تومان

Inception: Mercury 2

inception/mercury-2

Inception

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

گفتگو128K context

ورودی / ۱M توکن

۴۲٬۵۰۰ تومان

خروجی / ۱M توکن

۱۲۷٬۵۰۰ تومان

inclusionAI: Ling-2.6-1T

inclusionai/ling-2.6-1t

Inclusionai

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...

گفتگو262K context

ورودی / ۱M توکن

۱۲٬۷۵۰ تومان

خروجی / ۱M توکن

۱۰۶٬۲۵۰ تومان

inclusionAI: Ling-2.6-flash

inclusionai/ling-2.6-flash

Inclusionai

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

گفتگو262K context

ورودی / ۱M توکن

۱٬۷۰۰ تومان

خروجی / ۱M توکن

۵٬۱۰۰ تومان

inclusionAI: Ring-2.6-1T

inclusionai/ring-2.6-1t

Inclusionai

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...

گفتگو262K context

ورودی / ۱M توکن

۱۲٬۷۵۰ تومان

خروجی / ۱M توکن

۱۰۶٬۲۵۰ تومان

Inflection: Inflection 3 Pi

inflection/inflection-3-pi

Inflection

Inflection 3 Pi powers Inflection's [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivity, and safety. It has access to recent news, and excels in scenarios like customer support and roleplay. Pi...

گفتگو8K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

Inflection: Inflection 3 Productivity

inflection/inflection-3-productivity

Inflection

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. It has access to recent news. For emotional...

گفتگو8K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

Kwaipilot: KAT-Coder-Pro V2

kwaipilot/kat-coder-pro-v2

Kwaipilot

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...

گفتگو256K context

ورودی / ۱M توکن

۵۱٬۰۰۰ تومان

خروجی / ۱M توکن

۲۰۴٬۰۰۰ تومان

LiquidAI: LFM2-24B-A2B

liquid/lfm-2-24b-a2b

Liquid

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per...

گفتگو128K context

ورودی / ۱M توکن

۵٬۱۰۰ تومان

خروجی / ۱M توکن

۲۰٬۴۰۰ تومان

LiquidAI: LFM2.5-1.2B-Instruct (free)

liquid/lfm-2.5-1.2b-instruct:free

Liquid

LFM2.5-1.2B-Instruct is a compact, high-performance instruction-tuned model built for fast on-device AI. It delivers strong chat quality in a 1.2B parameter footprint, with efficient edge inference and broad runtime support.

رایگانگفتگو33K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

LiquidAI: LFM2.5-1.2B-Thinking (free)

liquid/lfm-2.5-1.2b-thinking:free

Liquid

LFM2.5-1.2B-Thinking is a lightweight reasoning-focused model optimized for agentic tasks, data extraction, and RAG—while still running comfortably on edge devices. It supports long context (up to 32K tokens) and is...

رایگانگفتگو33K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Magnum v4 72B

anthracite-org/magnum-v4-72b

Anthracite-org

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).

گفتگو33K context

ورودی / ۱M توکن

۵۱۰٬۰۰۰ تومان

خروجی / ۱M توکن

۸۵۰٬۰۰۰ تومان

Mancer: Weaver (alpha)

mancer/weaver

Mancer

An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.

گفتگو8K context

ورودی / ۱M توکن

۱۲۷٬۵۰۰ تومان

خروجی / ۱M توکن

۱۷۰٬۰۰۰ تومان

Meta: Llama 3 8B Instruct

meta-llama/llama-3-8b-instruct

Meta

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

گفتگو8K context

ورودی / ۱M توکن

۲۳٬۸۰۰ تومان

خروجی / ۱M توکن

۲۳٬۸۰۰ تومان

Meta: Llama 3.1 70B Instruct

meta-llama/llama-3.1-70b-instruct

Meta

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

گفتگو131K context

ورودی / ۱M توکن

۶۸٬۰۰۰ تومان

خروجی / ۱M توکن

۶۸٬۰۰۰ تومان

Meta: Llama 3.1 8B Instruct

meta-llama/llama-3.1-8b-instruct

Meta

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

گفتگو131K context

ورودی / ۱M توکن

۳٬۴۰۰ تومان

خروجی / ۱M توکن

۵٬۱۰۰ تومان

Meta: Llama 3.2 11B Vision Instruct

meta-llama/llama-3.2-11b-vision-instruct

Meta

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

گفتگوبینایی131K context

ورودی / ۱M توکن

۵۸٬۶۵۰ تومان

خروجی / ۱M توکن

۵۸٬۶۵۰ تومان

Meta: Llama 3.2 1B Instruct

meta-llama/llama-3.2-1b-instruct

Meta

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

گفتگو131K context

ورودی / ۱M توکن

۴٬۵۹۰ تومان

خروجی / ۱M توکن

۳۴٬۱۷۰ تومان

Meta: Llama 3.2 3B Instruct

meta-llama/llama-3.2-3b-instruct

Meta

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

گفتگو131K context

ورودی / ۱M توکن

۸٬۶۵۳ تومان

خروجی / ۱M توکن

۵۶٬۹۵۰ تومان

nousresearch/hermes-3-llama-3.1-405b:free

Nousresearch

رایگانگفتگو131K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Nous: Hermes 3 70B Instruct

nousresearch/hermes-3-llama-3.1-70b

Nousresearch

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

گفتگو131K context

ورودی / ۱M توکن

۱۱۹٬۰۰۰ تومان

خروجی / ۱M توکن

۱۱۹٬۰۰۰ تومان

Nous: Hermes 4 405B

nousresearch/hermes-4-405b

Nousresearch

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

گفتگو131K context

ورودی / ۱M توکن

۱۷۰٬۰۰۰ تومان

خروجی / ۱M توکن

۵۱۰٬۰۰۰ تومان

Nous: Hermes 4 70B

nousresearch/hermes-4-70b

Nousresearch

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

گفتگو131K context

ورودی / ۱M توکن

۲۲٬۱۰۰ تومان

خروجی / ۱M توکن

۶۸٬۰۰۰ تومان

NVIDIA: Llama 3.3 Nemotron Super 49B V1.5

nvidia/llama-3.3-nemotron-super-49b-v1.5

NVIDIA

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

گفتگو131K context

ورودی / ۱M توکن

۶۸٬۰۰۰ تومان

خروجی / ۱M توکن

۶۸٬۰۰۰ تومان

NVIDIA: Nemotron 3 Nano 30B A3B

nvidia/nemotron-3-nano-30b-a3b

NVIDIA

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

گفتگو262K context

ورودی / ۱M توکن

۸٬۵۰۰ تومان

خروجی / ۱M توکن

۳۴٬۰۰۰ تومان

NVIDIA: Nemotron 3 Nano 30B A3B (free)

nvidia/nemotron-3-nano-30b-a3b:free

NVIDIA

NVIDIA Nemotron 3 Nano 30B A3B is a small language MoE model with highest compute efficiency and accuracy for developers to build specialized agentic AI systems. The model is fully...

رایگانگفتگو256K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

NVIDIA: Nemotron 3 Nano Omni (free)

nvidia/nemotron-3-nano-omni-30b-a3b-reasoning:free

NVIDIA

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

رایگانگفتگوبینایی256K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

NVIDIA: Nemotron 3 Super

nvidia/nemotron-3-super-120b-a12b

NVIDIA

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

گفتگو1.0M context

ورودی / ۱M توکن

۱۵٬۳۰۰ تومان

خروجی / ۱M توکن

۷۶٬۵۰۰ تومان

NVIDIA: Nemotron 3 Super (free)

nvidia/nemotron-3-super-120b-a12b:free

NVIDIA

رایگانگفتگو1.0M context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

NVIDIA: Nemotron 3 Ultra

nvidia/nemotron-3-ultra-550b-a55b

NVIDIA

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

گفتگو1.0M context

ورودی / ۱M توکن

۸۵٬۰۰۰ تومان

خروجی / ۱M توکن

۳۷۴٬۰۰۰ تومان

openai/gpt-4o-2024-05-13

OpenAI

گفتگوبینایی128K context

ورودی / ۱M توکن

۸۵۰٬۰۰۰ تومان

خروجی / ۱M توکن

۲٬۵۵۰٬۰۰۰ تومان

OpenAI: GPT-4o (2024-08-06)

openai/gpt-4o-2024-08-06

OpenAI

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...

گفتگوبینایی128K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

OpenAI: GPT-4o (2024-11-20)

openai/gpt-4o-2024-11-20

OpenAI

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It’s also better at working with uploaded...

گفتگوبینایی128K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

OpenAI: GPT-4o Search Preview

openai/gpt-4o-search-preview

OpenAI

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

گفتگو128K context

ورودی / ۱M توکن

۴۲۵٬۰۰۰ تومان

خروجی / ۱M توکن

۱٬۷۰۰٬۰۰۰ تومان

OpenAI: GPT-4o-mini

openai/gpt-4o-mini

OpenAI

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

گفتگوبینایی128K context

ورودی / ۱M توکن

۲۵٬۵۰۰ تومان

خروجی / ۱M توکن

۱۰۲٬۰۰۰ تومان

openai/gpt-oss-120b:free

OpenAI

رایگانگفتگو131K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

OpenAI: gpt-oss-20b

openai/gpt-oss-20b

OpenAI

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

گفتگو131K context

ورودی / ۱M توکن

۴٬۹۳۰ تومان

خروجی / ۱M توکن

۲۳٬۸۰۰ تومان

poolside/laguna-m.1:free

Poolside

رایگانگفتگو262K context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Poolside: Laguna XS.2

poolside/laguna-xs.2

Poolside

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...

گفتگو262K context

ورودی / ۱M توکن

۱۷٬۰۰۰ تومان

خروجی / ۱M توکن

۳۴٬۰۰۰ تومان

qwen/qwen3-coder:free

Qwen

رایگانگفتگو1.0M context

ورودی / ۱M توکن

رایگان

خروجی / ۱M توکن

رایگان

Qwen: Qwen3 Coder Flash

qwen/qwen3-coder-flash

Qwen

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

گفتگو1.0M context

ورودی / ۱M توکن

۳۳٬۱۵۰ تومان

خروجی / ۱M توکن

۱۶۵٬۷۵۰ تومان

Qwen: Qwen3 Coder Next

qwen/qwen3-coder-next

Qwen

Qwen3-Coder-Next is an open-weight causal language model optimized for coding agents and local development workflows. It uses a sparse MoE design with 80B total parameters and only 3B activated per...

گفتگو262K context

ورودی / ۱M توکن

۱۸٬۷۰۰ تومان

خروجی / ۱M توکن

۱۳۶٬۰۰۰ تومان

Qwen: Qwen3 Coder Plus

qwen/qwen3-coder-plus

Qwen

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

گفتگو1.0M context

ورودی / ۱M توکن

۱۱۰٬۵۰۰ تومان

خروجی / ۱M توکن

۵۵۲٬۵۰۰ تومان

Qwen: Qwen3 Max

qwen/qwen3-max

Qwen

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 version. It...

گفتگو262K context

ورودی / ۱M توکن

۱۳۲٬۶۰۰ تومان

خروجی / ۱M توکن

۶۶۳٬۰۰۰ تومان

Qwen: Qwen3 Max Thinking

qwen/qwen3-max-thinking

Qwen

Qwen3-Max-Thinking is the flagship reasoning model in the Qwen3 series, designed for high-stakes cognitive tasks that require deep, multi-step reasoning. By significantly scaling model capacity and reinforcement learning compute, it...

گفتگو262K context

ورودی / ۱M توکن

۱۳۲٬۶۰۰ تومان

خروجی / ۱M توکن

۶۶۳٬۰۰۰ تومان

Qwen: Qwen3 Next 80B A3B Instruct

qwen/qwen3-next-80b-a3b-instruct

Qwen

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

گفتگو262K context

ورودی / ۱M توکن

۱۵٬۳۰۰ تومان

خروجی / ۱M توکن

۱۸۷٬۰۰۰ تومان

z-ai/glm-5v-turbo

Z-ai

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, video, and text inputs, excels at long-horizon planning, complex coding,...

گفتگوبینایی203K context

ورودی / ۱M توکن

۲۰۴٬۰۰۰ تومان

خروجی / ۱M توکن

۶۸۰٬۰۰۰ تومان