How to Use the Hy3 Preview API for Free ?

Call Tencent's new Hy3 Preview (295B MoE, 256K context) for free via OpenRouter. Step-by-step setup, reasoning modes, benchmarks, limits, and a free API path with Apidog.

Ashley Innocent

Ashley Innocent

24 April 2026

How to Use the Hy3 Preview API for Free ?

Apidog for Enterprise

On-Premises Deploy

SSO & RBAC

SOC 2 Compliant

Explore Apidog Enterprise

Tencent open-sourced Hy3 Preview on April 22, 2026, and within a day OpenRouter listed it as a fully free endpoint. No credit card, no token metering, no trial window. You can call the same 295B-parameter Mixture-of-Experts model that powers Tencent’s Yuanbao app and CodeBuddy assistant from your own code, today, for zero dollars.

This guide shows how to use the Hy3 Preview API for free through OpenRouter, the Hugging Face Space, and the raw Hy3 repo. It also covers the reasoning modes that make Hy3 different from most 2026 open models, and how to test the API inside Apidog without writing throwaway scripts.

button

If you want the fastest route to your first response, jump to “Step-by-step: call Hy3 Preview free on OpenRouter.”

TL;DR

What is Hy3 Preview?

Hy3 Preview is the first flagship release from Tencent’s restructured Hunyuan foundation-model team, now led by Yao Shunyu, a former OpenAI researcher the company hired to push its reasoning stack. Let's frame it as Tencent’s most capable model yet and a direct answer to the top Chinese open-weights releases from DeepSeek, Alibaba, and Zhipu.

The technical profile from the official model card is agent-first:

What sets it apart from a generic 200B-range MoE is the agentic training. Tencent rebuilt its RL infrastructure for multi-turn tool use, and the published scores on SWE-bench Verified, Terminal-Bench 2.0, and the internal WildClawBench suite land it close to the top closed models on code and shell tasks.

Three free ways to use Hy3 Preview

You have three paths depending on whether you want a chat UI, an API, or local weights.

Path What it is Free? Good for
OpenRouter tencent/hy3-preview:free Hosted OpenAI-compatible API Yes, $0 in/out Building agents, scripts, and backend features
Hugging Face Space Browser chat demo Yes Quick prompts, kicking the tires, smoke tests
Self-hosted weights (vLLM / SGLang) Run the open weights on your own GPUs Free software, hardware cost applies Privacy-sensitive workloads, high volume

Most developers will want the OpenRouter route. It is the shortest path from signup to a working API call, and the rate limits on the free tier are generous enough for prototyping.

Step-by-step: call Hy3 Preview free on OpenRouter

Here is the minimal path from zero to a working tencent/hy3-preview:free response.

  1. Create an OpenRouter account. Sign up at openrouter.ai. Email is enough; no payment method required for free-tier models.
  2. Generate an API key. In the OpenRouter dashboard, open “Keys” and create a new key. Copy it into an environment variable, for example export OPENROUTER_API_KEY=sk-or-....
  3. Open the model page. Go to the Hy3 Preview free listing and confirm the status banner reads “Free.” You will also see usage stats there; at launch the endpoint was handling 6.81B prompt tokens per day across all users.

Send your first request. OpenRouter exposes the OpenAI Chat Completions schema, so any OpenAI SDK works:

curl https://openrouter.ai/api/v1/chat/completions \
  -H "Authorization: Bearer $OPENROUTER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "tencent/hy3-preview:free",
    "messages": [
      {"role": "user", "content": "Explain the MoE routing decision inside a top-8 of 192 setup in 3 sentences."}
    ],
    "temperature": 0.9,
    "top_p": 1.0
  }'
  1. Turn on reasoning when you need it. Hy3 accepts a reasoning parameter with effort set to low or high. OpenRouter returns the thinking trace in a separate reasoning_details array, billed as its own token bucket:
{
  "model": "tencent/hy3-preview:free",
  "messages": [
    {"role": "user", "content": "Plan, then write a Bash script that rotates daily log files older than 30 days into a dated archive folder."}
  ],
  "reasoning": {"effort": "high"}
}
  1. Iterate. Keep the session in the same thread if you want the model to build on earlier context; Hy3’s 256K window handles most full codebases end to end.

That is the whole flow. The model you are calling is the same one published on Hugging Face; quality on the OpenRouter free tier is identical to the paid routes on other providers.

Free, Plus, and self-host: where they differ

Free is not the only path, and it helps to see the real diff before you commit to one.

Capability OpenRouter Free OpenRouter Paid (non-free endpoints) Self-hosted (vLLM / SGLang)
Per-token cost $0 Per provider Electricity plus GPU amortization
Reasoning modes no_think, low, high Same Same
Context length 256K 256K 256K (memory permitting)
Throughput under load Shared pool, deprioritized under demand Dedicated Whatever your cluster serves
Rate limits OpenRouter free-tier cap (flexes) Provider-specific None
Data retention OpenRouter logging policy Provider-specific Stays on your hardware
Reasoning token visibility Yes, via reasoning_details Yes Yes

Free is the right choice for prototypes, side projects, evaluation benchmarks, and low-traffic agents. Paid or self-hosted makes sense the moment latency matters or you exceed the rate cap.

Prompt and parameter tips that get more out of Hy3

Hy3 rewards explicit setup more than smaller models. A few habits help.

Limits worth knowing before you ship

A few gotchas will trip you up if you skip them.

The developer fast path: Hy3 Preview plus Apidog

Command-line curl is fine for a demo. For real iteration, a visual API client saves hours.

  1. Open Apidog and create a new project. Import the OpenAI Chat Completions OpenAPI spec; OpenRouter uses the same schema.
  2. Set the base URL to https://openrouter.ai/api/v1 and add an environment variable for OPENROUTER_API_KEY.
  3. Create a request that hits /chat/completions with the model set to tencent/hy3-preview:free.
  4. Fork the request to compare reasoning modes. Apidog lets you duplicate a request and tweak one parameter, so you can run the same prompt with no_think, low, and high side by side and inspect the latency and output diff.
  5. Save prompt templates. Agentic prompts get long. Apidog’s environment and variable system keeps system prompts, tool schemas, and user turns separated so you can reuse them across tests.

If you are coming off Postman, the shift is quick; our API testing without Postman in 2026 guide covers the migration. Teams that live in their editor can run the same workflow inside VS Code with Apidog inside VS Code, which keeps prompt tuning next to the code that consumes the output.

Free alternatives when you hit the cap

If the OpenRouter free pool throttles you during peak hours, two paths worth trying first.

None of these match Hy3’s SWE-bench and Terminal-Bench numbers for agentic coding, but they cover chat, multilingual, and multimodal use cases the free Hy3 tier does not prioritize. For a production build, Download Apidog and set up one collection per model; side-by-side benchmarks on your actual prompts beat reading any leaderboard.

button

Self-hosting Hy3 Preview with vLLM

If you have the hardware, local inference is the fourth free path. The model card recommends vLLM with tensor parallelism of 8 and multi-token prediction enabled for speculative decoding:

vllm serve tencent/Hy3-preview \
  --tensor-parallel-size 8 \
  --speculative-config.method mtp \
  --speculative-config.num_speculative_tokens 1 \
  --tool-call-parser hy_v3 \
  --reasoning-parser hy_v3 \
  --enable-auto-tool-choice \
  --served-model-name hy3-preview

The equivalent SGLang command uses --tool-call-parser hunyuan and --reasoning-parser hunyuan. Once the server is up at http://localhost:8000/v1, any OpenAI SDK points at it the same way it would point at OpenRouter; only the base URL and key change.

Expect eight H100-class GPUs at BF16 for the full model. Quantized community builds will appear, but at launch the official path is full precision.

FAQ

Is Hy3 Preview free?Yes. OpenRouter lists tencent/hy3-preview:free with $0 per million input tokens and $0 per million output tokens. Reasoning tokens on the free tier are also free, though they count against rate limits. Confirm the current status on the OpenRouter model page before depending on it for production.

How does Hy3 Preview compare to DeepSeek V3 and Qwen 3?Hy3 Preview’s SWE-bench Verified score of 74.4 and Terminal-Bench 2.0 of 54.4 put it in the same tier as the top Chinese open models, with a clear agent and tool-use tilt. For pure chat, Qwen 3 and DeepSeek V3 are competitive; for agent and coding workflows, Hy3’s RL-trained tool use is the differentiator.

What are Hy3’s reasoning modes?Three: no_think (default, direct answer), low, and high. Switch them through the reasoning parameter on OpenRouter or via chat_template_kwargs={"reasoning_effort": "high"} when calling the model directly. Use high for planning, multi-step code, and math; leave it off for chat.

Can I use Hy3 Preview commercially?Yes, under the Tencent Hy Community License. The license permits commercial use with attribution and usage-policy compliance. Read the full terms on the Hy3 GitHub repo before deploying it in a revenue-generating product.

What context length does the free tier support?256K tokens end to end. OpenRouter’s listing shows 262,144 tokens, matching the model card. You can paste an entire mid-size codebase and still have room for tool schemas and conversation history.

How do I test Hy3 Preview without writing code?Use the Hugging Face Space for a browser chat demo, or point Apidog at the OpenRouter endpoint. Apidog imports the OpenAI OpenAPI spec, so configuring the request is three fields: base URL, API key, and model name.

Explore more

Moving From Keploy to Apidog CLI

Moving From Keploy to Apidog CLI

Moving from Keploy to Apidog CLI: an honest switching guide from recorded tests to designed, maintainable API suites. Import a spec, author, run in CI.

17 June 2026

Best Keploy Alternatives for API Testing

Best Keploy Alternatives for API Testing

Looking for a Keploy alternative? Compare Apidog CLI, Newman, Hoppscotch, Schemathesis and record-replay tools with honest pros, cons, and a feature table.

17 June 2026

How to Build a Fake REST API in Minutes (with JSONPlaceholder)

How to Build a Fake REST API in Minutes (with JSONPlaceholder)

Use json-server to turn a JSON file into a full REST API in seconds, call JSONPlaceholder with zero setup, and learn when to move up to a schema-aware mock.

17 June 2026

Practice API Design-first in Apidog

Discover an easier way to build and use APIs

How to Use the Hy3 Preview API for Free ?