Agent skill
llm-inference
Use when wanting to interact with any LLM - Explains available inference endpoints so the agent selects suitable models.
Install this agent skill to your Project
npx add-skill https://github.com/majiayu000/claude-skill-registry/tree/main/skills/design/llm-inference
SKILL.md
LLM Inference
The Cloudflare Pages function functions/cerebras-chat.ts provides OpenAI-compatible LLM inference. See tools/cerebras-llm-inference/index.html for a working example.
Available models
| Model | Max context tokens | Requests / minute | Tokens / minute |
|---|---|---|---|
| gpt-oss-120b | 65,536 | 30 | 64,000 |
| llama-3.3-70b | 65,536 | 30 | 64,000 |
| llama3.1-8b | 8,192 | 30 | 60,000 |
| qwen-3-235b-a22b-instruct-2507 | 65,536 | 30 | 64,000 |
| qwen-3-235b-a22b-thinking-2507 | 65,536 | 30 | 60,000 |
| qwen-3-32b | 65,536 | 30 | 64,000 |
| zai-glm-4.6 | 64,000 | 10 | 150,000 |
llama3.1-8bis the fastest option.zai-glm-4.6is the most powerful option.gpt-oss-120bremains the best all rounder.
LLMs are not just for chat: they can be used to process any string in any arbitrary way. If making a tool that requires the LLM to respond in a specific way or format then be very clear and explicit in its system prompt; eg what to include/exclude, plain/markdown formatting, length, etc.
Recommended Agent Skills
Expand your agent's capabilities with these related and highly-rated skills.
agent-ops-spec
Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.
agent-ops-state
Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.
agent-ops-spec
Manage specification documents in .agent/specs/. Use when user provides requirements, acceptance criteria, or feature descriptions that need to be tracked and validated against implementation.
agent-ops-testing
Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.
agent-ops-testing
Test strategy, execution, and coverage analysis. Use when designing tests, running test suites, or analyzing test results beyond baseline checks.
agent-ops-state
Maintain .agent state files. Use at session start, after meaningful steps, and before concluding: read/update constitution/memory/focus/issues/baseline consistently.
Didn't find tool you were looking for?