logo
WeChat Login
aigc
aigc
No description

Pinned

🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。
TypeScript
17500
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
0000
vLLM官方镜像部署DeepSeek模型,生产环境中提供类OpenAI接口服务。
Markdown
01000
An easy API for making Event Source requests, with all the features of fetch(), Supports browsers and node.js
TypeScript
0000
LLM inference in C/C++ Fork from https://github.com/ggerganov/llama.cpp.git 自动化构建Docker镜像
C++
2000
SGLang is a fast serving framework for large language models and vision language models.
Python
0000
Recent updates
WeClaws 是一个可快速部署的多用户(注册、登录)微信Agent机器人管理面板。可在Web端管理你的多个AI机器人,支持工具调用、Skills、MCP、子智能体、记忆、做梦、定时任务和沙盒执行等能力。
TypeScript
1100
🤖 Free Search with AI, 💡 Open-Source Perplexity, 📍 Support Ollama/SearXNG, Support Docker deployment. 让AI大模型和搜索引擎回答你的问题,支持本地大模型(Ollama)、聚合搜索引擎SearXNG,支持Docker一键部署。
TypeScript
17500
Claude/OpenAI 协议转换网关 CLI (可以在cc/codex中相互用对方的模型)
TypeScript
0000
Run MCP stdio servers over SSE and SSE over stdio. AI gateway.
TypeScript
0000
AI模型聚合管理中转分发系统,支持将多种大模型转为统一格式调用,支持OpenAI、Claude、Gemini等格式,可供个人或者企业内部管理与分发渠道使用。🍥 The next-generation LLM gateway and AI asset management system supports multiple languages.
JavaScript
Go
0000
LLM inference in C/C++ Fork from https://github.com/ggerganov/llama.cpp.git 自动化构建Docker镜像
C++
2000
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
0000
https://github.com/vllm-project/FlashMLA.git
Cuda
Python
0000
Fast and memory-efficient exact attention
Python
0000
https://github.com/oneapi-src/oneDNN.git
C++
0000
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python
0000
vLLM官方镜像部署DeepSeek模型,生产环境中提供类OpenAI接口服务。
Markdown
01000
DeepResearch Agent with LangGraph, using any LLM models, search engine, RAG retrieval.
TypeScript
0000
Xorbits Inference(Xinference) is a powerful and versatile library designed to serve language, speech recognition, and multimodal models. With Xorbits Inference, you can effortlessly deploy and serve your or state-of-the-art built-in models using just a single command. Whether you are a researcher, developer, or data scientist...
Python
0000
SGLang is a fast serving framework for large language models and vision language models.
Python
0000