Starred repositories
Stable Diffusion web UI
A feature-rich command-line audio/video downloader
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
2025年最新总结,阿里,腾讯,百度,美团,头条等技术面试题目,以及答案,专家出题人分析汇总。
Real-time face swap for PC streaming or video calls
🍰 Desktop utility to download images/videos/music/text from various websites, and more.
Google Chromium, sans integration with Google
Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。
Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
Lets make video diffusion practical!
Translate the video from one language to another and embed dubbing & subtitles.
Bringing Old Photo Back to Life (CVPR 2020 oral)
Llama中文社区,实时汇总最新Llama学习资料,构建最好的中文Llama大模型开源生态,完全开源可商用
Self-hosted YouTube downloader (web UI for youtube-dl / yt-dlp)
[CVPR 2024] Official repository for "MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model"
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key