《ollama ：AI library》

空云风语

于 2025-03-03 17:02:50 发布

阅读量998

点赞数 31

分类专栏：人工智能文章标签：人工智能

版权声明：本文为博主原创文章，遵循 CC 4.0 BY-SA 版权协议，转载请附上原文出处链接和本声明。

本文链接：https://blog.csdn.net/zheng_ruiguo/article/details/145993173

版权

Sort By Popular Newest

deepseek-r1

DeepSeek's first-generation of reasoning models with comparable performance to OpenAI-o1, including six dense models distilled from DeepSeek-R1 based on Llama and Qwen.

1.5b7b8b14b32b70b671b

22.2M Pulls29 TagsUpdated 3 weeks ago
llama3.3

New state of the art 70B model. Llama 3.3 70B offers similar performance compared to the Llama 3.1 405B model.

tools70b

1.4M Pulls14 TagsUpdated 2 months ago
phi4

Phi-4 is a 14B parameter, state-of-the-art open model from Microsoft.

14b

840.4K Pulls5 TagsUpdated 7 weeks ago
llama3.2

Meta's Llama 3.2 goes small with 1B and 3B models.

tools1b3b

9.7M Pulls63 TagsUpdated 5 months ago
llama3.1

Llama 3.1 is a new state-of-the-art model from Meta available in 8B, 70B and 405B parameter sizes.

tools8b70b405b

25.6M Pulls93 TagsUpdated 3 months ago
nomic-embed-text

A high-performing open embedding model with a large token context window.

embedding

17.7M Pulls3 TagsUpdated 12 months ago
mistral

The 7B model released by Mistral AI, updated to version 0.3.

tools7b

9.7M Pulls84 TagsUpdated 7 months ago
llama3

Meta Llama 3: The most capable openly available LLM to date

8b70b

7.5M Pulls68 TagsUpdated 9 months ago
qwen2.5

Qwen2.5 models are pretrained on Alibaba's latest large-scale dataset, encompassing up to 18 trillion tokens. The model supports up to 128K tokens and has multilingual support.

tools0.5b1.5b3b7b14b32b72b

4.8M Pulls133 TagsUpdated 5 months ago
qwen

Qwen 1.5 is a series of large language models by Alibaba Cloud spanning from 0.5B to 110B parameters

0.5b1.8b4b7b14b32b72b110b

4.4M Pulls379 TagsUpdated 10 months ago
gemma

Gemma is a family of lightweight, state-of-the-art open models built by Google DeepMind. Updated to version 1.1

2b7b

4.4M Pulls102 TagsUpdated 10 months ago
qwen2.5-coder

The latest series of Code-Specific Qwen models, with significant improvements in code generation, code reasoning, and code fixing.

tools0.5b1.5b3b7b14b32b

4.3M Pulls196 TagsUpdated 3 months ago
qwen2

Qwen2 is a new series of large language models from Alibaba group

tools0.5b1.5b7b72b

4.1M Pulls97 TagsUpdated 5 months ago
llava

🌋 LLaVA is a novel end-to-end trained large multimodal model that combines a vision encoder and Vicuna for general-purpose visual and language understanding. Updated to version 1.6.

vision7b13b34b

3.7M Pulls98 TagsUpdated 13 months ago
gemma2

Google Gemma 2 is a high-performing and efficient model available in three sizes: 2B, 9B, and 27B.

2b9b27b

3.1M Pulls94 TagsUpdated 7 months ago
llama2

Llama 2 is a collection of foundation language models ranging from 7B to 70B parameters.

7b13b70b

3M Pulls102 TagsUpdated 14 months ago
phi3

Phi-3 is a family of lightweight 3B (Mini) and 14B (Medium) state-of-the-art open models by Microsoft.

3.8b14b

2.9M Pulls72 TagsUpdated 7 months ago
codellama

A large language model that can use text prompts to generate and discuss code.

7b13b34b70b

1.8M Pulls199 TagsUpdated 7 months ago
mxbai-embed-large

State-of-the-art large embedding model from mixedbread.ai

embedding335m

1.6M Pulls4 TagsUpdated 10 months ago
llama3.2-vision

Llama 3.2 Vision is a collection of instruction-tuned image reasoning generative models in 11B and 90B sizes.

vision11b90b

1.4M Pulls9 TagsUpdated 3 months ago
tinyllama

The TinyLlama project is an open endeavor to train a compact 1.1B Llama model on 3 trillion tokens.

1.1b

1.3M Pulls36 TagsUpdated 14 months ago
mistral-nemo

A state-of-the-art 12B model with 128k context length, built by Mistral AI in collaboration with NVIDIA.

tools12b

1.2M Pulls17 TagsUpdated 6 months ago
starcoder2

StarCoder2 is the next generation of transparently trained open code LLMs that comes in three sizes: 3B, 7B and 15B parameters.

3b7b15b

892.4K Pulls67 TagsUpdated 5 months ago
deepseek-coder-v2

An open-source Mixture-of-Experts code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks.

16b236b

700.6K Pulls64 TagsUpdated 5 months ago
deepseek-v3

A strong Mixture-of-Experts (MoE) language model with 671B total parameters with 37B activated for each token.

671b

696.4K Pulls5 TagsUpdated 6 weeks ago
snowflake-arctic-embed

A suite of text embedding models by Snowflake, optimized for performance.

embedding22m33m110m137m335m

694.7K Pulls16 TagsUpdated 10 months ago
llama2-uncensored

Uncensored Llama 2 model by George Sung and Jarrad Hope.

7b70b

626.7K Pulls34 TagsUpdated 16 months ago
deepseek-coder

DeepSeek Coder is a capable coding model trained on two trillion code and natural language tokens.

1.3b6.7b33b

587K Pulls102 TagsUpdated 14 months ago
mixtral

A set of Mixture of Experts (MoE) model with open weights by Mistral AI in 8x7b and 8x22b parameter sizes.

tools8x7b8x22b

573.8K Pulls70 TagsUpdated 2 months ago
dolphin-mixtral

Uncensored, 8x7b and 8x22b fine-tuned models based on the Mixtral mixture of experts models that excels at coding tasks. Created by Eric Hartford.

8x7b8x22b

517.3K Pulls70 TagsUpdated 2 months ago
codegemma

CodeGemma is a collection of powerful, lightweight models that can perform a variety of coding tasks like fill-in-the-middle code completion, code generation, natural language understanding, mathematical reasoning, and instruction following.

2b7b

511.6K Pulls85 TagsUpdated 7 months ago
openthinker

最低0.47元/天解锁文章

评论

被折叠的条评论为什么被折叠?

到【灌水乐园】发言

查看更多评论

添加红包

成就一亿技术人!

hope_wisdom

发出的红包

打赏作者

空云风语 人工智能，深度学习，神经网络

¥1 ¥2 ¥4 ¥6 ¥10 ¥20

扫码支付：¥1

获取中

扫码支付

您的余额不足，请更换扫码支付或充值

实付元

使用余额支付

点击重新获取

扫码支付

钱包余额 0

抵扣说明：

1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。