OpenAI-compatible vLLM-ready Production-grade

Model Catalog

Deploy state-of-the-art models on Xerotier.ai infrastructure -- fast, reliable, and built for builders.

26+
Available Models
14
Model Families
100%
OpenAI Compatible

All models listed are compatible with vLLM inference backends. Simple deployment, resulting in an OpenAI-compatible API.

Gemma Models

Google's lightweight open language models

Models
gemma-3-4b-it
EA5D2652-3D78-4D47-A576-FCC3EA9879F6
XIM Only

---
license: gemma
libraryname: transformers
pipeline
tag: image-text-to-text
extragatedheading: Access Gemma on Hugging Face
extragatedprompt: To access Gemma on Hugging Face, you’re required to review and
agree to Google’s usage license. To do ...

Parameters 4.3B
Context 15K
License Unknown
Architecture gemma3
gemma-3-12b-it
C92FB30F-F677-4F29-92D3-6026BD715F2D
XIM Only Not Deployed

---
license: gemma
libraryname: transformers
pipeline
tag: image-text-to-text
extragatedheading: Access Gemma on Hugging Face
extragatedprompt: To access Gemma on Hugging Face, you’re required to review and
agree to Google’s usage license. To do ...

Parameters 12.2B
Context 32K
License Unknown
Architecture gemma3
gemma-3-27b-it
0422A634-819E-4F77-9899-3598D694BFE9
XIM Only Not Deployed

---
license: gemma
libraryname: transformers
pipeline
tag: image-text-to-text
extragatedheading: Access Gemma on Hugging Face
extragatedprompt: To access Gemma on Hugging Face, you’re required to review and
agree to Google’s usage license. To do ...

Parameters 27.4B
Context Unknown
License Unknown
Architecture gemma3

Gemma Models

Google's lightweight open language models

Models
gemma-4-E4B
4546C80E-450B-471A-8043-9F2817CB6620
XIM Only Not Deployed

---
libraryname: transformers
license: apache-2.0
license
link: https://ai.google.dev/gemma/docs/gemma4license
pipelinetag: any-to-any
---

<div align="center">
<img src=https://ai.google.dev/gemma/images/gemma4
banner.png>
</div>

<p align="cent...

Parameters 5.7B
Context 131K
License Unknown
Architecture gemma4
gemma-4-E2B
7ACD4EF5-96D1-4C3D-8FED-E7B00A040416
XIM Only Not Deployed

---
libraryname: transformers
license: apache-2.0
license
link: https://ai.google.dev/gemma/docs/gemma4license
pipelinetag: any-to-any
---

<div align="center">
<img src=https://ai.google.dev/gemma/images/gemma4
banner.png>
</div>

<p align="cent...

Parameters 2.1B
Context 131K
License Unknown
Architecture gemma4
gemma-4-26B-A4B
3D154A2B-C0E2-4345-838A-FB27BC5B6C2C
XIM Only Not Deployed

---
libraryname: transformers
license: apache-2.0
license
link: https://ai.google.dev/gemma/docs/gemma4license
pipelinetag: image-text-to-text
---

<div align="center">
<img src=https://ai.google.dev/gemma/images/gemma4
banner.png>
</div>

<p ali...

Parameters 5.3B
Context 262K
License Unknown
Architecture gemma4

GLM Models

Zhipu AI's general language models

Models
GLM-4.7-Flash
B0545E60-0048-421A-8EE8-093B126F7340
XIM Only Not Deployed

---
language:

  • en
  • zh

libraryname: transformers
license: mit
pipeline
tag: text-generation
---

GLM-4.7-Flash

<div align="center">
<img src=https://raw.githubusercontent.com/zai-org/GLM-4.5/refs/heads/main/resources/logo.svg width="15%"/>
</di...

Parameters 3.8B
Context 202K
License MIT
Architecture glm4_moe_lite

Granite Models

IBM's enterprise-focused language models

Models
granite-4.0-h-small
F879E17F-5521-4CCD-A6B6-C47E0036834E
XIM Only Not Deployed

---
license: apache-2.0
libraryname: transformers
tags:

  • language
  • granite-4.0

---

mof-class3-qualified

Granite-4.0-H-Small

📣 **Updat...

Parameters 11.6B
Context 131K
License Unknown
Architecture granitemoehybrid
granite-4.0-h-tiny
C29307CD-71DD-43D4-B3AC-A271BC80BB9B
XIM Only

---
license: apache-2.0
libraryname: transformers
tags:

  • language
  • granite-4.0

---

mof-class3-qualified

Granite-4.0-H-Tiny

📣 **Update...

Parameters 1.8B
Context 131K
License Unknown
Architecture granitemoehybrid

Mixture of Experts

Mixture-of-experts architecture models

Models
LFM2-8B-A1B
7E32DB9D-FA97-40B4-A277-6F2D41E2C760
XIM Only

---
libraryname: transformers
license: other
license
name: lfm1.0
licenselink: LICENSE
language:

  • en
  • ar
  • zh
  • fr
  • de
  • ja
  • ko
  • es

pipeline
tag: text-generation
tags:
  • liquid
  • lfm2
  • edge
  • moe

---

<center>
<div style="text-align: center;">
...

Parameters 1.9B
Context 68K
License LFM Open License v1.0
Architecture lfm2_moe

Llama Models

Meta's open-weight large language models

Models
Llama-4-Scout-17B-16E-Instruct-quantized.w4a16
4103842E-F281-41AF-AB47-7409DCE49B01
XIM Only Not Deployed

---
language:

  • ar
  • de
  • en
  • es
  • fr
  • hi
  • id
  • it
  • pt
  • th
  • tl
  • vi

basemodel:
  • meta-llama/Llama-4-Scout-17B-16E-Instruct

pipeline
tag: image-text-to-text
tags:
  • facebook
  • meta
  • pytorch
  • llama
  • llama4
  • neuralmagic
  • redhat
  • llmcompressor

...

Parameters 22.2B
Context 10485K
License Unknown
Architecture llama4

Mistral Models

Mistral AI's efficient language models

Models
Ministral-3-14B-Reasoning-2512
85AC319C-D50B-45B2-B9C5-B3A0093A9885
XIM Only Popular

---
libraryname: vllm
language:

  • en
  • fr
  • es
  • de
  • it
  • pt
  • nl
  • zh
  • ja
  • ko
  • ar

license: apache-2.0
inference: false
base
model:
  • mistralai/Ministral-3-14B-Base-2512

extragateddescription: >-
If you want to learn more about how we process y...

Parameters 18.1B
Context 262K
License Unknown
Architecture mistral3
Ministral-3-3B-Reasoning-2512
973063DC-E3A7-42F8-A0F6-4F3043473E7B
XIM Only Popular

---
libraryname: vllm
language:

  • en
  • fr
  • es
  • de
  • it
  • pt
  • nl
  • zh
  • ja
  • ko
  • ar

license: apache-2.0
inference: false
base
model:
  • mistralai/Ministral-3-3B-Base-2512

extragateddescription: >-
If you want to learn more about how we process yo...

Parameters 4.7B
Context 262K
License Unknown
Architecture mistral3
Devstral-Small-2-24B-Instruct-2512
F7E5ED05-1378-4DD8-9D41-C62CD37404CB
XIM Only

---
libraryname: vllm
inference: false
base
model:

  • mistralai/Mistral-Small-3.1-24B-Base-2503

extragateddescription: >-
If you want to learn more about how we process your personal data, please read
our <a href="https://mistral.ai/terms/">Privac...

Parameters 18.1B
Context 82K
License Unknown
Architecture mistral3

Community Models

User-shared models from the Xerotier community

Models
granite-embedding-reranker-english-r2
3B871E57-ED1D-4845-868F-3D538F06B2D5
XIM Only

---
license: apache-2.0
language:

  • en

basemodel:
  • ibm-granite/granite-embedding-english-r2

pipeline
tag: text-ranking
library_name: sentence-transformers
tags:
  • granite
  • transformers
  • embeddings
  • mteb
  • text-embeddings-inference

---

granite-em...

Parameters 285M
Context 8K
License Unknown
Architecture modernbert
granite-embedding-english-r2
C9776EBA-C4BF-4171-A376-7B43F0874EDE
XIM Only

---
language:

  • en

libraryname: sentence-transformers
license: apache-2.0
pipeline
tag: feature-extraction
tags:
  • granite
  • embeddings
  • transformers
  • mteb

---

Granite-Embedding-English-R2

<!-- Provide a quick summary of what the model is/does. -...

Parameters 285M
Context 8K
License Unknown
Architecture modernbert

Community Models

User-shared models from the Xerotier community

Models
nomic-embed-text-v1.5
CB042730-1149-40C8-BDB1-7574E33DDC30
XIM Only

---
libraryname: sentence-transformers
pipeline
tag: sentence-similarity
tags:

  • feature-extraction
  • sentence-similarity
  • mteb
  • transformers
  • transformers.js

model-index:
  • name: epoch0model

results:
  • task:

type: Classification
dat...

Parameters 160M
Context 2K
License Unknown
Architecture nomic_bert

Community Models

User-shared models from the Xerotier community

Models
gpt-oss-20b
A2CDFE3C-3F89-44C7-AD8D-D8AB6986E90D
XIM Only Not Deployed

---
license: apache-2.0
pipelinetag: text-generation
library
name: transformers
tags:

  • vllm

---

<p align="center">
<img alt="gpt-oss-20b" src="https://raw.githubusercontent.com/openai/gpt-oss/main/docs/gpt-oss-20b.svg">
</p>

<p align="center">
<...

Parameters 4.3B
Context Unknown
License Apache-2.0
Architecture Unknown

Phi Models

Microsoft's compact high-performance models

Models
phi-4
96EFB1B3-84A1-4660-AF02-B4E25EDB2A5D
XIM Only Not Deployed

---
license: mit
licenselink: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
language:

  • en

pipeline
tag: text-generation
tags:
  • phi
  • nlp
  • math
  • code
  • chat
  • conversational

inference:
parameters:
temperature: 0
widget:
  • message...

Parameters 17.8B
Context 16K
License MIT
Architecture phi3
Phi-4-mini-instruct
A8F06DE3-8E0F-4684-804E-20BE19BBD37A
XIM Only Not Deployed

---
language:

  • multilingual
  • ar
  • zh
  • cs
  • da
  • nl
  • en
  • fi
  • fr
  • de
  • he
  • hu
  • it
  • ja
  • ko
  • 'no'
  • pl
  • pt
  • ru
  • es
  • sv
  • th
  • tr
  • uk

libraryname: transformers
license: mit
license
link: https://huggingface.co/microsoft/Phi-4-mini-instruct/...

Parameters 6.1B
Context 131K
License MIT
Architecture phi3

Qwen Models

Alibaba Cloud's multilingual language models

Models
Qwen3-0.6B
BCEF18DA-1F3B-4543-ACD4-B00598CCBD0F
XIM Only

---
libraryname: transformers
license: apache-2.0
license
link: https://huggingface.co/Qwen/Qwen3-0.6B/blob/main/LICENSE
pipelinetag: text-generation
base
model:

  • Qwen/Qwen3-0.6B-Base

---

Qwen3-0.6B

<a href="https://chat.qwen.ai/" target="_blank" ...
Parameters 781M
Context 40K
License Apache-2.0
Architecture qwen3
Qwen3-8B
B59EBB93-175E-4B1B-9802-4A9AC213B795
XIM Only Not Deployed

---
libraryname: transformers
license: apache-2.0
license
link: https://huggingface.co/Qwen/Qwen3-8B/blob/main/LICENSE
pipelinetag: text-generation
base
model:

  • Qwen/Qwen3-8B-Base

---

Qwen3-8B

<a href="https://chat.qwen.ai/" target="_blank" style=...
Parameters 10.9B
Context 40K
License Apache-2.0
Architecture qwen3
Qwen3-14B-AWQ
AC410C0A-F022-44CA-AE56-6CFED22E8E35
XIM Only Not Deployed

---
libraryname: transformers
license: apache-2.0
license
link: https://huggingface.co/Qwen/Qwen3-14B/blob/main/LICENSE
pipelinetag: text-generation
base
model: Qwen/Qwen3-14B
---

Qwen3-14B-AWQ

<a href="https://chat.qwen.ai/" target="_blank" style=...
Parameters 18.3B
Context 40K
License Apache-2.0
Architecture qwen3

Qwen Models

Alibaba Cloud's multilingual language models

Models
Qwen3.5-0.8B
714E40DE-1EC2-4F89-9AA0-A1E15E535C9A
XIM Only

---
libraryname: transformers
license: apache-2.0
license
link: https://huggingface.co/Qwen/Qwen3.5-0.8B/blob/main/LICENSE
pipelinetag: image-text-to-text
base
model:

  • Qwen/Qwen3.5-0.8B-Base

---

Qwen3.5-0.8B

<img width="400px" src="https://qianwe...

Parameters 911M
Context 262K
License Apache-2.0
Architecture qwen3_5
Qwen3.5-27B-FP8
77BDCA79-81D6-4484-9CF4-93EE22B1B6DC
XIM Only Not Deployed

---
libraryname: transformers
license: apache-2.0
license
link: https://huggingface.co/Qwen/Qwen3.5-27B-FP8/blob/main/LICENSE
pipelinetag: image-text-to-text
base
model:

  • Qwen/Qwen3.5-27B

---

Qwen3.5-27B-FP8

<img width="400px" src="https://qianwen...

Parameters 29.4B
Context 262K
License Apache-2.0
Architecture qwen3_5

Qwen Models

Alibaba Cloud's multilingual language models

Models
Qwen3.5-35B-A3B
33F46EFB-365A-4437-B7AF-306F20EE8D16
XIM Only Popular

---
libraryname: transformers
license: apache-2.0
license
link: https://huggingface.co/Qwen/Qwen3.5-35B-A3B/blob/main/LICENSE
pipelinetag: image-text-to-text
base
model:

  • Qwen/Qwen3.5-35B-A3B-Base

---

Qwen3.5-35B-A3B

<img width="400px" src="https...

Parameters 3.7B
Context 138K
License Apache-2.0
Architecture qwen3_5_moe