|
1 | 1 | --- |
2 | 2 | title: Together AI |
3 | | -description: Access hundreds of open-source AI models including Llama, Mistral, and Qwen through Together's optimized inference platform. |
| 3 | +description: Access hundreds of open-source AI models through Together's optimized platform. |
4 | 4 | --- |
5 | 5 |
|
| 6 | +**Website:** [https://api.together.xyz/](https://api.together.xyz/) |
6 | 7 |
|
7 | | -Together AI provides access to a comprehensive collection of open-source AI models from leading research organizations, offering high-performance inference with competitive pricing and extensive model variety. |
| 8 | +## Getting an API Key |
8 | 9 |
|
9 | | -## Overview |
| 10 | +1. Go to [Together AI](https://api.together.xyz/) and sign in |
| 11 | +2. Navigate to Settings > API Keys |
| 12 | +3. Create a new API key |
| 13 | +4. Copy the key immediately |
10 | 14 |
|
11 | | -Together AI serves as a marketplace for open-source AI models, providing access to cutting-edge models from Meta, Mistral, Google, and other leading AI research labs. Their platform offers both static model access and dynamic model discovery. |
| 15 | +## Configuration |
12 | 16 |
|
13 | | -<CardGroup cols={3}> |
14 | | - <Card title="Model Variety" icon="layers"> |
15 | | - Access to hundreds of open-source models |
16 | | - </Card> |
17 | | - <Card title="High Performance" icon="zap"> |
18 | | - Optimized infrastructure for fast inference |
19 | | - </Card> |
20 | | - <Card title="Research Models" icon="microscope"> |
21 | | - Latest models from top AI research labs |
22 | | - </Card> |
23 | | -</CardGroup> |
| 17 | +1. Click the settings icon (⚙️) in CodinIT |
| 18 | +2. Select "Together AI" as the API Provider |
| 19 | +3. Paste your API key |
| 20 | +4. Choose your model |
24 | 21 |
|
25 | 22 | ## Popular Models |
26 | 23 |
|
27 | | -<AccordionGroup> |
28 | | - <Accordion title="Meta Llama Series" icon="brain"> |
29 | | - ### Llama Models |
30 | | - Industry-leading open-source models from Meta. |
| 24 | +- **Llama series:** Llama 3.1 70B, Llama 3.2 90B Vision |
| 25 | +- **Mistral series:** Mixtral 8x7B, Devstral Small, Magistral Small |
| 26 | +- **Google series:** Gemma 3 (27B, 12B, 4B, 1B) |
| 27 | +- **Coding models:** Qwen3-Coder 480B, Arcee AI Coder |
| 28 | +- **Reasoning models:** Kimi K2 Thinking, DeepSeek-V3.2-Exp, Cogito V1 |
31 | 29 |
|
32 | | - - **Llama 3.1 70B**: Powerful general-purpose model |
33 | | - - **Llama 3.2 90B Vision**: Multimodal model with vision capabilities |
34 | | - - **Best for**: Advanced reasoning, creative tasks, multimodal applications |
| 30 | +## Features |
35 | 31 |
|
36 | | - </Accordion> |
| 32 | +- **Model variety:** Hundreds of open-source models |
| 33 | +- **Research access:** Latest models from top labs |
| 34 | +- **High performance:** Optimized inference infrastructure |
| 35 | +- **Dynamic discovery:** Automatic model catalog updates |
| 36 | +- **Cost effective:** Flexible usage-based pricing |
37 | 37 |
|
38 | | - <Accordion title="Mistral Models" icon="wind"> |
39 | | - ### Mistral AI Series |
40 | | - Efficient and capable models from Mistral AI. |
| 38 | +## Notes |
41 | 39 |
|
42 | | - - **Mixtral 8x7B**: High-performance mixture-of-experts model |
43 | | - - **Devstral Small**: Specialized coding model |
44 | | - - **Magistral Small**: Advanced reasoning model |
45 | | - - **Best for**: Efficient inference, specialized tasks |
46 | | - |
47 | | - </Accordion> |
48 | | - |
49 | | - <Accordion title="Google Models" icon="search"> |
50 | | - ### Google/Gemma Series |
51 | | - Lightweight and capable models from Google. |
52 | | - |
53 | | - - **Gemma 3 27B**: Advanced general-purpose model |
54 | | - - **Gemma 3 12B/4B/1B**: Range of model sizes for different needs |
55 | | - - **Best for**: Balanced performance, research applications |
56 | | - |
57 | | - </Accordion> |
58 | | - |
59 | | - <Accordion title="Coding Models" icon="code"> |
60 | | - ### Specialized Coding |
61 | | - Models optimized for programming and technical tasks. |
62 | | - |
63 | | - - **Qwen3-Coder 480B**: Massive coding model |
64 | | - - **Arcee AI Coder**: Specialized coding assistant |
65 | | - - **Best for**: Code generation, debugging, technical writing |
66 | | - |
67 | | - </Accordion> |
68 | | - |
69 | | - <Accordion title="Reasoning Models" icon="lightbulb"> |
70 | | - ### Advanced Reasoning |
71 | | - Models with enhanced reasoning and thinking capabilities. |
72 | | - |
73 | | - - **Kimi K2 Thinking**: Advanced thinking model |
74 | | - - **DeepSeek-V3.2-Exp**: Experimental reasoning model |
75 | | - - **Cogito V1 Preview**: Best-in-class reasoning |
76 | | - - **Best for**: Complex problem-solving, analysis |
77 | | - |
78 | | - </Accordion> |
79 | | -</AccordionGroup> |
80 | | - |
81 | | -## Setup Instructions |
82 | | - |
83 | | -<Steps> |
84 | | - <Step title="Create Account">Visit [Together AI](https://api.together.xyz/) and create an account</Step> |
85 | | - <Step title="Access API Keys">Navigate to Settings > API Keys in your dashboard</Step> |
86 | | - <Step title="Generate API Key">Create a new API key for model access</Step> |
87 | | - <Step title="Configure in Codinit">Add your Together API key to the provider settings</Step> |
88 | | - <Step title="Explore Models">Browse available models and test different options</Step> |
89 | | -</Steps> |
90 | | - |
91 | | -## Key Features |
92 | | - |
93 | | -<BadgeGroup> |
94 | | - <Badge variant="secondary">Open Source</Badge> |
95 | | - <Badge variant="secondary">Model Variety</Badge> |
96 | | - <Badge variant="secondary">High Performance</Badge> |
97 | | - <Badge variant="secondary">Dynamic Discovery</Badge> |
98 | | - <Badge variant="secondary">Cost Effective</Badge> |
99 | | -</BadgeGroup> |
100 | | - |
101 | | -### Platform Advantages |
102 | | - |
103 | | -- **Extensive Catalog**: Access to hundreds of open-source models |
104 | | -- **Research Access**: Latest models from top AI research institutions |
105 | | -- **Performance Optimized**: Fast inference with optimized infrastructure |
106 | | -- **Flexible Pricing**: Pay only for what you use |
107 | | -- **Regular Updates**: New models added frequently |
108 | | - |
109 | | -## Use Cases |
110 | | - |
111 | | -<AccordionGroup> |
112 | | - <Accordion title="Research & Experimentation" icon="flask"> |
113 | | - ### AI Research |
114 | | - Perfect for researchers and developers exploring different models. |
115 | | - |
116 | | - - Model comparison and evaluation |
117 | | - - Research prototyping |
118 | | - - Algorithm testing |
119 | | - - Performance benchmarking |
120 | | - |
121 | | - </Accordion> |
122 | | - |
123 | | - <Accordion title="Development Tasks" icon="code"> |
124 | | - ### Software Development |
125 | | - Excellent for coding and technical development work. |
126 | | - |
127 | | - - Code generation and completion |
128 | | - - Technical documentation |
129 | | - - API development |
130 | | - - Debugging assistance |
131 | | - |
132 | | - </Accordion> |
133 | | - |
134 | | - <Accordion title="Content Creation" icon="edit"> |
135 | | - ### Creative Applications |
136 | | - Suitable for content creation and creative tasks. |
137 | | - |
138 | | - - Creative writing and ideation |
139 | | - - Content generation |
140 | | - - Marketing copy |
141 | | - - Educational materials |
142 | | - |
143 | | - </Accordion> |
144 | | - |
145 | | - <Accordion title="Business Applications" icon="building"> |
146 | | - ### Enterprise Use |
147 | | - Reliable for business and productivity applications. |
148 | | - |
149 | | - - Business analysis |
150 | | - - Customer service automation |
151 | | - - Process optimization |
152 | | - - Data analysis |
153 | | - |
154 | | - </Accordion> |
155 | | -</AccordionGroup> |
156 | | - |
157 | | -## Pricing Information |
158 | | - |
159 | | -Together AI offers flexible pricing based on model size and usage: |
160 | | - |
161 | | -<TokenPricing> |
162 | | - <TokenPricingItem model="Small Models (<10B)" input="$0.001" output="$0.001" /> |
163 | | - <TokenPricingItem model="Medium Models (10-50B)" input="$0.002" output="$0.002" /> |
164 | | - <TokenPricingItem model="Large Models (50B+)" input="$0.003" output="$0.003" /> |
165 | | - <TokenPricingItem model="XL Models (100B+)" input="$0.005" output="$0.005" /> |
166 | | -</TokenPricing> |
167 | | - |
168 | | -<Callout type="info">**Dynamic Pricing**: Actual prices may vary based on current model popularity and demand.</Callout> |
169 | | - |
170 | | -<Callout type="tip"> |
171 | | - **Model Selection**: Start with smaller models for testing, then scale up to larger models for production use. |
172 | | -</Callout> |
173 | | - |
174 | | -<Callout type="warning"> |
175 | | - **Model Availability**: Some models may have limited availability or higher costs during peak usage. |
176 | | -</Callout> |
177 | | - |
178 | | -## Model Management |
179 | | - |
180 | | -<AccordionGroup> |
181 | | - <Accordion title="Dynamic Model Discovery" icon="search"> |
182 | | - ### Model Catalog |
183 | | - How Together AI provides access to available models. |
184 | | - |
185 | | - - **API Integration**: Automatic model discovery through API |
186 | | - - **Real-time Updates**: New models added as they become available |
187 | | - - **Pricing Information**: Cost details included in model listings |
188 | | - - **Performance Metrics**: Context window and capability information |
189 | | - |
190 | | - </Accordion> |
191 | | - |
192 | | - <Accordion title="Model Categories" icon="tags"> |
193 | | - ### Model Organization |
194 | | - Understanding the different types of models available. |
195 | | - |
196 | | - - **Text Generation**: General-purpose language models |
197 | | - - **Code Models**: Specialized for programming tasks |
198 | | - - **Vision Models**: Multimodal models with image understanding |
199 | | - - **Reasoning Models**: Enhanced logical reasoning capabilities |
200 | | - - **Experimental Models**: Cutting-edge research models |
201 | | - |
202 | | - </Accordion> |
203 | | - |
204 | | - <Accordion title="Performance Optimization" icon="gauge"> |
205 | | - ### Optimization Features |
206 | | - Features to improve model performance and cost efficiency. |
207 | | - |
208 | | - - **Context Management**: Efficient handling of large context windows |
209 | | - - **Caching**: Request caching for repeated queries |
210 | | - - **Load Balancing**: Automatic distribution across available resources |
211 | | - - **Cost Optimization**: Suggestions for cost-effective model selection |
212 | | - |
213 | | - </Accordion> |
214 | | -</AccordionGroup> |
215 | | - |
216 | | -## Best Practices |
217 | | - |
218 | | -<AccordionGroup> |
219 | | - <Accordion title="Model Selection" icon="target"> |
220 | | - ### Choosing the Right Model |
221 | | - Guidelines for selecting appropriate models for your use case. |
222 | | - |
223 | | - - **Task Matching**: Choose models specialized for your specific task |
224 | | - - **Cost Consideration**: Balance performance needs with budget constraints |
225 | | - - **Context Requirements**: Ensure model context window meets your needs |
226 | | - - **Testing Phase**: Test multiple models before committing to one |
227 | | - |
228 | | - </Accordion> |
229 | | - |
230 | | - <Accordion title="Usage Optimization" icon="settings"> |
231 | | - ### Optimizing Performance |
232 | | - Tips for getting the best performance from Together AI models. |
233 | | - |
234 | | - - **Prompt Engineering**: Craft clear, specific prompts |
235 | | - - **Context Management**: Keep context focused and relevant |
236 | | - - **Batch Processing**: Group similar requests when possible |
237 | | - - **Caching Strategy**: Cache frequent queries to reduce costs |
238 | | - |
239 | | - </Accordion> |
240 | | - |
241 | | - <Accordion title="Cost Management" icon="dollar-sign"> |
242 | | - ### Managing Costs |
243 | | - Strategies for controlling and optimizing usage costs. |
244 | | - |
245 | | - - **Usage Monitoring**: Track usage patterns and costs |
246 | | - - **Model Switching**: Use smaller models for simpler tasks |
247 | | - - **Caching**: Implement caching to reduce API calls |
248 | | - - **Bulk Operations**: Combine multiple operations when feasible |
249 | | - |
250 | | - </Accordion> |
251 | | -</AccordionGroup> |
| 40 | +- **Pricing:** Based on model size and usage |
| 41 | +- **Model updates:** New models added frequently |
0 commit comments