This repository was archived by the owner on Jul 4, 2025. It is now read-only.
File tree Expand file tree Collapse file tree 2 files changed +6
-8
lines changed Expand file tree Collapse file tree 2 files changed +6
-8
lines changed Original file line number Diff line number Diff line change @@ -4,8 +4,6 @@ title: Architecture
4
4
5
5
![ Nitro Architecture] ( img/architecture.drawio.png )
6
6
7
- ### Details element example
8
-
9
7
## Key Concepts
10
8
11
9
## Inference Server
Original file line number Diff line number Diff line change @@ -167,7 +167,7 @@ paths:
167
167
operationId : createChatCompletion
168
168
tags :
169
169
- Chat Completion
170
- summary : Create an chat with the model.
170
+ summary : Create a chat with the model.
171
171
requestBody :
172
172
content :
173
173
application/json :
@@ -544,23 +544,23 @@ components:
544
544
stream :
545
545
type : boolean
546
546
default : true
547
- description : Enables continuous output generation, allowing for streaming of model responses.
547
+ description : Enables continuous output generation, allowing for streaming of model responses
548
548
model :
549
549
type : string
550
550
example : " gpt-3.5-turbo"
551
- description : Specifies the model being used for inference or processing tasks.
551
+ description : Specifies the model being used for inference or processing tasks
552
552
max_tokens :
553
553
type : number
554
554
default : 2048
555
555
description : The maximum number of tokens the model will generate in a single response
556
556
stop :
557
557
type : arrays
558
558
example : ["hello"]
559
- description : Defines specific tokens or phrases at which the model will stop generating further output.
559
+ description : Defines specific tokens or phrases at which the model will stop generating further output
560
560
frequency_penalty :
561
561
type : number
562
562
default : 0
563
- description : Adjusts the likelihood of the model repeating words or phrases in its output.
563
+ description : Adjusts the likelihood of the model repeating words or phrases in its output
564
564
presence_penalty :
565
565
type : number
566
566
default : 0
@@ -570,7 +570,7 @@ components:
570
570
default : 0.7
571
571
min : 0
572
572
max : 1
573
- description : Controls the randomness of the model's output.
573
+ description : Controls the randomness of the model's output
574
574
top_p :
575
575
type : number
576
576
default : 0.95
You can’t perform that action at this time.
0 commit comments