Open AI (ChatGPT)
🇺🇸 English
  • 🇺🇸 English
  • 🇯🇵 Japanese
  • 🇵🇹 Portuguese
  • 🇰🇷 Korea
  • 🇩🇪 German
  1. Audio
Open AI (ChatGPT)
🇺🇸 English
  • 🇺🇸 English
  • 🇯🇵 Japanese
  • 🇵🇹 Portuguese
  • 🇰🇷 Korea
  • 🇩🇪 German
  • README
  • Introduction
  • Authentication
  • Making requests
  • Streaming
  • Debugging requests
  • Backward compatibility
  • Administration
  • Audio
    • Create speech
      POST
    • Create transcription
      POST
    • Create translation
      POST
  • Chat
    • Create chat completion
      POST
  • Embeddings
    • Create embeddings
      POST
  • Fine-tuning
    • Create fine-tuning job
      POST
    • List fine-tuning jobs
      GET
    • List fine-tuning events
      GET
    • List fine-tuning checkpoints
      GET
    • Retrieve fine-tuning job
      GET
    • Cancel fine-tuning
      POST
  • Batch
    • Create batch
      POST
    • Retrieve batch
      GET
    • Cancel batch
      POST
    • List batch
      GET
  • Files
    • README
    • Upload file
      POST
    • List files
      GET
    • Retrieve file
      GET
    • Delete file
      DELETE
    • Retrieve file content
      GET
  • Uploads
    • Create upload
    • Add upload part
    • Complete upload
  • Images
    • README
    • Create image
    • Create image edit
    • Create image variation
  • Models
    • List models
    • Retrieve model
    • Delete a fine-tuned model
  • Moderations
    • Create moderation
  • Invites
    • List invites
    • Create invite
    • Retrieve invite
    • Delete invite
  • Users
    • List users
    • Modify user
    • Retrieve user
  • Projects
    • List projects
    • Create project
    • Retrieve project
    • Modify project
    • Archive project
  • Project users
    • List project users
    • Create project user
    • Retrieve project user
    • Modify project user
    • Delete project user
    • Create project service account
  • Project service accounts
    • List project service accounts
    • Create project service account
    • Retrieve project service account
    • Delete project service account
  • Project API keys
    • List project API keys
    • Retrieve project API key
    • Delete project API key
  • Project rate limits
    • List project rate limits
    • Modify project rate limit
  • Audit logs
    • List audit logs
  • Usage
    • Completions
    • Embeddings
    • Moderations
    • Images
    • Audio speeches
    • Audio transcriptions
    • Vector stores
    • Code interpreter sessions
    • Costs
  • Completions
    • Create completion
  • Assistants (v1)
    • Create assistant (v1)
    • Create assistant file (v1)
    • List assistants (v1)
    • List assistant files (v1)
    • Retrieve assistant (v1)
    • Retrieve assistant file (v1)
    • Modify assistant (v1)
    • Delete assistant (v1)
    • Delete assistant file (v1)
  • Threads (v1)
    • Create thread (v1)
    • Retrieve thread (v1)
    • Modify thread (v1)
    • Delete thread (v1)
  • Messages (v1)
    • Create message (v1)
    • List messages (v1)
    • List message files (v1)
    • Retrieve message (v1)
    • Retrieve message file (v1)
    • Modify message (v1)
  • Runs (v1)
    • Create run (v1)
    • Create thread and run (v1)
    • List runs (v1)
    • List run steps (v1)
    • Retrieve run (v1)
    • Retrieve run step (v1)
    • Modify run (v1)
    • Submit tool outputs to run (v1)
    • Cancel a run (v1)
  1. Audio

Create transcription

POST
https://api.openai.com/v1/audio/transcriptions
Learn how to turn audio into text.
Related guide: Speech to text
Transcribes audio into the input language.
Request Request Example
Shell
JavaScript
Java
Swift
curl --location --request POST 'https://api.openai.com/v1/audio/transcriptions' \
--header 'Accept: application/json' \
--header 'Authorization: Bearer ' \
--header 'Content-Type: multipart/form-data' \
--form 'file=@""' \
--form 'model="whisper-1"' \
--form 'prompt="eiusmod nulla"' \
--form 'response_format="json"' \
--form 'temperature="0"' \
--form 'language=""'
Response Response Example
{
  "text": "Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger. This is a place where you can get to do that."
}

Request

Authorization
Provide your bearer token in the
Authorization
header when making requests to protected resources.
Example:
Authorization: Bearer ********************
Header Params
Content-Type
string 
required
Example:
multipart/form-data
Accept
string 
required
Example:
application/json
Authorization
string 
optional
Example:
Bearer {{YOUR_API_KEY}}
Body Params multipart/form-data
file
file 
required
The audio file object (not file name) to transcribe, in one of these formats: mp3, mp4, mpeg, mpga, m4a, wav, or webm.
model
string 
required
ID of the model to use. Only whisper-1 is currently available.
Example:
whisper-1
prompt
string 
optional
An optional text to guide the model's style or continue a previous audio segment. The prompt should match the audio language.
Example:
eiusmod nulla
response_format
string 
optional
The format of the transcript output, in one of these options: json, text, srt, verbose_json, or vtt.
Example:
json
temperature
number 
optional
The sampling temperature, between 0 and 1. Higher values like 0.8 will make the output more random, while lower values like 0.2 will make it more focused and deterministic. If set to 0, the model will use log probability to automatically increase the temperature until certain thresholds are hit.
Example:
0
language
string 
optional
The language of the input audio. Supplying the input language in ISO-639-1 format will improve accuracy and latency.

Responses

🟢200OK
application/json
Body
text
string 
required
Modified at 2023-06-28 05:57:00
Previous
Create speech
Next
Create translation
Built with