How do I stream chat completions with OpenAI’s Python API? · openai/openai-python · Discussion #2462

Istituto-freudinttheprodev
Jul 14, 2025

I'm using the official openai Python package to call the Chat API (gpt-3.5 or gpt-4), and I'd like to stream the response instead of waiting for the full reply.

I tried this:

response = openai.ChatCompletion.create(
 model="gpt-3.5-turbo",
 messages=[{"role": "user", "content": "Hello"}]
)

But it waits until everything is returned.

How can I make it stream the tokens one by one as they’re generated?

Answered by MatteoMgr2008

Jul 14, 2025

Great question!

To stream chat completions using the openai Python package, you need to set stream=True and then iterate over the events.

Here’s how you can do it:

import openai
openai.api_key = "your-api-key"
response = openai.ChatCompletion.create(
 model="gpt-3.5-turbo",
 messages=[{"role": "user", "content": "Hello!"}],
 stream=True # ✅ this enables streaming
)
for chunk in response:
 if "choices" in chunk:
 content = chunk["choices"][0]["delta"].get("content", "")
 print(content, end="", flush=True)

This will print the generated message token-by-token in real time.

Let me know if that works — and feel free to mark this as the answer if it helps! ✅

View full answer

Replies: 2 comments 2 replies

MatteoMgr2008
Jul 14, 2025

Great question!

To stream chat completions using the openai Python package, you need to set stream=True and then iterate over the events.

Here’s how you can do it:

import openai
openai.api_key = "your-api-key"
response = openai.ChatCompletion.create(
 model="gpt-3.5-turbo",
 messages=[{"role": "user", "content": "Hello!"}],
 stream=True # ✅ this enables streaming
)
for chunk in response:
 if "choices" in chunk:
 content = chunk["choices"][0]["delta"].get("content", "")
 print(content, end="", flush=True)

This will print the generated message token-by-token in real time.

Let me know if that works — and feel free to mark this as the answer if it helps! ✅

1 reply

@Istituto-freudinttheprodev

Istituto-freudinttheprodev Jul 14, 2025
Author

Thank you so much!

Answer selected by Istituto-freudinttheprodev

AhmedGMurtaza
Jul 23, 2025

here is a detailed explanation : https://cookbook.openai.com/examples/how_to_stream_completions

1 reply

@MatteoMgr2008

MatteoMgr2008 Jul 23, 2025

@AhmedGMurtaza Thanks for the link to the article, I didn't think OpenAI had released an official paper on this specific topic!
In any case I will gladly read it to find out more

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How do I stream chat completions with OpenAI’s Python API? #2462

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Istituto-freudinttheprodev
Jul 14, 2025

Replies: 2 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

MatteoMgr2008
Jul 14, 2025

Uh oh!

{{title}}

Uh oh!

Istituto-freudinttheprodev Jul 14, 2025
Author

Uh oh!

{{title}}

Uh oh!

AhmedGMurtaza
Jul 23, 2025

Uh oh!

{{title}}

Uh oh!

MatteoMgr2008 Jul 23, 2025

Select a reply

Uh oh!

How do I stream chat completions with OpenAI’s Python API? #2462

Uh oh!

Uh oh!

Istituto-freudinttheprodev Jul 14, 2025

Replies: 2 comments · 2 replies

Uh oh!

Uh oh!

MatteoMgr2008 Jul 14, 2025

Uh oh!

Istituto-freudinttheprodev Jul 14, 2025 Author

Uh oh!

AhmedGMurtaza Jul 23, 2025

Uh oh!

MatteoMgr2008 Jul 23, 2025

Istituto-freudinttheprodev
Jul 14, 2025

Replies: 2 comments 2 replies

MatteoMgr2008
Jul 14, 2025

Istituto-freudinttheprodev Jul 14, 2025
Author

AhmedGMurtaza
Jul 23, 2025