You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat!: v6 — fully generated SDK with latest APIs and WebSocket support (#640)
## Summary
This is the Deepgram Python SDK v6 release. The SDK moves to a fully
Fern-generated architecture, replacing all hand-rolled WebSocket code
from v5 with generated, type-safe implementations aligned with the
latest API spec.
### What's changing
- **Fully generated WebSocket clients** — Listen v1/v2, Speak v1, and
Agent v1 WebSocket implementations are now generated from the API spec,
replacing the manually maintained code in v5. This means faster feature
delivery and fewer SDK-specific bugs.
- **Latest APIs and features** — Includes all current Deepgram API
capabilities: Listen v2 (conversational speech recognition with turn
detection), Agent v1 (voice agents), and the latest Speak v1 features.
- **Simplified send methods** — `send_media()` now accepts raw `bytes`
directly. Control messages use dedicated methods (`send_keep_alive()`,
`send_finalize()`, `send_flush()`, etc.) instead of the generic
`send_control()` pattern.
- **New type system** — Types are generated per-domain
(`deepgram.listen.v1.types`, `deepgram.agent.v1.types`,
`deepgram.types`) instead of the shared
`deepgram.extensions.types.sockets` barrel import.
- **22 production-ready examples** covering authentication,
transcription (file, URL, live), voice agents, TTS, text intelligence,
and management APIs.
- **CI/CD improvements** — Matrix testing across Python 3.8–3.13,
release-please workflow, PR title validation.
### Breaking changes
- All imports from `deepgram.extensions.types.sockets` must be updated
to domain-specific type packages
- `send_control()` replaced by dedicated methods per WebSocket client
- `send_media()` now takes `bytes` instead of wrapper message types
- Agent settings types renamed to match generated schema hierarchy (e.g.
`AgentV1SettingsMessage` → `AgentV1Settings`)
### Documentation
- **[Migration guide](docs/Migrating-v5-to-v6.md)** — Complete v5 to v6
migration guide
- **[API Reference](reference.md)** — Full REST and WebSocket reference
with v6 types and examples
---------
Co-authored-by: fern-api[bot] <115122769+fern-api[bot]@users.noreply.github.com>
Copy file name to clipboardExpand all lines: README.md
+34-32Lines changed: 34 additions & 32 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -13,8 +13,9 @@ Comprehensive API documentation and guides are available at [developers.deepgram
13
13
14
14
### Migrating From Earlier Versions
15
15
16
+
-[v5 to v6](./docs/Migrating-v5-to-v6.md) (current)
17
+
-[v3+ to v5](./docs/Migrating-v3-to-v5.md)
16
18
-[v2 to v3+](./docs/Migrating-v2-to-v3.md)
17
-
-[v3+ to v5](./docs/Migrating-v3-to-v5.md) (current)
18
19
19
20
## Installation
20
21
@@ -26,8 +27,7 @@ pip install deepgram-sdk
26
27
27
28
## Reference
28
29
29
-
-**[API Reference](./reference.md)** - Complete reference for all SDK methods and parameters
30
-
-**[WebSocket Reference](./websockets-reference.md)** - Detailed documentation for real-time WebSocket connections
30
+
-**[API Reference](./reference.md)** - Complete reference for all SDK methods, parameters, and WebSocket connections
31
31
32
32
## Usage
33
33
@@ -37,7 +37,7 @@ The Deepgram SDK provides both synchronous and asynchronous clients for all majo
37
37
38
38
#### Real-time Speech Recognition (Listen v2)
39
39
40
-
Our newest and most advanced speech recognition model with contextual turn detection ([WebSocket Reference](./websockets-reference.md#listen-v2-connect)):
40
+
Our newest and most advanced speech recognition model with contextual turn detection ([Reference](./reference.md#listen-v2-connect)):
0 commit comments