- Home
- Hardware
- SDKs
- Cloud
- Solutions
- Support
- Ecosystem
- Company
- Contact
The ESP-WebRTC Communication Solution is Espressif’s real-time communication framework for smart devices. Based on the ESP32 series SoCs, it provides an end-to-end development framework covering on-device audio and video capture, real-time transmission, remote playback, and cloud service integration.
Architecture
ESP-WebRTC
Key Benefits
Ultra-Low Latency
Real-time audio, video, and DataChannel transmission for video intercom, remote monitoring, and collaborative device control.
Reliable Connectivity
Built-in ICE, STUN, TURN, dual ICE roles, and optimized candidate pairing, combined with DTLS-SRTP encryption and integrity protection for secure and reliable connections.
Broad Codec Support
Supports H.264, MJPEG, Opus, G.711A, and G.711U, while carrying application control and event messages alongside media streams.
Lightweight and Efficient
Multi-threaded architecture with a deeply optimized protocol stack, balancing performance, stability, code size, and resource efficiency.
Flexible Signaling Integration
Integrate with existing signaling protocols for seamless interoperability with existing systems and devices.
One-Step Integration
Unified components for PeerConnection, RTP, SCTP, signaling, media capture, and playback, enabling rapid integration, validation, and production deployment.
Development Resources
ESP-WebRTC SDK
The open-source ESP-WebRTC solution provides a reusable real-time audio and video foundation for smart devices. It delivers a complete real-time communication framework for ESP32 series SoCs, covering media capture, peer-to-peer connectivity, signaling, rendering and playback, and example applications, helping developers quickly complete prototype validation and product integration.
Additional Resources
Recommended Development Boards
ESP32-P4 Series
ESP32-P4-Function-EV-Board
Designed for multimedia applications such as video doorbells, device-to-device video calls, WHIP streaming, and WebRTC USB camera bridging, enabling fast prototype validation.
- Hardware encoding (H264, JPEG), decoding (JPEG)
- Pixel Processing Accelerator (PPA)
- Rich HMI peripheral interfaces
ESP32-S31 Series
ESP32-S3-Korvo-1
Integrated with a dual-microphone array, LCD screen, and DVP camera, suitable for diverse multimedia interaction scenarios.
- Dual-microphone array
- Near/Far-Field Wake Word
- LCD screen and DVP camera


