TeleFuser
A high-performance runtime for world model inference and multimodal generation.
Features
- π World Model Runtime β Continuous execution, stateful sessions, bidirectional control loops
- π High Performance β Optimized Triton kernels, feature caching, and multi-GPU inference
- π¨ Multimodal Generation β Image/video generation, super-resolution, speech-to-video
- π‘ Streaming Transport β WebRTC with media tracks plus DataChannel for real-time inference
- π§ Flexible Configuration β Attention implementations, parallel strategies, quantization, offloading
- π¦ Extensible β Easy to add new models, stages, and pipelines
Supported Models
World Model and Real-Time
| Model | Tasks | Description |
| LingBot-World-Fast | Bidirectional streaming | Interactive world model via WebRTC DataChannel |
Video Generation
| Model | Tasks | Description |
| WanVideo (Wan2.Β½.2) | T2V, I2V, FL2V | Video generation and editing |
| HunyuanVideo | T2V, I2V | Video generation |
| LTX Video | I2V + Audio | Video generation with audio |
| FlashVSR | VSR | Video super-resolution |
| LiveAct | S2V | Speech-to-video |
| LongCat-Video | T2V, I2V | Long video generation |
Image Generation
| Model | Tasks | Description |
| Qwen-Image | T2I, Edit | Image generation and editing |
| Z-Image | T2I | Image generation |
| Flux2 Klein | T2I | Image generation |
Quick Start
# Install
pip install telefuser
# Batch serving
telefuser serve /path/to/pipeline.py --port 8000
# Stream serving (requires WebRTC)
pip install -e ".[webrtc]"
telefuser stream-serve examples/stream_server/stream_lingbot_world_fast.py -p 8088
Documentation Sections
Switch to Chinese π¨π³