Skip to content

TeleFuserΒΆ

A high-performance runtime for world model inference and multimodal generation.

FeaturesΒΆ

  • 🌍 World Model Runtime β€” Continuous execution, stateful sessions, bidirectional control loops
  • πŸš€ High Performance β€” Optimized Triton kernels, feature caching, and multi-GPU inference
  • 🎨 Multimodal Generation β€” Image/video generation, super-resolution, speech-to-video
  • πŸ“‘ Streaming Transport β€” WebRTC with media tracks plus DataChannel for real-time inference
  • πŸ”§ Flexible Configuration β€” Attention implementations, parallel strategies, quantization, offloading
  • πŸ“¦ Extensible β€” Easy to add new models, stages, and pipelines

Supported ModelsΒΆ

World Model and Real-TimeΒΆ

Model Tasks Description
LingBot-World-Fast Bidirectional streaming Interactive world model via WebRTC DataChannel

Video GenerationΒΆ

Model Tasks Description
WanVideo (Wan2.Β½.2) T2V, I2V, FL2V Video generation and editing
HunyuanVideo T2V, I2V Video generation
LTX Video I2V + Audio Video generation with audio
FlashVSR VSR Video super-resolution
LiveAct S2V Speech-to-video
LongCat-Video T2V, I2V Long video generation

Image GenerationΒΆ

Model Tasks Description
Qwen-Image T2I, Edit Image generation and editing
Z-Image T2I Image generation
Flux2 Klein T2I Image generation

Quick StartΒΆ

# Install
pip install telefuser

# Batch serving
telefuser serve /path/to/pipeline.py --port 8000

# Stream serving (requires WebRTC)
pip install -e ".[webrtc]"
telefuser stream-serve examples/stream_server/stream_lingbot_world_fast.py -p 8088

Documentation SectionsΒΆ


Switch to Chinese πŸ‡¨πŸ‡³