A lightweight native unified multimodal model for image and video understanding, generation, and editing.
Awesome samples for Volcengine AgentKit Platform with VeADK.
Official repo for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing
最近更新: 11个月前Official implementation in ComfyUI of CVPR 2025 paper "HyperLoRA: Parameter-Efficient Adaptive Generation for Portrait Synthesis"
最近更新: 11个月前BytevalKit-Emb is a modular embedding model evaluation framework that implements automated model performance assessment through standardized proces...
最近更新: 12个月前Findings of ACL'25 DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models
最近更新: 12个月前PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search ...
最近更新: 1年前WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?
最近更新: 1年前