Tr-dq: Time-rotation diffusion quantization
Y Shao, D Lin, M Yan, S Chen, F Zeng, M Liao, A Ma, Z Yan, H Wang, et al. "Tr-dq: Time-rotation diffusion quantization." AAAI 2026, 40(11), 8869-8877.
Image Generation; Video Generation; Multimodal Generation; Model Compression
Y Shao, D Lin, M Yan, S Chen, F Zeng, M Liao, A Ma, Z Yan, H Wang, et al. "Tr-dq: Time-rotation diffusion quantization." AAAI 2026, 40(11), 8869-8877.
et al., M Zhang (5th author). "Robust Detection in Complex Construction Sites: HiPA-DETR with Weather-Aware and Cross-Domain Generalization." ACM MM 2025 (under review).
et al., M Zhang (6th author). "Memory Efficient Point Cloud Segmentation with Spatial Group Attention." ACM MM 2025 (under review).
et al., M Zhang (4th author). "Vision Also You Need: Navigating Out-of-Distribution Detection with Multimodal Large Language Model." ICCV 2025 (under review).
M Zhang et al. "AccidentX: A Large-Scale Multimodal BEV Dataset for Traffic Accident Analysis and Prevention." IROS 2025 (under review).
M Zhang et al. "Controllable Panoramic Video Generation with 360-Degree Motion Consistency for Multiple Control Tasks using a Unified Framework." ICCV 2025 (under review).
et al., M Zhang (3rd author). "AASD: Accelerate Inference by Aligning Speculative Decoding in Multimodal Large Language Models." DAC 2025.
M Zhang, W Meng, M Jia, J Gu, Y Shao, C Wang, R Xu, Z Ma, X Zhang. "PDFT: parameter-diminish fine-tuning for transformer-based models." The Visual Computer, 41(9), 6745-6755.
M Zhang, Y Chen, R Xu, C Wang, JM Yang, W Meng, J Guo, H Zhao, et al. "PanoDit: Panoramic videos generation with diffusion transformer." AAAI 2025, 39(10), 10040.
B Wang, X Wang, C Ni, G Zhao, Z Yang, Z Zhu, M Zhang, Y Zhou, X Chen, et al. "Humandreamer: Generating controllable human-motion videos via decoupled generation." CVPR 2025, 12391.
Y Shao, H He, S Li, S Chen, X Long, F Zeng, Y Fan, M Zhang, Z Yan, et al. "Eventvad: Training-free event-aware video anomaly detection." ACM MM 2025, 2586-2595.
M Zhang, J Yang, Y Xian, W Li, J Gu, W Meng, J Zhang, X Zhang. "AG-SDM: Aquascape generation based on stable diffusion model with low-rank adaptation." Computer Animation and Virtual Worlds, 35(3), e2252.
Z Ma, W Li, M Zhang, W Meng, S Xu, X Zhang. "HTCViT: an effective network for image classification and segmentation based on natural disaster datasets." The Visual Computer, 39(8), 3285-3297.
J Gu, J Zhang, M Zhang, W Meng, S Xu, J Zhang, X Zhang. "Feaco: Reaching robust feature-level consensus in noisy pose conditions." ACM MM 2023, 3628-3636.