I'm currently pursuing my M.S. at Institute of Computing Technology, Chinese Academy of Sciences, after earning my B.Eng. at Huazhong University of Science and Technology. My research focuses on Multimodal LLMs and Efficient Inference.
-
23:07
(UTC +08:00) - https://jjjymmm.github.io
- https://www.jjjymmm.cn
Pinned Loading
-
QwenLM/Qwen3.5
QwenLM/Qwen3.5 PublicQwen3.5 is the large language model series developed by Qwen team, Alibaba Cloud.
-
QwenLM/Qwen3-VL
QwenLM/Qwen3-VL PublicQwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
-
huggingface/transformers
huggingface/transformers Public🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
-
Multimodal-RoPEs
Multimodal-RoPEs PublicOfficial implement of paper "Revisiting Multimodal Positional Encoding in Vision–Language Models", ICLR 2026
-
Pix2SeqV2-Pytorch
Pix2SeqV2-Pytorch PublicSimple Implementation of Pix2seqV2(multi-task)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


