Skip to content
View RainBowLuoCS's full-sized avatar
🌴
On vacation
🌴
On vacation
  • UCAS | TongYi Laboratory
  • Beijing,China

Block or report RainBowLuoCS

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
RainBowLuoCS/README.md

Hi I am Run Luo 👋

My PhD Research Objectives for the Coming Years:

Objective 1 (RUN1): Develop a GUI-integrated Multimodal Large Language Model (MLLM) capable of autonomously completing all complex tasks at the operating system level, replacing 90% of routine work through a single RL-optimized MLLM model.

Objective 2 (RUN2): Enable MLLM to achieve L4-level autonomous machine navigation for unmanned systems, demonstrating full-scenario driving capability without human intervention.

Objective 3 (RUN3): Generalize MLLM to embodied intelligent coordination and control, creating a unified super-intelligent model integrating: Spatial perception,Task planning,Motion generation,Feedback execution,Cross-modal understanding

This research roadmap aims to push the boundaries of MLLM capabilities across three critical dimensions of human-machine interaction and autonomous systems.

Anurag's github stats

Pinned Loading

  1. DiffusionTrack DiffusionTrack Public

    [AAAI 2024] DiffusionTrack: Diffusion Model For Multi-Object Tracking. DiffusionTrack is the first work to employ the diffusion model for multi-object tracking by formulating it as a generative noi…

    Python 183 7

  2. DEEM DEEM Public

    (ICLR2025 Spotlight) DEEM: Official implementation of Diffusion models serve as the eyes of large language models for image perception.

    Python 23 2

  3. MMEvol MMEvol Public

    🔥🔥🔥Code for "Empowering Multimodal Large Language Models with Evol-Instruct"

    Jupyter Notebook 13

  4. OpenOmni OpenOmni Public

    OpenOmni: Official implementation of Advancing Open-Source Omnimodal Large Language Models with Progressive Multimodal Alignment and Real-Time Self-Aware Emotional Speech Synthesis

    Python 32 1

  5. VCM VCM Public

    2

  6. RUN1 RUN1 Public

    2