Skip to content
View shuangkouyizu's full-sized avatar

Block or report shuangkouyizu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🔥 Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Python 949 59 Updated Feb 25, 2025

LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding

22 1 Updated Feb 27, 2025

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Python 1,097 72 Updated Jan 23, 2025

【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 3,180 228 Updated Dec 3, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 14,404 1,488 Updated Dec 25, 2024

A list of referring video object segmentation papers

27 Updated Mar 6, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 49,187 5,804 Updated Sep 18, 2024

Deep Interactive Thin Object Selection

Python 89 9 Updated Mar 8, 2021

A Survey on Vision-Language Geo-Foundation Models (VLGFMs)

154 8 Updated Feb 16, 2025

Efficient vision foundation models for high-resolution generation and perception.

Python 2,694 214 Updated Jan 24, 2025

A modern GUI client based on Tauri, designed to run in Windows, macOS and Linux for tailored proxy experience

TypeScript 49,901 3,904 Updated Mar 7, 2025

List of datasets, codes, and contests related to remote sensing change detection

1,771 349 Updated Nov 16, 2024

Free ChatGPT Site List 这儿为你准备了众多免费好用的ChatGPT镜像站点

17,100 1,450 Updated Mar 5, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,868 1,452 Updated Sep 5, 2024

SAM (Segment Anything Model) for generating rotated bounding boxes with MMRotate, which is a comparison method of H2RBox-v2.

Python 184 14 Updated Jul 31, 2023

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

Python 1,443 277 Updated Mar 19, 2021

[CVPR 2022] Learning Affinity from Attention: End-to-End Weakly-Supervised Semantic Segmentation with Transformers

Python 213 23 Updated Oct 17, 2022

Implementation for "Context Prior for Scene Segmentation"

Python 250 14 Updated Feb 26, 2021

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Cuda 4,378 919 Updated Aug 30, 2024

realtime multiple people tracking (centerNet based person detector + deep sort algorithm with pytorch)

Python 596 146 Updated Jun 10, 2020

这是一个deeplabv3-plus-pytorch的源码,可以用于训练自己的模型。

Python 1,034 176 Updated Oct 18, 2023

Pretrained DeepLabv3 and DeepLabv3+ for Pascal VOC & Cityscapes

Python 2,176 466 Updated Nov 15, 2022

This is an official implementation of TransVOS

Python 31 3 Updated Oct 13, 2021

Training code for "SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation"

Cuda 87 3 Updated Nov 21, 2021

Download City Scapes Dataset using this script

Shell 210 26 Updated Dec 19, 2023

Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges

Python 183 42 Updated Feb 26, 2023

[WACV 2022] Pixel-Level Bijective Matching for Video Object Segmentation

Python 27 1 Updated Dec 17, 2024

Code for our CVPR2021 paper coordinate attention

Python 1,041 123 Updated Jun 8, 2021

FEELVOS implementation in PyTorch; FEELVOS: Fast End-to-End Embedding Learning for Video Object Segmentation

Python 68 12 Updated Apr 2, 2020

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

Python 1,202 163 Updated Feb 16, 2021
Next
Showing results