numb3r3

felix-wang numb3r3

@jina-ai working on Embedding, Ranking and Small Language Models. Past @HUYA-AI, @ Tencent-AI

178 followers · 1.2k following

@jina-ai
Shenzhen, China
@felix1987_

Achievements

x3 x2 x3

Achievements

x3 x2 x3

Organizations

Lists (2)

Sort

🪨 ANN

4 repositories

whisper

4 repositories

Starred repositories

Lightning-AI / lightning-thunder

Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and parallelism, or roll out your own.

Python 1,335 93 Updated Apr 29, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 757 63 Updated Apr 8, 2025

shangshang-wang / Tina

Tina: Tiny Reasoning Models via LoRA

Python 135 12 Updated Apr 23, 2025

lightblue-tech / lb-reranker

Jupyter Notebook 24 2 Updated Jan 30, 2025

ShiyinTan / ReREF

Python 2 Updated Apr 15, 2025

KRLabsOrg / LettuceDetect

LettuceDetect is a hallucination detection framework for RAG applications.

Python 397 22 Updated Apr 5, 2025

atoma-network / atoma-infer

Fast serverless LLM inference, in Rust.

Rust 69 18 Updated Mar 1, 2025

lucasjinreal / coreai

Common used component in AI applications. (inference interface, processing utils, serving etc)

Python 5 Updated Apr 23, 2025

lucasjinreal / Crane

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

Rust 102 7 Updated Mar 26, 2025

SmallDoges / small-doge

Doge Family of Small Language Model

Python 132 10 Updated Apr 29, 2025

haon-chen / mmE5

Python 47 1 Updated Feb 27, 2025

zhudotexe / fanoutqa

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)

Python 53 3 Updated Mar 5, 2025

HansiZeng / scaling-retriever

[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"

Python 12 1 Updated Mar 31, 2025

linjc16 / Rec-R1

A general framework for bridging LLMs and recommendation systems via reinforcement learning. https://arxiv.org/pdf/2503.24289

Python 78 4 Updated Apr 13, 2025

zeroentropy-ai / zchunk

A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.

Python 179 11 Updated Jan 28, 2025

lablup / callosum

An RPC Transport Library for asyncio

Python 20 2 Updated Apr 7, 2025

allenai / infinigram-api

Python 53 5 Updated Apr 18, 2025

superlinear-ai / python-gpu

🐳 Python GPU adds a minimal install of CUDA and cuDNN on top of the official python:3.x-slim base image

Dockerfile 13 1 Updated Dec 20, 2024

superlinear-ai / wtpsplit-lite

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

Python 11 Updated Mar 13, 2025

microsoft / REBEL

Python 34 2 Updated Apr 22, 2025

ses4255 / Versatile-OCR-Program

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

Python 613 45 Updated Apr 8, 2025

Yelp / nrtsearch

A high performance gRPC server on top of Apache Lucene

Java 282 43 Updated Apr 29, 2025

RapidAI / RapidOCR

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 3,984 429 Updated Apr 26, 2025

AmineDiro / ferrules

Modern, fast, document parser written in 🦀

C 476 4 Updated Mar 23, 2025

mangopy / Generate-then-Ground

Official Repo for ACL 2024 "Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering"

Python 3 Updated Apr 6, 2025

mangopy / SearchLM

Official code for "SearchLM: Language Models Can Self-Incentivize as Search Reasoners"

Python 2 Updated Mar 24, 2025

RyanLiu112 / GenPRM

Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

Python 70 Updated Apr 24, 2025

BatsResearch / sycl

Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance

Python 6 Updated Apr 19, 2025

foundation-model-stack / fastsafetensors

High-performance safetensors model loader

Python 24 5 Updated Apr 7, 2025

clovaai / synthtiger

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Python 519 104 Updated Jun 14, 2024

felix-wang numb3r3

Organizations

Lists (2)

🪨 ANN

whisper

Starred repositories

infini-attention

large-language-models

Rust

3D

document-similarity

vosk

speech-recognition

adversarial-networks

Neural Network

Machine learning