Skip to content
View numb3r3's full-sized avatar

Organizations

@jina-ai

Block or report numb3r3

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Thunder gives you PyTorch models superpowers for training and inference. Unlock out-of-the-box optimizations for performance, memory and parallelism, or roll out your own.

Python 1,335 93 Updated Apr 29, 2025

Ring attention implementation with flash attention

Python 757 63 Updated Apr 8, 2025

Tina: Tiny Reasoning Models via LoRA

Python 135 12 Updated Apr 23, 2025
Jupyter Notebook 24 2 Updated Jan 30, 2025
Python 2 Updated Apr 15, 2025

LettuceDetect is a hallucination detection framework for RAG applications.

Python 397 22 Updated Apr 5, 2025

Fast serverless LLM inference, in Rust.

Rust 69 18 Updated Mar 1, 2025

Common used component in AI applications. (inference interface, processing utils, serving etc)

Python 5 Updated Apr 23, 2025

A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.

Rust 102 7 Updated Mar 26, 2025

Doge Family of Small Language Model

Python 132 10 Updated Apr 29, 2025
Python 47 1 Updated Feb 27, 2025

Companion code for FanOutQA: Multi-Hop, Multi-Document Question Answering for Large Language Models (ACL 2024)

Python 53 3 Updated Mar 5, 2025

[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"

Python 12 1 Updated Mar 31, 2025

A general framework for bridging LLMs and recommendation systems via reinforcement learning. https://arxiv.org/pdf/2503.24289

Python 78 4 Updated Apr 13, 2025

A new chunking strategy developed by ZeroEntropy for general semantic chunking using Llama-70B.

Python 179 11 Updated Jan 28, 2025

An RPC Transport Library for asyncio

Python 20 2 Updated Apr 7, 2025
Python 53 5 Updated Apr 18, 2025

🐳 Python GPU adds a minimal install of CUDA and cuDNN on top of the official python:3.x-slim base image

Dockerfile 13 1 Updated Dec 20, 2024

✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models

Python 11 Updated Mar 13, 2025
Python 34 2 Updated Apr 22, 2025

Multi-modal OCR pipeline optimized for ML training (text, figure, math, tables, diagrams)

Python 613 45 Updated Apr 8, 2025

A high performance gRPC server on top of Apache Lucene

Java 282 43 Updated Apr 29, 2025

📄 Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVINO, PaddlePaddle and PyTorch.

Python 3,984 429 Updated Apr 26, 2025

Modern, fast, document parser written in 🦀

C 476 4 Updated Mar 23, 2025

Official Repo for ACL 2024 "Generate-then-Ground in Retrieval-Augmented Generation for Multi-hop Question Answering"

Python 3 Updated Apr 6, 2025

Official code for "SearchLM: Language Models Can Self-Incentivize as Search Reasoners"

Python 2 Updated Mar 24, 2025

Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

Python 70 Updated Apr 24, 2025

Beyond Contrastive Learning: Synthetic Data Enables List-wise Training with Multiple Levels of Relevance

Python 6 Updated Apr 19, 2025

High-performance safetensors model loader

Python 24 5 Updated Apr 7, 2025

Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021

Python 519 104 Updated Jun 14, 2024
Next
Showing results