Skip to content
@unum-cloud

Unum

Scaling Intelligence. Rebuilding the cloud bottom-up — from Storage to Compute to AI — innovating across Storage, Numerics, and Modeling.

Cloud Infrastructure Designed to Scale

Discord     LinkedIn     Twitter     Blog     Youtube     GitHub


What if we could redesign the cloud from the ground up?

  1. We would realize that the foundation of the modern cloud stack is small and can be maintained by a tiny team.
  2. Despite being fast, networking isn't free. The concept of "disaggregated storage and compute" is great for convenience but not for software efficiency.
  3. Even the best compilers struggle to vectorize high-level code, resulting in approximately 10× lower hardware utilization for computational workloads.
  4. Most operating systems have custom mechanisms to accelerate I/O that are unused by 99% of modern software, leading to about 10× higher latency for networking and storage.
  5. Data-center servers often feature purpose-built accelerators, meaning a single simple build toolchain is unlikely to handle bleeding-edge heterogeneous software.
  6. Reimplementing solutions in every language isn't feasible. An abundant language with a well-defined committee and strong industry support, like C99 and C++17, should be used.
  7. Consolidating all cloud technology into a mono-repo increases component interdependencies, hindering the adoption of individual parts. A modular design with clear isolation is preferable.

Since 2015, Unum has been striving to meet all these conditions. We have developed a unified framework consisting of concise, low-level implementations for storage, computing, and AI modeling systems, all designed with efficiency in mind. This endeavor compelled us to build our own liquid-cooled clusters for R&D, collaborate closely with multiple cloud providers, and implement assembly-level optimizations often unique in the software industry.

Today, some of our projects run on hundreds of millions of devices, trusted by unicorns, decacorns, trillion-dollar tech companies, governments, and even intelligence agencies. Our primary goal is to power the next generation of computing, focusing on applications in AI and computational science. Since 2022, we've been increasingly open-sourcing our work and look forward to sharing much more soon!


1 Most database management systems are built on top of just a few key-value stores, like RocksDB. Proximity graphs and algorithms like HNSW can replace most indexing data structures. Most networking is built on top of TCP/IP and relies on just a few algorithms. Similar statements hold true for numeric libraries, machine learning frameworks, and even the models built on top of them. 2 InfiniBand is now powering the majority of Top-500 supercomputers, and Remote Direct Memory Access systems provide convenient abstractions for users, but their latency is still orders of magnitude higher than accessing local memory. 3 Our optimizations encompass SIMD instructions across AVX2, AVX-512 generations, NEON, SVE, SVE2, Intel and Apple AMX variants, SME, WMMA, and other NVIDIA extensions. 4 We utilize SPDK, DPDK, and io_uring for Linux kernel bypass. 5 The last 10 years of attempts to build heterogeneous compilers, like SyCL, have failed, so multiple tools have to be used in conjunction. 6 Many mechanisms exist for implementing language bindings. Unum generally focuses on Python, Rust, and JavaScript as the primary languages in machine learning and the web.

Pinned Loading

  1. usearch usearch Public

    Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

    C++ 2.3k 143

  2. uform uform Public

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

    Python 1.1k 62

  3. ucall ucall Public

    Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️

    C 1.2k 43

  4. ustore ustore Public

    Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

    C++ 553 31

Repositories

Showing 10 of 25 repositories
  • .github Public

    GitHub Profile Header

    unum-cloud/.github’s past year of commit activity
    0 0 0 0 Updated Nov 30, 2024
  • usearch Public

    Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

    unum-cloud/usearch’s past year of commit activity
    C++ 2,284 Apache-2.0 143 56 (5 issues need help) 7 Updated Nov 21, 2024
  • awesome Public

    A list of awesome resources and blogs on topics related to Unum

    unum-cloud/awesome’s past year of commit activity
    31 2 0 0 Updated Oct 15, 2024
  • ucall Public

    Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring ☎️

    unum-cloud/ucall’s past year of commit activity
    C 1,151 Apache-2.0 43 10 (3 issues need help) 3 Updated Oct 4, 2024
  • uform Public

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

    unum-cloud/uform’s past year of commit activity
    Python 1,058 Apache-2.0 62 8 (2 issues need help) 1 Updated Oct 1, 2024
  • usearch-benchmarks Public

    Comparing USearch to FAISS and other Vector Search engines on Billion-scale datasets

    unum-cloud/usearch-benchmarks’s past year of commit activity
    Python 8 2 0 0 Updated Nov 29, 2023
  • unum-cloud/uform-benchmarks’s past year of commit activity
    Python 3 0 0 0 Updated Oct 6, 2023
  • ucsb Public

    Wide NoSQL benchmark for RocksDB, LevelDB, Redis, WiredTiger and MongoDB extending the Yahoo Cloud Serving Benchmark

    unum-cloud/ucsb’s past year of commit activity
    C++ 50 5 3 1 Updated Sep 8, 2023
  • ustore Public

    Multi-Modal Database replacing MongoDB, Neo4J, and Elastic with 1 faster ACID solution, with NetworkX and Pandas interfaces, and bindings for C 99, C++ 17, Python 3, Java, GoLang 🗄️

    unum-cloud/ustore’s past year of commit activity
    C++ 553 Apache-2.0 31 26 (7 issues need help) 3 Updated Sep 1, 2023
  • coco-sm Public

    Evaluation of Vision-Language models' multilingual properties in 20 different languages.

    unum-cloud/coco-sm’s past year of commit activity
    Python 7 1 0 0 Updated Aug 17, 2023

People

This organization has no public members. You must be a member to see who’s a part of this organization.