Skip to content
@Infini-AI-Lab

Infini-AI-Lab

Next Generation AI algorithms and systems

Popular repositories Loading

  1. Sequoia Sequoia Public

    scalable and robust tree-based speculative decoding algorithm

    Python 366 37

  2. TriForce TriForce Public

    [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

    Python 276 17

  3. MagicPIG MagicPIG Public

    [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

    Python 247 17

  4. MagicDec MagicDec Public

    [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

    Python 137 9

  5. UMbreLLa UMbreLLa Public

    LLM Inference on consumer devices

    Python 129 15

  6. Multiverse Multiverse Public

    Python 110 10

Repositories

Showing 10 of 31 repositories

Top languages

Loading…

Most used topics

Loading…