Machine Learning Inference IP 

Quadric's Chimera General Purpose Neural Processing Unit (GPNPU)

Quadric is the leading processor architecture optimized for on-device artificial intelligence computing. Only the Quadric Chimera GPNPU delivers high ML inference performance and also runs complex C++ code without forcing the developer to artificially partition code between two or three different kinds of processors.

Quadric’s Chimera GPNPU is a licensable processor that scales from 1 to 16 TOPs.  Chimera GPNPUs run all types ML networks - including classical backbones, vision transformers, and large language models.

Design your Soc faster with Chimera GPNPU

Traditional Design

Quadric GPNPU Design

One architecture for ML inference plus pre-and-post processing simplifies SoC hardware design and software programming.

Three REASONS TO CHOOSE the chimera GPNPU

1
Handles matrix and vector operations and scalar (control) code in one execution pipeline. No need to artificially partition application code (C++ code, ML graph code) between different kinds of processors.
2
Executes diverse workloads - including state-of-the-art ViT models - with efficiency, low power and faster speed, all in a single processor.
3
Scales from 1 to 16 TOPs.
Find out more about the chimera GPNPU

Quadric Developer studio

Quadric’s hosted SDK provides easy simulation and deployment of AI software.
Learn more about developer studio

Quadric Insights

How to Unlock the Power of Operator Fusion to Accelerate AI

In July of 1887, Carl Benz held the first public outing for his “vehicle powered by a gas engine” – the first automobile ever invented. Nine short months after the first car was publicly displayed, […]

Read More
Compiler-driven Performance Boosts for GPNPUs

Graph Compilers are just getting started! The GNU C Compiler – GCC – was first released in 1987.  36 years ago.  Several version streams are still actively being developed and enhanced, with GCC13 being the […]

Read More
Vision Transformers Change the AI Acceleration Rules

Transformers were first introduced by the team at Google Brain in 2017 in their paper, “Attention is All You Need“. Since their introduction, transformers have inspired a flurry of investment and research which have produced […]

Read More
How to Unlock the Power of Operator Fusion to Accelerate AI

In July of 1887, Carl Benz held the first public outing for his “vehicle powered by a gas engine” – the first automobile ever invented. Nine short months after the first car was publicly displayed, […]

Read More
Compiler-driven Performance Boosts for GPNPUs

Graph Compilers are just getting started! The GNU C Compiler – GCC – was first released in 1987.  36 years ago.  Several version streams are still actively being developed and enhanced, with GCC13 being the […]

Read More
Explore more quadric blogs

© Copyright 2023  Quadric    All Rights Reserved     Privacy Policy

linkedin facebook pinterest youtube rss twitter instagram facebook-blank rss-blank linkedin-blank pinterest youtube twitter instagram