Performance

532 readers

1 users here now

A community for posts relating to performance

Wormhole

[email protected]

founded 2 years ago

MODERATORS

Ategon

agilob

LLaMA Now Goes Faster on CPUs (justine.lol)

submitted 1 year ago by agilob to c/performance

0 comments fedilink hide all child comments

My kernels go 2x faster than MKL for matrices that fit in L2 cache, which makes them a work in progress, since the speedup works best for prompts having fewer than 1,000 tokens.

no comments (yet)

sorted by: hot top controversial new old

there doesn't seem to be anything here