Cactus

Pre-Seed

www.cactuscompute.comB2BSan Francisco, CA, USA

Market data is refreshed once per day from public sources. Information may be incomplete or outdated — verify independently before making decisions. This is not investment advice.

Beta

DealFlow OS uses public web data and automated enrichment. Research may be incomplete, outdated, or incorrect. Verify important information before making investment or outreach decisions.

Investor read

Evidence-bound summary — expand sections for movement, risks, and signals.

Memo snapshot · May 20, 2026, 6:21 PM

Beta

DealFlow OS uses public web data and automated enrichment. Research may be incomplete, outdated, or incorrect. Verify important information before making investment or outreach decisions.

What they do

Cactus - On-device AI for Smartphones, Laptops & Edge One inference engine for on-device AI across smartphones, laptops, and edge hardware

Funding

Raised $500K across 1 funding round. Latest: $500K Pre-seed (Jun 2025). Investors: Y Combinator. (High).

Quick read

•Cactus - On-device AI for Smartphones, Laptops & Edge One inference engine for on-device AI across smartphones, laptops, and edge hardware
•Reported angle: TurboQuant-H: Hadamard Rotation for 2-Bit Embedding Quantization
•Indexed activity snapshot: 1 funding‑related row(s), 0 hiring‑related, 0 GitHub‑tagged, 7 product/news‑style — scoring reflects corpus coverage only.

Stage

Seed (YC)

Evidence summary

Verified facts

•Cactus - On-device AI for Smartphones, Laptops & Edge One inference engine for on-device AI across smartphones, laptops, and edge hardware
•Reported angle: TurboQuant-H: Hadamard Rotation for 2-Bit Embedding Quantization
•Indexed activity snapshot: 1 funding‑related row(s), 0 hiring‑related, 0 GitHub‑tagged, 7 product/news‑style — scoring reflects corpus coverage only.

Recent movers

•May 11, 2026 · Blog / news — TurboQuant-H: Hadamard Rotation for 2-Bit Embedding Quantization
•May 11, 2026 · Blog / news — Ridiculously Fast On-Device Transcription: Reviewing Parakeet CTC 1.1B with Cactus
•May 11, 2026 · Blog / news — LFM-2.5-350m on Cactus: 140 tok/sec, Single Core, 355 MB

+5 more in Recent movement below

•May 11, 2026 · Blog / news — TurboQuant-H: Hadamard Rotation for 2-Bit Embedding Quantization
•May 11, 2026 · Blog / news — Ridiculously Fast On-Device Transcription: Reviewing Parakeet CTC 1.1B with Cactus
•May 11, 2026 · Blog / news — LFM-2.5-350m on Cactus: 140 tok/sec, Single Core, 355 MB
•May 11, 2026 · Blog / news — The Sweet Spot for Mac Code Use: Reviewing LFM2 24B MoE A2B with Cactus
•May 11, 2026 · Blog / news — Sub-150ms Transcription with Cloud-Level Accuracy: Why We Built a Hybrid Engine
•May 11, 2026 · Blog / news — Gemma 4 on Cactus: The first model you can talk to, show things, and trust to know when it needs help
•May 11, 2026 · Blog / news — Engineering Blog | Cactus
•May 20, 2026 · ycombinator.com — Cactus | Y Combinator

•Y Combinator

Funding

Raised $500K across 1 funding round. Latest: $500K Pre-seed (Jun 2025). Investors: Y Combinator. (High).

Hiring

No hiring/careers evidence indexed (Low).

GitHub

No GitHub‑linked evidence indexed (Low).

Product / news

7 product/news‑styled row(s); headline risk without filings (High).

Traffic / social

No traffic/social evidence indexed (Low).

Funding & hiring signals

funding_articleMay 20, 2026Confidence: high

Cactus | Y Combinator

Summer 2025 batch; low-latency on-device AI engine for mobile and wearables.

Affected score: yesObserved: May 20, 2026Round: Pre-Seed

raisedpre-seed

Open roles (indexed)

No open roles indexed yet.

Failed or blocked links

public_page:_pressnot_found
Last checked Mon, May 11, 04:56 AM
HTTP 404
public_page:_newsnot_found
Last checked Mon, May 11, 04:56 AM
HTTP 404
public_page:_jobsnot_found
Last checked Mon, May 11, 04:56 AM
HTTP 404
public_page:_companynot_found
Last checked Mon, May 11, 04:56 AM
HTTP 404
public_page:_careersnot_found
Last checked Mon, May 11, 04:56 AM
HTTP 404
public_page:_aboutnot_found
Last checked Mon, May 11, 04:56 AM
HTTP 404

DealFlow growth score

42.0Limited recent public signal

7D+0%

30D+0%

Needs Review

The score is an algorithmic estimate based on observed public company-level signals. It may be incomplete, stale, or inaccurate and is not investment, legal, tax, or business advice.

Source health

public_market_enrichmentok
Last checked Mon, May 11, 04:56 AM
public_page:_blog_turboquant-hok
Last checked Mon, May 11, 04:56 AM
public_page:_blog_parakeetok
Last checked Mon, May 11, 04:56 AM
public_page:_blog_lfm2-5-350mok
Last checked Mon, May 11, 04:56 AM
public_page:_blog_lfm2-24b-a2bok
Last checked Mon, May 11, 04:56 AM
public_page:_blog_hybrid-transcriptionok
Last checked Mon, May 11, 04:56 AM
public_page:_blog_gemma4ok
Last checked Mon, May 11, 04:56 AM
public_page:_blogok
Last checked Mon, May 11, 04:56 AM
public_page:_ok
Last checked Mon, May 11, 04:56 AM
public_page:homeok
Last checked Mon, May 11, 04:56 AM

DealFlow score momentum

427D +030D +0

100500

More runs will build history.

The score is an algorithmic estimate based on observed public company-level signals. It may be incomplete, stale, or inaccurate and is not investment, legal, tax, or business advice.

Signal breakdown

Latest momentum signal per category. Expand a card to inspect raw payloads.

Public source summary

Total evidence rows: 11
Latest evidence: Wed, May 20, 06:21 PM

Source types found

blogcompany_sitefunding_articleofficial_sitepress

Strongest / recent news-style rows

Cactus | Y Combinator
Wed, May 20, 06:21 PM · confidence 88%high quality
https://www.ycombinator.com/companies/cactus

Public signal timeline

Newest first · 11 event(s)

Wed, May 20, 06:21 PM · funding_article · 88% · verified_publichigh quality

Cactus | Y Combinator

Summer 2025 batch; low-latency on-device AI engine for mobile and wearables.

Source ↗

Wed, May 20, 06:21 PM · company_site · 85% · verified_publichigh quality

Cactus - On-device AI for Smartphones, Laptops & Edge

One inference engine for on-device AI across hardware targets.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

TurboQuant-H: Hadamard Rotation for 2-Bit Embedding Quantization

Source: Blog / news

A simplified offline variant of TurboQuant using Hadamard rotation and per-group Lloyd-Max codebooks — 4× compression of per-layer embeddings in Gemma 4 E2B at +0.06 PPL.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

Ridiculously Fast On-Device Transcription: Reviewing Parakeet CTC 1.1B with Cactus

Source: Blog / news

Review of NVIDIA's Parakeet-CTC-1.1B model running locally on Mac with Cactus. Architecture breakdown, benchmarks, and transcription use cases.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

LFM-2.5-350m on Cactus: 140 tok/sec, Single Core, 355 MB

Source: Blog / news

Benchmarking Liquid's LFM-2.5-350m across seven devices with Cactus. INT8 quantization, single-core CPU decode, zero-copy loading, and why this configuration makes on-device inference practical.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

The Sweet Spot for Mac Code Use: Reviewing LFM2 24B MoE A2B with Cactus

Source: Blog / news

Review of LiquidAI's LFM2-24B-A2B mixture-of-experts model running locally on Mac with Cactus. Architecture breakdown, benchmarks, and coding agent use cases.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

Sub-150ms Transcription with Cloud-Level Accuracy: Why We Built a Hybrid Engine

Source: Blog / news

How Cactus combines on-device and cloud inference for real-time speech transcription with sub-150ms latency and automatic cloud handoff for noisy audio.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

Gemma 4 on Cactus: The first model you can talk to, show things, and trust to know when it needs help

Source: Blog / news

Gemma 4 runs natively on your device with real-time voice, vision, and audio, and routes hard problems to the cloud when it should.

Source ↗

Mon, May 11, 04:56 AM · blog · 90% · publichigh quality

Engineering Blog | Cactus

Source: Blog / news

Deep dives into on-device AI, inference optimization, and running models on smartphones, laptops, and edge hardware.

Source ↗

Mon, May 11, 04:56 AM · official_site · 90% · publichigh quality

Cactus - On-device AI for Smartphones, Laptops & Edge

Source: Homepage

One inference engine for on-device AI across smartphones, laptops, and edge hardware. Run LLMs, transcription, and embeddings locally with automatic cloud fallback.

Source ↗

Wed, May 20, 06:21 PM · press · 85% · verified_publiclow quality

cactus-compute/cactus on GitHub

Open-source low-latency mobile AI engine.

Source ↗

Official / company site

2 row(s)

company_site·Wed, May 20, 06:21 PM·Confidence 85%high qualityverified_public

Cactus - On-device AI for Smartphones, Laptops & Edge

One inference engine for on-device AI across hardware targets.

https://www.cactuscompute.com

official_site·Mon, May 11, 04:56 AM·Confidence 90%high qualitypublic

Cactus - On-device AI for Smartphones, Laptops & Edge

Source name: Homepage

One inference engine for on-device AI across smartphones, laptops, and edge hardware. Run LLMs, transcription, and embeddings locally with automatic cloud fallback.

https://cactuscompute.com/

Funding / news

1 row(s)

funding_article·Wed, May 20, 06:21 PM·Confidence 88%high qualityverified_public

Cactus | Y Combinator

Summer 2025 batch; low-latency on-device AI engine for mobile and wearables.

https://www.ycombinator.com/companies/cactus

GitHub

1 row(s)

press·Wed, May 20, 06:21 PM·Confidence 85%low qualityverified_public

cactus-compute/cactus on GitHub

Open-source low-latency mobile AI engine.

https://github.com/cactus-compute/cactus