Sent — we'll reply within 48 hours
Pittsburgh-built · Cloud · On-Prem · Hybrid

AI deployed
your way.

We deploy AI for your business — cloud, on-prem, or hybrid. We build the workflow so you're never locked in. About a week to ship. We answer the phone.

Deploy time ~1 week
Customers 12 deployed
Models Open + frontier
Lock-in None
01 / 06 — The Problem

AI vendors built the cage. We didn't.

Most "AI strategies" end the same way: three contracts, four logins, and a quarterly bill that grows faster than the value. Here's what we hear in the discovery call.

// Problem 01 — Vendor lock-in

One vendor, one cloud, one pricing curve.

Your workflow lives inside their platform. The model upgrade is theirs. The data export is "coming soon." When the price doubles, the migration cost is the moat.

// Problem 02 — Per-seat sprawl

$30/seat × 8 tools × 47 people.

Copilot here, ChatGPT Team there, a Notion AI add-on, a Zapier upgrade. Nobody knows what's used. Finance signs the renewal because cancelling is harder than paying.

// Problem 03 — Data exfiltration

Your contracts are training someone's model.

Legal flagged it. So did the auditor. Sensitive docs shouldn't leave the building, but the team is pasting them into a free chatbot every day because the policy doesn't have a substitute.

02 / 06 — What we deploy

Five SKUs. One workflow layer. Your choice of deployment.

Each SKU runs cloud, on-prem, or hybrid. The Server is optional hardware — we recommend it when volume or privacy make the math work.

// SKU 01

Alluqi.AI
Front Desk

AI receptionist, website chat, and lead qualifier. Sounds like the person who's been at your front desk for twelve years. 24/7. Picks up on the first ring.

Cloud / On-Prem / Hybrid
// SKU 02

Alluqi.AI
Workflows

Custom agent automations across your tool stack — CRM, email, accounting, ticketing. We build the routing. You keep the keys. The workflow moves with you.

Cloud / On-Prem / Hybrid
// SKU 03

Alluqi.AI
Brain

Private RAG knowledge base — "ask the company." Every contract, SOP, ticket, and email indexed and answerable. Nothing leaves the building unless you say so.

Cloud / On-Prem / Hybrid
// SKU 04

Alluqi.AI
Back Office

Document extraction, bookkeeping, month-end automation. Invoices, receipts, expense reports — read, coded, posted. Your bookkeeper goes from data entry to review.

Cloud / On-Prem / Hybrid
// SKU 05

Alluqi.AI
Server

Optional on-prem hardware. $5K / $15K / $40K tiers sized to your usage. Ships racked, models pre-loaded. You own the box. No monthly fee. Yours forever.

On-Prem only
03 / 06 — Deployment Models

Cloud. On-prem. Hybrid.

Three equal paths. We pick the one that fits your volume, data sensitivity, and budget — and build the workflow so the other two are a config change away.

Cloud

// Pay-as-you-go

  • Light or sporadic volume — under ~50K calls/month
  • Frontier-model reasoning (Claude, GPT-5, Gemini)
  • Buyers not ready for capex
  • Variable monthly spend, no install fee
Starts at $0.30 / call · orchestrated

On-Prem

// Owned hardware

  • High volume — 50K+ calls/month
  • Legal, medical, financial, IP-sensitive workloads
  • Capex-friendly buyers — own, don't rent
  • Predictable cost, zero data egress
One time. Yours forever.
04 / 06 — Customer Results

The math, anonymized.

Three customers, three deployments, three different problems. We don't promise ROI numbers we can't back with a real case.

// Dental group · 4 locations
$34K/yr
Replaced one front-desk hire.

Front Desk on hybrid. AI answers after-hours and triages new-patient calls. Saved a $42K salaried role; kept the receptionist they liked for daytime.

Deployment · Hybrid Live · 11 days Payback · 4 mo
// Regional law firm · 38 attorneys
$18K/mo
Killed the per-seat ChatGPT bill.

Brain on a $15K server. Every contract and matter file indexed locally. Replaced 47 Copilot seats. Zero documents leave the building. Auditor signed off.

Deployment · On-Prem Live · 9 days Payback · 2 mo
// HVAC shop · 22 trucks
$210K+ revenue
Caught the leads they were losing.

Front Desk + Workflows on cloud. Picks up after-hours service calls, books them straight into the dispatch board. 31% lift in booked jobs over 6 months.

Deployment · Cloud Live · 6 days Payback · 3 wk
05 / 06 — Pricing

Server tiers. Install fees. No surprises.

Three Server tiers, sized to monthly volume. Workflow SKUs run on top — same install fee whether you're cloud, on-prem, or hybrid.

// Spec
What's on the box
// Tier 01 — Starter
Server 5K
$5,000One time · yours forever
// Tier 02 — Most chosen
Server 15K
$15,000One time · yours forever
// Tier 03 — Scale
Server 40K
$40,000One time · yours forever
Sized for
Up to 30 users · <25K calls/mo
Up to 150 users · <120K calls/mo
500+ users · unlimited internal
Hardware class
1× consumer GPU · 64GB
2× pro GPU · 128GB · ECC
4× datacenter GPU · 256GB · redundant PSU
Models pre-loaded
Llama 3.1 8B · Mistral 7B
Llama 3.1 70B · Mistral · Whisper
Llama 3.1 405B · pick any open-weights
Concurrent sessions
Up to 8
Up to 40
Unlimited
Front Desk / Workflows / Brain / Back Office
Included
Included
Included
On-site install
One afternoon · included
One day · included
Two days · included
Warranty + remote support
1 year · email
3 years · phone + remote
3 years · on-call + on-site
// Monthly fee
$0 — no subscription
$0 — no subscription
$0 — no subscription
// Tier 01 — Starter
Server 5K
$5,000One time · yours forever
  • Up to 30 users · <25K calls/mo
  • 1× consumer GPU · 64GB
  • Llama 3.1 8B · Mistral 7B pre-loaded
  • All four SKUs included
  • One-afternoon install · 1yr warranty
  • $0 monthly fee
// Tier 02 — Most chosen
Server 15K
$15,000One time · yours forever
  • Up to 150 users · <120K calls/mo
  • 2× pro GPU · 128GB ECC
  • Llama 3.1 70B · Mistral · Whisper
  • All four SKUs included
  • One-day install · 3yr phone support
  • $0 monthly fee
// Tier 03 — Scale
Server 40K
$40,000One time · yours forever
  • 500+ users · unlimited internal calls
  • 4× datacenter GPU · 256GB · redundant PSU
  • Llama 3.1 405B · any open-weights model
  • All four SKUs included
  • Two-day install · 3yr on-site support
  • $0 monthly fee
Workflow build (cloud-only customers) Per-SKU install fee. Same workflow runs on-prem later — no rebuild.
$4,800 / SKU
06 / 06 — Frequently Asked

Six honest answers.

If your question isn't here, call 412-207-4282. We pick up between 7am and 6pm Eastern. Mike answers most days.

Cloud vs on-prem — how do you decide?

Three numbers: monthly call volume, data sensitivity, capex appetite.

Under ~50K calls/month with no sensitive data — cloud. Frontier-model quality, no install, pay-as-you-go.

Over 50K calls or any legal/medical/financial data — on-prem. The $15K server pays for itself against a $1,500/mo cloud bill in ten months.

Most customers land on hybrid: sensitive stuff on the box, the rest to the cloud. Same workflow either way.

What happens to our data?

On-prem: nothing leaves the building. Inference runs on your server. We can't see it.

Cloud: routed through Claude, GPT, or Gemini under business-tier terms — your data isn't used for training, isn't logged for human review.

Hybrid: you tell us which data classes are sensitive. Those route to the local box. Everything else goes cloud. The routing rules are yours to edit.

How long from contract to live?

Cloud-only deployments: 3 to 7 days. No hardware to ship.

On-prem with a Server 5K or 15K: ~2 weeks. Hardware ships in a week, install is one afternoon to one day.

Server 40K: 3 to 4 weeks. Custom build, two-day on-site install. We've never blown past four.

What does support look like?

Mike answers the phone. So does Tom. There are three of us in Trafford.

Server 5K: email support, business hours.

Server 15K and up: phone, remote screen-share, and we drive out if something needs hands on it. We've made the trip nine times in two years.

Which models do you run?

On-prem: Llama 3.1 (8B / 70B / 405B), Mistral, Qwen, Whisper for speech, embedding models for RAG. All open-weights. All swappable.

Cloud: Claude (Sonnet, Opus), GPT-5, Gemini 2.5. We orchestrate — you don't sign separate contracts.

We don't have a favorite. We pick by job. Speech → Whisper. Long-context legal review → Claude. Cheap structured extraction → Llama 8B local.

What if we outgrow the deployment?

That's the point of building it this way.

Outgrow a Server 5K? We swap it for the 15K. Workflow doesn't change. We credit 60% of the old box against the new one.

Outgrow us entirely? Your workflow is portable — open models, open standards, no proprietary glue we control. Take the stack and go. We'll help you move it.

07 / 07 — Get a quote

Tell us what you need. We'll quote you in 48 hours.

No discovery deck. No "synergy session." Two emails, maybe a phone call, a fixed-price quote with a deploy date.

Pittsburgh-built.
We answer the phone.

Mike Brown runs the shop. Stop by Friday and we'll show you a server running.

Phone 412-207-4282
Shop 514 Cavitt Ave · Trafford, PA
Hours · 7am–6pm ET · Mon–Fri  ·  Quote SLA · 48 hrs
Quote request

Tell us volume, sensitivity, and what you'd like the AI to do. We'll come back with a fixed price, a deploy date, and a recommended deployment model.

No newsletter. No drip campaign. One reply within 48hrs.
Sent. Mike will reply within 48 hours from [email protected].