Alluqi.AI — AI deployed your way.

01 / 06 — The Problem

AI vendors built the cage. We didn't.

Most "AI strategies" end the same way: three contracts, four logins, and a quarterly bill that grows faster than the value. Here's what we hear in the discovery call.

// Problem 01 — Vendor lock-in

One vendor, one cloud, one pricing curve.

Your workflow lives inside their platform. The model upgrade is theirs. The data export is "coming soon." When the price doubles, the migration cost is the moat.

// Problem 02 — Per-seat sprawl

$30/seat × 8 tools × 47 people.

Copilot here, ChatGPT Team there, a Notion AI add-on, a Zapier upgrade. Nobody knows what's used. Finance signs the renewal because cancelling is harder than paying.

// Problem 03 — Data exfiltration

Your contracts are training someone's model.

Legal flagged it. So did the auditor. Sensitive docs shouldn't leave the building, but the team is pasting them into a free chatbot every day because the policy doesn't have a substitute.

02 / 06 — What we deploy

Five SKUs. One workflow layer. Your choice of deployment.

Each SKU runs cloud, on-prem, or hybrid. The Server is optional hardware — we recommend it when volume or privacy make the math work.

// SKU 01

Alluqi.AI
Front Desk

AI receptionist, website chat, and lead qualifier. Sounds like the person who's been at your front desk for twelve years. 24/7. Picks up on the first ring.

Cloud / On-Prem / Hybrid

// SKU 02

Alluqi.AI
Workflows

Custom agent automations across your tool stack — CRM, email, accounting, ticketing. We build the routing. You keep the keys. The workflow moves with you.

Cloud / On-Prem / Hybrid

// SKU 03

Alluqi.AI
Brain

Private RAG knowledge base — "ask the company." Every contract, SOP, ticket, and email indexed and answerable. Nothing leaves the building unless you say so.

Cloud / On-Prem / Hybrid

// SKU 04

Alluqi.AI
Back Office

Document extraction, bookkeeping, month-end automation. Invoices, receipts, expense reports — read, coded, posted. Your bookkeeper goes from data entry to review.

Cloud / On-Prem / Hybrid

// SKU 05

Alluqi.AI
Server

Optional on-prem hardware. $5K / $15K / $40K tiers sized to your usage. Ships racked, models pre-loaded. You own the box. No monthly fee. Yours forever.

On-Prem only

03 / 06 — Deployment Models

Cloud. On-prem. Hybrid.

Three equal paths. We pick the one that fits your volume, data sensitivity, and budget — and build the workflow so the other two are a config change away.

Cloud

// Pay-as-you-go

Light or sporadic volume — under ~50K calls/month
Frontier-model reasoning (Claude, GPT-5, Gemini)
Buyers not ready for capex
Variable monthly spend, no install fee

Starts at $0.30 / call · orchestrated

Hybrid

// The pragmatic default

Mixed sensitivity — some data private, some not
Sensitive workloads routed to a local server
Everything else flows to cloud frontier models
Best balance of cost, privacy, and quality

~70% of customers run this

On-Prem

// Owned hardware

High volume — 50K+ calls/month
Legal, medical, financial, IP-sensitive workloads
Capex-friendly buyers — own, don't rent
Predictable cost, zero data egress

One time. Yours forever.

04 / 06 — Customer Results

The math, anonymized.

Three customers, three deployments, three different problems. We don't promise ROI numbers we can't back with a real case.

// Dental group · 4 locations

$34K/yr

Replaced one front-desk hire.

Front Desk on hybrid. AI answers after-hours and triages new-patient calls. Saved a $42K salaried role; kept the receptionist they liked for daytime.

Deployment · Hybrid Live · 11 days Payback · 4 mo

// Regional law firm · 38 attorneys

$18K/mo

Killed the per-seat ChatGPT bill.

Brain on a $15K server. Every contract and matter file indexed locally. Replaced 47 Copilot seats. Zero documents leave the building. Auditor signed off.

Deployment · On-Prem Live · 9 days Payback · 2 mo

// HVAC shop · 22 trucks

$210K+ revenue

Caught the leads they were losing.

Front Desk + Workflows on cloud. Picks up after-hours service calls, books them straight into the dispatch board. 31% lift in booked jobs over 6 months.

Deployment · Cloud Live · 6 days Payback · 3 wk

05 / 06 — Pricing

Server tiers. Install fees. No surprises.

Three Server tiers, sized to monthly volume. Workflow SKUs run on top — same install fee whether you're cloud, on-prem, or hybrid.

// Spec

What's on the box

// Tier 01 — Starter

Server 5K

$5,000One time · yours forever

// Tier 02 — Most chosen

Server 15K

$15,000One time · yours forever

// Tier 03 — Scale

Server 40K

$40,000One time · yours forever

Sized for

Up to 30 users · <25K calls/mo

Up to 150 users · <120K calls/mo

500+ users · unlimited internal

Hardware class

1× consumer GPU · 64GB

2× pro GPU · 128GB · ECC

4× datacenter GPU · 256GB · redundant PSU

Models pre-loaded

Llama 3.1 8B · Mistral 7B

Llama 3.1 70B · Mistral · Whisper

Llama 3.1 405B · pick any open-weights

Concurrent sessions

Up to 8

Up to 40

Unlimited

Front Desk / Workflows / Brain / Back Office

Included

On-site install

One afternoon · included

One day · included

Two days · included

Warranty + remote support

1 year · email

3 years · phone + remote

3 years · on-call + on-site

// Monthly fee

$0 — no subscription

// Tier 01 — Starter

Server 5K

$5,000One time · yours forever

Up to 30 users · <25K calls/mo
1× consumer GPU · 64GB
Llama 3.1 8B · Mistral 7B pre-loaded
All four SKUs included
One-afternoon install · 1yr warranty
$0 monthly fee

// Tier 02 — Most chosen

Server 15K

$15,000One time · yours forever

Up to 150 users · <120K calls/mo
2× pro GPU · 128GB ECC
Llama 3.1 70B · Mistral · Whisper
All four SKUs included
One-day install · 3yr phone support
$0 monthly fee

// Tier 03 — Scale

Server 40K

$40,000One time · yours forever

500+ users · unlimited internal calls
4× datacenter GPU · 256GB · redundant PSU
Llama 3.1 405B · any open-weights model
All four SKUs included
Two-day install · 3yr on-site support
$0 monthly fee

Workflow build (cloud-only customers) Per-SKU install fee. Same workflow runs on-prem later — no rebuild.

$4,800 / SKU

06 / 06 — Frequently Asked

Six honest answers.

If your question isn't here, call 412-207-4282. We pick up between 7am and 6pm Eastern. Mike answers most days.

Cloud vs on-prem — how do you decide?

Three numbers: monthly call volume, data sensitivity, capex appetite.

Under ~50K calls/month with no sensitive data — cloud. Frontier-model quality, no install, pay-as-you-go.

Over 50K calls or any legal/medical/financial data — on-prem. The $15K server pays for itself against a $1,500/mo cloud bill in ten months.

Most customers land on hybrid: sensitive stuff on the box, the rest to the cloud. Same workflow either way.

What happens to our data?

On-prem: nothing leaves the building. Inference runs on your server. We can't see it.

Cloud: routed through Claude, GPT, or Gemini under business-tier terms — your data isn't used for training, isn't logged for human review.

Hybrid: you tell us which data classes are sensitive. Those route to the local box. Everything else goes cloud. The routing rules are yours to edit.

How long from contract to live?

Cloud-only deployments: 3 to 7 days. No hardware to ship.

On-prem with a Server 5K or 15K: ~2 weeks. Hardware ships in a week, install is one afternoon to one day.

Server 40K: 3 to 4 weeks. Custom build, two-day on-site install. We've never blown past four.

What does support look like?

Mike answers the phone. So does Tom. There are three of us in Trafford.

Server 5K: email support, business hours.

Server 15K and up: phone, remote screen-share, and we drive out if something needs hands on it. We've made the trip nine times in two years.

Which models do you run?

On-prem: Llama 3.1 (8B / 70B / 405B), Mistral, Qwen, Whisper for speech, embedding models for RAG. All open-weights. All swappable.

Cloud: Claude (Sonnet, Opus), GPT-5, Gemini 2.5. We orchestrate — you don't sign separate contracts.

We don't have a favorite. We pick by job. Speech → Whisper. Long-context legal review → Claude. Cheap structured extraction → Llama 8B local.

What if we outgrow the deployment?

That's the point of building it this way.

Outgrow a Server 5K? We swap it for the 15K. Workflow doesn't change. We credit 60% of the old box against the new one.

Outgrow us entirely? Your workflow is portable — open models, open standards, no proprietary glue we control. Take the stack and go. We'll help you move it.

07 / 07 — Get a quote

Tell us what you need. We'll quote you in 48 hours.

No discovery deck. No "synergy session." Two emails, maybe a phone call, a fixed-price quote with a deploy date.

Pittsburgh-built.
We answer the phone.

Mike Brown runs the shop. Stop by Friday and we'll show you a server running.

Phone 412-207-4282

Email [email protected]

Shop 514 Cavitt Ave · Trafford, PA

Hours · 7am–6pm ET · Mon–Fri · Quote SLA · 48 hrs

Quote request

Tell us volume, sensitivity, and what you'd like the AI to do. We'll come back with a fixed price, a deploy date, and a recommended deployment model.

Your name

Company

Email

Phone

What you want

Preferred deployment

Volume, sensitivity, what hurts

No newsletter. No drip campaign. One reply within 48hrs.

Sent. Mike will reply within 48 hours from [email protected].

AI deployedyour way.

AI vendors built the cage. We didn't.

One vendor, one cloud, one pricing curve.

$30/seat × 8 tools × 47 people.

Your contracts are training someone's model.

Five SKUs. One workflow layer. Your choice of deployment.

Alluqi.AIFront Desk

Alluqi.AIWorkflows

Alluqi.AIBrain

Alluqi.AIBack Office

Alluqi.AIServer

Cloud. On-prem. Hybrid.

Cloud

Hybrid

On-Prem

The math, anonymized.

Server tiers. Install fees. No surprises.

Six honest answers.

Tell us what you need. We'll quote you in 48 hours.

AI deployed
your way.

Alluqi.AI
Front Desk

Alluqi.AI
Workflows

Alluqi.AI
Brain

Alluqi.AI
Back Office

Alluqi.AI
Server