All teardowns
Teardown · 12 min

What breaks when a Lovable app meets real scale

A Lovable demo is one of the best things to happen to early-stage product. You describe an app, and ninety seconds later something real is on screen. The gap we keep getting hired to close is the one nobody films: what happens between that demo and the first thousand real users.

We took a representative Lovable build — a B2B SaaS dashboard with auth, a Supabase backend, and one AI feature — and pushed it toward production conditions. Below is where it held, where it settled, and exactly how we'd rebuild it. No logos required. Just the thinking.

LovableAI app builderTeardown · 12 min
01 · The premise

Lovable (the rebranded GPT Engineer) turns plain-English prompts into full-stack TypeScript and React apps, with an optional Supabase backend wired in. It reached roughly $100M ARR in about eight months with a team of around 45 — a number worth sitting with, because it tells you how badly founders wanted this. The promise is direct: skip the React boilerplate, the schema design, the deploy config, and get a working product in front of users this week.

We picked it because it's the single most common starting point we see in rescue conversations. Founders arrive having shipped something genuinely useful in days. Then they try to grow it, and the questions change from "can we build this" to "why is this falling over."

02 · What they got right

Credit where it's due, because there's a lot.

The time-to-first-working-thing is real, not a party trick. For validation, that speed is a legitimate competitive advantage — you learn whether anyone wants the thing before you've spent $15k on a freelancer. The generated code is real React and TypeScript you can own and export, not a locked no-drag-and-drop cage. Agent Mode handling its own exploration and debugging is a sensible direction. And wiring Supabase in by default removes the single most common reason prototypes never get a backend at all.

If your goal is "prove the idea is worth building properly," Lovable is a good call. We tell founders this directly. The trouble starts when "prove it" silently becomes "run it."

03 · Where they settled

Here's where the trade-offs land on the user — and they land hardest exactly when the product starts working.

The Technical Cliff

The demo produces a frontend. Production needs infrastructure: row-level security policies, environment variables, real auth flows, a deploy pipeline that doesn't break on the third push. That work gets handed back to a founder who chose Lovable specifically to avoid it. The magic evaporates at the exact moment the product gets traction.

The backend coupling

The tight default binding to Supabase is a gift on day one and a constraint by month three. Once your data model, your access rules, and your AI calls all assume that one wiring, moving any piece of it becomes a project in itself.

The credit treadmill

Pricing is credit-based, and you spend credits fixing the model's own mistakes. Founders describe burning a week's budget re-prompting the same bug the AI swore it had solved. "Cheap development" turns into expensive operational experimentation with an unpredictable monthly number — the worst kind of cost when you're managing runway.

Security you can't see

AI generates functional code. Functional is not the same as secure. In 2026 Lovable took public criticism after users found a permissions issue that could expose project information under certain conditions — a specific instance of the general vibe-coding risk: nobody senior reviewed the auth model, so nobody caught it. For anything touching user data, payments, or PII, that's not a quirk. It's the whole risk.

None of this means the tool is bad. It means the tool is a prototyper, and the market keeps using prototypes as products.

04 · The rebuild

This is how we'd take that same Lovable app to production. The principle: keep everything Lovable did well, replace only what doesn't survive contact with real users. We don't throw away the frontend — we own it and harden everything under it.

1. Extract and own the code

First move is taking the generated code out of the credit loop and into a real repo with version control, so fixes are diffs and reviews, not re-prompts you pay for.

2. Put a real boundary on the backend

We keep Supabase where it earns its place but stop letting the frontend talk to it raw. A thin API/edge layer gives us one place to enforce auth, rate limits, and validation — and the option to move pieces to dedicated Postgres later without a rewrite.

3. Write the security the demo skipped

Explicit row-level security on every table, real session handling, secrets out of the client, and an actual permissions model reviewed by someone who's shipped one before.

4. Isolate the AI feature

Any model call moves behind its own service with input/output validation, a fallback path, and an eval gate so a model change can't silently degrade the product. Here's how we'd think about model choice:

Use case in the appCandidate modelEst. latencyEst. cost / 1k callsWhy
High-volume classificationSmall / fast tier~0.4–0.8s~$0.50–$2Cheap, good enough, scales
User-facing generationMid frontier model~1–2.5s~$8–$20Quality where users see it
Rare complex reasoningTop frontier model~3–6s~$30–$60Only on the 5% that needs it

Planning-stage estimates, not a benchmark. The point isn't the exact figures — it's that the original build used one model for everything, and routing by job is usually where margin and speed both come from.

5. Wrap it in observability and CI

Error tracking, basic load testing, and a deploy pipeline that catches the third-push breakage before users do.

05 · The 6-week plan

What we'd cut, and how we'd ship it.

Week 1

Audit & triage

Map what exists, find the security holes, rank by blast radius. Cut any half-built feature that isn't load-bearing.

Week 2

Security pass

RLS on every table, auth hardening, secrets moved server-side. This ships first because it's the one that ends companies.

Weeks 3–4

Backend boundary

Introduce the API layer, decouple the frontend, set up migrations so the schema stops being a live wire.

Weeks 4–5

AI layer

Isolate the model service, add guardrails and the eval gate, route by job per the table above.

Week 5

Observability & CI/CD

Error tracking, a real pipeline, a basic load test against expected traffic.

Week 6

Load test & handoff

Push it past target load, fix what breaks, document it, hand over a codebase a senior team can extend.

06 · The verdict

Twelve months out, Lovable is going to keep winning the part it owns — getting a real product on screen faster than anything else. We expect the prototyping gap to keep narrowing and the production gap to stay roughly where it is, because secure auth, sane data architecture, and cost discipline aren't prompt problems. They're judgment problems.

So the honest verdict: build your first version in Lovable. Validate fast, spend little, learn everything. Then, the moment real users show up, treat the output as a sharp first draft — not a foundation. The founders who get burned are the ones who never made that switch.

That switch is most of what we do.

FAQ

For validation and early MVPs, yes. For production with real users, payments, or sensitive data, the generated code needs a senior security and architecture pass first — the tool optimizes for speed, not for what survives scale.

Yes. We keep the generated frontend, own the code, and rebuild the backend, security, and AI layer underneath it — typically in about six weeks.

Credit-based pricing means you pay to re-prompt fixes, including fixes for the AI's own errors, which makes monthly cost hard to predict as the app grows.

Usually rescue. The frontend and product logic are often worth keeping — the cost is concentrated in backend, security, and infrastructure, not in a full rewrite.