Pangaea Labs // AI Infrastructure

AI,
implemented.

Q: How do I know what I’m paying for?

Total observability. Every metric and the cost and usage behind it, in real time — you can see and cap spend, so cost always maps to outcomes.

From strategy to production — evaluated, audited, shipped. We build the models, the agents, and the platform around them, then prove they work.

Book an audit→See the agents

Secure by design — your data stays in your environment.

Scroll

Trusted by teams shipping across Indonesia

Why Pangaea

Adoption is a discipline.
We install it.

Most teams lose 6–12 months learning what works with AI — and ship a pile of unguarded prototypes along the way. We bring the method that skips the detour, and we hand it to your people to keep.

We compress the learning curve

Teams reach competent, safe AI delivery in weeks — not the 6–12 months it takes alone. We bring the playbook; your people keep it.

We sell the discipline, not the hype

Guardrails, gates and evals first. The interesting capabilities come after the controls — because that’s the only way they last in production.

We prove it with metrics

We baseline your delivery numbers on day one and report against them. Full observability included — you see every metric and exactly what each agent costs to run, so spend always maps to outcomes. No runaway bills.

Build

AI Implementation

We take AI from whiteboard to production — RAG pipelines, LLM apps and agentic systems, built across the cost spectrum (self-hosted, hybrid or fully managed) and wired into your stack with guardrails, evals, observability and real cost controls.

Production RAG — parsing, PII redaction, hybrid retrieval & reranking

Self-correcting answers (CRAG / Self-RAG), gated on faithfulness

Agent & tool-use orchestration — agentic RAG, A2A

Eval-gated CI/CD with full observability and a feedback loop

production rag pipeline

01parse()↓

02redact()↓

03chunk()↓

04embed()↓

05retrieve()↓

06rerank()↓

07generate()↓

08verify()

✓ every response gated on faithfulness ≥ 0.90

Prove it

Eval & Audit

No AI ships on vibes. Faithfulness, relevancy, precision and recall are measured on every release — gated against targets agreed up front — and we red-team the same build for jailbreaks, injection and PII leakage before it reaches a user.

Representative targetsIllustrative gate thresholds a production build should clear — not past client results. Your baseline and the targets that define “done” are set together on day one.

0.94

Faithfulness

0.91

Answer Relevancy

0.89

Context Precision

98.6%

Jailbreak Resist

Measured against the standardsOWASP Top 10 for LLM Applications NIST AI Risk Management Framework RAGAS Retrieval-Augmented Generation (Lewis et al., 2020)

Ready to run

Plugins & Agents

A library of autonomous agents you can demo today — each one a node in the constellation, each one doing real work in your pipeline.

AI Audit

eval + security

Scores faithfulness, relevancy and recall, and red-teams for jailbreaks, injection and PII leaks.

run demo

Prod RAG Pipeline

rag

Crawl → chunk → embed → retrieve, with guarded tool calls on every hop.

run demo

CodeGen AI

codegen

Best-practice templates and flows for Go and Elysia/Bun — deterministic, traceable, consistent code, no freeform drift.

run demo

AI Review

ci/cd

LSP + SAST + standards on every merge, then a CI/CD-gated rollout with a human in the loop.

run demo

Shipped

Software Development

AI is one half of the work. The other half is the platform it runs on — here is some of what we have built and shipped.

Clients & Partners

Golden Rama

PERF · SEO

Travel commerce, rebuilt

Modernised a 50-year travel brand’s booking funnel — a faster catalog across 200+ tours and a cleaner path to conversion.

Next.jsNodeRDS MySQLAWSJenkinsDocker SwarmElasticsearchRedisNSQ

TripDeals.id

LAUNCH · CONVERSION

Greenfield youth travel

Built a budget, multi-country tour site for a younger market from scratch — mobile-first and made to convert.

Next.jsNodeRDS MySQLAWSJenkinsDocker SwarmElasticsearchRedisNSQ

Prakerja

SCALE · RELIABILITY

Government-scale microservices

High-throughput services for a national skilling program serving millions of citizens.

GogRPCKafkaAliyunKubernetes

Wadugs

MAPS · RBAC

Geospatial multi-tenant SaaS

Tenant-isolated geospatial dashboards with live cloud charts at scale.

ReactGoPostGISPythonAWSLambda

Jari PMI

SOCIAL · GOV

Migrant-worker info network

An information network for Indonesian migrant workers and their families — protection, empowerment and financial health in one place.

WebCMSAnalyticsRailway

Adeeptive

PARTNER · SEO

Delivery partner

We work with Adeeptive to make sure every web platform we deliver ships with strong, search-ready SEO.

Visit adeeptive.com ↗

Proof

Numbers that survive an audit

We baseline your delivery on day one and report against it every sprint. Here is what changes once the discipline is installed.

Weeks

to safe AI delivery — not 6–12 months

Week 1

gates + metric baseline live

12+

production systems shipped

100%

yours — code, evals, guardrails

“They shipped a visible lift in our tour site’s performance — and fixed the web structure, which moved our SEO rankings up.”

Golden Rama · travel commerce

“The platform has reliably handled 100 million registered users over four years, and 47 million transactions.”

Prakerja · national skilling program

Shipping across

GovernmentTravelGeospatialPayments

How we work

From scope to shipped in four moves

Scope

We map the use-case, the data, and the eval that defines “done”.

Build

Models, agents, and the platform around them — wired to your stack.

Eval

Faithfulness, safety and cost, measured before anything launches.

Ship

Gated deploys, observability, and a feedback loop that keeps learning.

Reach

Coming soon

Digital Marketing Platform

Own your storefront, your brand and your customer list — instead of renting a marketplace stall. Your spend grows your own store — not a marketplace’s cut on every sale plus fees to be seen in their app. You run your ads across channels and compete on your own terms; our one-stop engine builds the store, keeps the winners, and tracks sales versus spend.

Your customers already live on their phones. In Indonesia the average person spends 3+ hours a day on social media, and WhatsApp reaches roughly nine in ten internet users — while businesses worldwide pour over USD 700 billion a year into digital ads. They are already online; digital marketing is simply how you meet them there.

One platform across the whole funnel — a story to earn attention, proof to win the comparison, a click-to-WhatsApp or live session to close, and retention that compounds (a 5% lift can raise profit 25%+).

Click-to-WhatsApp ads that turn spend into real chats

Live-selling support — Indonesia’s ~3× converter

The right message per stage: story → proof → offer → return

Audiences you own — your list & profiles, not rented reach

On-brand content at volume — Bahasa Indonesia + English

Measured on what matters: cost per customer vs profit

See pricing→Talk to us

Pricing

Priced to approve,
structured to stop.

Buy a single rung when you want a contained win, or retain an embedded team to climb the whole ladder. Either way, you see every metric and the cost behind it — spend maps to outcomes.

01AdoptInstall the method on one use-case — guardrails, gates and an eval that defines done.

02AmplifyScale it across teams and surfaces, with metrics compounding each sprint.

03AutonomizeHand the wheel to gated agents — humans on the exceptions, costs in check.

Per-rung

Fixed · per rung

Project-based: one rung at a time, fixed scope. Easy to approve, easy to budget, easy to stop.

→Defined deliverables and exit criteria
→A single rung: Adopt, Amplify or Autonomize
→Baseline + target metrics agreed up front
→No long-term commitment

Scope a single rung

Most teams start here

Monthly retainer

Monthly · embedded team

Embedded advisory + delivery across all three rungs, moving in lockstep with your roadmap.

→Advisory + hands-on delivery across all rungs
→Continuous guardrail, gate and eval ownership
→Metrics reported every sprint
→Priority access in Bahasa Indonesia + English

Book an Adopt Assessment

Why us

Built by people who ship

Secure by design, quality over quantity, and real production pedigree — here is what you get when our team embeds with yours.

SECURE

Secure by design

Protection built in from day one — least-privilege access, guardrails and PII handling — not patched on later.

QUALITY

Quality over quantity

Fewer things, done right and to industry best practices. We ship what survives production, not demos.

PEDIGREE

Real pedigree

A team from real tech companies — 10+ years shipping production software, not slideware.

Disciplines on the team

Harry Osmar Sitohang

Principal Engineer · Architecture & agents

The hard production calls — system design, agent orchestration, and code that has to hold.

Evaluation & Safety

Faithfulness & red-team

The gates that decide what ships: faithfulness, jailbreak resistance and PII safety.

Platform & Delivery

CI, gates & observability

Guardrails wired into your pipeline, shipping on cadence with the metrics in view.

Indonesia + Global

Bahasa Indonesia & English

Embedded with your team in either language — built for Indonesian and international orgs.

Questions

Methodology, security, commercials.

Methodology

Is Pangaea a tool or a service?+

A service. We embed with your team and install a method — guardrails, gates and evals wired into your own pipeline. You keep everything we build; there’s no Pangaea product to license afterwards.

What does Adopt deliver in week one?+

Working guardrails and CI gates, an eval harness running against real changes, and a measured baseline of your current delivery numbers. By Friday of week one, nothing ships without passing the gates.

Do you replace our senior engineers?+

No — we make them sharper. The discipline frees seniors from routine work and gives them leverage. Adoption that sidelines your best people doesn’t survive contact with production.

How fast is the first rung?+

Adopt is scoped in weeks, not quarters. The first gates and the metric baseline are live within the first week; the full rung typically completes inside a month.

What do we own at the end?+

All of it. The guardrails, gate configs, eval suites, prompts and playbooks live in your repos under your license. If we walked away, your team keeps shipping the same way.

Security

Which CI / Git platforms do you support?+

GitHub, GitLab and Bitbucket — cloud or self-managed. Gates and evals run inside your existing CI; we don’t route your pipeline through anything of ours.

How is our source code and data handled?+

It stays in your environment. We work against your repos and infrastructure with least-privilege access, and we don’t exfiltrate code or train on your data. Specifics are set in a DPA before kickoff.

Can you run on-prem or in our VPC?+

Yes. The entire method runs inside your VPC or on-prem — model endpoints, gates and evals included. Air-gapped variants are available for regulated environments.

Commercials

How do I know what I’m paying for?+

Total observability. Every metric and the cost and usage behind it, in real time — you can see and cap spend, so cost always maps to outcomes.

Do you work in Bahasa Indonesia and English?+

Both, fluently — workshops, docs and day-to-day delivery in either language. Our team is built for Indonesian and international engineering orgs alike.

Let’s build

Let’s start.

Tell us what you are building. We will scope an AI workstream, audit what you already have, or stand up an agent this week.

sales@pangaea.id· Jakarta, ID

Adoption is a discipline.We install it.

We compress the learning curve

We sell the discipline, not the hype

We prove it with metrics

AI Implementation

Eval & Audit

Plugins & Agents

AI Audit

Prod RAG Pipeline

CodeGen AI

AI Review

Software Development

Travel commerce, rebuilt

Greenfield youth travel

Government-scale microservices

Geospatial multi-tenant SaaS

Migrant-worker info network

Delivery partner

Numbers that survive an audit

From scope to shipped in four moves

Scope

Build

Eval

Ship

Digital Marketing Platform

Priced to approve,structured to stop.

Built by people who ship

Secure by design

Quality over quantity

Real pedigree

Disciplines on the team

Harry Osmar Sitohang

Evaluation & Safety

Platform & Delivery

Indonesia + Global

Methodology, security, commercials.

Methodology

Security

Commercials

Let’s start.

Adoption is a discipline.
We install it.

Priced to approve,
structured to stop.