Stop waiting on compliance. Stop labeling by hand. Get realistic synthetic transaction data โ structured + narrative โ ready to train your fraud model today.
{
"transaction_id": "synth_a3f2b91c",
"card_hash": "card_7d4e9f2a1b3c",
"amount_usd": 4250.00,
"merchant_mcc": "5944",
"merchant_mcc_label": "Jewelry Stores",
"timestamp": "2026-06-15T02:47:00Z",
"hour_of_day": 2,
"location": { "city": "Lagos", "region": "NG" },
"is_card_present": false,
"velocity_last_1h": 6,
"velocity_last_24h": 22,
"distance_from_home_km": 8432.1,
"fraud_label": 1,
"fraud_pattern": "account_takeover",
"fraud_score": 0.934,
"fraud_narrative": "New device fingerprint โ no match to cardholder's known devices. Transaction in Lagos (8,432km from billing address) at 2:47 local time..."
}
Your ML team needs millions of labeled examples. Here's what getting real data actually looks like.
Real transaction data has cardholder names, cards, and behavioral signals. Every dataset needs legal review, DPA agreements, and audit trails. 6 months before your data engineer can even open the file.
Hand-labeling fraud patterns takes months. Rare fraud types (synthetic identity, bust-out rings) are underrepresented in real data โ exactly the cases your model needs most.
Real fraud rates are 0.1-2%. Training on imbalanced data produces models that flag nothing or everything. You need controlled synthetic datasets with exactly the right fraud-to-legit ratio.
No legal review. No labeling sprint. No PII headaches.
Dataset size, fraud rate, fraud patterns to include, industry vertical, feature set. We configure the generator to match your model's needs.
Realistic synthetic transactions โ structured fields + narrative fraud descriptions โ generated to your exact specifications. JSON + CSV, ready for your pipeline.
Drop the dataset into your training pipeline. No DPA. No compliance sign-off. Your model is in production faster.
Every dataset includes configurable proportions of each pattern โ including the rare ones your real data doesn't have enough of.
New device, off-hours, geographic anomaly
CNP, billing mismatch, CVV failures
Shell merchants, MCC mismatch, ring activity
Coordinated cluster, cross-bank, cash-out
High-frequency low-value, systematic draining
Authorized push payment, KBA coaching
ITIN, mail drop, thin-file bust-out
Rapid spend, no payments, credit limit approach
No monthly commitments. No contracts. Just order the data you need.
Enterprise (100K+ records)? Reply to your sample email or book a 15-min call for custom pricing + SLA.
1,000 synthetic fraud transactions. All 8 patterns. No credit card required. Delivered within 24 hours.
After you download, we'll send you three emails: Try it, Scale it, and Talk to us. Pick whichever fits.