VynFi Accounting Network Explorer
Interactive ISO 21378 account-class flow graph (v5.9.0)
Synthetic Data
Privacy-preserving, fully synthetic financial datasets and ML demos โ generated by a deterministic engine that produces balanced double-entry general ledgers, multi-entity group consolidations, AML/banking transactions, and OCEL process logs, all carrying ground-truth fraud / anomaly labels and grounded in real accounting standards (IFRS ยท US/French/German GAAP ยท ISA) and the ISO 21378 audit-data model.
Everything here is synthetic โ no client or real-world data โ so it can be used freely to train, benchmark, and stress-test audit, fraud-detection, and graph-ML systems.
| Space | What it does |
|---|---|
| ๐ Inverse-Audit Detector | Label-free anomaly detection on a synthetic GL โ fit the normal-system manifold, then flag journal entries by deviation via two fit-on-self residual arms (per-JE density + relational account-flow-graph) routed into one risk score. Pick a fraud scenario, see per-arm ROC, recall @ audit budget, and the top suspicious entries. |
| ๐ Counterfactual GL Explorer | Seed-locked baseline vs counterfactual ledgers from a causal-DAG intervention โ pick a scenario (control-stress / SoD breakdown), see the effect-field distribution shift, the intervention trace, and the exact changed lines. Byte-deterministic generation, so the diff is signal, not noise. |
| ๐ธ Financial Sankey Explorer | Multi-step funds-flow view of synthetic financial statements โ the income-statement waterfall (Revenue โ Gross Profit โ Operating Income โ Net Income, with operating-expense sub-bands) and the cash-flow flow (Operating/Investing/Financing โ Net change in cash), per entity / period. |
| ๐ก๏ธ Fraud-GNN Demo | Graph-neural-network fraud detection on the JE network โ edge fraud predictor, node anomaly explorer, and a live check with confusion matrix + ROC. |
| ๐ Accounting Network Explorer | Interactive ISO 21378 account-class flow graph โ filter by business process, fraud, anomaly, amount, top-N; drill from Level-2 classes into Level-3 sub-classes. |
| ๐ Process Mining Demo | pm4py directly-follows graphs, variants, and statistics on the supply-chain OCEL 2.0 event log. |
| ๐๏ธ Data Explorer | Browse and inspect the VynFi synthetic datasets. |
| ๐ต๏ธ Perfect Audit Crime Challenge | Two-track community leaderboard โ flag the planted fraud in synthetic GLs and help map the detectability frontier. Track A (ledger only): the mimetic perfect crime โ fraud drawn from the ledger's own normal distribution โ is provably uncatchable. Track B (ledger + ISA-520/505 evidence): it becomes catchable. Upload a submission โ PR-AUC + per-observability recall on held-out labels. |
ERP showcase / financial statements
| Dataset | Highlights |
|---|---|
| vynfi-sap-showcase | "All capabilities" SAP-style ERP + audit bundle โ per-period financial statements (BS / IS / CF) with prior-period comparatives, operating-expense breakout, and Sankey funds-flow exports, plus subledgers, FX, intercompany + consolidation, audit (ISA/SOX) + COSO controls, HR / manufacturing / treasury / tax / ESG, and ERP-coupled banking. Explore the flows in the Financial Sankey Explorer. |
Group audit & consolidation
| Dataset | Highlights |
|---|---|
| vynfi-group-audit-enterprise-2000 | End-to-end 2 000-entity group: matched intercompany pairs, eliminations, IFRS-consolidated financial statements + schedules + notes + CTA/NCI/equity-method rollforwards. |
| vynfi-group-audit-3yr-medium | Multi-period (3-year) group-audit bundle โ period N+1 opens from period N's closing trial balance. |
| vynfi-je-network-2k | 68.5 M-edge consolidated journal-entry network from the 2 000-entity group โ drop-in for GNN training (PyG / DGL), with is_fraud, ic_pair_id, is_eliminated. |
General ledger / journal entries
| Dataset | Highlights |
|---|---|
| vynfi-journal-entries-1m | ~1 M-entry manufacturing GL with ISA 240 manual flags, fraud labels, and chart of accounts. |
| vynfi-journal-entries-10m | Research-scale ~10.9 M-entry synthetic GL. |
| vynfi-audit-p2p | Procure-to-Pay document chain (PO/GR/VI/Payment) with fraud labels โ audit-engagement grade. |
Causal / counterfactual
| Dataset | Highlights |
|---|---|
| vynfi-counterfactual-gl | Seed-locked, byte-deterministic baseline โ counterfactual GL pairs under named causal-DAG interventions (control-environment, SoD) โ each pair differs only by the intervention's effect; the diff split isolates the changed lines. A clean treatment/control substrate for causal ML, treatment-effect estimation, and residual-based audit analytics. |
AML / banking
| Dataset | Highlights |
|---|---|
| vynfi-aml-100k | 748 K banking transactions with AML/SAR-style labels and velocity features. |
| vynfi-sar-narratives | 156 K transactions paired with suspicious-activity-report narratives + AML labels. |
Process mining (OCEL 2.0)
| Dataset | Highlights |
|---|---|
| vynfi-ocel-manufacturing | Manufacturing OCEL 2.0 event log โ production-order lifecycle + quality inspections. |
| vynfi-supply-chain-ocel | 5-company manufacturing supply-chain OCEL 2.0 event log for cross-process mining. |
Challenge
| Dataset | Highlights |
|---|---|
| perfect-audit-crime-data | The ledgers behind the Perfect Audit Crime Challenge โ 3 multi-entity GLs across two tracks (ledger / ledger + ISA-520/505 evidence) with a planted mimetic perfect-crime family; labels held out for scoring. |
| Model | What it is |
|---|---|
| je-fraud-gnn | GraphSAGE 2-layer journal-entry fraud classifier (test AUC 0.914) + attribute-reconstruction GAE node-anomaly scorer (per-edge AUC 0.654, unsupervised). Includes weights, preprocessor, and full metrics. |
All datasets and demos are synthetic and contain no client or real-world data.
Interactive ISO 21378 account-class flow graph (v5.9.0)
GraphSAGE fraud + GAE anomaly on synthetic JE network
Funds-flow Sankeys from synthetic financial statements
Seed-locked baseline vs counterfactual GL pairs
Score your fraud detection submissions on synthetic ledgers