Predicting financial reporting failures before they happen.

FilingRisk uses EDGAR metadata and machine learning to quantify reporting infrastructure strain. We bridge the gap between academic research and real-world compliance monitoring.

Read the research Explore the API

The EDGAR-ReSTMT Architecture

Our foundation is EDGAR-ReSTMT, the first public dataset of SEC financial restatements derived entirely from EDGAR metadata. No proprietary dependencies. Fully deterministic.

Metadata as a Signal

Financial content models miss the process signatures of filing behavior. We analyze acceptance lags, document fragmentation, and amendment frequencies as empirical proxies for reporting system quality.

Deterministic Labeling

Our pipeline combines form-type filtering, regex keyword matching, and NLP context disambiguation to achieve reliable, reproducible labels without black-box LLM classification.

Predictive Modeling

We train logistic regression and random forest classifiers on millions of historical filings, using metadata to predict subsequent restatements and amendments with measurable AUC improvements.

Enterprise API (Coming Soon)

Embed our predictive scores directly into your compliance or investment workflows. Identify high-entropy filings and quantify infrastructure risk in real-time.