DATA BROKERAGE

DIGITAL DATASETS

Curated user-generated data for AI training. Real human behavior, structured for model consumption.

[ USER GENERATED ][ 250+ TB ][ 15+ YEARS ]
AVAILABLE DATASETS

PRODUCT CATALOG

RETAIL INVESTOR DATA

SOCIAL FINANCE

The largest social finance platform — retail investor posts, sentiment signals, and engagement data spanning 15+ years. Already licensed by leading quantitative funds.

Total volume250–270 TB
Messages426M+
Tickers50,000+
Data sinceJuly 2009
Current throughput6–7M posts/mo
FormatJSON (S3)
Latency200ms (firehose)
SUB-PRODUCTS

Firehose

Full message stream with user metadata, tickers, sentiment flags, and activity (likes, reshares, follows).

Symbol Event

Ticker-level engagement — message volume, likes, pageviews, watchlist changes. Real-time.

Sentiment

Per-ticker sentiment scores refreshed every 5 minutes. Backtested: 18.1% CAGR on Nasdaq 100 L/S.

SPORTS + FINANCE NLP DATA

SPORTS NLP

The only sports data provider offering natural language query to structured data response. Sole data provider to Grok for sports — six months of engineering, half their team, because it is real-time.

Total queries1Bn+
Growth500M+ last year alone
Output formatJSON
CoverageAll historical seasons
Data typeNLP → structured stats
Provider statusSole Grok data source
KEY VALUE

NLP Query Layer

Natural language in, structured data out. AI labs cannot get this elsewhere — the conversational query layer is the moat.

API Access Model

Recurring API licensing — no raw data dump. Same model powering the Grok integration. Stop paying, lose access.

Full Coverage

All major sports, all historical seasons, all available statistics. Finance data layer included.

INTERESTED?

We broker curated datasets for AI training. Reach out to discuss licensing, pricing, and trial access.

team@gerra.com