Methodology — TickerTruth

Source-to-Schema Process

These notebooks demonstrate each pipeline stage on real sample data — runnable in Google Colab with no install required.

↗

↗

Split/bonus ratio parsing, confidence flag filtering, RELIANCE cumulative adjustment chain

↗

Applies cumulative factors to produce a continuous RELIANCE price series; shows the adjustment formula

↗

Shows the wrong momentum signal generated by raw prices and the corrected P&L with adjustment factors

Trust over completeness Mark uncertain data with confidence flags rather than excluding it.

Provenance first Log source and transformation for every record.

Audit trail Keep all versions in Dolt for regulatory and analytical review.

Fail gracefully Missing data is better than wrong data; gaps are logged explicitly.

Table	Purpose
dim_security_master	Central security identifier hub; NSE symbol, ISIN, active status
dim_issuer	Company/issuer identity; sector and market cap category
dim_exchange	Exchange reference (NSE, BSE); enables multi-exchange support
dim_symbol_alias	All historical symbols for a security with effective date ranges
dim_corporate_action_type	Immutable lookup: SPLIT, DIVIDEND, BONUS, MERGER, DELISTING, etc.

Table	Purpose
fact_equity_eod	End-of-day OHLCV price snapshots
fact_corporate_action_event	Normalized corporate action records with confidence scores
fact_adjustment_factor	Pre-computed cumulative adjustment multipliers for backtesting
fact_symbol_lineage_event	Ticker and name change history (renames, mergers, delistings)
fact_listing_status_history	Active, suspended, delisted, relisted status over time

→Surrogate keys — decouple logical identity from business keys
→Soft deletes via flags — keep historical records for audit trails
→Temporal tracking — created_at / updated_at on all tables
→Confidence scoring — every event carries a confidence_score for quality filtering
→Flexible fact granularity — old_value/new_value design supports multiple action types without separate tables

All data is sourced from official NSE public sources: equity master, corporate actions board, daily bhavcopy, and circulars.

Confidence	Sources
High ≥ 0.95	NSE EOD OHLCV, NSE Corporate Action Board, NSDL ISIN Registry
Medium 0.7–0.95	NSE historical archives, parsed web content, BSE cross-references
Low < 0.7	Estimated adjustment factors, reconstructed lineage from sparse data

Every record includes a confidence_score and _source_file field for full traceability.