WIP UI rework with qt6

2025-09-10 15:39:16 +08:00
parent 36385af6f3
commit ebf232317c
63 changed files with 4005 additions and 5221 deletions
--- a/docs/decisions/ADR-001-metrics-storage.md
+++ b/docs/decisions/ADR-001-metrics-storage.md
@@ -1,120 +0,0 @@
-# ADR-001: Persistent Metrics Storage
-
-## Status
-Accepted
-
-## Context
-The original orderflow backtest system kept all orderbook snapshots in memory during processing, leading to excessive memory usage (>1GB for typical datasets). With the addition of OBI and CVD metrics calculation, we needed to decide how to handle the computed metrics and manage memory efficiently.
-
-## Decision
-We will implement persistent storage of calculated metrics in the SQLite database with the following approach:
-
-1. **Metrics Table**: Create a dedicated `metrics` table to store OBI, CVD, and related data
-2. **Streaming Processing**: Process snapshots one-by-one, calculate metrics, store results, then discard snapshots
-3. **Batch Operations**: Use batch inserts (1000 records) for optimal database performance
-4. **Query Interface**: Provide time-range queries for metrics retrieval and analysis
-
-## Consequences
-
-### Positive
- **Memory Reduction**: >70% reduction in peak memory usage during processing
- **Avoid Recalculation**: Metrics calculated once and reused for multiple analysis runs
- **Scalability**: Can process months/years of data without memory constraints
- **Performance**: Batch database operations provide high throughput
- **Persistence**: Metrics survive between application runs
- **Analysis Ready**: Stored metrics enable complex time-series analysis
-
-### Negative
- **Storage Overhead**: Metrics table adds ~20% to database size
- **Complexity**: Additional database schema and management code
- **Dependencies**: Tighter coupling between processing and database layer
- **Migration**: Existing databases need schema updates for metrics table
-
-## Alternatives Considered
-
-### Option 1: Keep All Snapshots in Memory
-**Rejected**: Unsustainable memory usage for large datasets. Would limit analysis to small time ranges.
-
-### Option 2: Calculate Metrics On-Demand
-**Rejected**: Recalculating metrics for every analysis run is computationally expensive and time-consuming.
-
-### Option 3: External Metrics Database
-**Rejected**: Adds deployment complexity. SQLite co-location provides better performance and simpler management.
-
-### Option 4: Compressed In-Memory Cache
-**Rejected**: Still faces fundamental memory scaling issues. Compression/decompression adds CPU overhead.
-
-## Implementation Details
-
-### Database Schema
-```sql
-CREATE TABLE metrics (
-    id INTEGER PRIMARY KEY AUTOINCREMENT,
-    snapshot_id INTEGER NOT NULL,
-    timestamp TEXT NOT NULL,
-    obi REAL NOT NULL,
-    cvd REAL NOT NULL,
-    best_bid REAL,
-    best_ask REAL,
-    FOREIGN KEY (snapshot_id) REFERENCES book(id)
-);
-
-CREATE INDEX idx_metrics_timestamp ON metrics(timestamp);
-CREATE INDEX idx_metrics_snapshot_id ON metrics(snapshot_id);
-```
-
-### Processing Pipeline
-1. Create metrics table if not exists
-2. Stream through orderbook snapshots
-3. For each snapshot:
-   - Calculate OBI and CVD metrics
-   - Batch store metrics (1000 records per commit)
-   - Discard snapshot from memory
-4. Provide query interface for time-range retrieval
-
-### Memory Management
- **Before**: Store all snapshots → Calculate on demand → High memory usage
- **After**: Stream snapshots → Calculate immediately → Store metrics → Low memory usage
-
-## Migration Strategy
-
-### Backward Compatibility
- Existing databases continue to work without metrics table
- System auto-creates metrics table on first processing run
- Fallback to real-time calculation if metrics unavailable
-
-### Performance Impact
- **Processing Time**: Slight increase due to database writes (~10%)
- **Query Performance**: Significant improvement for repeated analysis
- **Overall**: Net positive performance for typical usage patterns
-
-## Monitoring and Validation
-
-### Success Metrics
- **Memory Usage**: Target >70% reduction in peak memory usage
- **Processing Speed**: Maintain >500 snapshots/second processing rate  
- **Storage Efficiency**: Metrics table <25% of total database size
- **Query Performance**: <1 second retrieval for typical time ranges
-
-### Validation Methods
- Memory profiling during large dataset processing
- Performance benchmarks vs. original system
- Storage overhead analysis across different dataset sizes
- Query performance testing with various time ranges
-
-## Future Considerations
-
-### Potential Enhancements
- **Compression**: Consider compression for metrics storage if overhead becomes significant
- **Partitioning**: Time-based partitioning for very large datasets
- **Caching**: In-memory cache for frequently accessed metrics
- **Export**: Direct export capabilities for external analysis tools
-
-### Scalability Options
- **Database Upgrade**: PostgreSQL if SQLite becomes limiting factor
- **Parallel Processing**: Multi-threaded metrics calculation
- **Distributed Storage**: For institutional-scale datasets
-
---
-
-This decision provides a solid foundation for efficient, scalable metrics processing while maintaining simplicity and performance characteristics suitable for the target use cases.
--- a/docs/decisions/ADR-001-sqlite-database-choice.md
+++ b/docs/decisions/ADR-001-sqlite-database-choice.md
@@ -0,0 +1,122 @@
+# ADR-001: SQLite Database Choice
+
+## Status
+Accepted
+
+## Context
+The orderflow backtest system needs to efficiently store and stream large volumes of historical orderbook and trade data. Key requirements include:
+
+- Fast sequential read access for time-series data
+- Minimal setup and maintenance overhead
+- Support for concurrent reads from visualization layer
+- Ability to handle databases ranging from 100MB to 10GB+
+- No network dependencies for data access
+
+## Decision
+We will use SQLite as the primary database for storing historical orderbook and trade data.
+
+## Consequences
+
+### Positive
+- **Zero configuration**: No database server setup or administration required
+- **Excellent read performance**: Optimized for sequential scans with proper PRAGMA settings
+- **Built-in Python support**: No external dependencies or connection libraries needed
+- **File portability**: Database files can be easily shared and archived
+- **ACID compliance**: Ensures data integrity during writes (for data ingestion)
+- **Small footprint**: Minimal memory and storage overhead
+- **Fast startup**: No connection pooling or server initialization delays
+
+### Negative
+- **Single writer limitation**: Cannot handle concurrent writes (acceptable for read-only backtest)
+- **Limited scalability**: Not suitable for high-concurrency production trading systems
+- **No network access**: Cannot query databases remotely (acceptable for local analysis)
+- **File locking**: Potential issues with file system sharing (mitigated by read-only access)
+
+## Implementation Details
+
+### Schema Design
+```sql
+-- Orderbook snapshots with timestamp windows
+CREATE TABLE book (
+    id INTEGER PRIMARY KEY,
+    instrument TEXT,
+    bids TEXT NOT NULL,        -- JSON array of [price, size] pairs
+    asks TEXT NOT NULL,        -- JSON array of [price, size] pairs
+    timestamp TEXT NOT NULL
+);
+
+-- Individual trade records
+CREATE TABLE trades (
+    id INTEGER PRIMARY KEY,
+    instrument TEXT,
+    trade_id TEXT,
+    price REAL NOT NULL,
+    size REAL NOT NULL,
+    side TEXT NOT NULL,        -- "buy" or "sell"
+    timestamp TEXT NOT NULL
+);
+
+-- Indexes for efficient time-based queries
+CREATE INDEX idx_book_timestamp ON book(timestamp);
+CREATE INDEX idx_trades_timestamp ON trades(timestamp);
+```
+
+### Performance Optimizations
+```python
+# Read-only connection with optimized PRAGMA settings
+connection_uri = f"file:{db_path}?immutable=1&mode=ro"
+conn = sqlite3.connect(connection_uri, uri=True)
+conn.execute("PRAGMA query_only = 1")
+conn.execute("PRAGMA temp_store = MEMORY")
+conn.execute("PRAGMA mmap_size = 268435456")  # 256MB
+conn.execute("PRAGMA cache_size = 10000")
+```
+
+## Alternatives Considered
+
+### PostgreSQL
+- **Rejected**: Requires server setup and maintenance
+- **Pros**: Better concurrent access, richer query features
+- **Cons**: Overkill for read-only use case, deployment complexity
+
+### Parquet Files
+- **Rejected**: Limited query capabilities for time-series data
+- **Pros**: Excellent compression, columnar format
+- **Cons**: No indexes, complex range queries, requires additional libraries
+
+### MongoDB
+- **Rejected**: Document structure not optimal for time-series data
+- **Pros**: Flexible schema, good aggregation pipeline
+- **Cons**: Requires server, higher memory usage, learning curve
+
+### CSV Files
+- **Rejected**: Poor query performance for large datasets
+- **Pros**: Simple format, universal compatibility
+- **Cons**: No indexing, slow filtering, type conversion overhead
+
+### InfluxDB
+- **Rejected**: Overkill for historical data analysis
+- **Pros**: Optimized for time-series, good compression
+- **Cons**: Additional service dependency, learning curve
+
+## Migration Path
+If scalability becomes an issue in the future:
+
+1. **Phase 1**: Implement database abstraction layer in `db_interpreter`
+2. **Phase 2**: Add PostgreSQL adapter for production workloads
+3. **Phase 3**: Implement data partitioning for very large datasets
+4. **Phase 4**: Consider distributed storage for multi-terabyte datasets
+
+## Monitoring
+Track the following metrics to validate this decision:
+- Database file sizes and growth rates
+- Query performance for different date ranges
+- Memory usage during streaming operations
+- Time to process complete backtests
+
+## Review Date
+This decision should be reviewed if:
+- Database files consistently exceed 50GB
+- Query performance degrades below 1000 rows/second
+- Concurrent access requirements change
+- Network-based data sharing becomes necessary
--- a/docs/decisions/ADR-002-json-ipc-communication.md
+++ b/docs/decisions/ADR-002-json-ipc-communication.md
@@ -0,0 +1,162 @@
+# ADR-002: JSON File-Based Inter-Process Communication
+
+## Status
+Accepted
+
+## Context
+The orderflow backtest system requires communication between the data processing pipeline and the web-based visualization frontend. Key requirements include:
+
+- Real-time data updates from processor to visualization
+- Tolerance for timing mismatches between writer and reader
+- Simple implementation without external dependencies
+- Support for different update frequencies (OHLC bars vs. orderbook depth)
+- Graceful handling of process crashes or restarts
+
+## Decision
+We will use JSON files with atomic write operations for inter-process communication between the data processor and Dash visualization frontend.
+
+## Consequences
+
+### Positive
+- **Simplicity**: No message queues, sockets, or complex protocols
+- **Fault tolerance**: File-based communication survives process restarts
+- **Debugging friendly**: Data files can be inspected manually
+- **No dependencies**: Built-in JSON support, no external libraries
+- **Atomic operations**: Temp file + rename prevents partial reads
+- **Language agnostic**: Any process can read/write JSON files
+- **Bounded memory**: Rolling data windows prevent unlimited growth
+
+### Negative
+- **File I/O overhead**: Disk writes may be slower than in-memory communication
+- **Polling required**: Reader must poll for updates (500ms interval)
+- **Limited throughput**: Not suitable for high-frequency (microsecond) updates
+- **No acknowledgments**: Writer cannot confirm reader has processed data
+- **File system dependency**: Performance varies by storage type
+
+## Implementation Details
+
+### File Structure
+```
+ohlc_data.json     # Rolling array of OHLC bars (max 1000)
+depth_data.json    # Current orderbook depth snapshot
+metrics_data.json  # Rolling array of OBI/CVD metrics (max 1000)
+```
+
+### Atomic Write Pattern
+```python
+def atomic_write(file_path: Path, data: Any) -> None:
+    """Write data atomically to prevent partial reads."""
+    temp_path = file_path.with_suffix('.tmp')
+    with open(temp_path, 'w') as f:
+        json.dump(data, f)
+        f.flush()
+        os.fsync(f.fileno())
+    temp_path.replace(file_path)  # Atomic on POSIX systems
+```
+
+### Data Formats
+```python
+# OHLC format: [timestamp_ms, open, high, low, close, volume]
+ohlc_data = [
+    [1640995200000, 50000.0, 50100.0, 49900.0, 50050.0, 125.5],
+    [1640995260000, 50050.0, 50200.0, 50000.0, 50150.0, 98.3]
+]
+
+# Depth format: top-N levels per side
+depth_data = {
+    "bids": [[49990.0, 1.5], [49985.0, 2.1]],
+    "asks": [[50010.0, 1.2], [50015.0, 1.8]]
+}
+
+# Metrics format: [timestamp_ms, obi_open, obi_high, obi_low, obi_close]
+metrics_data = [
+    [1640995200000, 0.15, 0.22, 0.08, 0.18],
+    [1640995260000, 0.18, 0.25, 0.12, 0.20]
+]
+```
+
+### Error Handling
+```python
+# Reader pattern with graceful fallback
+try:
+    with open(data_file) as f:
+        new_data = json.load(f)
+    _LAST_DATA = new_data  # Cache successful read
+except (FileNotFoundError, json.JSONDecodeError) as e:
+    logging.warning(f"Using cached data: {e}")
+    new_data = _LAST_DATA  # Use cached data
+```
+
+## Performance Characteristics
+
+### Write Performance
+- **Small files**: < 1MB typical, writes complete in < 10ms
+- **Atomic operations**: Add ~2-5ms overhead for temp file creation
+- **Throttling**: Updates limited to prevent excessive I/O
+
+### Read Performance
+- **Parse time**: < 5ms for typical JSON file sizes
+- **Polling overhead**: 500ms interval balances responsiveness and CPU usage
+- **Error recovery**: Cached data eliminates visual glitches
+
+### Memory Usage
+- **Bounded datasets**: Max 1000 bars × 6 fields × 8 bytes = ~48KB per file
+- **JSON overhead**: ~2x memory during parsing
+- **Total footprint**: < 500KB for all IPC data
+
+## Alternatives Considered
+
+### Redis Pub/Sub
+- **Rejected**: Additional service dependency, overkill for simple use case
+- **Pros**: True real-time updates, built-in data structures
+- **Cons**: External dependency, memory overhead, configuration complexity
+
+### ZeroMQ
+- **Rejected**: Additional library dependency, more complex than needed
+- **Pros**: High performance, flexible patterns
+- **Cons**: Learning curve, binary dependency, networking complexity
+
+### Named Pipes/Unix Sockets
+- **Rejected**: Platform-specific, more complex error handling
+- **Pros**: Better performance, no file I/O
+- **Cons**: Platform limitations, harder debugging, process lifetime coupling
+
+### SQLite as Message Queue
+- **Rejected**: Overkill for simple data exchange
+- **Pros**: ACID transactions, complex queries possible
+- **Cons**: Schema management, locking considerations, overhead
+
+### HTTP API
+- **Rejected**: Too much overhead for local communication
+- **Pros**: Standard protocol, language agnostic
+- **Cons**: Network stack overhead, port management, authentication
+
+## Future Considerations
+
+### Scalability Limits
+Current approach suitable for:
+- Update frequencies: 1-10 Hz
+- Data volumes: < 10MB total
+- Process counts: 1 writer, few readers
+
+### Migration Path
+If performance becomes insufficient:
+1. **Phase 1**: Add compression (gzip) to reduce I/O
+2. **Phase 2**: Implement shared memory for high-frequency data
+3. **Phase 3**: Consider message queue for complex routing
+4. **Phase 4**: Migrate to streaming protocol for real-time requirements
+
+## Monitoring
+Track these metrics to validate the approach:
+- File write latency and frequency
+- JSON parse times in visualization
+- Error rates for partial reads
+- Memory usage growth over time
+
+## Review Triggers
+Reconsider this decision if:
+- Update frequency requirements exceed 10 Hz
+- File I/O becomes a performance bottleneck
+- Multiple visualization clients need the same data
+- Complex message routing becomes necessary
+- Platform portability becomes a concern
--- a/docs/decisions/ADR-002-visualization-separation.md
+++ b/docs/decisions/ADR-002-visualization-separation.md
@@ -1,217 +0,0 @@
-# ADR-002: Separation of Visualization from Strategy
-
-## Status
-Accepted
-
-## Context
-The original system embedded visualization functionality within the `DefaultStrategy` class, creating tight coupling between trading analysis logic and chart rendering. This design had several issues:
-
-1. **Mixed Responsibilities**: Strategy classes handled both trading logic and GUI operations
-2. **Testing Complexity**: Strategy tests required mocking GUI components
-3. **Deployment Flexibility**: Strategies couldn't run in headless environments
-4. **Timing Control**: Visualization timing was tied to strategy execution rather than application flow
-
-The user specifically requested to display visualizations after processing each database file, requiring better control over visualization timing.
-
-## Decision
-We will separate visualization from strategy components with the following architecture:
-
-1. **Remove Visualization from Strategy**: Strategy classes focus solely on trading analysis
-2. **Main Application Control**: `main.py` orchestrates visualization timing and updates
-3. **Independent Configuration**: Strategy and Visualizer get database paths independently
-4. **Clean Interfaces**: No direct dependencies between strategy and visualization components
-
-## Consequences
-
-### Positive
- **Single Responsibility**: Strategy focuses on trading logic, Visualizer on charts
- **Better Testability**: Strategy tests run without GUI dependencies
- **Flexible Deployment**: Strategies can run in headless/server environments
- **Timing Control**: Visualization updates precisely when needed (after each DB)
- **Maintainability**: Changes to visualization don't affect strategy logic
- **Performance**: No GUI overhead during strategy analysis
-
-### Negative
- **Increased Complexity**: Main application handles more orchestration logic
- **Coordination Required**: Must ensure strategy and visualizer get same database path
- **Breaking Change**: Existing strategy initialization code needs updates
-
-## Alternatives Considered
-
-### Option 1: Keep Visualization in Strategy
-**Rejected**: Violates single responsibility principle. Makes testing difficult and deployment inflexible.
-
-### Option 2: Observer Pattern
-**Rejected**: Adds unnecessary complexity for this use case. Direct control in main.py is simpler and more explicit.
-
-### Option 3: Visualization Service
-**Rejected**: Over-engineering for current requirements. May be considered for future multi-strategy scenarios.
-
-## Implementation Details
-
-### Before (Coupled Design)
-```python
-class DefaultStrategy:
-    def __init__(self, instrument: str, enable_visualization: bool = True):
-        self.visualizer = Visualizer(...) if enable_visualization else None
-        
-    def on_booktick(self, book: Book):
-        # Trading analysis
-        # ...
-        # Visualization update
-        if self.visualizer:
-            self.visualizer.update_from_book(book)
-```
-
-### After (Separated Design)
-```python
-# Strategy focuses on analysis only
-class DefaultStrategy:
-    def __init__(self, instrument: str):
-        # No visualization dependencies
-        
-    def on_booktick(self, book: Book):
-        # Pure trading analysis
-        # No visualization code
-
-# Main application orchestrates both
-def main():
-    strategy = DefaultStrategy(instrument)
-    visualizer = Visualizer(...)
-    
-    for db_path in db_paths:
-        strategy.set_db_path(db_path)
-        visualizer.set_db_path(db_path)
-        
-        # Process data
-        storage.build_booktick_from_db(db_path, db_date)
-        
-        # Analysis
-        strategy.on_booktick(storage.book)
-        
-        # Visualization (controlled timing)
-        visualizer.update_from_book(storage.book)
-    
-    # Final display
-    visualizer.show()
-```
-
-### Interface Changes
-
-#### Strategy Interface (Simplified)
-```python
-class DefaultStrategy:
-    def __init__(self, instrument: str)                    # Removed visualization param
-    def set_db_path(self, db_path: Path) -> None           # No visualizer.set_db_path()
-    def on_booktick(self, book: Book) -> None              # No visualization calls
-```
-
-#### Main Application (Enhanced)
-```python
-def main():
-    # Separate initialization
-    strategy = DefaultStrategy(instrument)
-    visualizer = Visualizer(window_seconds=60, max_bars=500)
-    
-    # Independent configuration
-    for db_path in db_paths:
-        strategy.set_db_path(db_path)
-        visualizer.set_db_path(db_path)
-        
-        # Controlled execution
-        strategy.on_booktick(storage.book)        # Analysis
-        visualizer.update_from_book(storage.book) # Visualization
-```
-
-## Migration Strategy
-
-### Code Changes Required
-1. **Strategy Classes**: Remove visualization initialization and calls
-2. **Main Application**: Add visualizer creation and orchestration
-3. **Tests**: Update strategy tests to remove visualization mocking
-4. **Configuration**: Remove visualization parameters from strategy constructors
-
-### Backward Compatibility
- **API Breaking**: Strategy constructor signature changes
- **Functionality Preserved**: All visualization features remain available
- **Test Updates**: Strategy tests become simpler (no GUI mocking needed)
-
-### Migration Steps
-1. Update `DefaultStrategy` to remove visualization dependencies
-2. Modify `main.py` to create and manage `Visualizer` instance
-3. Update all strategy constructor calls to remove `enable_visualization`
-4. Update tests to reflect new interfaces
-5. Verify visualization timing meets requirements
-
-## Benefits Achieved
-
-### Clean Architecture
- **Strategy**: Pure trading analysis logic
- **Visualizer**: Pure chart rendering logic  
- **Main**: Application flow and component coordination
-
-### Improved Testing
-```python
-# Before: Complex mocking required
-def test_strategy():
-    with patch('visualizer.Visualizer') as mock_viz:
-        strategy = DefaultStrategy("BTC", enable_visualization=True)
-        # Complex mock setup...
-
-# After: Simple, direct testing
-def test_strategy():
-    strategy = DefaultStrategy("BTC")
-    # Direct testing of analysis logic
-```
-
-### Flexible Deployment
-```python
-# Headless server deployment
-strategy = DefaultStrategy("BTC")
-# No GUI dependencies, can run anywhere
-
-# Development with visualization
-strategy = DefaultStrategy("BTC")
-visualizer = Visualizer(...)
-# Full GUI functionality when needed
-```
-
-### Precise Timing Control
-```python
-# Visualization updates exactly when requested
-for db_file in database_files:
-    process_database(db_file)           # Data processing
-    strategy.analyze(book)              # Trading analysis  
-    visualizer.update_from_book(book)   # Chart update after each DB
-```
-
-## Monitoring and Validation
-
-### Success Criteria
- **Test Simplification**: Strategy tests run without GUI mocking
- **Timing Accuracy**: Visualization updates after each database as requested
- **Performance**: No GUI overhead during pure analysis operations
- **Maintainability**: Visualization changes don't affect strategy code
-
-### Validation Methods
- Run strategy tests in headless environment
- Verify visualization timing matches requirements
- Performance comparison of analysis-only vs. GUI operations
- Code complexity metrics for strategy vs. visualization modules
-
-## Future Considerations
-
-### Potential Enhancements
- **Multiple Visualizers**: Support different chart types or windows
- **Visualization Plugins**: Pluggable chart renderers for different outputs
- **Remote Visualization**: Web-based charts for server deployments
- **Batch Visualization**: Process multiple databases before chart updates
-
-### Extensibility
- **Strategy Plugins**: Easy to add strategies without visualization concerns
- **Visualization Backends**: Swap chart libraries without affecting strategies
- **Analysis Pipeline**: Clear separation enables complex analysis workflows
-
---
-
-This separation provides a clean, maintainable architecture that supports the requested visualization timing while improving code quality and testability.
--- a/docs/decisions/ADR-003-dash-visualization-framework.md
+++ b/docs/decisions/ADR-003-dash-visualization-framework.md
@@ -0,0 +1,204 @@
+# ADR-003: Dash Web Framework for Visualization
+
+## Status
+Accepted
+
+## Context
+The orderflow backtest system requires a user interface for visualizing OHLC candlestick charts, volume data, orderbook depth, and derived metrics. Key requirements include:
+
+- Real-time chart updates with minimal latency
+- Professional financial data visualization capabilities
+- Support for multiple chart types (candlesticks, bars, line charts)
+- Interactive features (zooming, panning, hover details)
+- Dark theme suitable for trading applications
+- Python-native solution to avoid JavaScript development
+
+## Decision
+We will use Dash (by Plotly) as the web framework for building the visualization frontend, with Plotly.js for chart rendering.
+
+## Consequences
+
+### Positive
+- **Python-native**: No JavaScript development required
+- **Plotly integration**: Best-in-class financial charting capabilities
+- **Reactive architecture**: Automatic UI updates via callback system
+- **Professional appearance**: High-quality charts suitable for trading applications
+- **Interactive features**: Built-in zooming, panning, hover tooltips
+- **Responsive design**: Bootstrap integration for modern layouts
+- **Development speed**: Rapid prototyping and iteration
+- **WebGL acceleration**: Smooth performance for large datasets
+
+### Negative
+- **Performance overhead**: Heavier than custom JavaScript solutions
+- **Limited customization**: Constrained by Dash component ecosystem
+- **Single-page limitation**: Not suitable for complex multi-page applications
+- **Memory usage**: Can be heavy for resource-constrained environments
+- **Learning curve**: Callback patterns require understanding of reactive programming
+
+## Implementation Details
+
+### Application Structure
+```python
+# Main application with Bootstrap theme
+app = dash.Dash(__name__, external_stylesheets=[dbc.themes.FLATLY])
+
+# Responsive layout with 9:3 ratio for charts:depth
+app.layout = dbc.Container([
+    dbc.Row([
+        dbc.Col([  # OHLC + Volume + Metrics
+            dcc.Graph(id='ohlc-chart', style={'height': '100vh'})
+        ], width=9),
+        dbc.Col([  # Orderbook Depth
+            dcc.Graph(id='depth-chart', style={'height': '100vh'})
+        ], width=3)
+    ]),
+    dcc.Interval(id='interval-update', interval=500, n_intervals=0)
+])
+```
+
+### Chart Architecture
+```python
+# Multi-subplot chart with shared x-axis
+fig = make_subplots(
+    rows=3, cols=1,
+    row_heights=[0.6, 0.2, 0.2],  # OHLC, Volume, Metrics
+    vertical_spacing=0.02,
+    shared_xaxes=True,
+    subplot_titles=['Price', 'Volume', 'OBI Metrics']
+)
+
+# Candlestick chart with dark theme
+fig.add_trace(go.Candlestick(
+    x=timestamps, open=opens, high=highs, low=lows, close=closes,
+    increasing_line_color='#00ff00', decreasing_line_color='#ff0000'
+), row=1, col=1)
+```
+
+### Real-time Updates
+```python
+@app.callback(
+    [Output('ohlc-chart', 'figure'), Output('depth-chart', 'figure')],
+    [Input('interval-update', 'n_intervals')]
+)
+def update_charts(n_intervals):
+    # Read data from JSON files with error handling
+    # Build and return updated figures
+    return ohlc_fig, depth_fig
+```
+
+## Performance Characteristics
+
+### Update Latency
+- **Polling interval**: 500ms for near real-time updates
+- **Chart render time**: 50-200ms depending on data size
+- **Memory usage**: ~100MB for typical chart configurations
+- **Browser requirements**: Modern browser with WebGL support
+
+### Scalability Limits
+- **Data points**: Up to 10,000 candlesticks without performance issues
+- **Update frequency**: Optimal at 1-2 Hz, maximum ~10 Hz
+- **Concurrent users**: Single user design (development server)
+- **Memory growth**: Linear with data history size
+
+## Alternatives Considered
+
+### Streamlit
+- **Rejected**: Less interactive, slower updates, limited charting
+- **Pros**: Simpler programming model, good for prototypes
+- **Cons**: Poor real-time performance, limited financial chart types
+
+### Flask + Custom JavaScript
+- **Rejected**: Requires JavaScript development, more complex
+- **Pros**: Complete control, potentially better performance
+- **Cons**: Significant development overhead, maintenance burden
+
+### Jupyter Notebooks
+- **Rejected**: Not suitable for production deployment
+- **Pros**: Great for exploration and analysis
+- **Cons**: No real-time updates, not web-deployable
+
+### Bokeh
+- **Rejected**: Less mature ecosystem, fewer financial chart types
+- **Pros**: Good performance, Python-native
+- **Cons**: Smaller community, limited examples for financial data
+
+### Custom React Application
+- **Rejected**: Requires separate frontend team, complex deployment
+- **Pros**: Maximum flexibility, best performance potential
+- **Cons**: High development cost, maintenance overhead
+
+### Desktop GUI (Tkinter/PyQt)
+- **Rejected**: Not web-accessible, limited styling options
+- **Pros**: No browser dependency, good performance
+- **Cons**: Deployment complexity, poor mobile support
+
+## Configuration Options
+
+### Theme and Styling
+```python
+# Dark theme configuration
+dark_theme = {
+    'plot_bgcolor': '#000000',
+    'paper_bgcolor': '#000000',
+    'font_color': '#ffffff',
+    'grid_color': '#333333'
+}
+```
+
+### Chart Types
+- **Candlestick charts**: OHLC price data with volume
+- **Bar charts**: Volume and metrics visualization
+- **Line charts**: Cumulative depth and trend analysis
+- **Scatter plots**: Trade-by-trade analysis (future)
+
+### Interactive Features
+- **Zoom and pan**: Time-based navigation
+- **Hover tooltips**: Detailed data on mouse over
+- **Crosshairs**: Precise value reading
+- **Range selector**: Quick time period selection
+
+## Future Enhancements
+
+### Short-term (1-3 months)
+- Add range selector for time navigation
+- Implement chart annotation for significant events
+- Add export functionality for charts and data
+
+### Medium-term (3-6 months)
+- Multi-instrument support with tabs
+- Advanced indicators and overlays
+- User preference persistence
+
+### Long-term (6+ months)
+- Real-time alerts and notifications
+- Strategy backtesting visualization
+- Portfolio-level analytics
+
+## Monitoring and Metrics
+
+### Performance Monitoring
+- Chart render times and update frequencies
+- Memory usage growth over time
+- Browser compatibility and error rates
+- User interaction patterns
+
+### Quality Metrics
+- Chart accuracy compared to source data
+- Visual responsiveness during heavy updates
+- Error recovery from data corruption
+
+## Review Triggers
+Reconsider this decision if:
+- Update frequency requirements exceed 10 Hz consistently
+- Memory usage becomes prohibitive (> 1GB)
+- Custom visualization requirements cannot be met
+- Multi-user deployment becomes necessary
+- Mobile responsiveness becomes a priority
+- Integration with external charting libraries is needed
+
+## Migration Path
+If replacement becomes necessary:
+1. **Phase 1**: Abstract chart building logic from Dash specifics
+2. **Phase 2**: Implement alternative frontend while maintaining data formats
+3. **Phase 3**: A/B test performance and usability
+4. **Phase 4**: Complete migration with feature parity