cluster4npu

Author	SHA1	Message	Date
HuangMason320	c9f294bb4c	fix: Improve FPS calculation and filter async results 主要改進： - 修改 FPS 計算邏輯為累積式計算，避免初期不穩定的高 FPS 值 - 過濾掉 async 和 processing 狀態的結果，不顯示也不計入統計 - 只有真正的推理結果才會被計入 FPS 和處理計數 - 新增 _has_valid_inference_result 方法來驗證結果有效性 - 改善 MultiDongle 的 stop 方法，正確斷開設備連接 - 清理不必要的檔案和更新測試配置 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-30 19:45:34 +08:00
Masonmason	cde1aac908	debug: Add comprehensive logging to diagnose pipeline hanging issue - Add pipeline activity logging every 10 results to track processing - Add queue size monitoring in InferencePipeline coordinator - Add camera frame capture logging every 100 frames - Add MultiDongle send/receive thread logging every 100 operations - Add error handling for repeated callback failures in camera source This will help identify where the pipeline stops processing: - Camera capture stopping - MultiDongle threads blocking - Pipeline coordinator hanging - Queue capacity issues 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 19:49:00 +08:00
Masonmason	f41d9ae5c8	feat: Implement output queue based FPS calculation for accurate throughput measurement - Add time-window based FPS calculation using output queue timestamps - Replace misleading "Theoretical FPS" (based on processing time) with real "Pipeline FPS" - Track actual inference output generation rate over 10-second sliding window - Add thread-safe FPS calculation with proper timestamp management - Display realistic FPS values (4-9 FPS) instead of inflated values (90+ FPS) Key improvements: - _record_output_timestamp(): Records when each output is generated - get_current_fps(): Calculates FPS based on actual throughput over time window - Thread-safe implementation with fps_lock for concurrent access - Automatic cleanup of old timestamps outside the time window - Integration with GUI display to show meaningful FPS metrics This provides users with accurate inference throughput measurements that reflect real-world performance, especially important for multi-dongle setups where understanding actual scaling is crucial. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 19:17:18 +08:00
Masonmason	2ba0f4ae27	fix: Remove duplicate inference result logging to prevent terminal spam - Comment out print() statements in InferencePipeline that duplicate GUI callback output - Prevents each inference result from appearing multiple times in terminal - Keeps logging system clean while maintaining GUI formatted display - This was causing terminal output to show each result 2-3 times due to: 1. InferencePipeline print() statements captured by StdoutCapture 2. Same results formatted and sent via terminal_output callback 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 19:10:37 +08:00
Masonmason	23d8c4ff61	fix: Replace undefined 'processed_result' with 'inference_result' Fixed NameError where 'processed_result' was referenced but not defined. Should use 'inference_result' which contains the actual inference output from MultiDongle.get_latest_inference_result(). 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 12:11:44 +08:00
HuangMason320	18f1426cbc	Merge branch 'main' of github.com:HuangMason320/cluster4npu	2025-07-24 11:56:40 +08:00
HuangMason320	bab06b9fa4	Merge branch 'main' of github.com:HuangMason320/cluster4npu	2025-07-24 11:56:32 +08:00
Masonmason	f902659017	fix: Remove incompatible parameters to match standalone MultiDongle API Key fixes: 1. Remove 'block' parameter from put_input() call - not supported in standalone code 2. Remove 'timeout' parameter from get_latest_inference_result() call 3. Improve _has_inference_result() logic to properly detect real inference results - Don't count "Processing" or "async" status as valid results - Only count actual tuple (prob, result_str) or meaningful dict results - Match standalone code behavior for FPS calculation This should resolve the "unexpected keyword argument" errors and provide accurate FPS counting like the standalone baseline. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 11:56:01 +08:00
Masonmason	80275bc774	fix: Correct FPS calculation to count actual inference results only Key changes: 1. FPS Calculation: Only count when stage receives actual inference results - Add _has_inference_result() method to check for valid results - Only increment processed_count when real inference result is available - This measures "inferences per second" not "frames per second" 2. Reduced Log Spam: Remove excessive preprocessing debug logs - Remove shape/dtype logs for every frame - Only log successful inference results - Keep essential error logs 3. Maintain Async Pattern: Keep non-blocking processing - Still use timeout=0.001 for get_latest_inference_result - Still use block=False for put_input - No blocking while loops Expected result: ~4 FPS (1 dongle) vs ~9 FPS (2 dongles) matching standalone code behavior. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 11:30:13 +08:00
Masonmason	273ae71846	fix: Correct FPS calculation and reduce log spam Key fixes: 1. FPS Calculation: Only count actual inference results, not frame processing - Previous: counted every frame processed (~90 FPS, incorrect) - Now: only counts when actual inference results are received (~9 FPS, correct) - Return None from _process_data when no inference result available - Skip FPS counting for iterations without real results 2. Log Reduction: Significantly reduced verbose logging - Removed excessive debug prints for preprocessing steps - Removed "No inference result" spam messages - Only log actual successful inference results 3. Async Processing: Maintain proper async pattern - Still use non-blocking get_latest_inference_result(timeout=0.001) - Still use non-blocking put_input(block=False) - But only count real inference throughput for FPS This should now match standalone code behavior: ~4 FPS (1 dongle) vs ~9 FPS (2 dongles) 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 11:12:42 +08:00
Masonmason	67a1031009	fix: Remove blocking while loop that prevented multi-dongle scaling The key issue was in InferencePipeline._process_data() where a 5-second while loop was blocking waiting for inference results. This completely serialized processing and prevented multiple dongles from working in parallel. Changes: - Replace blocking while loop with single non-blocking call - Use timeout=0.001 for get_latest_inference_result (async pattern) - Use block=False for put_input to prevent queue blocking - Increase worker queue timeout from 0.1s to 1.0s - Handle async processing status properly This matches the pattern from the standalone code that achieved 4.xx FPS (1 dongle) vs 9.xx FPS (2 dongles) scaling. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-24 10:52:58 +08:00
Masonmason	0e8d75c85c	cleanup: Remove debug output after successful fix verification - Remove all debug print statements from deployment dialog - Remove debug output from workflow orchestrator and inference pipeline - Remove test signal emissions and unused imports - Code is now clean and production-ready - Results are successfully flowing from inference to GUI display	2025-07-23 22:50:34 +08:00
Masonmason	2dec66edad	debug: Add callback chain debugging to InferencePipeline and WorkflowOrchestrator - Add debug output in InferencePipeline result callback to see if it's called - Add debug output in WorkflowOrchestrator handle_result to trace callback flow - This will help identify exactly where the callback chain is breaking - Previous test showed GUI can receive signals but callbacks aren't triggered	2025-07-23 22:43:06 +08:00
Masonmason	be44e6214a	update debug for deploment	2025-07-17 12:05:10 +08:00
Masonmason	0a70df4098	fix: Complete array comparison fix and improve stop button functionality - Fix remaining array comparison error in inference result validation - Update PyQt signal signature for proper numpy array handling - Improve DeploymentWorker to keep running after deployment - Enhance stop button with non-blocking UI updates and better error handling 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-17 10:03:59 +08:00
Masonmason	183300472e	fix: Resolve array comparison error and add inference stop functionality - Fix ambiguous truth value error in InferencePipeline result handling - Add stop inference button to deployment dialog with proper UI state management - Improve error handling for tuple vs dict result types 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-17 09:46:31 +08:00
Masonmason	af9adc8e82	fix: Address file path and data processing bugs, add real-time viewer	2025-07-17 09:18:27 +08:00
Masonmason	e0169cd845	Fix device detection format and pipeline deployment compatibility Device Detection Updates: - Update device series detection to use product_id mapping (0x100 -> KL520, 0x720 -> KL720) - Handle JSON dict format from kp.core.scan_devices() properly - Extract usb_port_id correctly from device descriptors - Support multiple device descriptor formats (dict, list, object) - Enhanced debug output shows Product ID for verification Pipeline Deployment Fixes: - Remove invalid preprocessor/postprocessor parameters from MultiDongle constructor - Add max_queue_size parameter support to MultiDongle - Fix pipeline stage initialization to match MultiDongle constructor - Add auto_detect parameter support for pipeline stages - Store stage processors as instance variables for future use Example Updates: - Update device_detection_example.py to show Product ID in output - Enhanced error handling and format detection Resolves pipeline deployment error: "MultiDongle.__init__() got an unexpected keyword argument 'preprocessor'" Now properly handles real device descriptors with correct product_id to series mapping. 🤖 Generated with [Claude Code](https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-16 21:45:14 +08:00
Masonmason	080eb5b887	Add intelligent pipeline topology analysis and comprehensive UI framework Major Features: • Advanced topological sorting algorithm with cycle detection and resolution • Intelligent pipeline optimization with parallelization analysis • Critical path analysis and performance metrics calculation • Comprehensive .mflow file converter for seamless UI-to-API integration • Complete modular UI framework with node-based pipeline editor • Enhanced model node properties (scpu_fw_path, ncpu_fw_path) • Professional output formatting without emoji decorations Technical Improvements: • Graph theory algorithms (DFS, BFS, topological sort) • Automatic dependency resolution and conflict prevention • Multi-criteria pipeline optimization • Real-time stage count calculation and validation • Comprehensive configuration validation and error handling • Modular architecture with clean separation of concerns New Components: • MFlow converter with topology analysis (core/functions/mflow_converter.py) • Complete node system with exact property matching • Pipeline editor with visual node connections • Performance estimation and dongle management panels • Comprehensive test suite and demonstration scripts 🤖 Generated with Claude Code (https://claude.ai/code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-07-10 12:58:47 +08:00

19 Commits