UPSTREAM PR #16941: Model: add openPangu-Embedded#69
Conversation
|
Access the complete analysis in the LOCI Dashboard Based on my analysis of the performance data and code changes, here's the comprehensive performance impact assessment: Performance Analysis SummaryCritical Function Performance ChangesPrimary Performance Degradation
Secondary Performance Impact
KPI Impact Analysis1. Tokens Per Second ImpactStatus: No Direct Impact Expected Analysis: The degraded functions are located in regex processing components, not in core inference functions:
Conclusion: Based on the reference that 2ms slower 2. Power Consumption ImpactStatus: Minimal Impact Affected Binary:
Root Cause: Increased CPU cycles from inefficient STL container operations in regex processing. 3. Quantization EfficiencyStatus: No Impact Analysis: No changes detected in quantization-related functions:
4. Memory UsageStatus: Potential Indirect Impact Affected Areas:
No Direct Impact on core memory management functions:
5. Batch ProcessingStatus: No Impact Analysis: Core batch processing functions show no performance degradation:
Root Cause AnalysisAssembly-Level Issues
Code Changes ContextThe performance degradation appears unrelated to PR #69 (PanguEmbedded model addition), suggesting:
Action ItemsImmediate Code-Level Actions
Build System Actions
Performance Monitoring Focus
ConclusionThe identified performance regression is isolated to regex processing components and does not directly impact core inference performance metrics. The 0.169% power consumption increase in |
94381d7 to
0eeb29b
Compare
8a26d77 to
b1d9e01
Compare
Mirrored from ggml-org/llama.cpp#16941
Add a new model openPangu-Embedded-1/7B-V1.1.
Yu can get the the model from model path.