by elromulous on 6/23/24, 4:13 PM with 5 comments
by Nokinside on 6/23/24, 4:41 PM
Speedup is realistic. Multicore SMP or NUMA are not good for memory access patterns they optimize.
Their thick control flow model that should work for exclusive matrix-addition and log-prefix style memory access patterns. In comparison to the baseline the speedup is 150% in log-prefix algorithm, over 190% in fft-style butterfly algorithm, 50-100% in matrix addition and threshold filtering. silicon area and power consumption are estimated to be low.
Light reading material:
Optimizing Memory Access in TCF Processors with Compute-Update Operations Optimizing Memory Access in TCF Processors with Compute-Update Operations https://ieeexplore.ieee.org/document/9150423
The REPLICA on-chip network https://ieeexplore.ieee.org/document/7792877/
Preliminary Performance and Memory Access Scalability Study of Thick Control Flow Processors https://ieeexplore.ieee.org/document/10305463/
Realizing multioperations and multiprefixes in Thick Control Flow processors https://linkinghub.elsevier.com/retrieve/pii/S01419331230005...
by wmf on 6/24/24, 12:11 AM
by elromulous on 6/23/24, 4:13 PM
by bboreham on 6/23/24, 9:57 PM