Bluesky Facebook Reddit Email

An approach for processing compressed recommendation systems on ReRAM chip

10.30.24 | Higher Education Press

Apple iPhone 17 Pro

Apple iPhone 17 Pro delivers top performance and advanced cameras for field documentation, data collection, and secure research communications.


The random and sparse embedding lookup operations are the main performance bottleneck for processing recommendation systems. ReRAM-based processing-in-memory PIM can resolve this problem by processing embedding vectors where they are stored. However, the embedding table can easily exceed the capacity limit of a monolithic ReRAM-based PIM chip.
To solve the problems, a research team led by Hai Jin published their new research on 15 October 2024 in Frontiers of Computer Science co-published by Higher Education Press and Springer Nature.
The team deploys the decomposed model on-chip and leverage the high computing efficiency of ReRAM to compensate for the decompression performance loss. In this paper, we propose ARCHER, a ReRAM-based PIM architecture that implements fully on-chip recommendations under resource constraints.
The team observes the access pattern and computation pattern of the decompression. Based on the observation, the operations of each layer of the decomposed model are unified into multiply-and-accumulate operations and a hierarchical mapping schema is proposed to maximize resource utilization. Under the unified computation and mapping strategy, the team coordinates processing pipeline. Experiments results show that ARCHER can support large practical recommendation model on monolithic ReRAM chip, while surpassing existing solutions in terms of performance and energy savings.
DOI: 10.1007/s11704-023-3397-x

Frontiers of Computer Science

10.1007/s11704-023-3397-x

Experimental study

Not applicable

ARCHER: a ReRAM-based accelerator for compressed recommendation systems

15-Oct-2024

Keywords

Article Information

Contact Information

Rong Xie
Higher Education Press
xierong@hep.com.cn

Source

How to Cite This Article

APA:
Higher Education Press. (2024, October 30). An approach for processing compressed recommendation systems on ReRAM chip. Brightsurf News. https://www.brightsurf.com/news/8X5O39Y1/an-approach-for-processing-compressed-recommendation-systems-on-reram-chip.html
MLA:
"An approach for processing compressed recommendation systems on ReRAM chip." Brightsurf News, Oct. 30 2024, https://www.brightsurf.com/news/8X5O39Y1/an-approach-for-processing-compressed-recommendation-systems-on-reram-chip.html.