Presentation Information

[SO-PS-02-31 (Late News)]Hybrid SLC/MLC ReRAM for Quantized KV Cache in Transformer-based LLM

〇Shota Suzuki1, Naoko Misawa1, Chihiro Matsui1, Ken Takeuchi1,2 (1. Department of Electrical Engineering and Information Systems, The University of Tokyo (Japan), 2. Systems Design Lab., Graduate School of Engineering, The University of Tokyo (Japan))