I. Introduction
As Data size increases and Moore's law slows down, alternative in-memory computing (IMC) architectures are being studied and used to efficiently process information. Conventional analog IMC often uses crossbar array architectures and nonvolatile memory (NVM) such as resistive random-access memory (RRAM) and has shown many applications in neural network acceleration [1], [2], [3], neuromorphic computing [4], statistical learning [5], signal processing [6], [7], scientific computing [8], [9], and other fields. However, there is a challenge extending these architectures to operations other than matrix dot multiplication. Existing IMC architectures rely on extensive analog to digital conversion (ADC) and digital to analog conversion (DAC) to switch between matrix multiplication and other computations such as activation functions [10]. The consequent conversion cost greatly dilutes efficiency in both power and speed; e.g., data converters can consume around 85% of the total energy in a typical RRAM-based neural network accelerator [11]. This energy overhead is especially critical for edge computing.