The compare operation is widely used in many applications, from fundamental sorting to primitive operations in the database and AI systems. We present SRAM-based 3-D-CAM circuit designs using a monolithic 3-D (M3D) integration process for realizing beyond-Boolean in-memory compare operation without any area overheads. We also fabricated a processing-in-memory (PiM) macro with the same 3-D-CAM circuit using M3D for performing massively parallel compare operations used in the database, machine learning, and scientific applications. We show various system designs with the 3-D-CAM supporting operations, such as data filtering, sorting, and sparse matrix–matrix multiplication (SpGEMM). Our systems exhibit up to <inline-formula> <tex-math notation="LaTeX">$272\times $ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">$200\times $ </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">$226\times $ </tex-math></inline-formula> speedups and <inline-formula> <tex-math notation="LaTeX">$151\times $ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">$37\times $ </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">$156\times $ </tex-math></inline-formula> energy savings compared to systems using near memory compute for the data filtering, sorting, and SpGEMM applications, respectively.