Accepted Paper List 2025

FCCM 2025 Accepted Paper List

Paper IDFormatTitleAuthors
6ShortLLM4DV: Using Large Language Models for Hardware Test Stimuli GenerationZixi Zhang (University of Cambridge), Balint Szekely (Imperial College London), Pedro Gimenes (Imperial College London), Greg Chadwick (lowRISC), Hugo McNally (lowRISC), Jianyi Cheng (University of Edinburgh), Robert Mullins (University of Cambridge), Yiren Zhao (Imperial College London)
16LongMoyogi: A Memory-centric Accelerator for Low-Latency Random Forest Inference on Embedded DevicesAlessandro Verosimile (Politecnico di Milano), Francesco Peverelli (Politecnico di Milano), Marco D. Santambrogio (Politecnico di Milano)
49LongFPGA-based Approximate Multiplier for FP8Ruiqi Chen (Vrije Universiteit Brussel), Yangxintong Lyu (Vrije Universiteit Brussel), Han Bao (Vrije Universiteit Brussel), Jiayu Liu (University College London), Yanxiang Zhu (VeriMake Innovation Lab), Shidi Tang (Southeast University), Ming Ling (Southeast University), Bruno da Silva (Vrije Universiteit Brussel)
50LongBanked Memories for Soft SIMT Processorsmartin langhammer (Altera, Imperial College London), George A. Constantinides (Imperial College London, UK)
51LongHighWave: Large-scale High-Bandwidth Wave Simulations on FPGAsDimitrios Gourounas (The University of Texas at Austin), Austin G. James (The University of Texas at Austin), Bagus Hanindhito (The University of Texas at Austin), Arash Fathi (ExxonMobil), Lizy K. John (The University of Texas at Austin), Andreas Gerstlauer (The University of Texas at Austin)
60LongRealProbe: An Automated and Lightweight Performance Profiler for In-FPGA Execution of High-Level Synthesis DesignsJiho Kim (Georgia Institute of Technology), Cong (Callie) Hao (Georgia Institute of Technology)
74ShortHP-FFT: A General High-Performance FFT Generator Using High-Level SynthesisChengyue Wang (UCLA), Jiahao Zhang (UCLA), Yingquan Wu (Tenafe, Inc.), Jason Cong (UCLA)
81LongInTAR: Inter-Task Auto-Reconfigurable Accelerator Design for High Data Volume Variation in DNNsZifan He (University of California, Los Angeles), Anderson Truong (University of California, Los Angeles), Yingqi Cao (University of California, San Diego), Jason Cong (University of California, Los Angeles)
82LongEfficiency, Expressivity, and Extensibility in a Close-to-Metal NPU Programming InterfaceErika Hunhoff (University of Colorado, Boulder), Joseph Melber (AMD), Kristof Denolf (AMD), Andra Bisca (AMD), Samuel Bayliss (AMD), Stephen Neuendorffer (AMD), Jeff Fifield (AMD), Jack Lo (AMD), Phil James-Roxby (AMD), Eric Keller (University of Colorado, Boulder), Pranathi Vasireddy (AMD)
84LongIceSpy: Reconfigurable Edge Accelerator for Scalable and Private Structural Health MonitoringAlexandra Zhang Jiang (University of California, Irvine), Jonathan Ta (University of California, Irvine), Yuqiao Li (University of California, Irvine), Zhou Li (University of California, Irvine), Nalini Venkatasubramanian (University of California, Irvine), Monica D. Kohler (California Institute of Technology), Sang-Woo Jun (University of California Irvine)
99LongHBMex: An Attachment for Nonbursting Accelerators to Enhance HBM PerformanceCanberk Sonmez (EPFL), Mohamed Shahawy (EPFL), Paolo Ienne (EPFL)
102LongAutoNTT: Automatic Architecture Design and Exploration for Number Theoretic Transform Acceleration on FPGAsDilshan Sampath Kumarathunga Then Kuttiyage (Simon Fraser University), Qilin Hu (Hunan University), Zhenman Fang (Simon Fraser University)
103LongChronbench: An Incremental HDL Benchmark SuiteZakary Nafziger (University of British Columbia), Steve Wilton (University of British Columbia)
109LongN-TORC: Native Tensor Optimizer for Real-time ConstraintsSuyash Vardhan Singh (University of South Carolina), Iftakhar Ahmad (University of South Carolina), Miaoqing Huang (University of Arkansas), David Andrews (University of Arkansas), Austin Downey (University of South Carolina), Jason D. Bakos (University of South Carolina)
114LongHigh Throughput Matrix Transposition on HBM-Enabled FPGAsYang Yang (University of Southern California), Kyle Tseng (University of Southern California), Viktor Prasanna (University of Southern California), Rajgopal Kannan (DEVCOM Army Research Lab)
121ShortFREEDOM: FPGA-based Hardware Redaction EmulatorBenjamin Carrion Schaefer (The University of Texas at Dallas), Chaitali Gajanan Sathe (The University of Texas at Dallas), Yiorgos Makris (The University of Texas at Dallas)
129LongGuaranteed Yet Hard to Find: Uncovering FPGA Routing Convergence ParadoxShashwat Shrivastava (EPFL), Stefan Nikolić (University of Novi Sad), Sun Tanaka (The University of Tokyo), Chirag Ravishankar (AMD), Dinesh Gaitonde (AMD), Mirjana Stojilovic (EPFL)
141LongTransfer Learning on the Edge for a Wireless Application Using an SoC PlatformYiyue Jiang (Northeastern University), John Dooley (Maynooth University), Aidan Edward Colgan (Maynooth University), Zhilin Ren (Northeastern University), Jonathan Guimaraes Ribeiro (Maynooth University), Miriam Leeser (Northeastern University)
148LongEfficient and Distributed Computation of Electron Repulsion Integrals on AI EnginesJohannes Menzel (Paderborn University), Christian Plessl (Paderborn University)
158LongAn Efficient FPGA-based Hardware Accelerator of Fully Quantized Mamba-2Kailing Zhou (Sun Yat-sen University), Han Jiao (Sun Yat-sen University), Wenjin Huang (Sun Yat-sen University), Yihua Huang (Sun Yat-sen University)
173LongNeuraLUT-Assemble: Hardware-aware Assembling of Sub-Neural Networks for Efficient LUT InferenceMarta Andronic (Imperial College London, UK), George A. Constantinides (Imperial College London, UK), Marta Andronic (Imperial College London)
233LongITERA-LLM: Boosting Sub-8-Bit Large Language Model Inference Through Iterative Tensor DecompositionYinting Huang (Imperial College London), Keran Zheng (Imperial College London), Zhewen Yu (Imperial College London), Christos-Savvas Bouganis (Imperial College London)
235LongA partitioning-based CAD flow for interposer-based multi-die FPGAsMahesh A. Iyer (Altera), Andrew Kahng (University of California San Diego), Jason Luu (Intel), Bodhisatta Pramanik (University of California San Diego), Kristofer Vorwerk (Altera), Grace Zgheib (Altera)
246LongSMART: High-Performance SAR ATR through Model-Architecture Co-Design on FPGASachini Wickramasinghe (University of Southern California), Yi-Chien Lin (University of Southern California), Cauligi Raghavendra (University of Southern California), Viktor Prasanna (University of Southern California)
342ShortSoftCUDA: Running CUDA on Softcore GPUChihyo Ahn (Georgia Institute of Technology), Ruobing Han (Georgia Institute of Technology), Udit Subramanya (Georgia Institute of Technology), Jisheng Zhao (Georgia Institute of Technology), Blaise Tine (University of California, Los Angeles), Hyesoon Kim (Georgia Tech)
402LongNoH: NoC Compilation in High Level SynthesisJake Ke (UCLA), Sihao Liu (UCLA), Licheng Guo (RapidStream Design Automation, Inc.), Zifan He (University of California, Los Angeles), Suhail Basalama (University of California, Los Angeles), Linghao Song (Yale University), Yuze Chi (RapidStream Design Automation, Inc.), Jason Cong (UCLA), Tony Nowatzki (UCLA), Huifeng Ke (University of California, Los Angeles)