SASIMI 2016
The 20th Workshop on Synthesis And System Integration of Mixed Information Technologies
Technical Program

Remark: The presenter of each paper is marked with "*".

Technical Program: SIMPLE version DETAILED version with abstract

Author Index: HERE

Session Schedule

Monday, October 24, 2016


Registration - 9:00
Opening 9:00 - 9:20
K Keynote Speech 9:20 - 10:20
R1 POSTER I 10:20 - 11:50
Lunch 11:50 - 13:20
I1 Invited Talk 13:20 - 14:10
R2 POSTER II 14:10 - 15:40
20th Memorial Ceremony 15:40 - 15:55
D Panel 15:55 - 17:40

Tuesday, October 25, 2016


I2 Invited Talk 9:10 - 10:00
R3 POSTER III 10:00 - 11:50
Lunch 11:50 - 13:20
I3 Invited Talk 13:20 - 14:10
R4 POSTER IV 14:10 - 16:00
I4 Invited Talk 16:00 - 16:50
Closing 16:50 - 17:00

List of papers

Remark: The presenter of each paper is marked with "*".

Monday, October 24, 2016

Keynote Speech
Time: 9:20 - 10:20 Monday, October 24, 2016
Chair: Nagisa Ishiura (Kwansei Gakuin University, Japan)

K-1 (Time: 9:20 - 10:20)

Title	Cyber-Medical Systems: Requirements, Components and Design
Author	*Giovanni De Micheli (EPFL, Switzerland)
Page	p. 1
Abstract	We are continuously witnessing a relentless growth of computing power, storage capacity and communication bandwidth as well as a major trend in biomedical sciences to become more quantitative and amenable to benefit from the support of electronic systems. Moreover, societal and economic needs push us to develop and adopt health-management approaches that are more effective, less expensive and flexible enough to be personalized to individual and community needs. Within this frame, distributed data acquisition and control systems, i.e., cyberphysical systems, start playing an important role in health care. Examples include, but are not limited to, remote patient monitoring, emergency care as well as routine care. These examples benefit from organized and optimized means to quantify clinical data, handle large data sets as well as controlling and personalizing therapy and drug administration. Current electronic devices and systems need to grow in various directions to satisfy the quality needs for health care. Current semiconductor products have to incorporate bio-chemical interfaces, such as sensors, to perform data acquisition directly. The experience in electronic semicustom design and in platform-based design can be ported successfully to integrated sensing devices, where modularity and regularity can be key to reducing non-recurrent engineering costs. The fusion of sensing and microelectronic technologies, as well as the ability of volume production of integrated sensing systems that can be personalized in the very back end of the line or after fabrication is an important scientific and commercial goal. Field-programmable sensing arrays can enable inexpensive multi-panel sensing for various medical applications. Electronic design automation is a key technology to realize cyber-medical systems. Examples of specific EDA tools and methods encompass physical design of integrated sensors and their coupling to electronics, simulation of complex systems with bio-chemical stimuli, synthesis of decision making circuitry based on plurality of inexact inputs, policies design for therapies exploiting on-line data acquisition, and verification of life-critical applications under broadly-varying and unpredictable input conditions. Overall, cyber-medical systems represent an important and large market opportunity. EDA is a necessary underlying technology to realize the promises of better and less expensive care for everyone.

POSTER I
Time: 10:20 - 11:50 Monday, October 24, 2016
Chairs: Yoshinori Takeuchi (Osaka University, Japan), Kazuya Tanigawa (Hiroshima City University, Japan)

R1-1 (Time: 10:20 - 10:22)

Title	Detecting Missed Arithmetic Optimization in C Compilers by Differential Random Testing
Author	*Mitsuyoshi Iwatsuji, Atsushi Hashimoto, Nagisa Ishiura (Kwansei Gakuin University, Japan)
Page	pp. 2 - 3
Keyword	randomtest, compiler, optimization, differential testing
Abstract	In addition to the correctness of the generated code, it is one of the important test goals to verify that optimization modules work as intended. Besides the existing equivalence-based method, this paper presents a method of detecting missed optimization opportunities in C compilers by differential random testing. Randomly generated test programs are compiled by different compilers and the resulting codes are compared to detect missed optimization.

R1-2 (Time: 10:22 - 10:24)

Title	Symmetric Segmented Delta Encoding for Wireless Sensor Data Compression
Author	*Shu-Ping Liang, Yi-Yu Liu (Yuan Ze University, Taiwan)
Page	pp. 4 - 9
Keyword	Delta encoding, Huffman encoding
Abstract	Wireless sensor networks (WSNs) are utilized for various applications such as environmental monitoring, urban surveillance, home security, etc. Ideally, the massively deployed sensor nodes should be inexpensive, power-efficient, and reliable to maximize the functional lifetime. However, the data-transmission energy consumption has become one of the most challenging issues. To minimize the data-transmission energy cost in wireless applications, we propose a lossless Symmetric Segmented Delta encoding (SSD encoding) algorithm, which exploits high similarity of environmental sensing data in a short period of time. According to the experimental results, our encoding algorithm achieves better sensor data compression ratios and at the same time requires less hardware resource for wireless sensor data.

R1-3 (Time: 10:24 - 10:26)

Title	Register-Bridge Architecture and its Application to Multiprocessor Systems
Author	*Takafumi Fujii, Shinichi Nishizawa, Kazuhito Ito (Saitama University, Japan)
Page	pp. 10 - 15
Keyword	multiprocessor, register-bridge, data communication delay, parallel processing
Abstract	The interconnection delay in data transfer is becoming the dominant factor to restrain the improvement of the maximum clock frequency of LSIs. The regular distributed register (RDR) architecture is proposed where data transfer between the islands is separated from the computation and local data access, and distant data transfer is done using multiple clock cycles. In this paper a novel register-bridge (RB) architecture is proposed so that data transfer between adjacent islands is done through bridge registers in between the islands, thereby the necessary number of clock cycles for data transfer would be reduced. The experimental results show about 11 % reduction in the latency on average when example procedures are implemented on a multiprocessor system based on the RB architecture.

R1-4 (Time: 10:26 - 10:28)

Title	Hardware Accelerator of Convolutional Neural Network for Image Recognition and its Performance Evaluation Platform
Author	*Takayuki Ujiie, Masayuki Hiromoto, Takashi Sato (Kyoto University, Japan)
Page	pp. 16 - 17
Keyword	Image Recognition, Embedded Systems, Low Power Design
Abstract	Convolutional neural network (CNN) is widely used for realizing accurate image recognition systems. However, its computational cost is extremely high, which increases power consumption of embedded computer vision systems. This paper presents a hardware accelerator for CNN and its performance evaluation platform for co-designing efficient algorithm and hardware. Through the performance evaluation of two types of CNN algorithms, we demonstrated that our proposed platform is useful for making quantitative comparison of both algorithm and hardware performance among different CNN algorithms.

R1-5 (Time: 10:28 - 10:30)

Title	On a Radiation Resistant Data-Path and Controller Synthesis
Author	*Keisuke Inoue (Kanazawa Technical College, Japan)
Page	pp. 18 - 22
Keyword	high-level synthesis, radiation resistance
Abstract	This paper discusses a high-level design of an application specific integrated circuit with radiation resistance. A soft error occurs when a radiation event causes enough of a charge disturbance to reverse or flip the data state of a register. We introduce Error-Correcting Code register (ECC register), a type of data storage that can detect and correct data corruption. We consider data-path synthesis with ECC register, and controller synthesis with radiation resistance.

R1-6 (Time: 10:30 - 10:32)

Title	A Heuristic Decompositions Index Generation Functions with Many Variables
Author	Tsutomu Sasao, *Kyu Matsuura, Yukihiro Iguchi (Meiji University, Japan)
Page	pp. 23 - 28
Keyword	Logic design, Functional Decomposition, Heuristic, Monte Carlo
Abstract	This paper shows a heuristic method to decompose index generation functions with many variables. Three different measures are used to select the bound variables. Experimental results shows that this method finds fairly good decompositions in a short time. Comparison with Monte Carlo method is presented.

R1-7 (Time: 10:32 - 10:34)

Title	Debugging of Reconfigurable Single-Electron Transistor Arrays
Author	Wen-Chun Zeng, *Shih-Hsiang Liu, Yu-Da Chen, Yung-Chih Chen (Yuan Ze University, Taiwan)
Page	pp. 29 - 30
Keyword	Debugging, Satisfiability, Single-Electron Transistor Array
Abstract	This paper proposes an automatic debugging method for single-electron transistor arrays. The method iteratively calls a SAT solver to find a counterexample and analyzes the counterexample to identify errors. It can fix an incorrect SET array which can be corrected by changing an edge's configuration. The experimental results show that the proposed debugging method is efficient and effective. It finds all the possible corrections for an incorrect SET array within an average of 0.021 seconds.

R1-8 (Time: 10:34 - 10:36)

Title	Layer Assignment for Multi-Power-Mode 3D IC Designs with Power Distribution Networks Considered
Author	*Shih-Hsu Huang, Jian-Zhi Shen, Chun-Hua Cheng (Chung Yuan Christian University, Taiwan)
Page	pp. 31 - 35
Keyword	Layer Assignment, Multiple Power Modes, 3D IC, Footprint Area, High-Level Design Stage
Abstract	The design of multiple power modes has been recognized as an effective method to reduce the power consumption. However, the layer assignment of multi-power-mode designs based on 3D IC architecture has not been well studied. In this paper, we demonstrate that the layer assignment result has a significant impact on the areas of power distribution networks. As a result, there is a demand to take the area overheads of power distribution networks into account during the layer assignment stage. Based on this observation, we present the first work to perform the layer assignment of multi-power-mode 3D IC designs with the areas of power distribution networks considered. We use an integer linear programming (ILP) approach to formally draw up this problem. Different from previous works, our approach not only considers the areas of modules and through-silicon-vias (TSVs) but also considers the areas of power distribution networks. Benchmark data show that our approach can greatly reduce the footprint area (including module areas, TSV areas, and power distribution network areas) without affecting the circuit performance.

R1-9 (Time: 10:36 - 10:38)

Title	A Processor Architecture Integrating Voltage Scalable On-Chip Memories for Individual Tracking of Minimum Energy Points in Logic and Memory
Author	*Jun Shiomi, Tohru Ishihara, Hidetoshi Onodera (Kyoto University, Japan)
Page	pp. 36 - 41
Keyword	Dynamic Voltage and Frequency Scaling, Adaptive Body Basing, Minimum Energy Point Tracking, Standard-cell based memory
Abstract	A RISC processor with standard-cell based memories (SCMs) is designed to investigate a minimum energy point in a 65-nm FDSOI process technology. Measurement results show that the minimum operating voltage of the processor is 0.3V and the minimum energy consumption of 0.16nJ/cycle is achieved at 0.5V supply voltage. Simultaneous scaling of supply voltage and body bias is applied to a logic part and SCMs individually. As a result, the minimum energy consumption of the processor is reduced to 0.10nJ/cycle.

R1-10 (Time: 10:38 - 10:40)

Title	CORP: Control Routing for Paper-Based Digital Microfluidic Biochips
Author	Qin Wang, Hailong Yao (Tsinghua University, China), *Tsung-Yi Ho (National Tsing Hua University, Taiwan), Yici Cai (Tsinghua University, China)
Page	pp. 42 - 47
Keyword	Biochip, Paper-based, Control Routing, Microfluidic
Abstract	Paper-based digital microfluidic biochips (P-DMFBs) have recently emerged as a promising low-cost and fast-responsive platform for biochemical assays. In P-DMFBs, electrodes and control lines are printed on a piece of photo paper using inkjet printer and conductive ink of carbon nanotubes (CNTs). Compared with traditional digital microfluidic biochips (DMFBs), P-DMFBs enjoy notable advantages, such as much faster in-place fabrication with printer and ink, much lower costs, better disposability, etc. Because electrodes and CNT control lines are printed on the same layer of a paper, a new design challenge for P-DMFB is to prevent the unfavorable interactions between moving droplets and the voltages on CNT control lines. These interactions may result in unexpected droplet movements and thus incorrect assay outputs. This paper proposes the first COntrol line Routing method for P-DMFBs named CORP, which effectively eliminates the negative effects of control lines on droplets. Experimental results on real-life chips demonstrate the effectiveness of CORP.

R1-11 (Time: 10:40 - 10:42)

Title	Analysis of Body Bias Control for Real Time Systems
Author	*Carlos Cesar Cortes Torres, Hayate Okuhara, Akram Ben Ahmed, Nobuyuki Yamasaki, Hideharu Amano (Keio University, Japan)
Page	pp. 48 - 53
Keyword	Body-Bias, Real-Time, low-power, SOTB, Embedded
Abstract	In the past decade, Real-time Systems (RTSs) have been widely studied. RTSs should maintain time constraints to avoid catastrophic consequences and should also be energy efficient as it can be embedded in devices where the battery life is primordial. This paper is the first study of introducing Dynamic Body Biasing to RTSs, we investigate the energy efficiency of RTSs by analyzing the ability of BB on providing a satisfying tradeoff between performance and energy. The study was conducted using accurate parameters extracted from real chip measurements of a low-power microcontroller using Silicon On Thin Box (SOTB) technology; with such proposal we were able to achieve 46% energy reduction.

R1-12 (Time: 10:42 - 10:44)

Title	Performance-Driven Multi-Layer OARST Construction with Steiner-Point Pre-Selection and Bounded Maze Routing
Author	*Kuen-Wey Lin, Yeh-Sheng Lin, Yih-Lang Li (Institute of Computer Science and Engineering, National Chiao Tung University, Taiwan), Rung-Bin Lin (Computer Science and Engineering, Yuan Ze University, Taiwan), Wen-Hao Liu (Cadence Design Systems, U.S.A.)
Page	pp. 54 - 59
Keyword	Steiner tree, Physical design, Obstacle-avoidance, Performance-driven, Multi-layer
Abstract	An algorithm for performance-driven (PD) multi-layer obstacle-avoiding rectilinear Steiner tree construction with routing constraints is presented. We develop a PD selection of Steiner points. With the proposed bounded maze routing space, we identify a set of candidates for Steiner points; then, three policies are proposed to evaluate them. Experimental results demonstrate our algorithm generates tree topologies with 56.44% improvement in the PD metrics and 41.68% improvement in delay compared with the state-of-the-art heuristic while requiring less routing cost.

R1-13 (Time: 10:44 - 10:46)

Title	Performance Improvement of General-Synchronous Circuits by Variable Latency Technique using Dynamic Timing-Error Detection
Author	*Shimpei Sato, Hiroshi Nakatsuka, Atsushi Takahashi (Tokyo Institute of Technology, Japan)
Page	pp. 60 - 65
Keyword	general-synchronous circuit, variable latency, timing-error detection
Abstract	General-synchronous circuits have a better performance compared to a complete-synchronous circuit. The performance of them is expected to be further improved by allowing speculative execution. In this paper, a high performance general-synchronous circuits with speculative execution is realized as a variable latency circuit by adopting error-detection and correction mechanism. In our proposed method, a circuit is designed by combining clock scheduling, delay insertion, and speculation effectively. In experiments, we confirmed that 6.7% performance improvement is achieved compared to a general-synchronous circuit with fixed latency.

R1-14 (Time: 10:46 - 10:48)

Title	FPGA Prototyping of a Smart Card Platform for Evaluating Tamper Resistance of Cryptographic Circuits
Author	*Hiroyuki Kanbara (ASTEM RI, Japan), Naoya Ito, Hinata Takebayashi (Kwansei Gakuin University, Japan), Muneyuki Takenae (Ritsumeikan University, Japan), Takashi Tsukamoto (Information-technology Promotion Agency, Japan)
Page	pp. 66 - 70
Keyword	FPGA, Prototyping, Smart Card, Tamper Resistance, Cryptography
Abstract	This article presents a smart card platform to evaluate tamper resistance of cryptographic circuits. Tamper resistance means difficulty of revealing sensitive information like cryptographic keys of a cryptographic device tampered with in order to make the device behave abnormally. User of this platform can manipulate their own cryptographic circuits which are connected to a co-processor bus circuit and attempt to extract the key inside the circuit in non-invasive way called side-channel attacks. A RSA encryption/decryption Circuit, an AES encryption/decryption circuit and a random number generation circuit are designed as a reference and integrated with the platform. The platform with these circuits is implemented using Xilinx FPGA.

R1-15 (Time: 10:48 - 10:50)

Title	Convolutional Neural Network Layer Reordering for Acceleration
Author	*Vijay Daultani, Subhajit Chaudhury, Kazuhisa Ishizaka (NEC, Japan)
Page	pp. 71 - 76
Keyword	CNN, Deep learning, Activation layer, Layer Reordering, Acceleration
Abstract	We propose a new optimization technique to speed up performance of convolution neural networks. One of the challenges for CNN is large execution time. Many CNN models are found to have a repeatable layer pattern, i.e. convolution, activation and pooling layer in that order. We show that, for a class of functions, performed in activation layer and pooling layer, it is possible to reconfigure CNN, to reduce the number of operations performed in a network, without changing output of the network. Experimental results demonstrate that using the proposed reconfiguration, we can reduce the total time for VGG by almost 5% on CPU and time for activation layer, by almost 75% for CPU, and by a range of 20% to 67% for GPU, for 2x2 max pooling kernel.

R1-16 (Time: 10:50 - 10:52)

Title	System Design of Vision-based Framework for Senior Driver Assistance
Author	*Eric Aliwarga, Koichi Mitsunari, Jaehoon Yu, Takao Onoye (Osaka University, Japan), Toshitaka Azuma, Mitsuhiko Koga (Vehicle Information and Communication System Center, Japan)
Page	pp. 77 - 80
Keyword	image recognition, machine learning, BDT
Abstract	System design of a video-based framework for senior driver assistance has been constructed. By utilizing sophisticated machine-learning schemes, e.g. Aggregated Channel Features and Boosted Decision Tree, status of traffic signal, vehicles ahead and oncoming, and existence of pedestrian, are effectively recognized from driver view video stream with the use of the unified classifier. By adjusting the length of boosting classifiers from 128 to 2,048, the identical hardware can treat all types of objects. Hardware system organization of the proposed approach is also described.

R1-17 (Time: 10:52 - 10:54)

Title	An FPGA Implementation of SVM for Type Identification with Colorectal Endoscopic Images
Author	*Takumi Okamoto, Tetsushi Koide, Anh-Tuan Hoang, Tatsuya Shimizu, Koki Sugi, Toru Tamaki, Tsubasa Hirakawa, Bisser Raytchev, Kazufumi Kaneda (Hiroshima University, Japan), Shigeto Yoshida, Hiroshi Mieno (Hiroshima General Hospital of West Japan Railway Company, Japan), Shinji Tanaka (Hiroshima University Hospital, Japan)
Page	pp. 81 - 86
Keyword	Computer-Aided Diagnosis, Endoscopic Images, Type Identification, Support Vector Machine, FPGA
Abstract	With the increase of colorectal cancer patients in recent years, the needs of quantitative evaluation of colorectal cancer are increased, and the computer-aided diagnosis (CAD) system which supports doctor's diagnosis is essential. In this paper, a hardware design of type identification module in CAD system for colorectal endoscopic images with narrow band imaging (NBI) magnification is proposed for real-time processing of full high definition image (1920 x 1080 pixel). A pyramid style image segmentation with SVMs for multi-size scan windows, which can be implemented on an FPGA with small circuit area and achieve high accuracy, is proposed for actual complex colorectal endoscopic images.

Invited Talk
Time: 13:20 - 14:10 Monday, October 24, 2016
Chair: Mineo Kaneko (Japan Advanced Institute of Science and Technology, Japan)

I1-1 (Time: 13:20 - 14:10)

Title	Computing in the IoT Era, Opportunities and Challenges
Author	*David Chen (ARM, China)
Page	p. 87
Abstract	IoT or Internet of Things, there is nothing ground breaking in this definition, but why is this term gaining some much traction in the last few years? In this presentation, I will start with the plethora of architectures and technologies used in IoT, highlighting the challenges and opportunities. I will present these in the context of hardware, software and tools. And then, I will present the ARM ecosystem contribution, including these not only in the context of hardware, software and tools, but also in education which is an important element of the mix. Finally, a number of conclusions including a speculative view of the mid-to-long term future of IoT will be given.

POSTER II
Time: 14:10 - 15:40 Monday, October 24, 2016
Chairs: Masato Inagi (Hiroshima City University, Japan), Seiya Shibata (NEC Corp., Japan)

R2-1 (Time: 14:10 - 14:12)

Title	Random Testing of Back-end of Compiler Infrastructure LLVM
Author	*Kenji Tanaka, Nagisa Ishiura (Kwansei Gakuin University, Japan), Masanari Nishimura, Akiya Fukui (Renesas System Design, Japan)
Page	pp. 88 - 89
Keyword	LLVM, LLVM IR, random test, Compiler Infrastructure
Abstract	This paper presents a method of directly testing back-ends of the LLVM compiler infrastructure by randomly generated LLVM IR (intermediate representation). Using LLVM, a compiler for a new target can be developed only by implementing a machine dependent back-end, then the test of the back-end become a focusing issue. The proposed method generates random LLVM IR assembly codes containing arithmetic operations which are difficult to test by C programs.

R2-2 (Time: 14:12 - 14:14)

Title	Retention-aware Refresh Techniques for DRAM Refresh Power Reduction
Author	*Wei-Kai Cheng, Po-Yuan Shen (Chung Yuan Christian University, Taiwan)
Page	pp. 90 - 93
Keyword	DRAM, Refresh
Abstract	DRAM circuit requires periodic refresh operations to prevent data loss. However, DRAM refresh incurs extra power consumption and degrades system performance due to delaying of memory requests service. As DRAM density increases, DRAM refresh overhead is even worsened due to the increase of refresh cycle time. To address this problem, we propose a retention-aware auto-refresh (RAAR) technique by all-bank, partial-bank, and per-bank to reduce unnecessary refresh operations according to the DRAM cells’ retention time. Experimental results show that our RAAR technique not only reduces refresh power effectively, but also improves the memory access performance.

R2-3 (Time: 14:14 - 14:16)

Title	Stochastic Number Generation with Internal Signals of Logic Circuits
Author	*Naoya Kubota, Hideyuki Ichihara, Tsuyoshi Iwagaki, Tomoo Inoue (Hiroshima City University, Japan)
Page	pp. 94 - 95
Keyword	Stochastic Computing, Stochastic number, chi-square-value
Abstract	Stochastic computing (SC) is an approximate computation with random numbers. In this study, we propose a new SC scheme in which internal signals of logic circuits are exploited for generating random numbers. This scheme can eliminate random number generators (e.g., linear-feedback shift-registers), and accordingly reduce the hardware cost for SC without losing the accuracy.

R2-4 (Time: 14:16 - 14:18)

Title	A Branch-and-Bound Algorithm for Scheduling of Data-Parallel Tasks
Author	*Yang Liu, Lin Meng, Ittetsu Taniguchi, Hiroyuki Tomiyama (Ritsumeikan University, Japan)
Page	pp. 96 - 100
Keyword	task scheduling, multicore, data parallelism, branch-and-bound
Abstract	This paper studies a task scheduling problem which schedules a set of data-parallel tasks on multiple cores. Unlike most of previous literature where each task is assumed to run on a single core, this work allows individual tasks to run on multiple cores in a data-parallel fashion. Since the scheduling problem is NP-hard, a couple of heuristic algorithms which find near-optimal schedules in a short time were proposed so far. In some cases, however, exactly-optimal schedules are desired, for example, in order to evaluate heuristic algorithms. This paper proposes an exact algorithm to find optimal schedules in a reasonable time. The proposed algorithm is based on depth-first branch-and-bound search. In the experiments, the proposed algorithm could successfully find optimal schedules for task-sets of 50 tasks in a practical time.

R2-5 (Time: 14:18 - 14:20)

Title	Automatic Enable Candidate Extraction for Backward Sequential Clock Gating
Author	*Shinji Kimura, Tomoya Goto, Masao Yanagisawa (Waseda University, Japan)
Page	pp. 101 - 106
Keyword	CLock gating, Satisfiability
Abstract	Clock gating is a widely used dynamic power reduction method for LSI by stopping clock changes to registers. There needs an enable signal to apply clock gating, and automatic enable detection and selection has been worked hard. Recently clock gating considering the past or the future time steps has been paid attention as sequential clock gating. The paper proposes an automatic extraction method of candidates of backward sequential clock gating control using the satisfiability condition on time expanded circuit. The detected candidate can also be used for forward sequential clock gating.

R2-6 (Time: 14:20 - 14:22)

Title	A Decision Diagram to Analyze Probabilistic Behavior of Circuits
Author	*Kodai Abe, Shigeru Yamashita (Ritsumeikan University, Japan)
Page	pp. 107 - 112
Keyword	Decision Diagram, Probability Analysis, Binary Decision Diagram for Probabilities
Abstract	In near future, we may encounter serious problems in digital circuits due to probabilistic behaviors that are caused by many factors, e.g., soft-errors, variability of transistors, etc. Thus, we need to develop a good methodology to evaluate how logic circuit is dependable even in the logic design level. To do so, this paper proposes a framework to use what we call probability functions to calculate the error probability of the circuit. Then, we can naturally define an efficient data structure called Binary Decision Diagram for Probabilities (BDDP) to manipulate probability functions. To successfully utilize this new data structure, we carefully define the rule to maintain the canonical property of BDDPs, and also define recursive operations to calculate the AND/OR/NOT relations of probability functions. Our operations can be implemented efficiently as the conventional BDD operations. From our preliminary experiments we would expect a possibility of BDDPs which should be verified further in our future work.

R2-7 (Time: 14:22 - 14:24)

Title	Random Delay Elements for Tamper Resistant Asynchronous Circuits based on 2-phase Handshaking Protocol
Author	*Daiki Toyoshima, Tatsuya Ishikawa, Atsushi Kurokawa, Masashi Imai (Hirosaki University, Japan)
Page	pp. 113 - 118
Keyword	tamper resistant, asynchronous circuit, random delay, 2-phase handshaking protocol
Abstract	Side-channel-attacks have become one of the serious issues in the modern VLSI systems. In this paper, we propose a design method of tamper-resistant asynchronous circuits using random delay elements. We also present the circuit structure of random delay elements for asynchronous circuits based on the 2-phase handshaking protocol which does not distinguish between a rising edge and a falling edge. The delay cells are designed and evaluated using 0.18μm process technology and their characteristics are shown.

R2-8 (Time: 14:24 - 14:26)

Title	Optimization of Temperature Dependent Intentional Skew for Temperature Aware Timing Design
Author	*Makoto Soga, Mineo Kaneko (Japan Advanced Institute of Science and Technology, Japan)
Page	pp. 119 - 124
Keyword	clock skew, temperature, variation
Abstract	In this paper, the optimization of temperature dependent intentional skew for extending operating temperature range of LSI is proposed. Our discussion is based on the linear temperature dependency of delays, and assume a most general form of temperature dependency in intentional skews. The major contributions of this paper are (1) optimization algorithm for finding best parameter values of intentional skews, and (2) an optimum choice of temperature dependency of intentional skews for extending operating temperature range.

R2-9 (Time: 14:26 - 14:28)

Title	Performance Evaluation Platform for Programmable Interconnect Architecture Exploration
Author	Kohei Yamamoto, Toshiki Morioka, Tomoya Inoue, *Masataka Mori, Yukio Mitsuyama (Kochi University of Technology, Japan)
Page	pp. 125 - 128
Keyword	reconfigurable architecture, placement and routing, architecture exploration
Abstract	This paper proposed a performance evaluation platform for programmable interconnect architecture exploration. This platform enables repetitive modifications of programmable interconnect architecture, its performance evaluations based on application mapping results, and its RTL simulation with a configuration data obtained from an application mapping result, efficiently. In the platform, programmable interconnect architecture is defined by some parameters called wire-parameters, and the configuration files for placement and routing tools are automatically generated from the wire-parameters. We can evaluate the performance of reconfigurable architecture based on accurate path delay and area, which are estimated based on an application mapping result obtained by the placement and routing tools. Furthermore, this evaluation platform can generate RTL source codes of the reconfigurable array according to the wire-parameters, and enables an RTL simulation of the reconfigurable architecture with a configuration data obtained from application mapping result.

R2-10 (Time: 14:28 - 14:30)

Title	Efficient Standard Cell Layout Synthesis Algorithm Considering Various Driving Strengths
Author	*Hong-Yan Su, Bo-Shung Wang, Sin-Ye Hsieh, Yih-Lang Li (Department of Computer Science, National Chiao Tung University, Taiwan), I-Hsun Wu, Chang-Chung Wu, Wei-Chiang Shih (M31 Technology Corporation, Taiwan), Hidetoshi Onodera (Dept. of Communications and Computer Engineering, Kyoto University, Japan), Masanori Hashimoto (Department of Information Systems Engineering, Osaka University, Japan)
Page	pp. 129 - 134
Keyword	Cell layout synthesis, timing closure
Abstract	For reaching timing closure under distinct scenarios, such as different negative slacks, the required transistor sizes for a cell instance may not match any of the pre-designed cells in the library. In this paper we propose a timing–closure oriented cell layout synthesis algorithm to generate a high-driving cell layout accommodating demanded static folded large transistors. The proposed primitive-driving cell synthesis flow consists of two steps; (1) routability-aware dynamic-programming (DP) based transistor placement considering diffusion-shape constraint and pin-metal location for routability, and (2) resource-aware cell routing flow consisting of resource estimation scheme, simultaneous pattern routing and post-routing optimization. Higher-driving cells are synthesized based on the design of primitive-driving cells and static folded transistors with a LEGO-like assembling approach followed by the abovementioned resource-aware cell routing. Experimental results show that all the synthesized layouts have identical areas to handcrafted layouts of commercial 28nm cell library and use less metal 2 routing resource as well. Cell characterization outcome also demonstrates that the proposed cell synthesis algorithm can yield cell layouts with comparable performances to handicraft layouts of commercial cell library in terms of delay, leakage and power thanks to the proposed high-routability transistor placement and effective routing resource planning.

R2-11 (Time: 14:30 - 14:32)

Title	Fast Length-Matching Routing for Rapid Single Flux Quantum Circuits
Author	*Nobutaka Kito (Chukyo University, Japan), Kazuyoshi Takagi, Naofumi Takagi (Kyoto University, Japan)
Page	pp. 135 - 140
Keyword	routing, RSFQ circuits, simulated annealing, length-matching
Abstract	A fast routing method for Rapid Single Flux Quantum (RSFQ) digital circuits is proposed. The method deals with channel routing between adjacent columns of active devices. To perform routing considering difference between timing variabilities of active devices and those of passive transmission lines (PTLs) used for transmitting signals, the proposed method provides route of PTLs with specified additional length for delay insertion. The method is based on simulated annealing, and a special representation method of routing is proposed for searching solutions efficiently.

R2-12 (Time: 14:32 - 14:34)

Title	A Comparative Study on Multisource Clock Network Synthesis
Author	Wen-Hsin Chen, Chun-Kai Wang, *Hung-Ming Chen (NCTU Taiwan, Taiwan), Yih-Chih Chen, Cheng-Hong Tsai (Global Unichip Corp, Taiwan)
Page	pp. 141 - 145
Keyword	CTS, multisource, skew optimization
Abstract	Hybrid clock architecture offers a compromise between tree and mesh. While most of the relative works focus on tree-driven-mesh configuration, we are interested in the performance and optimization of multisource CTS flow provided by IC Compiler, which applies a coarse mesh with local sub-trees. Therefore, we analyze the QOR of conventional clock tree and multisource CTS on a real industrial design. We also propose several heuristic approaches to improving the performance of multisource CTS, especially for skew optimization. According to the experiment results, we reveal the benefits and drawbacks of each method, give some guidelines for determining the proper configuration for a design, and then summarize some future research directions.

R2-13 (Time: 14:34 - 14:36)

Title	Target Concentration Exploration for Reactant Minimization on Digital Microfluidic Biochips
Author	Yi-Ling Chen, Yung-Chun Lei, *Juinn-Dar Huang (National Chiao Tung University, Taiwan)
Page	pp. 146 - 151
Keyword	sample preparation, target concentration value determination, cost minimization, digital microfluidic biochip
Abstract	Lab-on-a-chip is one of the most sophisticated technologies, while sample preparation is the process that mixes buffer and reactant to achieve the desired target concentration. Since the reactant consumption takes a notable part of the total cost, minimizing its usage is preferable. Different from existing techniques, we propose two algorithms that can select a feasible concentration value within a given range for cost minimization. Experimental results show that the proposed methods can further reduce the reactant cost up to 20%.

R2-14 (Time: 14:36 - 14:38)

Title	Practical and Accurate SOC Estimation System for Lithium-Ion Batteries by EKF with Adaptive Noise Covariance Estimation
Author	*Lei Lin, Kiyotsugu Takaba, Masahiro Fukui (Ritsumeikan University, Japan)
Page	pp. 152 - 157
Keyword	Battery Management System, Lithium-Ion Battery, SOC Estimation, extended Kalman filter
Abstract	In this paper, we discuss a state-of-charge (SOC) estimation system for lithium-ion batteries using an extended Kalman filter (EKF) with adaptive noise covariance estimation. The EKF can affect a highly effective state estimation using the assuming that prior information about the stochastic characteristics of the system disturbances and observation noises is known. However, it is very difficult to obtain such prior information. The stationary noise covariance estimation method must estimate the noise covariance using the examination data in advance. The noise covariance estimation must use a large examination data. To remedy this problem, we propose herein an adaptive noise covariance estimation method for accurately estimating the SOC from observation data and for verifying the accuracy by examination. However, the noise is not stationary in real system. To resolute this problem in real system, we proposed a method to estimate the noise using moving window. In our experiment, the adaptive noise covariance estimation method’s SOC estimation error can approach to the SOC estimation error using the optimal stationary noise covariance setting.

R2-15 (Time: 14:38 - 14:40)

Title	True Random-Bit Generation Using a Continuous-Time Chaotic Oscillator
Author	*Chatchai Wannaboon, Masayoshi Tachibana (Kochi University of Technology, Japan)
Page	pp. 158 - 161
Keyword	Random-Bit Generator, Chaotic Oscillator
Abstract	This paper presents a true random-bit generation through a continuous-time chaotic oscillator, which provides automatically chaotic signals and is fully implemented on 0.18 CMOS standard technology. Chaotic dynamics of the oscillator are exhibited in terms of chaotic strange attractor in phase-space domain. In order to achieve true-random property, a simple designed of post-processing method is utilized. Finally, the quality of randomness is analyzed through 1,000,000 binary sequences which are verified by statistical test methods and NIST standard tests suite. The proposed system has offered a cost-effective and a compact random-bit generator for computer security applications.

R2-16 (Time: 14:40 - 14:42)

Title	Hardware Acceleration Technique for Radio-resource Scheduler in Ultra-high-density Distributed Antenna Systems
Author	*Yuki Arikawa, Hiroyuki Uzawa, Takeshi Sakamoto, Satoshi Shigematsu (NTT Device Innovation Center, Japan)
Page	pp. 162 - 163
Keyword	5G, Scheduler, Hardware acceleration
Abstract	This paper presents a hardware acceleration technique for the scheduling process in ultra-high-density distributed antenna systems for 5G mobile communications systems. In 5G systems, the overall system throughputs for a huge number of combinations of antennas and user equipment for communications have to be calculated in the scheduling process. In order to accelerate the calculation, the proposed technique calculates the throughputs of each UE simultaneously. Moreover, it obtains the system throughput for combinations at every clock cycle in the pipeline. Experimental results show that the proposed technique performs the calculation of system throughput 60 times faster than without the acceleration. The proposed technique enables a future practical 5G system.

R2-17 (Time: 14:42 - 14:44)

Title	An Overlay Architecture for FPGA-Based Industrial Control Systems Designed with Functional Block Diagrams
Author	*Taisei Segawa, Yuichiro Shibata, Yudai Shirakura, Kenichi Morimoto, Hidenori Maruta, Fujio Kurokawa (Nagasaki University, Japan), Masaharu Tanaka (Mitsubishi Heavy Industries, Japan), Masanori Nobe (Mitsubishi Hitachi Power Systems, Japan)
Page	pp. 164 - 169
Keyword	FPGA, HLS, FBD
Abstract	This paper discusses FPGA implementation of industrial control logic described in a function block diagram (FBD) language. First, we evaluate an approach where FBD descriptions are directly translated to FPGA hardware using a high level synthesis technique. Second, aiming at improving resource utilization efficiency, we proposed an overlay architecture which helps resource sharing of the same arithmetic structure utilized in different control logic sheets. Evaluation results show that the propose architecture can significantly reduce resource requirements per control logic sheet, at a cost of acceptable performance degradation.

Panel
Time: 15:55 - 17:40 Monday, October 24, 2016

D-1 (Time: 15:55 - 17:25)

Title	Past and Future 25 Years of Synthesis and System Integration
Author	Moderator: Isao Shirakawa (University of Hyogo, Japan), Panelists: Giovanni De Micheli (EPFL, Switzerland), Youn-Long Lin, Ren-Song Tsay (National Tsing Hua University, Taiwan), Peter Marwedel (Technical University of Dortmund, Germany), Organizer: Nagisa Ishiura (Kwansei Gakuin University, Japan)
Page	p. 170
Abstract	Since the foundation of SASIMI in 1989, we have experienced tremendous evolutions in the area of electronic design technology, as SASIMI also evolved from “Synthesis And SImulation Meeting and International interchange,” through “The workshop on Synthesis And System Integration of MIxed technologies” to “The workshop on Synthesis And System Integration of Mixed Information technologies.” What have really happened in the last years and what will come in the next 25 years? Starting with the enlightening short talks from the distinguished panelists, we would like to discuss the future of Synthesis and System Integration.

Tuesday, October 25, 2016

Invited Talk
Time: 9:10 - 10:00 Tuesday, October 25, 2016
Chair: Kiyoharu Hamaguchi (Shimane University, Japan)

I2-1 (Time: 9:10 - 10:00)

Title	Wide Bandgap Analog and Mixed-signal IC Design for Advanced Power Electronics
Author	*Alan Mantooth (University of Arkansas, U.S.A.)
Page	p. 171
Abstract	Economy and performance are benefits that come with high power density power electronics, just as in the case of VLSI electronics. High density power electronics require the heterogeneous integration of disparate technologies including power semiconductor devices, driver, protection and control circuitry, passives and voltage isolation techniques into single modules. Such integration activity was central to the Google Little Box Challenge competition conducted last year. One of the keys to advancing power electronic integration has been the commercial reality of wide bandgap power semiconductor devices made from silicon carbide and gallium nitride. The ability to design and manufacture wide bandgap integrated circuits as drivers, controllers, and protection circuitry allows them to be packaged in close proximity to the power device die to minimize parasitics that would adversely impact system performance. These impacts include excessive ringing, noise generation, power loss, and, potentially, self-destruction. This talk will describe the state of the art in wide bandgap analog and mixed-signal IC design including example circuits such as amplifiers, data converters, controllers, and protection circuits and their integration into power electronic platforms. A synthesis tool under development for heterogeneous power circuit layout will be briefly described as a capstone to the application space descriptions.

POSTER III
Time: 10:00 - 11:50 Tuesday, October 25, 2016
Chairs: Wenxing Zhu (Fuzhou University, China), Hideki Takase (Kyoto University, Japan)

R3-1 (Time: 10:00 - 10:02)

Title	Extending Distributed Control for High-Level Synthesis beyond Borders of Basic Blocks
Author	*Miho Shimizu, Nagisa Ishiura (Kwansei Gakuin University, Japan)
Page	pp. 172 - 177
Keyword	high-level synthesis, variable latency units, distributed controller
Abstract	This paper proposes an extension of distributed control, which enables efficient run-time scheduling of variable latency operations, to multiple dataflow graphs. Conventional high-level synthesis methods determine the execution schedule of operations statically assuming that their latencies are fixed. However, actual circuits contain so-called variable latency units whose execution cycles may vary depending on various run-time factors. Our method extends the Del Barrio's distributed control to handle multiple dataflow graphs. It enables dynamic scheduling of operations beyond the boundaries of basic blocks which results in fewer execution cycles than those by conventional centralized control with loop scheduling and trace scheduling.

R3-2 (Time: 10:02 - 10:04)

Title	Proposal of an Efficient Clock-Gating Mechanism for Multi-Core Processors to Reduce Power Supply Noise
Author	*Jun Kawabe, Yoshinori Takeuchi, Jaehoon Yu, Masaharu Imai (Osaka University, Japan)
Page	pp. 178 - 183
Keyword	multicore, reliability, clock gating, power supply noise
Abstract	This paper proposes an efficient clock-gating mechanism to reduce the power supply noise of multi-core processors. The intensity of power supply noise is proportional to the amount of current change and is a critical issue for multi-core processors due to the wide range of current fluctuation. The proposed mechanism is designed to suppress the current change by controlling the clock supply to each core. In experimental results, the proposed method reduced current fluctuation by 37.3% without processing performance degradation.

R3-3 (Time: 10:04 - 10:06)

Title	Automatic Netlist Transformation for WDF-Based Analog Emulator
Author	Hsu-Ping Yang, *Hsin-Ju Hsu, Chun Wang, Chien-Nan Jimmy Liu, Jing-Yang Jou (National Central University, Taiwan)
Page	pp. 184 - 189
Keyword	Wave Digital Filter, Analog Circuit Emulation
Abstract	System verification is still a big challenge for SOC designs with AMS circuits. Unlike well accepted FPGA emulation for digital circuits, there is still no practical solution for the emulation of mixed-signal circuits. Wave digital filter (WDF) based method is a possible approach to emulate analog circuits in digital environment [1]. Based on that technique, this paper proposes an automatic transformation flow to handle the translation from circuit netlist to its corresponding WDF structure, with optimized tree height and number of adaptors. As demonstrated in the circuit examples, the proposed algorithm is able to generate correct WDF structures and reduce the tree height, which can further increase the operation speed of the emulator.

R3-4 (Time: 10:06 - 10:08)

Title	Nonlinear Optimization Solver with Multiple Precision Arithmetic
Author	*Yuya Matsumoto, Hiroshige Dan (Kansai University, Japan)
Page	pp. 190 - 194
Keyword	Nonlinear programming, Multiple precision arithmetic
Abstract	Double precision arithmetic for nonlinear optimization problems (NLP) sometimes fails to solve some ill-posed problems. On the other hand, multiple precision arithmetic has attracted much attention recently as a brute-force method for avoiding numerical errors. In this research, we implemented an optimization solver for NLP by using multiple precision arithmetic and examined the advantage of multiple precision arithmetic for NLP through numerical results.

R3-5 (Time: 10:08 - 10:10)

Title	High Speed Cycle-Accurate Processor Simulation Through Ahead of Time Compilation
Author	*Lovic Gauthier (National Institute of Technology, Ariake College, Japan)
Page	pp. 195 - 200
Keyword	simulator, processor, ahead of time compilation, binary translation, just in time compilation
Abstract	This paper presents techniques for implementing fast cycle-accurate processor simulators based on ahead of time compilation (AoT). AoT is usually assumed to suffer from a large compilation overhead, and is difficult to implement due to the dynamic behavior of some instructions. The paper explains how to overcome these issues and presents experiments with MIPS processor simulators showing that our approach can surpass state of the art methods and can simulate more than one billion clock cycles per second.

R3-6 (Time: 10:10 - 10:12)

Title	Prototype Speed Limit Sign Recognition System Implementation on Rapid Prototyping Platform
Author	*Anh-Tuan Hoang, Takumi Okamoto, Tetsushi Koide (Research Institute for Nanodevice and Bio Systems, Hiroshima University, Japan)
Page	pp. 201 - 202
Keyword	ADAS, traffic sign recognition, FPGA, Rapid Prototyping Platform
Abstract	This paper introduce our prototype speed limit traffic sign recognition system implementation on Rapid Prototyping Platform. The system utilizes simple image feature such as area luminosity difference of grayscale image to detect traffic sign candidates and block histogram feature in binary image to recognize the speed. Combination of those simple traffic sign features helps our algorithm to achieve 100% of accuracy in recognizing speed limit traffic signs in daytime and over 90% in hard lightning condition such as rainy night. Simplicity in computation enables real-time processing (> 30 fps) and relatively small hardware occupied. The design occupies 11,713 slices LUTs (0.95%) and 5,060 slice registers (0.2%) of the hardware available on the Virtex xc7v2000t (1,221,600 slice LUTs and 2,443200 slice registers) on the Rapid Prototyping Platform Protium.

R3-7 (Time: 10:12 - 10:14)

Title	Reconfigurable Processor Array Architecture for Deep Convolutional Neural Networks
Author	*Kota Ando, Kentaro Orimo, Kodai Ueyoshi, Masayuki Ikebe, Tetsuya Asai, Masato Motomura (Hokkaido University, Japan)
Page	pp. 203 - 204
Keyword	Deep Learning, Neural Network, Reconfigurable Architecture, CNN
Abstract	A convolutional neural network (CNN) is a type of neural network that has achieved high accuracy on many tasks like image recognition. Because a CNN requires a large amount of computation, various types of accelerators have been invented. However, the difference between the two types of CNN layers decreases the availability of the accelerators. We propose a flexible and efficient accelerator, where the simple processing elements are parallelized and the data paths are controlled appropriately.

R3-8 (Time: 10:14 - 10:16)

Title	On Component Ratio of RECON Spare Cells for ECO-Friendly Design Style
Author	*Takeshi Sawai, Ayano Takezaki, Tetsuya Hirose, Nobutaka Kuroki, Masahiro Numa (Kobe University, Japan)
Page	pp. 205 - 210
Keyword	ECO, reconfigurable cell, error diagnosis, technology remapping
Abstract	This paper presents an approach to obtain suitable component ratio of 2T/4T/6T-RECON spare cells for the ECO-friendly design style in order to implement the changes caused by ECO’s suppressing increase in the maximum delay time. By using statistics on RECON cell types and logic functions to fix ECO’s, we can determine suitable component ratio of RECON spare cells. Experimental results have shown that the proposed approach is effective to fix post-mask ECO’s reducing increase in the maximum delay time.

R3-9 (Time: 10:16 - 10:18)

Title	Theorem-proving Verification of Multi-clock Synchronous Circuits on Multimodal Logic
Author	*Shunji Nishimura, Motoki Amagasaki, Toshinori Sueyoshi (Kumamoto University, Japan)
Page	pp. 211 - 212
Keyword	formal method, formal verification, theorem proving, multi-clock synchronous circuit
Abstract	Formal verification methods for synchronous circuits are widely used, but almost all of the methods are limited to single-clock synchronous circuits. In this paper, we propose a formal verification method for multi-clock synchronous circuits. The proposed verification method is in theorem-proving manner and based on multimodal logic. We also show an example of verification of a clock switching circuit by using the method.

R3-10 (Time: 10:18 - 10:20)

Title	Hardware Trojan Insertion Difficulties into Synchronous and Asynchronous Circuits
Author	*Masashi Imai (Hirosaki University, Japan), Tomohiro Yoneda (National Institute of Informatics, Japan)
Page	pp. 213 - 218
Keyword	hardware Trojan, asynchronous circuit, majority voted-enable latch
Abstract	Hardware trojan threats have become one of the serious issues in the modern VLSI systems. In this paper, we discuss the differences of hardware trojan insertions between synchronous circuits and asynchronous circuits. We propose a design method to tolerate a hardware trojan insertion in asynchronous circuits using voted-enable-latches and show some evaluation results using 130nm process technology.

R3-11 (Time: 10:20 - 10:22)

Title	A Delay Adjustment Method for Asynchronous Circuits with Bundled-data Implementation Considering a Latency Constraint
Author	*Kazumasa Yoshimi, Hiroshi Saito (The University of Aizu, Japan)
Page	pp. 219 - 224
Keyword	asynchronous circuits, timing constraints, delay adjustment
Abstract	In this paper, we propose a delay adjustment method for asynchronous circuits with bundled-data implementation considering a latency constraint. We modify delay adjustments for hold and idle constraints for an existing method. The experimental results show that the proposed method reduces energy consumption with the reduction of the number of delay adjustments and the reduction of the number of cells used for delay elements.

R3-12 (Time: 10:22 - 10:24)

Title	Path Grouping Approach for Efficient Candidate Selection of Replacing NBTI Mitigation Logic
Author	*Shumpei Morita, Song Bian, Michihiro Shintani, Masayuki Hiromoto, Takashi Sato (Kyoto University, Japan)
Page	pp. 225 - 230
Keyword	NBTI, NBTI mitigation, reliability, aging
Abstract	We propose a fast replacement method of the negative bias temperature instability mitigation logics. In our approach, to accelerate the exploration of a set of gates to be replaced, critical path candidates are partitioned into several groups based on similarity between paths, and only the representative paths of each group are evaluated. Experimental results show the proposed approach speeds up the computation time by 13.5 times while retaining the same level of the mitigation gain compared to the conventional approach.

R3-13 (Time: 10:24 - 10:26)

Title	Semi-Automated Analog Placement based on Margin Tolerances
Author	*Eric Lao, Marie-Minerve Louërat, Jean-Paul Chaput (Laboratoire d'informatique de Paris 6, France)
Page	pp. 231 - 235
Keyword	Analog layout, analog placement, slicing tree, analog automation
Abstract	Digital circuit design is extensively assisted by modern automation tool unlike analog design which is still a manual task because of the complexity of the interactions between devices. This paper presents a semi-automated analog placement based on margin tolerances controlled by the designer by creating analog circuits organized in row similar to digital circuits structure. The results show the ability of our tool at generating multiple layouts respecting designer’s constraints.

R3-14 (Time: 10:26 - 10:28)

Title	An Efficient Gaussian Mixture Reduction to Two Components
Author	Naoya Yokoyama, *Daiki Azuma, Shuji Tsukiyama (Chuo University, Japan), Masahiro Fukui (Ritsumeikan University, Japan)
Page	pp. 236 - 241
Keyword	Gaussian Mixture reduction, Gaussian mixture model, statistical static timing analysis
Abstract	In statistical methods, such as statistical static timing analysis, Gaussian mixture model (GMM) is a useful tool for representing a non-Gaussian distribution and handling correlation easily. In order to repeat various statistical operations such as summation and maximum for GMMs efficiently, the number of components should be restricted around two. In this paper, we propose a method for reducing the number of components of a given GMM to two (2-GMM) such that the mean and the variance of the 2-GMM are equal to those of original GMM and the normalized integral square error of 2-GMM PDF is minimized. In order to demonstrate the performance of the proposed methods, we show some experimental results.

R3-15 (Time: 10:28 - 10:30)

Title	Thermal Circuit Identification of Power MOSFETs through In-Situ Channel Temperature Estimation
Author	*Kazuki Oishi, Michihiro Shintani, Masayuki Hiromoto, Takashi Sato (Kyoto University, Japan)
Page	pp. 242 - 247
Keyword	thermal model, power MOSFET, device model, circuit simulation
Abstract	We propose a novel methodology for identifying thermal equivalent circuit of power MOSFETs. Drain current of the MOSFET is utilized to estimate channel temperature where heat actually generates. Two measurement methods, i.e., constant voltage (CV) method and constant power (CP) method, are proposed wherein waveforms of the input power for the target device are different. Through experiments using a commercial SiC power MOSFET, the accuracy of the proposed method is valideted.

R3-16 (Time: 10:30 - 10:32)

Title	Evaluation of PLL Layouts based on Transistor Array-style
Author	*Atsushi Nanri, Bo Liu, Yuki Miura, Shigetoshi Nakatake (The University of Kitakyushu, Japan)
Page	pp. 248 - 251
Keyword	Transistor array, analog layout, PLL
Abstract	This work proposes an analog characterization module to imitate (emulate) various analog components as essential technologies for analog-specific computation. This module converts an analog input signal into digital by an ADC circuit, makes use of digital signal processing technologies to characterize the signal as expected, and outputs the analog signal converting by a DAC. In this paper, characterizing the proposing module as a resistance, we demonstrate a use case of realizing a programmable gain amplifier by introducing the module.

R3-17 (Time: 10:32 - 10:34)

Title	Single Row Cell Placement Considering Self-aligned Double Patterning
Author	Ye-Hong Chen, *Ting-Chi Wang (National Tsing Hua University, Taiwan)
Page	pp. 252 - 257
Keyword	self-aligned double patterning, cell placement
Abstract	Self-aligned double patterning (SADP) has become one of the promising lithography techniques for advanced nodes. In SADP, overlay violation is a critical issue for fabrication, so how to minimize it is important. Most of existing works focus on layout decomposition and routing, while very few attempts are on placement. In this paper, we consider a single-row cell placement problem, where overlay violation is minimized by two placement techniques, white space insertion and cell flipping. Experimental results show our methods can effectively reduce overlay violation.

R3-18 (Time: 10:34 - 10:36)

Title	A Lithium Ion Battery Aging Simulator with Calibration Functions
Author	*Yukinori Hayakawa, Lei Lin, Masahiro Fukui (Ritsumeikan University, Japan)
Page	pp. 258 - 263
Keyword	Lithium Ion Battery, Aging Simulator, Calibration
Abstract	This paper discusses a practical degradation simulator for assembled battery of Lithium ion batteries. The degradation simulator has calibration function. The function is used to various conditions based on the measured vales. As a result, the accuracy of degradation simulator is improved.

R3-19 (Time: 10:36 - 10:38)

Title	A Full Charge Capacity Estimation Algorithm for Li-ion Batteries Based on Recursive Least-Squares Identification with Adaptive Forgetting Factor Tuning
Author	*Hironori Ono, Lei Lin, Masahiro Fukui, Kiyotsugu Takaba (Ritsumeikan University, Japan)
Page	pp. 264 - 267
Keyword	Battery Management System, Li-ion Battery, System Identification
Abstract	This paper discuss an Full Charge Capacity (FCC) estimation system for lithium ion batteries based on the recursive least-squares identification. The accuracy of the estimation is depended on accuracy of the state of charge (SOC) estimation. We have newly formulated the adaptive forgetting factor for recursive least-squares identification. As the result, the error rate of the SOC estimation has been improved. The evaluation shows that the new FCC estimation system can be used for various temperature conditions.

R3-20 (Time: 10:38 - 10:40)

Title	A Hardware Architecture to Perform K-means Clustering for Learning-Based Super-Resolution Combining Self-Learning and Prior-Learning Dictionaries
Author	*Daichi Murata, Ayumi Kiriyama, Tetsuya Hirose, Nobutaka Kuroki, Masahiro Numa (Kobe University, Japan)
Page	pp. 268 - 273
Keyword	learning-based super-resolution, self-learning dictionaries, hardware architecture, resource sharing, hardware system design
Abstract	This paper presents a hardware architecture to perform K-means clustering for learning-based super-resolution combining self-learning and prior-learning dictionaries. Using PCA (Principal Component Analysis) and a k-d tree is the key to save hardware resources and processing time. Moreover, we propose a technique to save resources by sharing computing elements. Experimental results have shown that our architecture reduces 63% ALUT (Adaptive Look Up Table) area, and speeds up more than 5.8 times as fast as a conventional circuit.

R3-21 (Time: 10:40 - 10:42)

Title	On-Chip Temperature Sensing using a Reconfigurable Ring Oscillator
Author	*Tadashi Kishimoto, Hidetoshi Onodera (Kyoto University, Japan)
Page	pp. 274 - 279
Keyword	Leakage Current, on-chip sensor, temperature, reconfigurable, MOSFET
Abstract	This paper proposes a temperature monitoring scheme using a reconfigurable ring oscillator that has been proposed to estimate process variation. New circuit configurations, whose delay characteristics are sensitive to leakage current, are proposed to exploit the exponential dependence of the leakage current to temperature. Based on transistor-level simulation assuming a 65 nm process technology, the oscillation frequency of the proposed circuit topology shows the temperature sensitivity of 5.0 %/℃ at 20 ℃ and 2.9 %/℃ at 80 ℃ and low voltage sensitivity of 0.28 ℃/10 mV at the supply voltage of 0.9 V and the temperature of 25 ℃.

R3-22 (Time: 10:42 - 10:44)

Title	A Shift HSV Algorithm for a Low-Power Monitoring System using an FPGA toward Internet of Things Agriculture
Author	Takahisa Kurose (Ehime University, Japan), *Hiroki Nakahara, Shimpei Sato (Tokyo Institute of Technology, Japan), Tetsuo Morimoto (Ehime University, Japan)
Page	pp. 280 - 281
Keyword	FPGA, IoT, Low Power Design
Abstract	An agriculture monitoring system observes growth of agricultural crops. It requires high-performance with a battery drivable system. To satisfy them, we use an FPGA, and realize a shift operation based HSV converter. Although the proposed shift-based HSV converter causes 8.4\% error compared with the original HSV one, its power consumption is 26.51 times smaller than the original one.

Invited Talk
Time: 13:20 - 14:10 Tuesday, October 25, 2016
Chair: Kiyoharu Hamaguchi (Shimane University, Japan)

I3-1 (Time: 13:20 - 14:10)

Title	The Challenges and Future of Electronic-System Level Design Automation
Author	*Ren-Song Tsay (National Tsing Hua University, Taiwan)
Page	p. 282
Abstract	In this talk, the speaker will introduce new concepts of software/hardware system abstractions for effective system modeling and design automation. As IC design automation reached its peak at the turn of the century, the industry unsurprisingly asks for upgrade to electronic system-level design automation. However, the complexity and diversity, particularly the inclusion of both software and hardware components, of system designs are of huge challenges. In contrast to the successful abstractions of transistor-, gate- and RTL-level designs, the traditional system-level transaction models and function-time models seem to be of limited use. It is concluded that proper characterizations of system behaviors, particularly inter-component interactions (shared-data accesses, signals, interrupts, etc.) and timing behaviors (bus contentions, cache misses, preemptions, etc.) are identified to be keys to system modeling and designs.

POSTER IV
Time: 14:10 - 16:00 Tuesday, October 25, 2016
Chairs: Tsuyoshi Matsumoto (Ishikawa Tech. College, Japan), Xin Jin (Tsinghua University, China)

R4-1 (Time: 14:10 - 14:12)

Title	Mathematical Algorithm Hardware Description Languages for System Level Modeling
Author	Ryo Hikawa, *Ryuji Kishimoto, Takashi Kambe (Kindai University, Japan)
Page	pp. 283 - 284
Keyword	mathematical algorithm description language, HDLMath, System Level Modeling
Abstract	Mathematical modeling is an important approach for both solving problems and visualizing the abstract concepts involved in system and/or products. Thus a mathematical algorithm description language (HDLMath) should be capable of describing and verifying the entire behavior using mathematical algorithms of electronic systems. In this paper, the functional requirements of HDLMaths are proposed and several current HDLMaths are compared from a design viewpoint.

R4-2 (Time: 14:12 - 14:14)

Title	High-Level Synthesis of Embedded Systems Controller from Erlang
Author	Hinata Takebayashi, Nagisa Ishiura, *Kagumi Azuma (Kwansei Gakuin University, Japan), Nobuaki Yoshida, Hiroyuki Kanbara (ASTEM RI, Japan)
Page	pp. 285 - 290
Keyword	high-level synthesis, Erlang, embedded system
Abstract	This article presents a method of specifying the behavior of embedded systems by a subset of Erlang, from which RTL hardware is synthesized. Assembly codes of the BEAM virtual machine compiled from Erlang programs are converted into CDFGs (control dataflow graphs), which are synthesized into Verilog HDL by the back-end of the high-level synthesizer ACAP. A prototype system based on the proposed method implemented in Perl5 has successfully synthesized a simple two-process Erlang program into logic-synthesizable Verilog HDL codes.

R4-3 (Time: 14:14 - 14:16)

Title	A Data Effect Aware Microcomponent-Based Estimation Approach for Accurate System-Level Memory Device Power Evaluation
Author	*Chi-Kang Chen (Industrial Technology Research Institute and National Tsing Hua University, Taiwan), Hsin-I Wu, Chi-Ting Hsiao, Ren-Song Tsay (National Tsing Hua University, Taiwan)
Page	pp. 291 - 296
Keyword	Memory, Power, DRAM, Microcomponent
Abstract	As memory is a major power dominant, we propose a highly efficient microcomponent-based approach with data-aware refinement for accurate system-level power estimation. The proposed method pre-calibrates the power consumption pattern of each identified microcomponent for power simulation. To achieve high accuracy, the data variation effect is considered and a simple interpolation technique is proposed to further boost accuracy. The proposed approach produces accurate results of less than 2% error rate in average for system-level power analysis.

R4-4 (Time: 14:16 - 14:18)

Title	Analysis of Co-Controlling Voltage/Frequency of Cores and DRAMs of Chip Multi-Processors with 3D-stacked DRAMs for Thermal Management
Author	*Yi-Jung Chen (National Chi Nan University, Taiwan), Chia-Lin Yang, Ping-Sheng Lin, Yi-Chang Lu (National Taiwan University, Taiwan)
Page	pp. 297 - 302
Keyword	3D-stacked DRAMs, Chip-MultiProcessors, Thermal Management
Abstract	Thermal control is a critical issue to Chip-Multiprocessors (CMPs) with 3D-stacked DRAMs due to its high power density. Existing thermal managements for 3D ICs all perform thermal control on cores only because lowering the power-level of cores can also lower DRAM access frequency. However, as the power consumption of single DRAM access increases with the number of DRAM stacks and the width of the vertical links, reducing the power consumption per DRAM access as well is also crucial. In this paper, we characterize the thermal and performance behavior of the target architecture when the voltage and frequency levels of cores and DRAMs are synergistically controlled.

R4-5 (Time: 14:18 - 14:20)

Title	Introducing Real Constraints in Partitioned ILP-Based Binding in High-Level Synthesis
Author	Nagisa Ishiura, *Yuuki Oosako (Kwansei Gakuin University, Japan)
Page	pp. 303 - 304
Keyword	high-level synthesis, binding, partitioned ILP, real linear constraints
Abstract	This paper presents an efficient ILP-based method of binding in high-level synthesis. The binding problem is formultated as partitioned ILP where linear inequations of the other unsolved portions without integer constraints are added.

R4-6 (Time: 14:20 - 14:22)

Title	A Framework for Automatic Generation of Application-Specific FPGA-based SoC
Author	*Tetsuo Miyauchi, Kiyofumi Tanaka (Japan Advanced Institute of Science and Technology, Japan)
Page	pp. 305 - 310
Keyword	real-Time, multicore, RTOS, application-specific, configuration
Abstract	As IoT or CPS devices/systems increase, efficient, cost effective real-time embedded systems are getting important. For providing various highly application-specific systems, we are developing a design environment based on a framework for automatic generation of application-specific FPGA-based SoC. Use of the environment makes it possible for designers to automatically generate a target system design which is highly adapted to the application. Our targets for optimization/customization are multicore processors, real-time operating systems, and acceleration hardware in FPGA.

R4-7 (Time: 14:22 - 14:24)

Title	Fast Song Searching by Simultaneous Execution of HiFP2.0 and Staged LSH
Author	*Masahiro Fukuda, Yasushi Inoguchi (Japan Advanced Institute of Science and Technology, Japan)
Page	pp. 311 - 316
Keyword	Audio Fingerprint, HiFP, Staged LSH, FPGA
Abstract	Fingerprinting techniques are generally used to search a song quickly. In this paper, the fingerprint generation method HiFP2.0 and the identification method Staged LSH are combined and executed almost simultaneously. This method reduced around 3600 clock cycles and was about 8.54 % faster than the sequential execution of them in the case that the song of the query was a bit distorted by the lossy compression of MP3.

R4-8 (Time: 14:24 - 14:26)

Title	An Error Diagnosis Technique Based on Averaged EPI Values to Extract Error Locations Sets
Author	*Ayano Takezaki, Takeshi Sawai, Hiroyuki Sakamoto, Tetsuya Hirose, Nobutaka Kuroki, Masahiro Numa (Kobe University, Japan)
Page	pp. 317 - 322
Keyword	error diagnosis, ECO, EPI, error location set
Abstract	This paper presents an error diagnosis technique based on averaged EPI (Error Possibility Index) values to extract error location sets. Each EPI corresponds to the controllability of a location to an inconsistent primary output for an input pattern. By averaging EPI values for each location, we can reduce the number of initial sets. Experimental results have shown that the proposed technique is effective in reducing error location sets and processing time for screening them.

R4-9 (Time: 14:26 - 14:28)

Title	Minimum Energy Point Tracking under a Wide Range of PVT Conditions
Author	*Shu Hokimoto, Tohru Ishihara, Hidetoshi Onodera (Kyoto University, Japan)
Page	pp. 323 - 328
Keyword	Minimum Energy Point Tracking, Dynamic Voltage and Frequency Scaling, Adaptive Body Basing
Abstract	Scaling the supply voltage (VDD) and threshold voltage (VTH) for minimizing the energy consumption of processors dynamically is highly desired for applications such as wireless sensor network and Internet of Things (IoT). In this paper, we refer to the pair of VDD and VTH , which minimizes the energy consumption of the processor under a given operating condition, as a minimum energy point (MEP in short). Since the MEP is heavily dependent on PVT (Process, Voltage and Temperature) conditions, it is not very easy to closely track the MEP at runtime. The voltage condition in this work is assumed to arise due to dynamic voltage and frequency scaling (DVFS) along with a change of the performance requirement. This paper proposes a simple but effective algorithm for dynamically tracking the MEP of a processor under a wide range of PVT conditions. Gate-level simulation for a 32-bit RISC processor designed with a 65nm process demonstrates that the proposed algorithm tracks the MEP under a situation that PVT conditions widely vary.

R4-10 (Time: 14:28 - 14:30)

Title	Comparison of Area-Delay-Energy Characteristics between General Purpose Processors and Dedicated Hardwares for Embedded Applications
Author	*Kei Yoshizawa, Tohru Ishihara, Hidetoshi Onodera (Kyoto University, Japan)
Page	pp. 329 - 334
Keyword	Dedicated hardware, General Purpose Processor, Comparison, Energy-efficiency
Abstract	We examine the performance difference between dedicated hardwares and a general purpose processor with respect to energy efficiency, area, and delay, using three applications. We show that the main difference comes from the number of cycles required for each application. In ADPCM, a dedicated hardware is 2,266 times energy-efficient and 2,302 times faster than a general purpose processor with a similar amount of area. However in DCT, which contains less branch codes, the advantage of a dedicated hardware in the energy efficiency and processing speed decreases to 127 times and 1,263 times, respectively. We also observed in ADPCM that the advantage of the dedicated hardware over the general purpose processor in energy efficiency and speed is reduced to 425 times and 431 times respectively if the processor is equipped with a multiplier. It is observed that the performance advantage of dedicated hardwares varies very much depends on applications and the hardware resources of a general purpose processor. Lastly, we analyze the processing cycles of the general purpose processor and it is revealed that a considerable amount of processing cycles are consumed by stalls and multiplications. For example in ADPCM, 39.7% of the processing cycles are lost by stalls and 43.4% by multiplications while the rest of 16.9% are consumed by sequential operations of the ADPCM algorithm that could be done in parallel in a dedicated hardware.

R4-11 (Time: 14:30 - 14:32)

Title	Finding Effective Simulation Patterns for Coverage-Driven Verification Using Deep Learning
Author	*Mami Miyamoto, Kiyoharu Hamaguchi (Shimane University, Japan)
Page	pp. 335 - 340
Keyword	RTL verification, Deep Learning, coverage-driven verification, automated testbench
Abstract	In Coverage-driven verification of RT/gate-level designs, one of important tasks is to cover each cover point with fairly a large number of new simulation patterns. We propose a method to learn features of simulation patterns by deep learning, and to find simulation patterns covering a hard-to-cover point based on reconstruction errors. The experimental results show that the proposed method is efficient in finding effective simulation patterns.

R4-12 (Time: 14:32 - 14:34)

Title	Static Timing Analysis of Rapid Single-Flux-Quantum Circuits
Author	*Takahiro Kawaguchi, Kazuyoshi Takagi, Naofumi Takagi (Kyoto University, Japan)
Page	pp. 341 - 345
Keyword	RSFQ, static timing analysis
Abstract	We propose a method for calculating pulse arrival timing at all gates in an RSFQ circuit and a new de nition of timing slacks of gates. In the pro- posed method, the total path delay, the total length of PTLs and the number of PTL transmitters/receivers on a path are also calculated.

R4-13 (Time: 14:34 - 14:36)

Title	Improved Method of Simulated Annealing for Unreachable Solution Space
Author	*Hiroyuki Nakano, Kunihiro Fujiyoshi (Tokyo University of Agriculture and Technology, Japan)
Page	pp. 346 - 351
Keyword	Simulated Annealing, Making Adjacent Solutions, Solution Space, Reachability
Abstract	Simulated Annealing is a universal probabilistic metaheuristic for optimization problems of locating a good approximation to the global minimum of given function in a large solution space. It is sometimes used for physical design problems. However, Simulated Annealing is known to be inefficient when it searches solution spaces containing infeasible solutions. In this paper, we propose two methods to make adjacent solutions for such solution spaces. Experimental comparisons indicate the effectiveness of the proposed methods.

R4-14 (Time: 14:36 - 14:38)

Title	Application of Monte-Carlo Tree Search to Traveling-Salesman Problem
Author	Masato Shimomura, *Yasuhiro Takashima (University of Kitakyushu, Japan)
Page	pp. 352 - 356
Keyword	Monte-Carlo Tree Search, Traveling-Salesman Problem, Optimization
Abstract	This paper shows an application of Monte-Carlo Tree Search (MCT) to Traveling-Salesman Problem (TSP). Compared with the simulated annealing, which is one of the general probabilistic optimization methods, MCT has very high ability of optimization with problem-aware implementation. Its efficiency is confirmed, empirically.

R4-15 (Time: 14:38 - 14:40)

Title	Analog Characterization Module with D/A Converter Configuration
Author	*Daishi Isogai, Bo Liu, Futa Yoshinaka, Shigetoshi Nakatake (The University of Kitakyushu, Japan)
Page	pp. 357 - 361
Keyword	FPAA, DAC, Opamp
Abstract	This work proposes an analog characterization module to imitate (emulate) various analog components as essential technologies for analog-specific computation. This module converts an analog input signal into digital by an ADC circuit, makes use of digital signal processing technologies to characterize the signal as expected, and outputs the analog signal converting by a DAC. In this paper, characterizing the proposing module as a resistance, we demonstrate a use case of realizing a programmable gain amplifier by introducing the module.

R4-16 (Time: 14:40 - 14:42)

Title	Range Limiter using Connection Bounding Box for SA-based Placement of Mixed-Grained Reconfigurable Architecture
Author	*Takashi Kishimoto (Ritsumeikan University, Japan), Wataru Takahashi, Kazutoshi Wakabayashi (NEC Corporation, Japan), Hiroyuki Ochi (Ritsumeikan University, Japan)
Page	pp. 362 - 367
Keyword	Simulated annealing
Abstract	In this paper, we propose a novel placement algorithm for mixed-grained reconfigurable architectures (MGRAs). MGRA consists of coarse-grained and fine-grained clusters, in order to implement a combined digital systems of high-speed data paths with multi-bit operands and random logic circuits for state machines and bit-wise operations. For accelerating simulated annealing based FPGA placement algorithm, range limiter has been proposed to control the distance of two blocks to be interchanged. However, it is not applicable to MGRAs due to the heterogeneous structure of MGRAs. Proposed range limiter using connection bounding box effectively keeps the size of range limiter to encourage moves across fine-grain blocks in non-adjacent clusters. From experimental results, the proposed method achieved 47.8% reduction of cost in the best case compared with conventional methods.

R4-17 (Time: 14:42 - 14:44)

Title	A Smart Hybrid Memetic Algorithm for Thermal-Aware Non-Slicing Floorplanning
Author	*Jianli Chen, Yan Liu, Ziran Zhu, Wenxing Zhu (Fuzhou University, China)
Page	pp. 368 - 373
Keyword	Floorplanning, memetic
Abstract	Floorplanning is a crucial design step in ASIC design flow. It provides valuable insights into the hardware decisions and estimates a floorplan with different cost metrics. In this paper, in order to handle a multi-objective non-slicing floorplanning problem efficiently, a smart hybrid memetic algorithm is presented to optimize the area, the total wirelength, and the maximum temperature and the average temperature of a chip. In the proposed method, an effective genetic algorithm is used to explore the search space, and an efficient modified hybrid simulated annealing algorithm is used to exploit information in the search region. The exploration and exploitation are balanced by a death probability strategy. In this strategy, according to the natural mechanisms, each individual in the population is endowed with an actual age and a dynamic survival age. Experimental results on the standard tested benchmarks show that the proposed algorithm is efficient to obtain floorplans, with decreasing the average and the peak temperature.

R4-18 (Time: 14:44 - 14:46)

Title	Hardware Acceleration of Rate-Distortion Optimized Quantization Algorithm
Author	*Yusuke Funayama, Takashi Kambe (Kindai University, Japan), Gen Fujita (Osaka Electro-Communication University, Japan)
Page	pp. 374 - 375
Keyword	Rate-distortion optimized quantization, Hardware Acceleration, video coding, high-level synthesis
Abstract	Rate-distortion optimized quantization (RDOQ) is an important technology for improving video coding performance. RDOQ is able to determine the optimal value among multiple quantization candidates based on rate-distortion (RD). We propose a hardware acceleration method to the algorithm to reduce its complexity by changing the bit-rate estimation method and by excluding low scored quantization candidates. The hardware design results are also evaluated.

R4-19 (Time: 14:46 - 14:48)

Title	Development of an Optimal Wireless Power Transfer System for Lithium-Ion Battery Charge
Author	*Yuto Honda, Lei Lin, Masahiro Fukui (Ritsumeikan University, Japan)
Page	pp. 376 - 381
Keyword	Wireless power transmission, charging, Lithium-ion battery
Abstract	We describe the development of the optimal wireless power transfer system for the Lithium-Ion battery charge. The proposed system has the DC-DC converter on both of the primary side and the secondary side. This system improves the transmission efficiency when the duty ration of the DC-DC converter on the primary side. This is because the value of the pretense load is optimum value. In addition, the DC-DC converter on the secondary side optimizes the charging current and voltage according to the state of the battery. Thus, this system can establish both of the improvement of transmission efficiency and optimum charging for the lithium-ion battery.

R4-20 (Time: 14:48 - 14:50)

Title	Design of a Fast Lock-in and Low-Power All-Digital Frequency Synthesizer with a Wide Tuning Range
Author	*Hao-Chiao Hong, Hung-Yi Wen (National Chiao Tung University, Taiwan), Hong-Yi Huang (National Taipei University, Taiwan)
Page	pp. 382 - 385
Keyword	ADFLL, Frequency synthesizer, DCO
Abstract	This paper presents the design of a low-power and fast-locking all-digital frequency lock loop (ADFLL) with a wide tuning range. The ADFLL adopts the Regula Falsi method to conduct the frequency acquisition to achieve a short lock-in time. The digitally control oscillator (DCO) comprises of an R-2R DAC driving a voltage-controlled ring oscillator to save the power consumption and to achieve a linear transfer function to facilitate the short lock-in time requirement of the Regula Falsi method. A test chip has been fabricated in 90-nm CMOS. Measurement results show that the output frequency of the ADFLL ranges from 1.20 GHz to 6.95 GHz. At an output frequency of 5.00 GHz, the frequency acquisition process takes only five reference clock cycles. Meanwhile, the measured rms and pk-pk jitters are 1.5 ps and 15 ps, respectively. The proposed ADFLL achieves a power efficiency of 1.88 mW/GHz at 5.00 GHz and 1 V.

R4-21 (Time: 14:50 - 14:52)

Title	A Method for Recognizing a Breaking Sound of a Window Glass for Realizing a Low-power Security Surveillance System Using FPGA
Author	*Ryo Terafuji, Hiroyuki Ochi (Ritsumeikan University, Japan)
Page	pp. 386 - 390
Keyword	sound recognition, wavelet transform, support vector machine, low power design, FPGA implementation
Abstract	This paper proposes a sound recognition method specialized to a breaking sound of a window glass to realize a security surveillance system that operates for a long time with battery. Our method analyzes the input sound using wavelet transform, and recognizes the sound using linear SVM. From simulation experiments, 90 % recognition ratio is achieved. We also propose an FPGA implementation of wavelet transform which enables us real-time processing at 3.3 MHz clock frequency using 6k-words ROM and 17 multipliers.

R4-22 (Time: 14:52 - 14:54)

Title	Electromagnetic Analysis for a Lightweight Block Cipher Simon
Author	*Yusuke Nozaki, Yoshiya Ikezaki, Masaya Yoshikawa (Meijo University, Japan)
Page	pp. 391 - 396
Keyword	Security, Electromagnetic analysis, Lightweight block cipher, Simon, Tamper resistance
Abstract	Lightweight block ciphers, which are suitable for IoT devices, have attracted attention. Regarding the security of cryptographic circuit, the risk of electromagnetic analysis is pointed out. However, electromagnetic analysis of lightweight block ciphers has barely been studied. Therefore, this study proposes a new dedicated electromagnetic analysis for Simon which is one of the most popular lightweight block ciphers. Experiments using a FPGA prove the validity of the proposed method and the vulnerability of Simon against electromagnetic analysis.

Invited Talk
Time: 16:00 - 16:50 Tuesday, October 25, 2016
Chair: Mineo Kaneko (Japan Advanced Institute of Science and Technology, Japan)

I4-1 (Time: 16:00 - 16:50)

Title	Qualitative-Modeling-Based Design for Silicon Neuronal Networks
Author	*Takashi Kohno (The University of Tokyo, Japan)
Page	p. 397
Abstract	Silicon neuronal network is a bottom-up approach to the neuro-mimetic systems, which are gaining prominence as a new approach to realize power-efficient, adaptive, and intelligent information processing systems with massively parallel architecture. It is a network of silicon neuron circuits which emulate the electrophysiological activities in neuronal cells in real time. A variety of neuronal cells with complex activities is observed in the nervous system. A class of neuron models that describes the dynamics of ionic particles near the cell membrane can precisely model these activities, but it is described by multivariable nonlinear differential equations. To realize power-efficient and simple silicon neuron circuits that reproduce the nonlinearity in these equations, we developed a new circuit design approach on the basis of nonlinear mathematical techniques that have been utilized in the qualitative neuronal modeling. This approach can also be applied to tune the dynamical properties of the neuronal activities after circuit fabrication and support a variety of neuronal activities.

SASIMI 2016 The 20th Workshop on Synthesis And System Integration of Mixed Information Technologies Technical Program

Session Schedule

List of papers

SASIMI 2016
The 20th Workshop on Synthesis And System Integration of Mixed Information Technologies
Technical Program