Publication Type:

Conference Paper


ICECT 2011 - 2011 3rd International Conference on Electronics Computer Technology, Volume 3, Kanyakumari, p.330-333 (2011)





Asynchronous sequential logic, Chip Multiprocessor, Clocks, Communication, control processor, Discrete cosine transforms, Discrete cosines, energy conservation, Energy utilization, First in first outs, Globally asynchronous locally synchronous, Interactive computer systems, Low Power, Microprocessor chips, multiple clock, Multiprocessing systems, Real time systems, Systems analysis, tri state buffer


Globally Asynchronous Locally Synchronous (GALS) Chip multiprocessors with separate clocks for separate modules inside the chip are highly suited for processing number crunching and processing jobs that have to be done with limited energy. The GALS uses clock and voltage scaling jointly in system sub-modules to achieve low energy consumption rates. An advantage here is feasibility to use different clock rates for different modules in the chip. GALS allows up to 25 % energy savings in addition to clock as well as voltage scalability. However, a chronic drawback of GALS is additional communication latency between the various clock domains. In addition the component processors consume power even when they are idle. Latency to a great extent is reduced by implementing large inter-processor FIFOs buffers [3]. This work proposes to enhance the latency minimization by alternately enhance one processor domain to optimally manage latency and power wastage. Real Time System algorithms are used for managing the inter clock domain communication. This technique can be used to either enhance the FIFO buffer technique if area is not a consideration where only the efficiency is considered or as a standalone manager for handling the inter clock domain communications efficiently with reduced area and increase resource handling capability. Tri state buffers are put to use to switch clock and supply voltage to individual clock domains. For performance evaluation, Component processors to find the DCT were implemented with FIFO in between the modules for communication between processors and in turn reduce latency. Comparison of performance of the latency management processor enhanced GALS chip with FIFO buffer chip revealed increase of 15% throughput and 40% energy savings approximately as compared to 10% and 25% for the latter. © 2011 IEEE.


cited By (since 1996)0; Conference of org.apache.xalan.xsltc.dom.DOMAdapter@2830f9ff ; Conference Date: org.apache.xalan.xsltc.dom.DOMAdapter@276ce8f6 Through org.apache.xalan.xsltc.dom.DOMAdapter@5f3282b2; Conference Code:85884

Cite this Research Publication

A. S. Menon, Gini, J. R., Aishwarya, B., Balaji, C. C. G., Jaswanth, R., and Krishnadas, A., “Optimization of GALS CMP architecture with DCT as case study”, in ICECT 2011 - 2011 3rd International Conference on Electronics Computer Technology, Kanyakumari, 2011, vol. 3, pp. 330-333.