Antonio Gonzalez

Research Interests:
Parallel Processing, Genetic Algorithm, and Loop Tiling

ABSTRACT Register files are becoming one of the critical components of current out-of-order processors in terms of delay and power consumption, since their potential to exploit instruction-level parallelism is closely related to the size... more

ABSTRACT Register files are becoming one of the critical components of current out-of-order processors in terms of delay and power consumption, since their potential to exploit instruction-level parallelism is closely related to the size and number of ports of the register file. In conventional register-renaming schemes, register releasing is conservatively done only after the instruction that redefines the same register is committed. Instead, we propose a scheme that releases registers as soon as the processor knows that there will be no further use of them. We present two early releasing hardware implementations with different performance/complexity trade-offs. Detailed cycle-level simulations show either a significant speedup for a given register file size, or a reduction in register file size for a given performance level.

Research Interests:
Distributed Computing

Research Interests:
Low Complexity

Download (.pdf)

Research Interests:
Computer Architecture, High Performance Computing, Computer Networks, Computer Hardware, Fault Tolerance, and 6 moreMicroarchitecture, Computer Languages, Prototypes, Fault Tolerant, System performance, and Electrical And Electronic Engineering

Download (.pdf)

Research Interests:
Compilers

ABSTRACT The trend of downsizing transistors and operating voltage scaling has made the processor chip more sensitive against radiation phenomena making soft errors an important challenge. New reliability techniques for handling soft... more

ABSTRACT The trend of downsizing transistors and operating voltage scaling has made the processor chip more sensitive against radiation phenomena making soft errors an important challenge. New reliability techniques for handling soft errors in the logic and memories that allow meeting the desired failures-in-time (FIT) target are key to keep harnessing the benefits of Moore&#39;s law. The failure to scale the soft error rate caused by particle strikes, may soon limit the total number of cores that one may have running at the same time. This paper proposes a light-weight and scalable architecture to eliminate silent data corruption errors (SDC) and detected unrecoverable errors (DUE) of a core. The architecture uses acoustic wave detectors for error detection. We propose to recover by confining the errors in the cache hierarchy, allowing us to deal with the relatively long detection latencies. Our results show that the proposed mechanism protects the whole core (logic, latches and memory arrays) incurring performance overhead as low as 0.60%.

ABSTRACT The vulnerability of the current and future processors towards transient errors caused by particle strikes is expected to increase rapidly because of exponential growth rate of on-chip transistors, the lower voltages and the... more

ABSTRACT The vulnerability of the current and future processors towards transient errors caused by particle strikes is expected to increase rapidly because of exponential growth rate of on-chip transistors, the lower voltages and the shrinking feature size. This encourages innovation in the direction of finding new techniques for providing robustness in logic and memories that allow meeting the desired failures in-time (FIT) budget in future chip multiprocessors (CMPs) present in embedded systems. In embedded systems two aspects of robustness, error detection and containment, are of paramount importance. This paper proposes a light-weight and scalable architecture that uses acoustic wave detectors for error detection and contains errors at the core level. We show how selectively applying error containment can reduce the number of detectors required for error containment. We observe that by using 17 detectors we can achieve error containment coverage of 97.8%.

ABSTRACT Evaluation of computing systems reliability must be accurate enough to provide hints for the required fault protection mechanisms that will guarantee correctness of operation at acceptance costs. To be useful, reliability... more

ABSTRACT Evaluation of computing systems reliability must be accurate enough to provide hints for the required fault protection mechanisms that will guarantee correctness of operation at acceptance costs. To be useful, reliability evaluation must be performed early enough in the design cycle when, however, the available details of the system are largely unknown. This inherent contradiction in terms: early vs. accurate, requires a cross-layer approach for reliability evaluation. Different layers of abstraction contribute differently in the overall system reliability; if this contribution can be assessed independently, the reliability of the system can be evaluated at the early stages of the design. We review the state-of-the-art in the area and discuss corresponding challenges .

Download (.pdf)

Research Interests:
Instruction Set Architecture and Data Dependence

Download (.pdf)

Current processors require a large number of in-flight instructions in order to look for further parallelism and hide the increasing gap between memory latency and processor cycle time. These in-flight instructions are typically stored in... more

Current processors require a large number of in-flight instructions in order to look for further parallelism and hide the increasing gap between memory latency and processor cycle time. These in-flight instructions are typically stored in centralized structures called reorder buffer (ROB), which is a centerpiece to handle precise exceptions and recover a safe state in the event of a branch

Research Interests:
Cycle Time and Compression

Research Interests:
Code Optimization

... Intel Barcelona Research Center, Intel Labs, Universitat Politècnica de Catalunya, Barcelona (Spain) {martix.torrents, raul.martinez, pedro.lopez, josep.m ... This prefetcher divides the memory space in different regions and memorizes... more

... Intel Barcelona Research Center, Intel Labs, Universitat Politècnica de Catalunya, Barcelona (Spain) {martix.torrents, raul.martinez, pedro.lopez, josep.m ... This prefetcher divides the memory space in different regions and memorizes the memory misses in each one of them. ...

Research Interests:
Simulation

Research Interests:
Analytical Model, Indexation, and Communication Cost

Download (.pdf)

ABSTRACT This paper presents a novel parallel implementation of Prolog. The system is based on Multipath, a novel execution model for Prolog that implements a partial breadth-first search of the SLD-tree. The paper focusses on the type of... more

ABSTRACT This paper presents a novel parallel implementation of Prolog. The system is based on Multipath, a novel execution model for Prolog that implements a partial breadth-first search of the SLD-tree. The paper focusses on the type of parallelism inherent to the execution model, which is called path parallelism. This is a particular case of data parallelism that can be efficiently exploited in a SPMD architecture. A SPMD architecture oriented to the Multipath execution model is presented. A simulator of such system has been developed and used to assess the performance of path parallelism. Performance figures show that path parallelism is effective for non-deterministic programs

Research Interests: Parallel Processing, Genetic Algorithm, and Loop Tiling<div>()</div>

Research Interests: Distributed Computing<div>()</div>

Research Interests: Low Complexity<div>()</div>

Research Interests: Compilers<div>()</div>

Research Interests: Instruction Set Architecture and Data Dependence<div>()</div>

Research Interests: Cycle Time and Compression<div>()</div>

Research Interests: Code Optimization<div>()</div>

Research Interests: Simulation<div>()</div>

Research Interests: Analytical Model, Indexation, and Communication Cost<div>()</div>

Log In

Research Interests:
Parallel Processing, Genetic Algorithm, and Loop Tiling

Research Interests:
Distributed Computing

Research Interests:
Low Complexity

Research Interests:
Compilers

Research Interests:
Instruction Set Architecture and Data Dependence

Research Interests:
Cycle Time and Compression

Research Interests:
Code Optimization

Research Interests:
Simulation

Research Interests:
Analytical Model, Indexation, and Communication Cost