Author: Juan, Toni : Search

research-article

Open Access

Architecting a hardware-managed hybrid DIMM optimized for cost/performance

MEMSYS '18: Proceedings of the International Symposium on Memory SystemsPages 327–340https://doi.org/10.1145/3240302.3240303

Rapidly evolving workloads and exploding data volumes place great pressure on data-center compute, IO, and memory performance, and especially on memory capacity. Increasing memory capacity requires a commensurate reduction in memory cost per bit. DRAM ...

Article

Reusing cached schedules in an out-of-order processor with in-order issue logic

ICCD'09: Proceedings of the 2009 IEEE international conference on Computer designPages 246–253

The complex and powerful out-of-order issue logic dismisses the repetitive nature of the code, unlike what caches or branch predictors do. We show that 90% of the cycles, the group of instructions selected by the issue logic belongs to just 13% of the ...

research-article

Larrabee: A Many-Core x86 Architecture for Visual Computing

IEEE Micro (IMIC), Volume 29, Issue 1Pages 10–21https://doi.org/10.1109/MM.2009.9

The Larrabee many-core visual computing architecture uses multiple in-order x86 cores augmented by wide vector processor units, together with some fixed-function logic. This increases the architecture's programmability as compared to standard GPUs. The ...

research-article

Larrabee: a many-core x86 architecture for visual computing

SIGGRAPH '08: ACM SIGGRAPH 2008 papersArticle No.: 18, Pages 1–15https://doi.org/10.1145/1399504.1360617

This paper presents a many-core visual computing architecture code named Larrabee, a new software rendering pipeline, a manycore programming model, and performance analysis for several applications. Larrabee uses multiple in-order x86 CPU cores that are ...

research-article

Larrabee: a many-core x86 architecture for visual computing

ACM Transactions on Graphics (TOG), Volume 27, Issue 3Pages 1–15https://doi.org/10.1145/1360612.1360617

This paper presents a many-core visual computing architecture code named Larrabee, a new software rendering pipeline, a manycore programming model, and performance analysis for several applications. Larrabee uses multiple in-order x86 CPU cores that are ...

Article

Tarantula: a vector extension to the alpha architecture

ISCA '02: Proceedings of the 29th annual international symposium on Computer architecturePages 281–292

Tarantula is an aggressive floating point machine targeted at technical, scientific and bioinformatics workloads, originally planned as a follow-on candidate to the EV8 processor [6, 5]. Tarantula adds to the EV8 core a vector unit capable of 32 double-...

Also Published in:

ACM SIGARCH Computer Architecture News: Volume 30 Issue 2

research-article

Asim: A Performance Model Framework

Computer (COMP), Volume 35, Issue 2Pages 68–76https://doi.org/10.1109/2.982918

The longevity and usefulness of a microprocessor performance modelhas historically depended on the model writer's skills and discipline. However,at Compaq the models became extremely complex and unmanageablebecause designers lacked a structured way to ...

Article

Free

Dataflow analysis of branch mispredictions and its application to early resolution of branch outcomes

MICRO 31: Proceedings of the 31st annual ACM/IEEE international symposium on MicroarchitecturePages 59–68

Article

Free

Dynamic history-length fitting: a third level of adaptivity for branch prediction

ISCA '98: Proceedings of the 25th annual international symposium on Computer architecturePages 155–166https://doi.org/10.1145/279358.279379

Accurate branch prediction is essential for obtaining high performance in pipelined superscalar processors that execute instructions speculatively. Some of the best current predictors combine a part of the branch address with a fixed amount of global ...

Also Published in:

ACM SIGARCH Computer Architecture News: Volume 26 Issue 3

Article

Free

Reducing TLB power requirements

ISLPED '97: Proceedings of the 1997 international symposium on Low power electronics and designPages 196–201https://doi.org/10.1145/263272.263332

Article

Free

Data caches for superscalar processors

ICS '97: Proceedings of the 11th international conference on SupercomputingPages 60–67https://doi.org/10.1145/263580.263595

Article

Free

The difference-bit cache

ISCA '96: Proceedings of the 23rd annual international symposium on Computer architecturePages 114–120https://doi.org/10.1145/232973.232986

The difference-bit cache is a two-way set-associative cache with an access time that is smaller than that of a conventional one and close or equal to that of a direct-mapped cache. This is achieved by noticing that the two tags for a set have to differ ...

Also Published in:

ACM SIGARCH Computer Architecture News: Volume 24 Issue 2

Article

Free

Block algorithms for sparse matrix computations on high performance workstations

ICS '96: Proceedings of the 10th international conference on SupercomputingPages 301–308https://doi.org/10.1145/237578.237624

Article

Free

MOB forms: a class of multilevel block algorithms for dense linear algebra operations

ICS '94: Proceedings of the 8th international conference on SupercomputingPages 354–363https://doi.org/10.1145/181181.181561

Multilevel block algorithms exploit the data locality in linear algebra operations when executed in machines with several levels in the memory hierarchy. It is shown that the family we call Multilevel Orthogonal Block (MOB) algorithms is optimal and ...

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Caption

Architecting a hardware-managed hybrid DIMM optimized for cost/performance

Reusing cached schedules in an out-of-order processor with in-order issue logic

Larrabee: A Many-Core x86 Architecture for Visual Computing

Larrabee: a many-core x86 architecture for visual computing

Larrabee: a many-core x86 architecture for visual computing

Tarantula: a vector extension to the alpha architecture

Also Published in:

Asim: A Performance Model Framework

Dataflow analysis of branch mispredictions and its application to early resolution of branch outcomes

Dynamic history-length fitting: a third level of adaptivity for branch prediction

Also Published in:

Reducing TLB power requirements

Data caches for superscalar processors

The difference-bit cache

Also Published in:

Block algorithms for sparse matrix computations on high performance workstations

MOB forms: a class of multilevel block algorithms for dense linear algebra operations

Applied Filters

People

Names

Institutions

Authors

Reviewers

Publications

Journal/Magazine Names

Proceedings/Book Names

All Publications

Content Type

Supplemental Material Type

Media Formats

Publisher

Conferences

Sponsors

Conference Event

Proceedings Series

Publication Date

Save to Binder

Also Published in:

Also Published in:

Also Published in: