Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.5555/320080.320097acmconferencesArticle/Chapter ViewAbstractPublication PagesmicroConference Proceedingsconference-collections
Article
Free access

Low-cost branch folding for embedded applications with small tight loops

Published: 16 November 1999 Publication History

Abstract

Many portable and embedded applications are characterized by spending a large fraction of execution time on small program loops. To improve performance, many embeded systems use special instructions to handle program loop executions. These special instructions, however, consume opcode space, which is valuable in the embedded computing environments. In this paper, we propose a hardware technique for folding out branches when executing these small loops. This technique does not require any special branch instructions. It is based on the detection and utilization of certain short backward branch instructions (sbb). A sbb is any PC-relative branch instruction with a limited backward branch distance. Once a sbb is detected, its displacement field is used by the hardware to identify the actual program loop size. It does so by loading this negative displacement field into a counter and incrementing the counter for each instruction sequentially executed. As the count approaches zero, the hardware folds out the sbb by predicting that it is always taken. The hardware overhead for this technique is minimal. Using a 5-bit increment counter, the performance improvement over a set of embedded applications is about 7.5%.

References

[1]
ADSP-2106x SHARCTM User's Manual, Analog Devices Inc., 1998.
[2]
D. Ditzel, H. McLellan, "Branch Folding in the CRISP Microprocessor: Reducing Branch Delay to Zero," Proc. IEEE Int'l. Syrup. on Computer Architecture, 1987.
[3]
Kissell, MIPSI6: High-density MIPS for the Embedded Market, Silicon Graphics MIPS Group, 1997.
[4]
M. Kozuch and A. Wolfe, "Compression of Embedded System Programs," Proc. IEEE lnt'l Conf. on Computer Design, 1994.
[5]
J. Lee and A. J. Smith, "Branch Prediction Strategies and Branch Target Buffer Design," IEEE Computer, pp.6-22, January 1984.
[6]
B. Moyer, J. Arends, "RISC Gets Small," Byte Magazine, February 1998.
[7]
M. CORE Reference Manual, Motorola Inc., 1997.
[8]
PowerPC 603 RISC Microprocessor User's Manual, IBM Microelectronics/Motorola Incorp., 1990.
[9]
J. Scott, L. H. Lee, J. Arends and B. Moyer, "Designing the Low-Power M.CORE Architecture," Proc. Power Driven Microarchitecture Workshop, Barcelona, Spain, June 28th 1998.
[10]
J. Scott, L. H. Lee, A. Chin, J. Arends and B. Moyer, "Designing the MoCORE M3 CPU Architecture," Proc. IEEE Int'l Conf. on Computer Design, Austin, Texas, October 10- t 3, 1999.
[11]
J.E. Smith, "A Study of Branch Prediction Strategies,' Proc. IEEE lnt'l Syrup. on Comp. Architecture, May 1991.
[12]
SuperH SH-4 Hardware Manual, Hitachi Inc., 1998.
[13]
TMS320C2x User's Guide, Texas Instruments Inc., 1993.
[14]
TriCore Architecture Manual, Siemens Incorp., 1997.
[15]
A. Wolfe and A. Chanin, "Executing Compressed Programs on an Embedded RISC Architecture," Proc. IEEE Int'l Symp. on Microarchitecture, December 1992.

Cited By

View all
  • (2001)Speeding up control-dominated applications through microarchitectural customizations in embedded processorsProceedings of the 38th annual Design Automation Conference10.1145/378239.379014(512-517)Online publication date: 22-Jun-2001

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
MICRO 32: Proceedings of the 32nd annual ACM/IEEE international symposium on Microarchitecture
November 1999
299 pages
ISBN:076950437X

Sponsors

Publisher

IEEE Computer Society

United States

Publication History

Published: 16 November 1999

Check for updates

Qualifiers

  • Article

Conference

MICRO99
Sponsor:

Acceptance Rates

MICRO 32 Paper Acceptance Rate 27 of 131 submissions, 21%;
Overall Acceptance Rate 484 of 2,242 submissions, 22%

Upcoming Conference

MICRO '24

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)66
  • Downloads (Last 6 weeks)22
Reflects downloads up to 12 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2001)Speeding up control-dominated applications through microarchitectural customizations in embedded processorsProceedings of the 38th annual Design Automation Conference10.1145/378239.379014(512-517)Online publication date: 22-Jun-2001

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media