Arm Cortex M7 Processor Datasheet
Arm Cortex M7 Processor Datasheet
Arm Cortex M7 Processor Datasheet
Datasheet
Overview
The Arm Cortex-M7 processor is the highest-performing processor in the Cortex-M family
that enables the design of sophisticated MCUs and SoCs. The Cortex-M7 offers industry-
leading scalar performance of 5.01 CoreMarks/MHz, while maintaining the excellent
responsiveness and ease-of-use of the Armv7-M architecture. With built-in instruction and
data caches and tightly coupled memories (TCMs) this superscalar processor never has to
slow down even in the most demanding processing applications at the endpoint.
Features
Feature Description
Figure 1: Block diagram of the
Cortex-M7 processor Architecture Armv7E-M
ISA support Thumb/Thumb2
Pipeline 6-stage superscalar and branch prediction
Single-cycle 16/32-bit MAC
Single-cycle dual 16-bit MAC
DSP extension
8/16-bit SIMD arithmetic
Hardware divide
Optional single and double-precision FPU (choices of none,
FPU single-precision only, and single and double-precision)
IEEE 754 compliant
64-bit AMBA4 AXI, 32-bit AHB peripheral (AHBP) port
32-bit AHB slave (AHBS) port for external master
Interconnect
(such as DMA controller) to access Tightly-Coupled Memories
APB interface for CoreSight debug components
0 to 64 KB, 2-way associative with optional error correction
Instruction cache
code (ECC)
Data cache 0 to 64 KB, 4-way associative with optional ECC
1
Interrupt priority levels 8 to 256 priority levels
Wake-up interrupt
Optional
controller
Integrated WFI and WFE Instructions and Sleep-On
Exit capability
Sleep modes
Sleep and deep sleep signals
Optional retention mode with Arm Power Management Kit
Optional JTAG and serial wire debug ports. Up to
Debug
8 breakpoints and 4 watchpoints
Optional Embedded Trace Macrocell (ETM), Micro Trace
Buffer (MTB), Data Watchpoint and Trace (DWT), and
Trace
Instrumentation Trace (ITM)
Optional full data trace with ETM
Dual Core Lock-Step
Yes, DCLS configuration
support (DCLS)
The processor is highly configurable and is intended for a wide range of high-performance,
deeply embedded applications that require fast interrupt response features.
2
Block Diagram
Cortex-M7 processor components
Figure 2: Cortex-M7
processor components
Parallelized integer register file with six read ports and four write ports for large-scale
dual-issue
Two Arithmetic Logic Units (ALUs), with one ALU capable of executing SIMD operations
Single MAC pipeline capable of 32x32-bit + 64-bit → 64-bit with two cycle result
latency and one MAC per cycle throughput
Prefetch Unit
The Prefetch Unit (PFU) provides:
A Branch Target Address Cache (BTAC) for single-cycle turn-around of branch predictor
state and target address
3
A static branch predictor when no BTAC is specified
Forwarding of flags for early resolution of direct branches in the decoder and first
execution stages of the processor pipeline
Dual 32-bit load channels to TCM, data cache, and AXI master (AXIM) interface
for 64-bit load bandwidth and dual 32-bit load capability
Single 32-bit load channel to the AHB interface. Single 64-bit store channel
Store buffering to increase store throughput and minimize RAM contention with data
and instruction reads
Separate store buffering for TCM, AHBP and AXIM for quality of service (QoS) and
interface-specific optimizations
Floating-point Unit
The optional FPU provides:
Combined multiply and accumulate instructions for increased precision (Fused MAC)
Hardware support for denormals and all IEEE Standard 754-2008 rounding modes
Support for tail-chaining and late arrival of interrupts. This enables back-to-back interrupt
processing without the overhead of state saving and restoration between interrupts
4
Wake-up Interrupt Controller
The optional Wake-up Interrupt Controller (WIC) provides ultra-low power sleep mode support.
Memory System
The optional memory system includes:
A Bus Interface Unit (BIU) with a configurable AMBA 4 AXI interface that can support
a high-performance L2 memory system
A Tightly Coupled Interface Unit (TCU) with TCM interfaces that can support external
A Memory Built-in Self-Test (MBIST) interface. The interface supports MBIST operation
while the processor is running
ETM
The optional Embedded Trace Macrocell provides instruction-only or instruction and data
trace capabilities when configured. See the Arm CoreSight ETM-M7 Technical Reference
Manual for more information.
Configurable DWT unit for implementing watchpoints, data tracing, and system profiling
Optional ITM for support of printf() style debugging, using instrumentation trace
Interfaces suitable for:
– Passing on-chip data to a Trace Port Analyzer (TPA), including Single Wire
Output (SWO) mode
– Debugger access to all memory and registers in the system, including access
to memory mapped devices, access to internal core registers when the core
is halted, and access to debug control registers even when reset is asserted
5
Interfaces
The processor contains the following external interfaces:
AHBP interface
AHBS interface
AHBD interface
ATB interfaces
TCM interface
MBIST interface
AXIM interface
AHBP Interface
The AHBP interface provides access suitable for low latency system peripherals. It provides support
for unaligned memory accesses, write buffer for buffering of write data, and exclusive access
transfers for multiprocessor systems.
AHBS Interface
The AHBS interface enables system access to TCMs.
AHBD Interface
The AHB-Lite Debug (AHBD) interface provides debug access to the Cortex-M7 processor and
the complete memory map.
ATB Interfaces
The ATB interface output traces information used for debugging. The ATB interface is compatible
with the CoreSight architecture. See the Arm CoreSightTM Architecture Specification (v2.0) for
more information.
TCM Interface
The processor can have up to two TCM memory instances, Instruction TCM (ITCM) and Data
TCM (DTCM), each with a double word data width. Access to ITCM is through the ITCM 64-bit
wide interface. Access to DTCM is through the 32-bit D0TCM interface and the 32-bit wide
D1TCM interface. The DTCM accesses are split so that lower words always access D0TCM and
upper words always access D1TCM. The size of both TCM instances is configurable, 4KB-16MB in
powers of 2.
6
Cross Trigger Interface
The processor includes an optional Cross Trigger Interface Unit which includes an interface suitable
for connection to external CoreSight components using a Cross Trigger Matrix.
MBIST Interface
The MBIST Interface is used for testing the RAMs during production test. The Cortex-M7
processor also allows the RAMs to be tested using the MBIST interface during normal execution.
This is known as online MBIST.
AXIM Interface
The AXIM interface provides high-performance access to an external memory system. The AXIM
interface supports use of the Arm CoreLink L2C-310 Level 2 Cache Controller. L2C-310 exclusive
cache configuration is not supported.
Integer pipe:
– Dual shifters, dual ALUs, one MAC-capable
Float pipe:
– FP instructions can be dual issued with integer instructions
– BTAC and branch predictor boost performance
Figure 3: Cortex-M7
processor pipeline
7
Processor Configuration Options
The Cortex-M7 processor has configurable options that you can set during the
implementation and integration stages to match your functional requirements.
Feature Options
No floating-point
Floating-point Single-precision floating-point only
Single-precision and double-precision floating-point
No instruction TCM
Instruction TCM
4KB-16MB (powers of 2)
No data TCM
8
Instruction Set
Armv7-M
Armv6-M
Cortex-M0/M0+
Cortex-M3
Cortex-M4
Cortex-M7
9
Power, Performance and Area
DMIPS CoreMark/MHz
2.14 5.01
Typical configuration 0.213 (core) 72.2 (core) 0.108 (core) 41.4 (core) 0.055 (core) 26 (core)
(32 I/D cache, ECC 0.261 (memory) 46 (memory) 0.127 (memory) 15.8 (memory) 0.092 (memory) 16 (memory)
@100Mhz
10
Additional Technical documents
Glossary of Terms
11
PFU Prefetch Unit
PPB Private Peripheral Bus
QoS Quality of Service
ROM Read Only Memory
RTL Register Transfer Level
SIMD Single Instruction, Multiple Data
SRAM Static RAM
SRD Sub Region Disable
SWD SingkleSingle Wire Debug
SWO Single Wire Output
TCM Tightly Coupled Memory
TCU Tightly Coupled Interface Unit
TPA Trace Port Analyzer
TPIU Trace Port Interface Unit
WFE Wait for interrupt
WFI Wait for event
WIC Wake-up Interrupt Controller
WT/WB Write Through/Write Back
Contact details
UK Europe Japan Taiwan China
Salesinfo-eu@Arm.com Salesinfo-eu@Arm.com Salesinfo-eu@Arm.com Salesinfo-eu@Arm.com Salesinfo-eu@Arm.com
All brand names or product names are the property of their respective holders. Neither the whole nor any part of the
information contained in, or the product described in, this document may be adapted or reproduced in any material form except with
the prior written permission of the copyright holder. The product described in this document is subject to continuous developments
and improvements. All particulars of the product and its use contained in this document are given in good faith. All warranties implied
or expressed, including but not limited to implied warranties of satisfactory quality or fitness for purpose are excluded. This document
is intended only to provide information to the reader about the product. To the extent permitted by local laws Arm shall not be liable
for any loss or damage arising from the use of any information in this document or any error or omission in such information.
12