2. Memory and bus architecture

2.1 System architecture

In STM32F401xB/C and STM32F401xD/E, the main system consists of 32-bit multilayer AHB bus matrix that interconnects:

The bus matrix provides access from a master to a slave, enabling concurrent access and efficient operation even when several high-speed peripherals work simultaneously. This architecture is shown in Figure 1 .

Figure 1. System architecture

Figure 1. System architecture diagram showing the interconnection of masters and slaves via a Bus matrix-S. Masters include ARM Cortex-M4 (I-bus, D-bus, S-bus), GP DMA1 (DMA_PI, DMA_MEM1), and GP DMA2 (DMA_MEM2, DMA_P2). Slaves include Flash (ICode, DCode), SRAM1, AHB periph1, and AHB periph2. The diagram shows the Bus matrix-S with 6 slave ports (S0-S5) and 5 master ports (M0-M4). Connections are as follows: ARM Cortex-M4 I-bus to S0, D-bus to S1, S-bus to S2; GP DMA1 DMA_PI to S3, DMA_MEM1 to S4; GP DMA2 DMA_MEM2 to S4, DMA_P2 to S5. Slave ports are: M0 (ICode) to Flash, M1 (DCode) to Flash, M2 to SRAM1, M3 to AHB periph1, M4 to AHB periph2. AHB periph1 and AHB periph2 are connected to APB1 and APB2 bridges respectively. MS31420V1 is noted in the bottom right.
Figure 1. System architecture diagram showing the interconnection of masters and slaves via a Bus matrix-S. Masters include ARM Cortex-M4 (I-bus, D-bus, S-bus), GP DMA1 (DMA_PI, DMA_MEM1), and GP DMA2 (DMA_MEM2, DMA_P2). Slaves include Flash (ICode, DCode), SRAM1, AHB periph1, and AHB periph2. The diagram shows the Bus matrix-S with 6 slave ports (S0-S5) and 5 master ports (M0-M4). Connections are as follows: ARM Cortex-M4 I-bus to S0, D-bus to S1, S-bus to S2; GP DMA1 DMA_PI to S3, DMA_MEM1 to S4; GP DMA2 DMA_MEM2 to S4, DMA_P2 to S5. Slave ports are: M0 (ICode) to Flash, M1 (DCode) to Flash, M2 to SRAM1, M3 to AHB periph1, M4 to AHB periph2. AHB periph1 and AHB periph2 are connected to APB1 and APB2 bridges respectively. MS31420V1 is noted in the bottom right.
  1. 1. STM32F401xB/C: 128 KBytes / 256 KBytes Flash with 64 KBytes SRAM.
    STM32F401xD/E: 384 KBytes / 512KBytes Flash with 96 KBytes SRAM.

2.1.1 I-bus

This bus connects the Instruction bus of the Cortex ® -M4 with FPU core to the BusMatrix. This bus is used by the core to fetch instructions. The target of this bus is a memory containing code (internal flash memory/SRAM).

2.1.2 D-bus

This bus connects the databus of the Cortex ® -M4 with FPU to the BusMatrix. This bus is used by the core for literal load and debug access. The target of this bus is a memory containing code or data (internal flash memory/SRAM).

2.1.3 S-bus

This bus connects the system bus of the Cortex ® -M4 with FPU core to a BusMatrix. This bus is used to access data located in a peripheral or in SRAM. Instructions may also be fetched on this bus (less efficient than ICode). The targets of this bus are the internal SRAM, the AHB1 peripherals including the APB peripherals and the AHB2 peripherals.

2.1.4 DMA memory bus

This bus connects the DMA memory bus master interface to the BusMatrix. It is used by the DMA to perform transfer to/from memories. The targets of this bus are data memories: internal Flash memory, internal SRAM and additionally for S4 the AHB1/AHB2 peripherals including the APB peripherals.

2.1.5 DMA peripheral bus

This bus connects the DMA peripheral master bus interface to the BusMatrix. This bus is used by the DMA to access AHB peripherals or to perform memory-to-memory transfers. The targets of this bus are the AHB and APB peripherals plus data memories: Flash memory and internal SRAM.

2.1.6 BusMatrix

The BusMatrix manages the access arbitration between masters. The arbitration uses a round-robin algorithm.

2.1.7 AHB/APB bridges (APB)

The two AHB/APB bridges, APB1 and APB2, provide full synchronous connections between the AHB and the two APB buses, allowing flexible selection of the peripheral frequency.

Refer to the device datasheets for more details on APB1 and APB2 maximum frequencies, and to Table 1 for the address mapping of AHB and APB peripherals.

After each device reset, all peripheral clocks are disabled (except for the SRAM and flash memory interface). Before using a peripheral you have to enable its clock in the RCC_AHBxENR or RCC_APBxENR register.

Note: When a 16- or an 8-bit access is performed on an APB register, the access is transformed into a 32-bit access: the bridge duplicates the 16- or 8-bit data to feed the 32-bit vector.

2.2 Memory organization

Program memory, data memory, registers and I/O ports are organized within the same linear 4 Gbyte address space.

The bytes are coded in memory in little endian format. The lowest numbered byte in a word is considered the word's least significant byte and the highest numbered byte, the word's most significant.

For the detailed mapping of peripheral registers, please refer to the related chapters.

The addressable memory space is divided into 8 main blocks, each of 512 MB.

All the memory areas that are not allocated to on-chip memories and peripherals are considered "Reserved"). Refer to the memory map figure in the product datasheet.

2.3 Memory map

See the datasheet corresponding to your device for a comprehensive diagram of the memory map. Table 1 gives the boundary addresses of the peripherals available in STM32F401xB/C and STM32F401xD/E devices.

Table 1. STM32F401xB/C and STM32F401xD/E register boundary addresses

Boundary addressPeripheralBusRegister map
0x5000 0000 - 0x5003 FFFFUSB OTG FSAHB2Section 22.16.6: OTG_FS register map on page 755
0x4002 6400 - 0x4002 67FFDMA2Section 9.5.11: DMA register map on page 198
0x4002 6000 - 0x4002 63FFDMA1
0x4002 3C00 - 0x4002 3FFFFlash interface registerSection 3.8: Flash interface registers on page 60
0x4002 3800 - 0x4002 3BFFRCCSection 6.3.22: RCC register map on page 137
0x4002 3000 - 0x4002 33FFCRCSection 4.4.4: CRC register map on page 70
0x4002 1C00 - 0x4002 1FFFGPIOHSection 8.4.11: GPIO register map on page 164
0x4002 1000 - 0x4002 13FFGPIOE
0x4002 0C00 - 0x4002 0FFFGPIOF
0x4002 0800 - 0x4002 0BFFGPIOC
0x4002 0400 - 0x4002 07FFGPIOB
0x4002 0000 - 0x4002 03FFGPIOA

Table 1. STM32F401xB/C and STM32F401xD/E register boundary addresses (continued)

Boundary addressPeripheralBusRegister map
0x4001 4800 - 0x4001 4BFFTIM11APB2Section 14.5.12: TIM10/11 register map on page 420
0x4001 4400 - 0x4001 47FFTIM10
0x4001 4000 - 0x4001 43FFTIM9Section 14.4.13: TIM9 register map on page 410
0x4001 3C00 - 0x4001 3FFFEXTISection 10.3.7: EXTI register map on page 212
0x4001 3800 - 0x4001 3BFFSYSCFGSection 7.2.8: SYSCFG register map
0x4001 3400 - 0x4001 37FFSPI4Section 20.5.10: SPI register map on page 611
0x4001 3000 - 0x4001 33FFSPI1
0x4001 2C00 - 0x4001 2FFFSDIOSection 21.9.16: SDIO register map on page 667
0x4001 2000 - 0x4001 23FFADC1Section 11.12.16: ADC register map on page 240
0x4001 1400 - 0x4001 17FFUSART6Section 19.6.8: USART register map on page 558
0x4001 1000 - 0x4001 13FFUSART1
0x4001 0000 - 0x4001 03FFTIM1Section 12.4.21: TIM1 register map on page 314
0x4000 7000 - 0x4000 73FFPWRAPB1Section 5.5: PWR register map on page 90
0x4000 5C00 - 0x4000 5FFFI2C3Section 18.6.11: I2C register map on page 505
0x4000 5800 - 0x4000 5BFFI2C2
0x4000 5400 - 0x4000 57FFI2C1Section 19.6.8: USART register map on page 558
0x4000 4400 - 0x4000 47FFUSART2
0x4000 4000 - 0x4000 43FFI2S3extSection 20.5.10: SPI register map on page 611
0x4000 3C00 - 0x4000 3FFFSPI3 / I2S3
0x4000 3800 - 0x4000 3BFFSPI2 / I2S2
0x4000 3400 - 0x4000 37FFI2S2ext
0x4000 3000 - 0x4000 33FFIWDGSection 15.4.5: IWDG register map on page 426
0x4000 2C00 - 0x4000 2FFFWWDGSection 16.6.4: WWDG register map on page 433
0x4000 2800 - 0x4000 2BFFRTC & BKP RegistersSection 17.6.21: RTC register map on page 471
0x4000 0C00 - 0x4000 0FFFTIM5Section 13.4.21: TIMx register map on page 374
0x4000 0800 - 0x4000 0BFFTIM4
0x4000 0400 - 0x4000 07FFTIM3
0x4000 0000 - 0x4000 03FFTIM2

2.3.1 Embedded SRAM

STM32F401xB/C devices feature 64 Kbytes and STM32F401xD/E devices feature 96 Kbytes of system SRAM.

The embedded SRAM can be accessed as bytes, half-words (16 bits) or full words (32 bits). Read and write operations are performed at CPU speed with 0 wait state.

The CPU can access the embedded SRAM through the System bus or through the I-Code/D-Code buses when boot from SRAM is selected or when physical remap is selected ( Section 7.2.1: SYSCFG memory remap register (SYSCFG_MEMRMP) in the SYSCFG controller). To get the max performance on SRAM execution, physical remap should be selected (boot or software selection).

2.3.2 Flash memory overview

The flash memory interface manages CPU AHB I-Code and D-Code accesses to the flash memory. It implements the erase and program flash memory operations and the read and write protection mechanisms. It accelerates code execution with a system of instruction prefetch and cache lines.

The flash memory is organized as follows:

Refer to Section 3: Embedded flash memory interface for more details.

2.3.3 Bit banding

The Cortex®-M4 with FPU memory map includes two bit-band regions. These regions map each word in an alias region of memory to a bit in a bit-band region of memory. Writing to a word in the alias region has the same effect as a read-modify-write operation on the targeted bit in the bit-band region.

In the STM32F4xx devices both the peripheral registers and the SRAM are mapped to a bit-band region, so that single bit-band write and read operations are allowed. The operations are only available for Cortex®-M4 with FPU accesses, and not from other bus masters (e.g. DMA).

A mapping formula shows how to reference each word in the alias region to a corresponding bit in the bit-band region. The mapping formula is:

\[ bit\_word\_addr = bit\_band\_base + (byte\_offset \times 32) + (bit\_number \times 4) \]

where:

Example

The following example shows how to map bit 2 of the byte located at SRAM address 0x20000300 to the alias region:

\[ 0x22006008 = 0x22000000 + (0x300*32) + (2*4) \]

Writing to address 0x22006008 has the same effect as a read-modify-write operation on bit 2 of the byte at SRAM address 0x20000300.

Reading address 0x22006008 returns the value (0x01 or 0x00) of bit 2 of the byte at SRAM address 0x20000300 (0x01: bit set; 0x00: bit reset).

For more information on bit-banding, please refer to the Cortex®-M4 with FPU programming manual (see Related documents on page 1 ).

2.4 Boot configuration

Due to its fixed memory map, the code area starts from address 0x0000 0000 (accessed through the ICode/DCode buses) while the data area (SRAM) starts from address 0x2000 0000 (accessed through the system bus). The Cortex®-M4 with FPU CPU always fetches the reset vector on the ICode bus, which implies to have the boot space available only in the code area (typically, flash memory). STM32F4xx microcontrollers implement a special mechanism to be able to boot from other memories (like the internal SRAM).

In the STM32F4xx, three different boot modes can be selected through the BOOT[1:0] pins as shown in Table 2 .

Table 2. Boot modes

Boot mode selection pinsBoot modeAliasing
BOOT1BOOT0
x0Main flash memoryMain flash memory is selected as the boot space
01System memorySystem memory is selected as the boot space
11Embedded SRAMEmbedded SRAM is selected as the boot space

The values on the BOOT pins are latched on the 4th rising edge of SYSCLK after a reset. It is up to the user to set the BOOT1 and BOOT0 pins after reset to select the required boot mode.

BOOT0 is a dedicated pin while BOOT1 is shared with a GPIO pin. Once BOOT1 has been sampled, the corresponding GPIO pin is free and can be used for other purposes.

The BOOT pins are also resampled when the device exits the Standby mode. Consequently, they must be kept in the required Boot mode configuration when the device is in the Standby mode. After this startup delay is over, the CPU fetches the top-of-stack value from address 0x0000 0000, then starts code execution from the boot memory starting from 0x0000 0004.

Note: When the device boots from SRAM, in the application initialization code, you have to relocate the vector table in SRAM using the NVIC exception table and the offset register.

Embedded bootloader

The embedded bootloader mode is used to reprogram the flash memory using one of the following serial interfaces:

The USART peripherals operate at the internal 16 MHz oscillator (HSI) frequency, while the USB OTG FS require an external clock (HSE) multiple of 1 MHz (ranging from 4 to 26 MHz).

The embedded bootloader code is located in system memory. It is programmed by ST during production. For additional information, refer to application note AN2606.

Physical remap in STM32F401xB/C and STM32F401xD/E

Once the boot pins are selected, the application software can modify the memory accessible in the code area (in this way the code can be executed through the ICode bus in place of the System bus). This modification is performed by programming the Section 7.2.1: SYSCFG memory remap register (SYSCFG_MEMRMP) in the SYSCFG controller.

The following memories can thus be remapped:

Table 3. Memory mapping vs. Boot mode/physical remap in STM32F401xB/C

AddressesBoot/Remap in main Flash memoryBoot/Remap in embedded SRAMBoot/Remap in System memory
0x2000 0000 - 0x2000 FFFFSRAM1 (64 KB)SRAM1 (64 KB)SRAM1 (64 KB)
0x1FFF 0000 - 0x1FFF 77FFSystem memorySystem memorySystem memory
0x0804 0000 - 0x1FFE FFFFReservedReservedReserved
0x0800 0000 - 0x0803 FFFFFlash memoryFlash memoryFlash memory
0x0400 0000 - 0x07FF FFFFReservedReservedReserved
0x0000 0000 - 0x0003 FFFF (1)Flash (256 KB) AliasedSRAM1 (64 KB) AliasedSystem memory (30 KB) Aliased
  1. 1. Even when aliased in the boot memory space, the related memory is still accessible at its original memory space.
Table 4. Memory mapping vs. Boot mode/physical remap in STM32F401xD/E
AddressesBoot/Remap in main Flash memoryBoot/Remap in embedded SRAMBoot/Remap in System memory
0x2000 0000 - 0x2001 7FFFSRAM1 (96 KB)SRAM1 (96KB)SRAM1 (96KB)
0x1FFF 0000 - 0x1FFF 77FFSystem memorySystem memorySystem memory
0x0808 0000 - 0x1FFE FFFFReservedReservedReserved
0x0800 0000 - 0x0807 FFFFFlash memoryFlash memoryFlash memory
0x0400 000 - 0x07FF FFFFReservedReservedReserved
0x0000 0000 - 0x0007 FFFF (1)Flash (512 KB) AliasedSRAM1 (96 KB)
Aliased
System memory
(30 KB) Aliased
  1. 1. Even when aliased in the boot memory space, the related memory is still accessible at its original memory space.