2. Memory and bus architecture

2.1 System architecture

The main system consists of 32-bit multilayer AHB bus matrix that interconnects:

The bus matrix provides access from a master to a slave, enabling concurrent access and efficient operation even when several high-speed peripherals work simultaneously. This architecture is shown in Figure 1 .

Figure 1. System architecture

Figure 1. System architecture diagram showing the ARM Cortex-M3 core connected to a Bus matrix-S. The matrix has 8 slave ports (S0-S7) and 7 master ports (M0-M6). Masters include ARM Cortex-M3 (I-bus, D-bus, S-bus), GP DMA1 (DMA_P1, DMA_MEM1), GP DMA2 (DMA_MEM2, DMA_P2), MAC Ethernet (ETHERNET_M), and USB OTG HS (USB_HS_M). Targets include Flash memory (via ICODE/DCODE), SRAM 112 Kbyte, SRAM 16 Kbyte, AHB periph 1, AHB periph 2, and FSMC Static MemCtl. APB1 and APB2 are connected to AHB periph 1.

The diagram illustrates the system architecture of the RM0033 microcontroller. At the top, the ARM Cortex-M3 core is shown with its I-bus, D-bus, and S-bus connected to the Bus matrix-S. Below the core, two General Purpose DMA (GP DMA) units, DMA1 and DMA2, are shown with their respective master interfaces (DMA_P1, DMA_MEM1, DMA_MEM2, DMA_P2) connected to the matrix. A MAC Ethernet and a USB OTG HS are also connected to the matrix via their master interfaces (ETHERNET_M and USB_HS_M). The Bus matrix-S is a grid with 8 slave ports (S0 to S7) on the left and 7 master ports (M0 to M6) on the right. The slave ports are connected to the ARM Cortex-M3 (S0, S1, S2), GP DMA1 (S3), GP DMA2 (S4), MAC Ethernet (S5), and USB OTG HS (S6, S7). The master ports are connected to various system components: M0 (ICODE) and M1 (DCODE) lead to an ART ACCEL block, which is connected to Flash memory; M2 is connected to SRAM 112 Kbyte; M3 is connected to SRAM 16 Kbyte; M4 is connected to AHB periph 1; M5 is connected to AHB periph 2; and M6 is connected to FSMC Static MemCtl. APB1 and APB2 are shown as being connected to AHB periph 1. The diagram is labeled 'ai15963b' in the bottom right corner.

Figure 1. System architecture diagram showing the ARM Cortex-M3 core connected to a Bus matrix-S. The matrix has 8 slave ports (S0-S7) and 7 master ports (M0-M6). Masters include ARM Cortex-M3 (I-bus, D-bus, S-bus), GP DMA1 (DMA_P1, DMA_MEM1), GP DMA2 (DMA_MEM2, DMA_P2), MAC Ethernet (ETHERNET_M), and USB OTG HS (USB_HS_M). Targets include Flash memory (via ICODE/DCODE), SRAM 112 Kbyte, SRAM 16 Kbyte, AHB periph 1, AHB periph 2, and FSMC Static MemCtl. APB1 and APB2 are connected to AHB periph 1.

2.1.1 S0: I-bus

This bus connects the Instruction bus of the Cortex ® -M3 core to the BusMatrix. This bus is used by the core to fetch instructions. The target of this bus is a memory containing code (internal Flash memory/SRAM or external memories through the FSMC).

2.1.2 S1: D-bus

This bus connects the databus of the Cortex ® -M3 to the BusMatrix. This bus is used by the core for literal load and debug access. The target of this bus is a memory containing code or data (internal Flash memory /SRAM or external memories through the FSMC).

2.1.3 S2: S-bus

This bus connects the system bus of the Cortex ® -M3 core to a BusMatrix. This bus is used to access data located in a peripheral or in SRAM. Instructions may also be fetch on this bus (less efficient than ICode). The targets of this bus are the 112 KB & 16 KB internal SRAMs, the AHB1 peripherals including the APB peripherals, the AHB2 peripherals and the external memories through the FSMC.

2.1.4 S3, S4: DMA memory bus

This bus connects the DMA memory bus master interface to the BusMatrix. It is used by the DMA to perform transfer to/from memories. The targets of this bus are data memories: internal SRAM and external memories through the FSMC.

2.1.5 S5: DMA peripheral bus

This bus connects the DMA peripheral master bus interface to the BusMatrix. This bus is used by the DMA to access AHB peripherals or to perform memory-to-memory transfers. The targets of this bus are the AHB and APB peripherals plus data memories: internal SRAM and external memories through the FSMC.

2.1.6 S6: Ethernet DMA bus

This bus connects the Ethernet DMA master interface to the BusMatrix. This bus is used by the Ethernet DMA to load/store data to a memory. The targets of this bus are data memories: internal SRAM and external memories through the FSMC.

2.1.7 S7: USB OTG HS DMA bus

This bus connects the USB OTG HS DMA master interface to the BusMatrix. This bus is used by the USB OTG DMA to load/store data to a memory. The targets of this bus are data memories: internal SRAM and external memories through the FSMC.

2.1.8 BusMatrix

The BusMatrix manages the access arbitration between masters. The arbitration uses a round-robin algorithm.

2.1.9 AHB/APB bridges (APB)

The two AHB/APB bridges provide full synchronous connections between the AHB and the two APB buses, allowing flexible selection of the peripheral frequency:

Refer to Table 1 on page 51 for the address mapping of AHB and APB peripherals.

After each device reset, all peripheral clocks are disabled (except for the SRAM and Flash memory interface). Before using a peripheral you have to enable its clock in the RCC_AHBxENR or RCC_APBxENR register.

Note: When a 16- or an 8-bit access is performed on an APB register, the access is transformed into a 32-bit access: the bridge duplicates the 16- or 8-bit data to feed the 32-bit vector.

2.2 Memory organization

Program memory, data memory, registers and I/O ports are organized within the same linear 4 Gbyte address space.

The bytes are coded in memory in little endian format. The lowest numbered byte in a word is considered the word's least significant byte and the highest numbered byte, the word's most significant.

For the detailed mapping of peripheral registers, refer to the related chapters.

The addressable memory space is divided into 8 main blocks, each of 512 MB.

All the memory areas that are not allocated to on-chip memories and peripherals are considered "Reserved"). Refer to the memory map figure in the product datasheet.

2.3 Memory map

See the datasheet corresponding to your device for a comprehensive diagram of the memory map. Table 1 gives the boundary addresses of the peripherals available in all STM32F20x and STM32F21x devices.

Table 1. STM32F20x and STM32F21x register boundary addresses

Boundary addressPeripheralBusRegister map
0xA000 0000 - 0xA000 0FFFFSMC control registerAHB3Section 31.6.9: FSMC register map on page 1299
0x5006 0800 - 0x5006 0BFFRNGAHB2Section 20.4.4: RNG register map on page 539
0x5006 0400 - 0x5006 07FFHASHSection 21.4.8: HASH register map on page 559
0x5006 0000 - 0x5006 03FFCRYPSection 19.6.11: CRYP register map on page 533
0x5005 0000 - 0x5005 03FFDCMISection 12.8.12: DCMI register map on page 296
0x5000 0000 - 0x5003 FFFFUSB OTG FSSection 29.16.6: OTG_FS register map on page 1025
0x4004 0000 - 0x4007 FFFFUSB OTG HSSection 30.12.6: OTG_HS register map on page 1171
0x4002 9000 - 0x4002 93FFETHERNET MACSection 28.8.5: Ethernet register maps on page 935
0x4002 8C00 - 0x4002 8FFF
0x4002 8800 - 0x4002 8BFF
0x4002 8400 - 0x4002 87FF
0x4002 8000 - 0x4002 83FF
0x4002 6400 - 0x4002 67FFDMA2Section 9.5.11: DMA register map on page 205
0x4002 6000 - 0x4002 63FFDMA1
0x4002 4000 - 0x4002 4FFFBKPSRAM
0x4002 3C00 - 0x4002 3FFFFlash interface registerAHB1See Flash programming manual
0x4002 3800 - 0x4002 3BFFRCCSection 5.3.24: RCC register map on page 132
0x4002 3000 - 0x4002 33FFCRCSection 3.4.4: CRC register map on page 63
0x4002 2000 - 0x4002 23FFGPIOISection 6.4.11: GPIO register map on page 154
0x4002 1C00 - 0x4002 1FFFGPIOH
0x4002 1800 - 0x4002 1BFFGPIOG
0x4002 1400 - 0x4002 17FFGPIOF
0x4002 1000 - 0x4002 13FFGPIOE
0x4002 0C00 - 0x4002 0FFFGPIO_D
0x4002 0800 - 0x4002 0BFFGPIOC
0x4002 0400 - 0x4002 07FFGPIOB
0x4002 0000 - 0x4002 03FFGPIOA

Table 1. STM32F20x and STM32F21x register boundary addresses (continued)

Boundary addressPeripheralBusRegister map
0x4001 4800 - 0x4001 4BFFTIM11APB2Section 15.5.12: TIM10/11/13/14 register map on page 473
0x4001 4400 - 0x4001 47FFTIM10
0x4001 4000 - 0x4001 43FFTIM9Section 15.4.14: TIM9/12 register map on page 463
0x4001 3C00 - 0x4001 3FFFEXTISection 8.3.7: EXTI register map on page 172
0x4001 3800 - 0x4001 3BFFSYSCFGSection 7.2.8: SYSCFG register map on page 160
0x4001 3000 - 0x4001 33FFSPI1Section 25.5.10: SPI register map on page 723
0x4001 2C00 - 0x4001 2FFFSDIOSection 26.9.16: SDIO register map on page 778
0x4001 2000 - 0x4001 23FFADC1 - ADC2 - ADC3Section 10.13.18: ADC register map on page 250
0x4001 1400 - 0x4001 17FFUSART6Section 24.6.8: USART register map on page 671
0x4001 1000 - 0x4001 13FFUSART1
0x4001 0400 - 0x4001 07FFTIM8Section 13.4.21: TIM1 and TIM8 register map on page 366
0x4001 0000 - 0x4001 03FFTIM1

Table 1. STM32F20x and STM32F21x register boundary addresses (continued)

Boundary addressPeripheralBusRegister map
0x4000 7400 - 0x4000 77FFDACAPB1Section 11.5.15: DAC register map on page 273
0x4000 7000 - 0x4000 73FFPWRSection 4.4.3: PWR register map on page 83
0x4000 6800 - 0x4000 6BFFCAN2Section 27.9.5: bxCAN register map on page 819
0x4000 6400 - 0x4000 67FFCAN1
0x4000 5C00 - 0x4000 5FFFI2C3Section 23.6.10: I2C register map on page 619
0x4000 5800 - 0x4000 5BFFI2C2
0x4000 5400 - 0x4000 57FFI2C1
0x4000 5000 - 0x4000 53FFUART5Section 24.6.8: USART register map on page 671
0x4000 4C00 - 0x4000 4FFFUART4
0x4000 4800 - 0x4000 4BFFUSART3
0x4000 4400 - 0x4000 47FFUSART2
0x4000 3C00 - 0x4000 3FFFSPI3 / I2S3Section 25.5.10: SPI register map on page 723
0x4000 3800 - 0x4000 3BFFSPI2 / I2S2
0x4000 3000 - 0x4000 33FFIWDGSection 17.4.5: IWDG register map on page 491
0x4000 2C00 - 0x4000 2FFFWWDGSection 18.6.4: WWDG register map on page 498
0x4000 2800 - 0x4000 2BFFRTC & BKP RegistersSection 22.6.15: RTC register map on page 587
0x4000 2000 - 0x4000 23FFTIM14Section 15.5.12: TIM10/11/13/14 register map on page 473
0x4000 1C00 - 0x4000 1FFFTIM13
0x4000 1800 - 0x4000 1BFFTIM12Section 15.4.14: TIM9/12 register map on page 463
0x4000 1400 - 0x4000 17FFTIM7Section 16.4.9: TIM6 and TIM7 register map on page 486
0x4000 1000 - 0x4000 13FFTIM6
0x4000 0C00 - 0x4000 0FFFTIM5Section 14.4.21: TIMx register map on page 426
0x4000 0800 - 0x4000 0BFFTIM4
0x4000 0400 - 0x4000 07FFTIM3
0x4000 0000 - 0x4000 03FFTIM2

2.3.1 Embedded SRAM

The STM32F20x and STM32F21x feature 4 Kbytes of backup SRAM (see Section 4.1.2: Battery backup domain ) plus 128 Kbytes of system SRAM.

The system SRAM can be accessed as bytes, half-words (16 bits) or full words (32 bits). The start address of the SRAM is 0x2000 0000. Read and write operations are performed at CPU speed with 0 wait state.

The system SRAM is split up into two blocks, of 112 KB and 16 KB, with a capability for concurrent access from by the AHB masters (like the Ethernet or the USB OTG HS): for

instance, the Ethernet MAC can read/write from/to the 16 KB SRAM while the CPU is reading/writing from/to the 112 KB SRAM.

The CPU can access the system SRAM through the System Bus or through the I-Code/D-Code buses when boot from SRAM is selected or when physical remap is selected ( SYSCFG memory remap register (SYSCFG_MEMRMP) in the SYSCFG controller). To get the max performance on SRAM execution, physical remap should be selected (boot or software selection).

2.3.2 Bit banding

The Cortex ® -M3 memory map includes two bit-band regions. These regions map each word in an alias region of memory to a bit in a bit-band region of memory. Writing to a word in the alias region has the same effect as a read-modify-write operation on the targeted bit in the bit-band region.

In the STM32F20x and STM32F21x both the peripheral registers and the SRAM are mapped to a bit-band region, so that single bit-band write and read operations are allowed. The operations are only available for Cortex ® -M3 accesses, and not from other bus masters (e.g. DMA).

A mapping formula shows how to reference each word in the alias region to a corresponding bit in the bit-band region. The mapping formula is:

\[ bit\_word\_addr = bit\_band\_base + (byte\_offset \times 32) + (bit\_number \times 4) \]

where:

Example

The following example shows how to map bit 2 of the byte located at SRAM address 0x20000300 to the alias region:

\[ 0x22006008 = 0x22000000 + (0x300*32) + (2*4) \]

Writing to address 0x22006008 has the same effect as a read-modify-write operation on bit 2 of the byte at SRAM address 0x20000300.

Reading address 0x22006008 returns the value (0x01 or 0x00) of bit 2 of the byte at SRAM address 0x20000300 (0x01: bit set; 0x00: bit reset).

For more information on bit-banding, refer to the Cortex ® -M3 programming manual (see Related documents on page 1 ).

2.3.3 Embedded Flash memory

The Flash memory has the following main features:

The Flash memory is organized as follows:

The OTP area contains 16 additional bytes used to lock the corresponding OTP data block.

Table 2. Flash module organization

BlockNameBlock base addressesSize
Main memorySector 00x0800 0000 - 0x0800 3FFF16 Kbyte
Sector 10x0800 4000 - 0x0800 7FFF16 Kbyte
Sector 20x0800 8000 - 0x0800 BFFF16 Kbyte
Sector 30x0800 C000 - 0x0800 FFFF16 Kbyte
Sector 40x0801 0000 - 0x0801 FFFF64 Kbyte
Sector 50x0802 0000 - 0x0803 FFFF128 Kbyte
Sector 60x0804 0000 - 0x0805 FFFF128 Kbyte
...
...
Sector 110x080E 0000 - 0x080F FFFF128 Kbyte
System memory0x1FFF 0000 - 0x1FFF 77FF30 Kbyte
OTP0x1FFF 7800 - 0x1FFF 7A0F528 bytes
Option bytes0x1FFF C000 - 0x1FFF C00F16 bytes

2.3.4 Flash memory read interface

Relation between CPU clock frequency and Flash memory read time

To correctly read data from Flash memory, the number of wait states (LATENCY) must be correctly programmed in the Flash access control register (FLASH_ACR) according to the frequency of the Cortex®-M3 clock and the supply voltage of the device. Table 3 shows the correspondence between wait states and core clock frequency.

Table 3. Number of wait states according to Cortex ® -M3 clock frequency
Wait states (WS)
(LATENCY)
HCLK - Cortex ® -M3 clock frequency (MHz)
Voltage range
2.7 to 3.6 V
Voltage range
2.4 to 2.7 V
Voltage range
2.1 to 2.4 V
Voltage range
1.8 (1) to 2.1 V
0 WS (1 CPU cycle)0 < HCLK ≤ 300 < HCLK ≤ 240 < HCLK ≤ 180 < HCLK ≤ 16
1 WS (2 CPU cycles)30 < HCLK ≤ 6024 < HCLK ≤ 4818 < HCLK ≤ 3616 < HCLK ≤ 32
2 WS (3 CPU cycles)60 < HCLK ≤ 9048 < HCLK ≤ 7236 < HCLK ≤ 5432 < HCLK ≤ 48
3 WS (4 CPU cycles)90 < HCLK ≤ 12072 < HCLK ≤ 9654 < HCLK ≤ 7248 < HCLK ≤ 64
4 WS (5 CPU cycles)96 < HCLK ≤ 12072 < HCLK ≤ 9064 < HCLK ≤ 80
5 WS (6 CPU cycles)90 < HCLK ≤ 10880 < HCLK ≤ 96
6 WS (7 CPU cycles)108 < HCLK ≤ 12096 < HCLK ≤ 112
7 WS (8 CPU cycles)112 < HCLK ≤ 120

1. If IRROFF is set to VDD on STM32F20xx devices, this value can be lowered to 1.65 V when the device operates in a reduced temperature range.

After reset, the CPU clock frequency is 16 MHz and 0 wait state (WS) is configured in the FLASH_ACR register.

It is highly recommended to use the following software sequences to tune the number of wait states needed to access the Flash memory with the CPU frequency.

Increasing the CPU frequency

Decreasing the CPU frequency

Note: A change in CPU clock configuration or wait state (WS) configuration may not be effective straight away. To make sure that the current CPU clock frequency is the one you have

configured, you can check the AHB prescaler factor and clock source status values. To make sure that the number of WS you have programmed is effective, you can read the FLASH_ACR register.

The FLASH_ACR register is used to enable/disable the acceleration features and control the Flash memory access time according to CPU frequency. The tables below provides the bit map and bit descriptions for this register.

For complete information on Flash memory operations and register configurations, refer to the STM32F20x and STM32F21x Flash programming manual (PM0059).

Flash access control register (FLASH_ACR)

The Flash access control register is used to enable/disable the acceleration features and control the Flash memory access time according to CPU frequency.

Address offset: 0x00

Reset value: 0x0000 0000

Access: no wait state, word, half-word and byte access

31302928272625242322212019181716
Reserved
1514131211109876543210
ReservedDCRSTICRSTDCENICENPRFTENReservedLATENCY
rwwrwrwrwrwrwrw

Bits 31:11 Reserved, must be kept cleared.

Bit 12 DCRST : Data cache reset

This bit can be written only when the D cache is disabled.

Bit 11 ICRST : Instruction cache reset

This bit can be written only when the I cache is disabled.

Bit 10 DCEN : Data cache enable

Bit 9 ICEN : Instruction cache enable

Bit 8 PRFTEN : Prefetch enable

0: Prefetch is disabled

1: Prefetch is enabled

Bits 7:3 Reserved, must be kept cleared.

Bits 2:0 LATENCY : Latency

These bits represent the ratio of the CPU clock period to the Flash memory access time.

000: Zero wait state

001: One wait state

010: Two wait states

011: Three wait states

100: Four wait states

101: Five wait states

110: Six wait states

111: Seven wait states

2.3.5 Adaptive real-time memory accelerator (ART Accelerator™)

The ART Accelerator™ is a memory accelerator which is optimized for STM32 industry-standard Arm® Cortex®-M3 processors. It balances the inherent performance advantage of the Arm® Cortex®-M3 over Flash memory technologies, which normally requires the processor to wait for the Flash memory at higher operating frequencies. Thanks to the ART Accelerator™, the CPU can operate up to 120 MHz without wait states, thereby increasing the overall system speed and efficiency.

To release the processor full 150 DMIPS performance at this frequency the accelerator implements an instruction prefetch queue and branch cache, which enables program execution from Flash memory at up to 120 MHz without wait states.

2.4 Boot configuration

Due to its fixed memory map, the code area starts from address 0x0000 0000 (accessed through the ICode/DCode buses) while the data area (SRAM) starts from address 0x2000 0000 (accessed through the system bus). The Cortex®-M3 CPU always fetches the reset vector on the ICode bus, which implies to have the boot space available only in the code area (typically, Flash memory). STM32F20x and STM32F21x microcontrollers implement a special mechanism to be able to boot from other memories (like the internal SRAM).

In the STM32F20x and STM32F21x, three different boot modes can be selected through the BOOT[1:0] pins as shown in Table 4 .

Table 4. Boot modes

Boot mode selection pinsBoot modeAliasings
BOOT1BOOT0
x0Main Flash memoryMain Flash memory is selected as the boot space
01System memorySystem memory is selected as the boot space
11Embedded SRAMEmbedded SRAM is selected as the boot space

The values on the BOOT pins are latched on the 4th rising edge of SYSCLK after a reset. It is up to the user to set the BOOT1 and BOOT0 pins after reset to select the required boot mode.

BOOT0 is a dedicated pin while BOOT1 is shared with a GPIO pin. Once BOOT1 has been sampled, the corresponding GPIO pin is free and can be used for other purposes.

The BOOT pins are also resampled when the device exits the Standby mode. Consequently, they must be kept in the required Boot mode configuration when the device is in the Standby mode. After this startup delay is over, the CPU fetches the top-of-stack value from address 0x0000 0000, then starts code execution from the boot memory starting from 0x0000 0004.

Note: When the device boots from SRAM, in the application initialization code, you have to relocate the vector table in SRAM using the NVIC exception table and the offset register.

Physical remap

Once the boot pins are selected, the application software can modify the memory accessible in the code area (in this way the code can be executed through the ICode bus in place of the System bus). This modification is performed by programming the Section 7.2.1: SYSCFG memory remap register (SYSCFG_MEMRMP) in the SYSCFG controller.

The following memories can thus be remapped:

Table 5. Memory mapping vs. Boot mode/physical remap

AddressesBoot/Remap in main Flash memoryBoot/Remap in embedded SRAMBoot/Remap in System memoryRemap in FSMC
0x2001 C000 - 0x2001 FFFFSRAM2 (16 KB)SRAM2 (16 KB)SRAM2 (16 KB)SRAM2 (16 KB)
0x2000 0000 - 0x2001 BFFFSRAM1 (112 KB)SRAM1 (112 KB)SRAM1 (112 KB)SRAM1 (112 KB)
0x1FFF 0000 - 0x1FFF 77FFSystem memorySystem memorySystem memorySystem memory
0x0810 0000 - 0x0FFF FFFFReservedReservedReservedReserved
0x0800 0000 - 0x080F FFFFFlash memoryFlash memoryFlash memoryFlash memory
0x0400 0000 - 0x07FF FFFFReservedReservedReservedFSMC Bank1
NOR/PSRAM 2
(Aliased)
0x0000 0000 - 0x03FF FFFF (1)(2)Flash (1 MB) AliasedSRAM1 (112 KB) AliasedSystem memory (30 KB) AliasedFSMC Bank1
NOR/PSRAM 1
(Aliased)
  1. 1. When the FSMC is remapped at address 0x0000 0000, only the first two regions of Bank 1 memory controller (Bank1 NOR/PSRAM 1 and NOR/PSRAM 2) can be remapped. In remap mode, the CPU can access the external memory via ICode bus instead of System bus which boosts up the performance. However, in remap mode, the FSMC addressing is fixed to the remap address area only (Bank1 NOR/PSRAM 1 and NOR/PSRAM 2) and FSMC control registers are not accessible. The FSMC remap function must be disabled to allows addressing other memory devices through the FSMC and/or to access FSMC control registers.
  2. 2. Even when aliased in the boot memory space, the related memory is still accessible at its original memory space.

Embedded bootloader

The embedded bootloader mode is used to reprogram the Flash memory using one of the following serial interfaces:

The USART peripherals operate at the internal 16 MHz oscillator (HSI) frequency, while the CAN and USB OTG FS require an external clock (HSE) multiple of 1 MHz (ranging from 4 to 26 MHz).

The embedded bootloader code is located in system memory. It is programmed by ST during production. For additional information, refer to application note AN2606.