33. Debug support (DBG)

33.1 Overview

The STM32F3xx devices are built around a Cortex-M4 ® F core, which contains hardware extensions for advanced debugging features. The debug extensions allow the core to be stopped either on a given instruction fetch (breakpoint) or data access (watchpoint). When stopped, the core's internal state and the system's external state may be examined. Once the examination is complete, the core and the system may be restored and program execution resumed.

The debug features are used by the debugger host when connecting to and debugging the STM32F3xx MCUs.

Two interfaces for debug are available:

Figure 406. Block diagram of STM32 MCU and Cortex-M4 ® F-level debug support

Block diagram of STM32 MCU and Cortex-M4®F-level debug support. The diagram shows the internal architecture of the MCU, including the Cortex-M4 Core, Bus matrix, Data, AHB-AP, Bridge, NVIC, DWT, FPB, ITM, ETM, TPIU, and DBGMCU. External connections include JTAG/SWD pins (JTAGS/SWDIO, JTDI, JTDO/TRACESWO, NJTRST, JTCK/SWCLK) and a Trace port (TRACESWO, TRACECK, TRACED[3:0]).

The diagram illustrates the internal architecture of the STM32 MCU debug support. It is divided into two main sections: 'STM32 MCU debug support' and 'Cortex-M4 debug support'. The 'Cortex-M4 debug support' section contains the 'Cortex-M4 Core', a 'Bus matrix', and 'Data'. The 'STM32 MCU debug support' section includes an 'SWJ-DP' (Serial Wire Debug) block connected to the 'AHB-AP' (AHB Access Port). The 'AHB-AP' is connected to the 'Bus matrix' and the 'Internal private peripheral bus (PPB)'. The 'PPB' contains a 'Bridge', 'NVIC', 'DWT', 'FPB', and 'ITM'. The 'Bridge' is connected to the 'External private peripheral bus (PPB)', which in turn connects to the 'ETM' (Embedded Trace Macrocell) and 'TPIU' (Trace Port Interface Unit). The 'TPIU' is connected to the 'Trace port' (TRACESWO, TRACECK, TRACED[3:0]). The 'DBGMCU' (Debug Microcontroller Unit) is also connected to the 'TPIU'. External connections include JTAGS/SWDIO, JTDI, JTDO/TRACESWO, NJTRST, and JTCK/SWCLK.

Block diagram of STM32 MCU and Cortex-M4®F-level debug support. The diagram shows the internal architecture of the MCU, including the Cortex-M4 Core, Bus matrix, Data, AHB-AP, Bridge, NVIC, DWT, FPB, ITM, ETM, TPIU, and DBGMCU. External connections include JTAG/SWD pins (JTAGS/SWDIO, JTDI, JTDO/TRACESWO, NJTRST, JTCK/SWCLK) and a Trace port (TRACESWO, TRACECK, TRACED[3:0]).

Note: The debug features embedded in the Cortex-M4 core are a subset of the Arm ® CoreSight Design Kit.

The Cortex-M4 ® F core provides integrated on-chip debug support. It is composed of:

It also includes debug features dedicated to the STM32F3xx:

Note: For further information on the debug feature supported by the Cortex-M4 ® F core, refer to the Cortex ® -M4 with FPU-r0p1 Technical Reference Manual and to the CoreSight Design Kit-r0p1 TRM (see Section 33.2: Reference Arm documentation ).

33.2 Reference Arm documentation

33.3 SWJ debug port (serial wire and JTAG)

The STM32F3xx core integrates the Serial Wire / JTAG Debug Port (SWJ-DP). It is an Arm standard CoreSight debug port that combines a JTAG-DP (5-pin) interface and an SW-DP (2-pin) interface.

In the SWJ-DP, the two JTAG pins of the SW-DP are multiplexed with some of the five JTAG pins of the JTAG-DP.

Figure 407. SWJ debug port

Figure 407. SWJ debug port block diagram showing internal logic for JTAG and SWD debug ports.

The diagram illustrates the internal architecture of the SWJ debug port. It features two main debug interfaces: the JTAG-Debug Port (JTAG-DP) and the Serial Wire Debug Port (SW-DP). The JTAG-DP includes pins for TDO, TDI, nTRST, TCK, and TMS. The SW-DP includes pins for DBGDI, DBGDO, DBGDOEN, and DBGCLK. A 'SWD/JTAG select' block is used to switch between the two modes. The asynchronous trace output (TRACESWO) is multiplexed with the JTAG TDO pin. The nPOTRST pin is connected to a power-on reset signal. The diagram also shows various internal signals and logic gates (AND, OR) that control the operation of the debug ports. The identifier 'ai17139' is present in the bottom right corner of the diagram.

Figure 407. SWJ debug port block diagram showing internal logic for JTAG and SWD debug ports.

Figure 407 shows that the asynchronous TRACE output (TRACESWO) is multiplexed with TDO. This means that the asynchronous trace can only be used with SW-DP, not JTAG-DP.

33.3.1 Mechanism to select the JTAG-DP or the SW-DP

By default, the JTAG-Debug Port is active.

If the debugger host wants to switch to the SW-DP, it must provide a dedicated JTAG sequence on TMS/TCK (respectively mapped to SWDIO and SWCLK) which disables the JTAG-DP and enables the SW-DP. This way it is possible to activate the SWDP using only the SWCLK and SWDIO pins.

This sequence is:

  1. 1. Send more than 50 TCK cycles with TMS (SWDIO) = 1
  2. 2. Send the 16-bit sequence on TMS (SWDIO) = 0111100111100111 (MSB transmitted first)
  3. 3. Send more than 50 TCK cycles with TMS (SWDIO) = 1

33.4 Pinout and debug port pins

The STM32F3xx MCUs are available in various packages with different numbers of available pins. As a result, some functionality (ETM) related to pin availability may differ between packages.

33.4.1 SWJ debug port pins

Five pins are used as outputs from the STM32F3xx for the SWJ-DP as alternate functions of general-purpose I/Os. These pins are available on all packages.

Table 189. SWJ debug port pins

SWJ-DP pin nameJTAG debug portSW debug portPin assignment
TypeDescriptionTypeDebug assignment
JTMS/SWDIOIJTAG Test Mode SelectionIOSerial Wire Data Input/OutputPA13
JTCK/SWCLKIJTAG Test ClockISerial Wire ClockPA14
JTDIIJTAG Test Data Input--PA15
JTDO/TRACEWOOJTAG Test Data Output-TRACEWO if async trace is enabledPB3
NJTRSTIJTAG Test nReset--PB4

33.4.2 Flexible SWJ-DP pin assignment

After RESET (SYSRESETn or PORESETn), all five pins used for the SWJ-DP are assigned as dedicated pins immediately usable by the debugger host (note that the trace outputs are not assigned except if explicitly programmed by the debugger host).

However, it is possible to disable some or all of the SWJ-DP ports and so, to release (in gray in the table below) the associated pins for general-purpose I/O(GPIO) usage. For more details on how to disable SWJ-DP port pins, please refer to Section 11.3.2: I/O pin alternate function multiplexer and mapping .

Table 190. Flexible SWJ-DP pin assignment

Available debug portsSWJ IO pin assigned
PA13 / JTMS/ SWDIOPA14 / JTCK/ SWCLKPA15 / JTDIPB3 / JTDOPB4/ NJTRST
Full SWJ (JTAG-DP + SW-DP) - Reset StateXXXXX
Full SWJ (JTAG-DP + SW-DP) but without NJTRSTXXXX
JTAG-DP Disabled and SW-DP EnabledXX
JTAG-DP Disabled and SW-DP DisabledReleased

33.4.3 Internal pull-up and pull-down on JTAG pins

It is necessary to ensure that the JTAG input pins are not floating since they are directly connected to flip-flops to control the debug mode features. Special care must be taken with the SWCLK/TCK pin, which is directly connected to the clock of some of these flip-flops.

To avoid any uncontrolled IO levels, the device embeds internal pull-ups and pull-downs on the JTAG input pins:

Once a JTAG IO is released by the user software, the GPIO controller takes control again. The reset states of the GPIO control registers put the I/Os in the equivalent state:

The software can then use these I/Os as standard GPIOs.

Note: The JTAG IEEE standard recommends adding pull-ups on TDI, TMS, and nTRST but there is no special recommendation for TCK. However, for JTCK, the device needs an integrated pull-down.

Having embedded pull-ups and pull-downs removes the need to add external resistors.

33.4.4 Using serial wire and releasing the unused debug pins as GPIOs

To use the serial wire DP to release some GPIOs, the user software must change the GPIO (PA15, PB3, and PB4) configuration mode in the GPIO_MODER PB4, which releases PA15, PB3 and PB4, which now become available as GPIOs.

When debugging, the host performs the following actions:

Note: For user software designs, note that:

To release the debug pins, remember that they are first configured either in input-pull-up (nTRST, TMS, TDI) or pull-down (TCK) or output tristate (TDO) for a certain duration after reset until the instant when the user software releases the pins.

When debug pins (JTAG or SW or TRACE) are mapped, changing the corresponding IO pin configuration in the IOPORT controller has no effect.

33.5 STM32F3xx JTAG TAP connection

The STM32F3xx MCUs integrate two serially connected JTAG TAPs, the boundary scan TAP (IR is 5-bit wide) and the Cortex-M4 ® F TAP (IR is 4-bit wide).

To access the TAP of the Cortex-M4 ® F for debug purposes:

  1. 1. First, it is necessary to shift the BYPASS instruction of the boundary scan TAP.
  2. 2. Then, for each IR shift, the scan chain contains 9 bits (=5+4) and the unused TAP instruction must be shifted in using the BYPASS instruction.
  3. 3. For each data shift, the unused TAP, which is in BYPASS mode, adds 1 extra data bit in the data scan chain.

Note: Important: Once Serial-Wire is selected using the dedicated Arm JTAG sequence, the boundary scan TAP is automatically disabled (JTMS forced high).

Figure 408. JTAG TAP connections

Figure 408. JTAG TAP connections diagram showing the internal architecture of an STM32 MCU. The diagram illustrates the connection between the STM32 MCU pins (NJTRST, JTMS, JTDI, JTDO) and two internal TAP components: a Boundary scan TAP and a Cortex-M4 TAP. The Boundary scan TAP has TMS, nTRST, TDI, and TDO pins, with an IR width of 5 bits. The Cortex-M4 TAP also has TMS, nTRST, TDI, and TDO pins, with an IR width of 4 bits. The TDI pins are connected in series, and the TDO pins are connected in series. The TMS pins are connected to the JTMS pin through an AND gate, which is also connected to the SW-DP Selected signal. The nTRST pins are connected to the NJTRST pin. The JTDI pin is connected to the TDI pin of the Boundary scan TAP, and the JTDO pin is connected to the TDO pin of the Cortex-M4 TAP.
Figure 408. JTAG TAP connections diagram showing the internal architecture of an STM32 MCU. The diagram illustrates the connection between the STM32 MCU pins (NJTRST, JTMS, JTDI, JTDO) and two internal TAP components: a Boundary scan TAP and a Cortex-M4 TAP. The Boundary scan TAP has TMS, nTRST, TDI, and TDO pins, with an IR width of 5 bits. The Cortex-M4 TAP also has TMS, nTRST, TDI, and TDO pins, with an IR width of 4 bits. The TDI pins are connected in series, and the TDO pins are connected in series. The TMS pins are connected to the JTMS pin through an AND gate, which is also connected to the SW-DP Selected signal. The nTRST pins are connected to the NJTRST pin. The JTDI pin is connected to the TDI pin of the Boundary scan TAP, and the JTDO pin is connected to the TDO pin of the Cortex-M4 TAP.

33.6 ID codes and locking mechanism

There are several ID codes inside the STM32F3xx MCUs. STMicroelectronics strongly recommends tools designers to lock their debuggers using the MCU DEVICE ID code located in the external PPB memory map at address 0xE0042000.

33.6.1 MCU device ID code

The STM32F3xx MCUs integrate an MCU ID code. This ID identifies the ST MCU part-number and the die revision. It is part of the DBG_MCU component and is mapped on the external PPB bus (see Section 33.16 on page 1113 ). This code is accessible using the JTAG debug port (4 to 5 pins) or the SW debug port (two pins) or by the user software. It is even accessible while the MCU is under system reset.

DBGMCU_IDCODE

Address: 0xE004 2000

Only 32-bits access supported. Read-only

31302928272625242322212019181716
REV_ID
rrrrrrrrrrrrrrrr
1514131211109876543210
DEV_ID
ResResResResrrrrrrrrrrrr

Bits 31:16 REV_ID[15:0] Revision identifier

This field indicates the revision of the device.

0x1001: Rev Z

0x1003: Rev Y

Bits 15:12 Reserved, must be kept at reset value.

Bits 11:0 DEV_ID[11:0] : Device identifier

This field indicates the device and its revision.

The device ID is:

0x422 for STM32F303xB/C and STM32F358 devices.

0x438 for STM32F303x6/8 and STM32F328 devices.

0x446 for STM32F303xD/E and STM32F398xE devices.

33.6.2 Boundary scan TAP

JTAG ID code

The TAP of the STM32F3xx BSC (boundary scan) integrates a JTAG ID code equal to 0x06432041.

33.6.3 Cortex-M4 ® F TAP

The TAP of the Cortex-M4 ® F integrates a JTAG ID code. This ID code is the Arm ® default one and has not been modified. This code is only accessible by the JTAG Debug Port.

This code is 0x4BA00477 (corresponds to Cortex-M4 ® F r0p1, see Section 33.2: Reference Arm documentation ).

Only the DEV_ID(11:0) should be used for identification by the debugger/programmer tools.

33.6.4 Cortex-M4 ® F JEDEC-106 ID code

The Cortex-M4 ® F integrates a JEDEC-106 ID code. It is located in the 4KB ROM table mapped on the internal PPB bus at address 0xE00FF000_0xE00FFFFF.

This code is accessible by the JTAG Debug Port (4 to 5 pins) or by the SW Debug Port (two pins) or by the user software.

33.7 JTAG debug port

A standard JTAG state machine is implemented with a 4-bit instruction register (IR) and five data registers (for full details, refer to the Cortex-M4 ® Fr0p1 Technical Reference Manual (TRM), for references, please see Section 33.2: Reference Arm documentation ).

Table 191. JTAG debug port data registers

IR(3:0)Data registerDetails
1111BYPASS
[1 bit]
1110IDCODE
[32 bits]
ID CODE
0x3BA00477 (Cortex-M4 ® F r0p1 ID Code)

Table 191. JTAG debug port data registers (continued)

IR(3:0)Data registerDetails
1010DPACC
[35 bits]

Debug port access register

This initiates a debug port and allows access to a debug port register.

  • – When transferring data IN:
    Bits 34:3 = DATA[31:0] = 32-bit data to transfer for a write request
    Bits 2:1 = A[3:2] = 2-bit address of a debug port register.
    Bit 0 = RnW = Read request (1) or write request (0).
  • – When transferring data OUT:
    Bits 34:3 = DATA[31:0] = 32-bit data which is read following a read request
    Bits 2:0 = ACK[2:0] = 3-bit Acknowledge:
    010 = OK/FAULT
    001 = WAIT
    OTHER = reserved

Refer to Table 192 for a description of the A(3:2) bits

1011APACC
[35 bits]

Access port access register

Initiates an access port and allows access to an access port register.

  • – When transferring data IN:
    Bits 34:3 = DATA[31:0] = 32-bit data to shift in for a write request
    Bits 2:1 = A[3:2] = 2-bit address (sub-address AP registers).
    Bit 0 = RnW= Read request (1) or write request (0).
  • – When transferring data OUT:
    Bits 34:3 = DATA[31:0] = 32-bit data which is read following a read request
    Bits 2:0 = ACK[2:0] = 3-bit Acknowledge:
    010 = OK/FAULT
    001 = WAIT
    OTHER = reserved

There are many AP Registers (see AHB-AP) addressed as the combination of:

  • – The shifted value A[3:2]
  • – The current value of the DP SELECT register
1000ABORT
[35 bits]

Abort register

  • – Bits 31:1 = Reserved
  • – Bit 0 = DAPABORT: write 1 to generate a DAP abort.

Table 192. 32-bit debug port registers addressed through the shifted value A[3:2]

AddressA(3:2) valueDescription
0x000Reserved, must be kept at reset value.
0x401

DP CTRL/STAT register. Used to:

  • – Request a system or debug power-up
  • – Configure the transfer operation for AP accesses
  • – Control the pushed compare and pushed verify operations.
  • – Read some status flags (overrun, power-up acknowledges)

Table 192. 32-bit debug port registers addressed through the shifted value A[3:2] (continued)

AddressA(3:2) valueDescription
0x810DP SELECT register: Used to select the current access port and the active 4-words register window.
  • – Bits 31:24: APSEL: select the current AP
  • – Bits 23:8: reserved
  • – Bits 7:4: APBANKSEL: select the active 4-words register window on the current AP
  • – Bits 3:0: reserved
0xC11DP RDBUFF register: Used to allow the debugger to get the final result after a sequence of operations (without requesting new JTAG-DP operation)

33.8 SW debug port

33.8.1 SW protocol introduction

This synchronous serial protocol uses two pins:

The protocol allows two banks of registers (DPACC registers and APACC registers) to be read and written to.

Bits are transferred LSB-first on the wire.

For SWDIO bidirectional management, the line must be pulled-up on the board (100 k \( \Omega \) recommended by Arm).

Each time the direction of SWDIO changes in the protocol, a turnaround time is inserted where the line is not driven by the host nor the target. By default, this turnaround time is one bit time, however this can be adjusted by configuring the SWCLK frequency.

33.8.2 SW protocol sequence

Each sequence consists of three phases:

  1. 1. Packet request (8 bits) transmitted by the host
  2. 2. Acknowledge response (3 bits) transmitted by the target
  3. 3. Data transfer phase (33 bits) transmitted by the host or the target

Table 193. Packet request (8-bits)

BitNameDescription
0StartMust be “1”
1APnDP0: DP Access
1: AP Access
2RnW0: Write Request
1: Read Request
Table 193. Packet request (8-bits) (continued)
BitNameDescription
4:3A(3:2)Address field of the DP or AP registers (refer to Table 192 )
5ParitySingle bit parity of preceding bits
6Stop0
7ParkNot driven by the host. Must be read as “1” by the target because of the pull-up

Refer to the Cortex-M4 ® F r0p1 TRM for a detailed description of DPACC and APACC registers.

The packet request is always followed by the turnaround time (default 1 bit) where neither the host nor target drive the line.

Table 194. ACK response (3 bits)
BitNameDescription
0..2ACK001: FAULT
010: WAIT
100: OK

The ACK Response must be followed by a turnaround time only if it is a READ transaction or if a WAIT or FAULT acknowledge has been received.

Table 195. DATA transfer (33 bits)
BitNameDescription
0..31WDATA or RDATAWrite or Read data
32ParitySingle parity of the 32 data bits

The DATA transfer must be followed by a turnaround time only if it is a READ transaction.

33.8.3 SW-DP state machine (reset, idle states, ID code)

The State Machine of the SW-DP has an internal ID code, which identifies the SW-DP. It follows the JEP-106 standard. This ID code is the default Arm ® one and is set to 0x1BA01477 (corresponding to Cortex-M4 ® F r0p1).

Note: Note that the SW-DP state machine is inactive until the target reads this ID code.

Further details of the SW-DP state machine can be found in the Cortex-M4 ® F r0p1 TRM and the CoreSight Design Kit r0p1 TRM .

33.8.4 DP and AP read/write accesses

33.8.5 SW-DP registers

Access to these registers is initiated when APnDP=0

Table 196. SW-DP registers

A(3:2)R/WCTRLSEL bit of SELECT registerRegisterNotes
00Read-IDCODEThe manufacturer code is not set to ST code 0x2BA01477 (identifies the SW-DP)
00Write-ABORT-
01Read/Write0DP-CTRL/STATPurpose is to:
– request a system or debug power-up
– configure the transfer operation for AP accesses
– control the pushed compare and pushed verify operations.
– read some status flags (overrun, power-up acknowledges)
01Read/Write1WIRE CONTROLPurpose is to configure the physical serial port protocol (like the duration of the turnaround time)
10Read-READ RESENDEnables recovery of the read data from a corrupted debugger transfer, without repeating the original AP transfer.

Table 196. SW-DP registers (continued)

A(3:2)R/WCTRLSEL bit of SELECT registerRegisterNotes
10Write-SELECTThe purpose is to select the current access port and the active 4-words register window
11Read/Write-READ BUFFERThis read buffer is useful because AP accesses are posted (the result of a read AP request is available on the next AP transaction).
This read buffer captures data from the AP, presented as the result of a previous read, without initiating a new transaction

33.8.6 SW-AP registers

Access to these registers is initiated when APnDP=1

There are many AP Registers (see AHB-AP) addressed as the combination of:

33.9 AHB-AP (AHB access port) - valid for both JTAG-DP and SW-DP

Features:

The address of the 32-bits AHP-AP resisters are 6-bits wide (up to 64 words or 256 bytes) and consists of:

  1. Bits [7:4] = the bits [7:4] APBANKSEL of the DP SELECT register
  2. Bits [3:2] = the 2 address bits of A(3:2) of the 35-bit packet request for SW-DP.

The AHB-AP of the Cortex-M4 ® F includes 9 x 32-bits registers:

Table 197. Cortex-M4 ® F AHB-AP registers
Address offsetRegister nameNotes
0x00AHB-AP Control and Status WordConfigures and controls transfers through the AHB interface (size, hprot, status on current transfer, address increment type)
0x04AHB-AP Transfer Address-
0x0CAHB-AP Data Read/Write-
0x10AHB-AP Banked Data 0Directly maps the 4 aligned data words without rewriting the Transfer Address Register.
0x14AHB-AP Banked Data 1
0x18AHB-AP Banked Data 2
0x1CAHB-AP Banked Data 3
0xF8AHB-AP Debug ROM AddressBase Address of the debug interface
0xFCAHB-AP ID Register-

Refer to the Cortex-M4 ® F r0p1 TRM for further details.

33.10 Core debug

Core debug is accessed through the core debug registers. Debug access to these registers is by means of the Advanced High-performance Bus (AHB-AP) port. The processor can access these registers directly over the internal Private Peripheral Bus (PPB).

It consists of 4 registers:

Table 198. Core debug registers

RegisterDescription
DHCSRThe 32-bit Debug Halting Control and Status Register
This provides status information about the state of the processor enable core debug halt and step the processor
DCRSRThe 17-bit Debug Core Register Selector Register:
This selects the processor register to transfer data to or from.
DCRDRThe 32-bit Debug Core Register Data Register:
This holds data for reading and writing registers to and from the processor selected by the DCRSR (Selector) register.
DEMCRThe 32-bit Debug Exception and Monitor Control Register:
This provides Vector Catching and Debug Monitor Control. This register contains a bit named TRCENA which enable the use of a TRACE.

Note: Important: these registers are not reset by a system reset. They are only reset by a power-on reset.

Refer to the Cortex-M4 ® F r0p1 TRM for further details.

To Halt on reset, it is necessary to:

33.11 Capability of the debugger host to connect under system reset

The STM32F3xx MCUs' reset system comprises the following reset sources:

The Cortex-M4 ® F differentiates the reset of the debug part (generally PORRESETn) and the other one (SYSRESETn)

This way, it is possible for the debugger to connect under System Reset, programming the Core Debug Registers to halt the core when fetching the reset vector. Then the host can release the system reset and the core immediately halt without having executed any instructions. In addition, it is possible to program any debug features under System Reset.

Note: It is highly recommended for the debugger host to connect (set a breakpoint in the reset vector) under system reset.

33.12 FPB (Flash patch breakpoint)

The FPB unit:

The use of a Software Patch or a Hardware Breakpoint is exclusive.

The FPB consists of:

33.13 DWT (data watchpoint trigger)

The DWT unit consists of four comparators. They are configurable as:

The DWT also provides some means to give some profiling information. For this, some counters are accessible to give the number of:

33.14 ITM (instrumentation trace macrocell)

33.14.1 General description

The ITM is an application-driven trace source that supports printf style debugging to trace Operating System (OS) and application events, and emits diagnostic system information. The ITM emits trace information as packets, which can be generated as:

The packets emitted by the ITM are output to the TPIU (Trace Port Interface Unit). The formatter of the TPIU adds some extra packets (refer to TPIU) and then output the complete packets sequence to the debugger host.

The bit TRCEN of the Debug Exception and Monitor Control Register must be enabled before programming or using the ITM.

33.14.2 Time stamp packets, synchronization, and overflow packets

Time stamp packets encode time stamp information, generic control, and synchronization. It uses a 21-bit timestamp counter (with possible prescalers) which is reset at each time stamp packet emission. This counter can be either clocked by the CPU clock or the SWV clock.

A synchronization packet consists of 6 bytes equal to 0x80_00_00_00_00_00, emitted to the TPIU as 00 00 00 00 00 80 (LSB emitted first).

A synchronization packet is a timestamp packet control. It is emitted at each DWT trigger.

For this, the DWT must be configured to trigger the ITM: the bit CYCCNTENA (bit0) of the DWT Control Register must be set. In addition, the bit2 (SYNCENA) of the ITM Trace Control Register must be set.

Note: If the SYNENA bit is not set, the DWT generates synchronization triggers to the TPIU, which sends only TPIU synchronization packets and not ITM synchronization packets.

An overflow packet consists in a special timestamp packet, which indicates that data was written, but the FIFO was full.

Table 199. Main ITM registers

AddressRegisterDetails
@E0000FB0ITM lock accessWrite 0xC5ACCE55 to unlock Write Access to the other ITM registers
@E0000E80ITM trace controlBits 31-24 = Always 0
Bits 23 = Busy
Bits 22-16 = 7-bits ATB ID which identifies the source of the trace data.
Bits 15-10 = Always 0
Bits 9:8 = TSPrescale = Time Stamp Prescaler
Bits 7-5 = Reserved
Bit 4 = SWOENA = Enable SWV behavior (to clock the timestamp counter by the SWV clock).
Bit 3 = DWTENA: Enable the DWT Stimulus
Bit 2 = SYNCENA: this bit must be to 1 to enable the DWT to generate synchronization triggers so that the TPIU can then emit the synchronization packets.
Bit 1 = TSENA (Timestamp Enable)
@E0000E40ITM trace privilegeBit 0 = ITMENA: Global Enable Bit of the ITM
Bit 3: mask to enable tracing ports31:24
Bit 2: mask to enable tracing ports23:16
Bit 1: mask to enable tracing ports15:8
@E0000E00ITM trace enableBit 0: mask to enable tracing ports7:0
Each bit enables the corresponding Stimulus port to generate trace.
@E0000000-E000007CStimulus port registers 0-31Write the 32-bits data on the selected Stimulus Port (32 available) to be traced out.

Example of configuration

To output a simple value to the TPIU:

33.15 ETM (Embedded trace macrocell)

ETM is available on STM32F303xB/C/D/E, STM32F358xC and STM32F398xE devices only.

33.15.1 General description

The ETM enables the reconstruction of program execution. Data are traced using the Data Watchpoint and Trace (DWT) component or the Instruction Trace Macrocell (ITM) whereas instructions are traced using the Embedded Trace Macrocell (ETM).

The ETM transmits information as packets and is triggered by embedded resources. These resources must be programmed independently and the trigger source is selected using the Trigger Event Register (0xE0041008). An event could be a simple event (address match from an address comparator) or a logic equation between 2 events. The trigger source is one of the fourth comparators of the DWT module. The following events can be monitored:

For more information on the trigger resources, refer to Section 33.13: DWT (data watchpoint trigger) .

The packets transmitted by the ETM are output to the TPIU (Trace Port Interface Unit). The formatter of the TPIU adds some extra packets (refer to Section 33.17: TPIU (trace port interface unit) ) and then outputs the complete packet sequence to the debugger host.

33.15.2 Signal protocol, packet types

This part is described in the chapter 7 ETMV3 Signal Protocol of the Arm® IHI 0014N document.

33.15.3 Main ETM registers

For more information on registers, refer to the chapter 3 of the Arm® IHI 0014N specification.

Table 200. Main ETM registers

AddressRegisterDetails
0xE0041FB0ETM Lock AccessWrite 0xC5ACCE55 to unlock the write access to the other ETM registers.
0xE0041000ETM ControlThis register controls the general operation of the ETM, for instance how tracing is enabled.
0xE0041010ETM StatusThis register provides information about the current status of the trace and trigger logic.
0xE0041008ETM Trigger EventThis register defines the event that will control trigger.
0xE004101CETM Trace Enable ControlThis register defines which comparator is selected.
0xE0041020ETM Trace Enable EventThis register defines the trace enabling event.
0xE0041024ETM Trace Start/StopThis register defines the traces used by the trigger source to start and stop the trace, respectively.

33.15.4 Configuration example

To output a simple value to the TPIU:

33.16 MCU debug component (DBGMCU)

The MCU debug component helps the debugger provide support for:

33.16.1 Debug support for low-power modes

To enter low-power mode, the instruction WFI or WFE must be executed.

The MCU implements several low-power modes, which can either deactivate the CPU clock or reduce the power of the CPU.

The core does not allow FCLK or HCLK to be turned off during a debug session. As these are required for the debugger connection, during a debug, they must remain active. The MCU integrates special means to allow the user to debug software in low-power modes.

For this, the debugger host must first set some debug configuration registers to change the low-power mode behavior:

33.16.2 Debug support for timers, watchdog, bxCAN, and I 2 C

During a breakpoint, it is necessary to choose how the counter of timers and watchdog should behave:

For the bxCAN, the user can choose to block the update of the receive register during a breakpoint.

For the I 2 C, the user can choose to block the SMBUS timeout during a breakpoint.

33.16.3 Debug MCU configuration register

This register allows the configuration of the MCU under DEBUG. This concerns:

This DBGMCU_CR is mapped on the external PPB bus at address 0xE0042004.

It is asynchronously reset by the PORESET (and not the system reset). It can be written by the debugger under system reset.

If the debugger host does not support these features, it is still possible for the user software to write to these registers.

DBGMCU_CR

Address: 0xE004 2004

Only 32-bit access supported

POR Reset: 0x0000 0000 (not reset by system reset)

31302928272625242322212019181716
ResResResResResResResResResResResResResResResRes
1514131211109876543210
ResResResResResResResResTRACE_MODE [1:0]TRACE_IOENRes.ResDBG_STANDBYDBG_STOPDBG_SLEEP
rwrwrwrwrwrw

Bits 31:8 Reserved, must be kept at reset value.

Bits 7:5 TRACE_MODE[1:0] and TRACE_IOEN : Trace pin assignment control

Note: In STM32F303x6/8 and STM32F328x8 devices, synchronous trace is not available, thus bits 7:5 are reserved and must be kept at 0.

Bits 4:3 Reserved, must be kept at reset value.

Bit 2 DBG_STANDBY: Debug Standby mode

0: (FCLK=Off, HCLK=Off) The whole digital part is unpowered.

From software point of view, exiting from Standby is identical than fetching reset vector (except a few status bit indicated that the MCU is resuming from Standby)

1: (FCLK=On, HCLK=On) In this case, the digital part is not unpowered and FCLK and HCLK are provided by the internal RC oscillator which remains active. In addition, the MCU generate a system reset during Standby mode so that exiting from Standby is identical than fetching from reset

Bit 1 DBG_STOP: Debug Stop mode

0: (FCLK=Off, HCLK=Off) In STOP mode, the clock controller disables all clocks (including HCLK and FCLK). When exiting from STOP mode, the clock configuration is identical to the one after RESET (CPU clocked by the 8 MHz internal RC oscillator (HSI)). Consequently, the software must reprogram the clock controller to enable the PLL, the Xtal, etc.

1: (FCLK=On, HCLK=On) In this case, when entering STOP mode, FCLK and HCLK are provided by the internal RC oscillator which remains active in STOP mode. When exiting STOP mode, the software must reprogram the clock controller to enable the PLL, the Xtal, etc. (in the same way it would do in case of DBG_STOP=0)

Bit 0 DBG_SLEEP: Debug Sleep mode

0: (FCLK=On, HCLK=Off) In Sleep mode, FCLK is clocked by the system clock as previously configured by the software while HCLK is disabled.

In Sleep mode, the clock controller configuration is not reset and remains in the previously programmed state. Consequently, when exiting from Sleep mode, the software does not need to reconfigure the clock controller.

1: (FCLK=On, HCLK=On) In this case, when entering Sleep mode, HCLK is fed by the same clock that is provided to FCLK (system clock as previously configured by the software).

33.16.4 Debug MCU APB1 freeze register (DBGMCU_APB1_FZ)

The DBGMCU_APB1_FZ register is used to configure the MCU under DEBUG. It concerns the APB1 peripherals:

This DBGMCU_APB1_FZ is mapped on the external PPB bus at address 0xE0042008.

The register is asynchronously reset by the POR (and not the system reset). It can be written by the debugger under system reset.

Address: 0xE004 2008

Only 32-bit access is supported.

Power on reset (POR): 0x0000 0000 (not reset by system reset)

31302928272625242322212019181716
ResDBG_I2C3_SMBUS_TIMEOUTResResResResDBG_CAN_STOPResResDBG_I2C2_SMBUS_TIMEOUT (1)DBG_I2C1_SMBUS_TIMEOUTResResResResRes
rwrwrw

1514131211109876543210
ResResResDBG_IWDG_STOPDBG_WWDG_STOPDBG_RTC_STOPResResResResDBG_TIM7_STOPDBG_TIM6_STOPResDBG_TIM4_STOP (1)DBG_TIM3_STOPDBG_TIM2_STOP
rwrwrwrwrwrwrwrw

1. Only in STM32F303xB/C/D/E, STM32F358xC and STM32F398xE devices.

Bits 31 Reserved, must be kept at reset value.

Bit 30 DBG_I2C3_SMBUS_TIMEOUT : SMBUS timeout mode stopped when core is halted

0: Same behavior as in normal mode

1: The SMBUS timeout is frozen

Bit 25 DBG_CAN_STOP : Debug CAN stopped when core is halted

0: Same behavior as in normal mode

1: The CAN2 receive registers are frozen

Bits 24:23 Reserved, must be kept at reset value.

Bit 22 DBG_I2C2_SMBUS_TIMEOUT : SMBUS timeout mode stopped when core is halted (Available in STM32F303xB/C and STM32F358xC devices only)

0: Same behavior as in normal mode

1: The SMBUS timeout is frozen

Bit 21 DBG_I2C1_SMBUS_TIMEOUT : SMBUS timeout mode stopped when core is halted

0: Same behavior as in normal mode

1: The SMBUS timeout is frozen

Bits 20:13 Reserved, must be kept at reset value.

Bit 12 DBG_IWDG_STOP : Debug independent watchdog stopped when core is halted

0: The independent watchdog counter clock continues even if the core is halted

1: The independent watchdog counter clock is stopped when the core is halted

Bit 11 DBG_WWDG_STOP : Debug window watchdog stopped when core is halted

0: The window watchdog counter clock continues even if the core is halted

1: The window watchdog counter clock is stopped when the core is halted

Bit 10 DBG_RTC_STOP : Debug RTC stopped when core is halted

0: The clock of the RTC counter is fed even if the core is halted

1: The clock of the RTC counter is stopped when the core is halted

Bits 9:6 Reserved, must be kept at reset value.

Bit 5 DBG_TIM7_STOP : TIM7 counter stopped when core is halted

0: The counter clock of TIM7 is fed even if the core is halted

1: The counter clock of TIM7 is stopped when the core is halted

Bit 4 DBG_TIM6_STOP : TIM6 counter stopped when core is halted

0: The counter clock of TIM6 is fed even if the core is halted

1: The counter clock of TIM6 is stopped when the core is halted

Bit 3 Reserved, must be kept at reset value.

Bit 2 DBG_TIM4_STOP : TIM4 counter stopped when core is halted (Available on STM32F303xB/C and STM32F358xC devices only)

0: The counter clock of TIM4 is fed even if the core is halted

1: The counter clock of TIM4 is stopped when the core is halted

Bit 1 DBG_TIM3_STOP : TIM3 counter stopped when core is halted

0: The counter clock of TIM3 is fed even if the core is halted

1: The counter clock of TIM3 is stopped when the core is halted

Bit 0 DBG_TIM2_STOP : TIM2 counter stopped when core is halted

0: The counter clock of TIM2 is fed even if the core is halted

1: The counter clock of TIM2 is stopped when the core is halted

33.16.5 Debug MCU APB2 freeze register (DBGMCU_APB2_FZ)

The DBGMCU_APB2_FZ register is used to configure the MCU under DEBUG. It concerns APB2 peripherals:

This register is mapped on the external PPB bus at address 0xE004 200C

It is asynchronously reset by the POR (and not the system reset). It can be written by the debugger under system reset.

Address: 0xE004 200C

Only 32-bit access is supported.

POR: 0x0000 0000 (not reset by system reset)

31302928272625242322212019181716
ResResResResResResResResResResResResResResResRes
1514131211109876543210
ResResResResResResResResResResDBG_TIM20_STOP (1)DBG_TIM17_STOPDBG_TIM16_STOPDBG_TIM15_STOPDBG_TIM8_STOP (2)DBG_TIM1_STOP
r/wr/wr/wr/wr/w

1. Available only in STM32F303xD/E and STM32F398xE.

2. Only in STM32F303xB/C/D/E, STM32F358xC and STM32F398xE devices.

Bits 31:6 Reserved, must be kept at reset value.

Bits 5:0 DBG_TIMx_STOP : TIMx counter stopped when core is halted (x=1, 8,15..17)

0: The clock of the involved timer counter is fed even if the core is halted

1: The clock of the involved timer counter is stopped when the core is halted

Note: Bit1 and Bit 5 are reserved in STM32F303x6/8 and STM32F328x8.

33.17 TPIU (trace port interface unit)

33.17.1 Introduction

The TPIU acts as a bridge between the on-chip trace data from the ITM and the ETM.

The output data stream encapsulates the trace source ID, that is then captured by a trace port analyzer (TPA).

The core embeds a simple TPIU, especially designed for low-cost debug (consisting of a special version of the CoreSight™ TPIU).

Figure 409. TPIU block diagram

Figure 409. TPIU block diagram. The diagram shows the internal architecture of the TPIU. It is divided into two clock domains: CLK domain and TRACECLKIN domain. In the CLK domain, there are two input blocks, ETM and ITM, which feed into two separate Asynchronous FIFOs. These FIFOs then feed into a central 'TPIU formatter' block. The TPIU formatter is connected to a 'Trace out (serializer)' block. The Trace out block has four output pins: TRACECLKIN, TRACECK, TRACEDATA [3:0], and TRACESWO. An 'External PPB bus' is shown at the bottom, with arrows pointing to the Asynchronous FIFOs and the Trace out block. The entire TPIU block is labeled 'ai17114' at the bottom right.
Figure 409. TPIU block diagram. The diagram shows the internal architecture of the TPIU. It is divided into two clock domains: CLK domain and TRACECLKIN domain. In the CLK domain, there are two input blocks, ETM and ITM, which feed into two separate Asynchronous FIFOs. These FIFOs then feed into a central 'TPIU formatter' block. The TPIU formatter is connected to a 'Trace out (serializer)' block. The Trace out block has four output pins: TRACECLKIN, TRACECK, TRACEDATA [3:0], and TRACESWO. An 'External PPB bus' is shown at the bottom, with arrows pointing to the Asynchronous FIFOs and the Trace out block. The entire TPIU block is labeled 'ai17114' at the bottom right.

33.17.2 TRACE pin assignment

The asynchronous mode requires 1 extra pin and is available on all packages. It is only available if using Serial Wire mode (not in JTAG mode).

Table 201. Asynchronous TRACE pin assignment

TPUI pin nameTrace synchronous modeSTM32F3xx pin assignment
TypeDescription
TRACESWOOTRACE Async Data OutputPB3

The synchronous mode requires from 2 to 6 extra pins depending on the data trace size and is only available in the larger packages. In addition it is available in JTAG mode and in Serial Wire mode and provides better bandwidth output capabilities than asynchronous trace.

Table 202. Synchronous TRACE pin assignment

TPUI pin nameTrace synchronous modeSTM32F3xx pin assignment
TypeDescription
TRACECKOTRACE ClockPE2
TRACED[3:0]OTRACE Sync Data Outputs
Can be 1, 2 or 4.
PE[6:3]

TPUI TRACE pin assignment

By default, these pins are NOT assigned. They can be assigned by setting the TRACE_IOEN and TRACE_MODE bits in the MCU Debug component configuration register . This configuration has to be done by the debugger host.

In addition, the number of pins to assign depends on the trace configuration (asynchronous or synchronous).

To assign the TRACE pin, the debugger host must program the bits TRACE_IOEN and TRACE_MODE[1:0] of the Debug MCU configuration Register (DBGMCU_CR). By default the TRACE pins are not assigned.

This register is mapped on the external PPB and is reset by the PORESET (and not by the SYSTEM reset). It can be written by the debugger under SYSTEM reset.

Table 203. Flexible TRACE pin assignment

DBGMCU_CR registerPins assigned for:TRACE IO pin assigned
TRACE_IOENTRACE_MODE [1:0]PB3 / JTDO/ TRACESWOPE2 / TRACECKPE3 / TRACED[0]PE4 / TRACED[1]PE5 / TRACED[2]PE6 / TRACED[3]
0XXNo Trace (default state)Released (1)-
100Asynchronous TraceTRACESWO--Released (usable as GPIO)

Table 203. Flexible TRACE pin assignment (continued)

DBGMCU_CR registerPins assigned for:TRACE IO pin assigned
TRACE_IOENTRACE_MODE [1:0]PB3 / JTDO/ TRACESWOPE2 / TRACECKPE3 / TRACED[0]PE4 / TRACED[1]PE5 / TRACED[2]PE6 / TRACED[3]
101Synchronous Trace 1 bitTRACECKTRACED[0]---
110Synchronous Trace 2 bitReleased (1)TRACECKTRACED[0]TRACED[1]--
111Synchronous Trace 4 bitTRACECKTRACED[0]TRACED[1]TRACED[2]TRACED[3]

1. When Serial Wire mode is used, it is released. But when JTAG is used, it is assigned to JTDO.

Note: By default, the TRACECLKIN input clock of the TPIU is tied to GND. It is assigned to HCLK two clock cycles after the bit TRACE_IOEN has been set.

The debugger must then program the Trace Mode by writing the PROTOCOL[1:0] bits in the SPP_R (Selected Pin Protocol) register of the TPIU.

It then also configures the TRACE port size by writing the bits [3:0] in the CPSPS_R (Current Sync Port Size Register) of the TPIU:

33.17.3 TPIU formatter

The formatter protocol outputs data in 16-byte frames:

Note: Refer to the Arm ® CoreSight Architecture Specification v1.0 (Arm ® IHI 0029B) for further information

33.17.4 TPUI frame synchronization packets

The TPUI can generate two types of synchronization packets:

It is output periodically between frames.

In continuous mode, the TPA must discard all these frames once a synchronization frame has been found.

These packets are only generated in continuous mode and enable the TPA to detect that the TRACE port is in IDLE mode (no TRACE to be captured). When detected by the TPA, it must be discarded.

33.17.5 Transmission of the synchronization frame packet

There is no Synchronization Counter register implemented in the TPIU of the core. Consequently, the synchronization trigger can only be generated by the DWT . Refer to the registers DWT Control Register (bits SYNCTAP[11:10]) and the DWT Current PC Sampler Cycle Count Register.

The TPUI Frame synchronization packet (0x7F_FF_FF_FF) is emitted:

33.17.6 Synchronous mode

The trace data output size can be configured to 4, 2 or 1 pin: TRACED(3:0)

The output clock is output to the debugger (TRACECK)

Here, TRACECLKIN is driven internally and is connected to HCLK only when TRACE is used.

Note: In this synchronous mode, it is not required to provide a stable clock frequency.

The TRACE I/Os (including TRACECK) are driven by the rising edge of TRACLKIN (equal to HCLK). Consequently, the output frequency of TRACECK is equal to HCLK/2.

33.17.7 Asynchronous mode

This is a low-cost alternative to output the trace using only 1 pin: this is the asynchronous output pin TRACESWO. Obviously there is a limited bandwidth.

TRACESWO is multiplexed with JTDO when using the SW-DP pin. This way, this functionality is available in all STM32F3xx packages.

This asynchronous mode requires a constant frequency for TRACECLKIN. For the standard UART (NRZ) capture mechanism, 5% accuracy is needed. The Manchester encoded version is tolerant up to 10%.

33.17.8 TRACECLKIN connection inside the STM32F3xx

In the STM32F3xx, this TRACECLKIN input is internally connected to HCLK. This means that when in asynchronous trace mode, the application is restricted to use to time frames where the CPU frequency is stable.

Note:

Important: when using asynchronous trace: it is important to be aware that:

The default clock of the STM32F3xx MCUs is the internal RC oscillator. Its frequency under reset is different from the one after reset release. This is because the RC calibration is the default one under system reset and is updated at each system reset release.

Consequently, the trace port analyzer (TPA) should not enable the trace (with the TRACE_IOEN bit) under system reset, because a Synchronization Frame Packet will be issued with a different bit time than trace packets, which will be transmitted after reset release.

33.17.9 TPIU registers

The TPIU APB registers can be read and written only if the bit TRCENA of the Debug Exception and Monitor Control Register (DEMCR) is set. Otherwise, the registers are read as zero (the output of this bit enables the PCLK of the TPIU).

Table 204. Important TPIU registers

AddressRegisterDescription
0xE0040004Current port sizeAllows the trace port size to be selected:
Bit 0: Port size = 1
Bit 1: Port size = 2
Bit 2: Port size = 3, not supported
Bit 3: Port Size = 4
Only 1 bit must be set. By default, the port size is one bit. (0x00000001)
0xE00400F0Selected pin protocolAllows the Trace Port Protocol to be selected:
Bit1:0=
00: Sync Trace Port Mode
01: Serial Wire Output - manchester (default value)
10: Serial Wire Output - NRZ
11: reserved
0xE0040304Formatter and flush controlBit 31-9 = always '0
Bit 8 = TrigIn = always '1 to indicate that triggers are indicated
Bit 7-4 = always 0
Bit 3-2 = always 0
Bit 1 = EnFCont. In Sync Trace mode (Select_Pin_Protocol register bit1:0=00), this bit is forced to '1: the formatter is automatically enabled in continuous mode. In asynchronous mode (Select_Pin_Protocol register bit1:0 <> 00), this bit can be written to activate or not the formatter.
Bit 0 = always 0
The resulting default value is 0x102
Note: In synchronous mode, because the TRACECTL pin is not mapped outside the chip, the formatter is always enabled in continuous mode -this way the formatter inserts some control packets to identify the source of the trace packets).
0xE0040300Formatter and flush statusNot used in Cortex-M4 ® F, always read as 0x00000008

33.17.10 Example of configuration

33.18 DBG register map

Table 205. DBG register map and reset values

Addr.Register313029282726252423222120191817161514131211109876543210
0xE0042000DBGMCU_IDCODEREV_IDResResResResDEV_ID
Reset value (1)XXXXXXXXXXXXXXXXXXXXXXXXXXXX
0xE0042004DBGMCU_CRResResResResResResResResResResResResResResResResResResResResResResResResResTRACE_MODE[1:0]TRACE_IOENResResDBG_STANDBYDBG_STOPDBG_SLEEP
Reset value000000
0xE0042008DBGMCU_APB1_FZResDBG_I2C3_SMBUS_TIMEOUTResResResResDBG_CAN_STOPResResDBG_I2C2_SMBUS_TIMEOUTDBG_I2C1_SMBUS_TIMEOUTResResResResResResResResResDBG_IWDG_STOPDBG_WWDG_STOPDBG_RTC_STOPResResResDBG_TIM20_STOPDBG_TIM7_STOPDBG_TIM6_STOPResDBG_TIM4_STOPDBG_TIM3_STOP
Reset value000000000000
0xE004200CDBGMCU_APB2_FZResResResResResResResResResResResResResResResResResResResResResResResResResResResDBG_TIM17_STOPDBG_TIM16_STOPDBG_TIM15_STOPDBG_TIM8_STOPDBG_TIM1_STOP
Reset value00000

1. The reset value is product dependent. For more information, refer to Section 33.6.1: MCU device ID code .