How a PX4 boots up (bootloader, linker script, initialization)

junwoo0914 · July 31, 2023, 12:36am

About

After going through some hard-faults, flashing bootloader 100s of times, learning how board nuttx configurations are organized, and tweaking the vendor / product / board ID for the flight controller, I was left feeling unsatisfied with not being sure on how PX4 actually starts up & bootloader starts. So I decided to create a little guide on the information I gathered throughout the research.

Note: Still Work In Progress!

Basics of STM32 startup process

The basics of how a STM32 starts, when the power is supplied to the MCU is quite well explained here: Bare-Metal STM32: Exploring Memory-Mapped I/O And Linker Scripts | Hackaday

Also, the book on understanding STM32 helps a lot, especially the “STM32 Memory Model and Boot Sequence” section in Chapter 3: https://legacy.cs.indiana.edu/~geobrown/book.pdf

So the important bit to understand is that each board (with specific processor) has it’s basic memory layout defined in the linker script “script.ld”. And further down, the specific data that goes into each section (FLASH, RAM, SRAM, etc) are defined in detail.

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/nuttx-config/scripts/script.ld#L110-L123


      
          MEMORY
          {
          	ITCM_RAM  (rwx) : ORIGIN = 0x00000000, LENGTH =   64K
          	FLASH      (rx) : ORIGIN = 0x08020000, LENGTH = 1920K
          
          	DTCM1_RAM (rwx) : ORIGIN = 0x20000000, LENGTH =   64K
          	DTCM2_RAM (rwx) : ORIGIN = 0x20010000, LENGTH =   64K
          	AXI_SRAM  (rwx) : ORIGIN = 0x24000000, LENGTH =  512K /* D1 domain AXI bus */
          	SRAM1     (rwx) : ORIGIN = 0x30000000, LENGTH =  128K /* D2 domain AHB bus */
          	SRAM2     (rwx) : ORIGIN = 0x30020000, LENGTH =  128K /* D2 domain AHB bus */
          	SRAM3     (rwx) : ORIGIN = 0x30040000, LENGTH =   32K /* D2 domain AHB bus */
          	SRAM4     (rwx) : ORIGIN = 0x38000000, LENGTH =   64K /* D3 domain */
          	BKPRAM    (rwx) : ORIGIN = 0x38800000, LENGTH =    4K
          }

The ‘specific data’ that is being referred to are in fact referenced in formats like “.text”, “.bss”, etc. Which are standard conventions for the types of data (for .bss, for example, un-initialized static or global variables) for the program.

You can read more about it here: text, data and bss: Code and Data Size Explained | MCU on Eclipse

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/nuttx-config/scripts/script.ld#L136-L160


      
          SECTIONS
          {
          	.text : {
          		_stext = ABSOLUTE(.);
          		*(.vectors)
          		. = ALIGN(32);
          		/*
          		This signature provides the bootloader with a way to delay booting
          		*/
          		_bootdelay_signature = ABSOLUTE(.);
          		FILL(0xffecc2925d7d05c5)
          		. += 8;
          		*(.text .text.*)
          		*(.fixup)
          		*(.gnu.warning)
          		*(.rodata .rodata.*)
          		*(.gnu.linkonce.t.*)
          		*(.glue_7)
          		*(.glue_7t)
          		*(.got)

This file has been truncated. show original

You can also notice that the start of each section are marked by the variable ‘_s****’ (e.g. _sdata), and are referenced in the code I will show below.

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/nuttx-config/scripts/script.ld#L185


      
          
          	__exidx_start = ABSOLUTE(.);
          	.ARM.exidx : {
          		*(.ARM.exidx*)
          	} > FLASH
          	__exidx_end = ABSOLUTE(.);
          
          	_eronly = ABSOLUTE(.);
          
          	.data : {
          		_sdata = ABSOLUTE(.);
          		*(.data .data.*)
          		*(.gnu.linkonce.d.*)
          		CONSTRUCTORS
          		_edata = ABSOLUTE(.);
          
          		/* Pad out last section as the STM32H7 Flash write size is 256 bits. 32 bytes */
          		. = ALIGN(16);
          		FILL(0xffff)
          		. += 16;
          	} > AXI_SRAM AT > FLASH  = 0xffff

Checking memory layout with Bloaty

In fact, you can check exactly how the sections of data are arranged in the binary built via examining it’s .elf file. To do that, simply execute “bloaty build//.elf”, after building the target (via “make ”).

That should show something like this:

    FILE SIZE        VM SIZE    
 --------------  -------------- 
  52.9%  14.3Mi   0.0%       0    .debug_info
  12.7%  3.44Mi   0.0%       0    .debug_loc
  11.8%  3.17Mi   0.0%       0    .debug_line
   5.9%  1.60Mi   0.0%       0    .debug_str
   5.4%  1.46Mi  97.1%  1.46Mi    .text
   4.2%  1.12Mi   0.0%       0    .debug_abbrev
   3.0%   830Ki   0.0%       0    .debug_ranges
   1.3%   354Ki   0.0%       0    .symtab
   1.1%   305Ki   0.0%       0    .strtab
   0.9%   261Ki   0.0%       0    .debug_frame
   0.4%   104Ki   0.0%       0    [Unmapped]
   0.3%  89.7Ki   0.0%       0    .debug_aranges
   0.0%       0   2.6%  40.7Ki    .bss
   0.0%  3.39Ki   0.2%  3.39Ki    .data
   0.0%     800   0.0%       0    [ELF Section Headers]
   0.0%     211   0.0%       0    .shstrtab
   0.0%     136   0.0%     136    .init_section
   0.0%     128   0.0%       0    [ELF Program Headers]
   0.0%      76   0.0%       0    .comment
   0.0%      60   0.0%       8    [2 Others]
   0.0%      53   0.0%       0    .ARM.attributes
 100.0%  26.9Mi 100.0%  1.50Mi    TOTAL

As explained in the article, for us the most relevant sections are:

.bss: where non-initialized static allocated variables value are stored, read more here: .bss - Wikipedia)
.data: where initialization values for static allocated variables are stored
.text: what ends up in the FLASH memory (e.g. constants, functions, vector table)

Apart from that, actually when I execute the bloaty command above with -v flag, I get the following 2 extra sections that gets included into the VM section (which gets actually into the final binary for the target, read more here: bloaty/doc/using.md at main · google/bloaty · GitHub):

.init_section - 136 bytes (placed right after the .text section in memory, in FLASH)
.ARM.exidx: 8 bytes (placed right after .init_section section in memory, in FLASH)

It’s quite interesting how they are placed exactly how the linker script has asked them to be. For example, the .data section gets first placed in SRAM (which in this case, for MATEK H743 mini, there were plenty of space), but then possibly would be written in FLASH, if it needed to (I think).

Here’s the whole output result in case you are curious. Feel free to compare the address range and check where each sections are located:

FILE MAP:
0000000-0000034	         52		[ELF Header]
0000034-00000b4	        128		[ELF Program Headers]
00000b4-0010000	      65356		[Unmapped]
0010000-0185ce0	    1531104		.text
0185ce0-0185d68	        136		.init_section
0185d68-0185d70	          8		.ARM.exidx
0185d70-0190000	      41616		[Unmapped]
0190000-0190d90	       3472		.data
0190d90-0190ddc	         76		.comment
0190ddc-0190e11	         53		.ARM.attributes
0190e11-02afa0b	    1174522		.debug_abbrev
02afa0b-10f247d	   14953074		.debug_info
10f247d-141ce70	    3320307		.debug_line
141ce70-1433530	      91840		.debug_aranges
1433530-15cccaa	    1677178		.debug_str
15cccaa-193c2b5	    3601931		.debug_loc
193c2b5-1a0be50	     850843		.debug_ranges
1a0be50-1a4d5a8	     268120		.debug_frame
1a4d5a8-1aa60c8	     363296		.symtab
1aa60c8-1af28a5	     313309		.strtab
1af28a5-1af2978	        211		.shstrtab
1af2978-1af2c98	        800		[ELF Section Headers]

VM MAP:
00000000-08020000	  134348800		[-- Nothing mapped --]
08020000-08195ce0	    1531104		.text
08195ce0-08195d68	        136		.init_section
08195d68-08195d70	          8		.ARM.exidx
08195d70-24000000	  468099728		[-- Nothing mapped --]
24000000-24000d90	       3472		.data
24000d90-24000dc0	         48		[-- Nothing mapped --]
24000dc0-2400b078	      41656		.bss

PX4 bootloader

So we now have rough idea on how important the linker script is for defining the overall memory structure. But which code actually then gets executed when the MCU powers up?

Setting up environment

First big role of a bootloader is to first make sure we move the data from the .data, .bss sections into the RAM appropriately. This is all handled by the NuttX itself for it’s own start-up sequence, and is explained very well here: https://cwiki.apache.org/confluence/display/NUTTX/NuttX+Initialization+Sequence

Implementation of how STM32H7 chip’s start sequence is handled can be found here:

github.com

PX4/NuttX/blob/00a68b7668d393ed69025e5ef0949f2d8f40d968/arch/arm/src/stm32h7/stm32_start.c#L165-L273


      
          /****************************************************************************
           * Name: __start
           *
           * Description:
           *   This is the reset entry point.
           *
           ****************************************************************************/
          
          void __start(void)
          {
            const uint32_t *src;
            uint32_t *dest;
          
          #ifdef CONFIG_ARMV7M_STACKCHECK
            /* Set the stack limit before we attempt to call any functions */
          
            __asm__ volatile("sub r10, sp, %0" : :
                             "r"(CONFIG_IDLETHREAD_STACKSIZE - 64) :);
          #endif

This file has been truncated. show original

Initializing the board

After the RAM copying is complete, NuttX then initializes the clock, Floating Point Unit, etc. Then, it calls the “stm32_boardinitialize” function, which is implemented in the PX4 domain.

So this is the part in the bootloader of the Matek H743 mini board that gets executed, which only configures the USB connection (as that’s the only thing we need while in bootloader, to re-flash the board):

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/src/bootloader_main.c#L55-L59


      
          __EXPORT void stm32_boardinitialize(void)
          {
          	/* configure USB interfaces */
          	stm32_usbinitialize();
          }

LED management

And since for this board, the timer hook is enabled in the NuttX defconfig for the bootloader:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/nuttx-config/bootloader/defconfig#L78


      
          CONFIG_START_DAY=30
          CONFIG_START_MONTH=11
          CONFIG_STDIO_BUFFER_SIZE=32
          CONFIG_STM32H7_BKPSRAM=y
          CONFIG_STM32H7_DMA1=y
          CONFIG_STM32H7_OTGFS=y
          CONFIG_STM32H7_PROGMEM=y
          CONFIG_STM32H7_SERIAL_DISABLE_REORDERING=y
          CONFIG_STM32H7_TIM1=y
          CONFIG_STM32H7_USART1=y
          CONFIG_SYSTEMTICK_HOOK=y
          CONFIG_SYSTEM_CDCACM=y
          CONFIG_TASK_NAME_SIZE=24
          CONFIG_TTY_SIGINT=y
          CONFIG_TTY_SIGINT_CHAR=0x03
          CONFIG_TTY_SIGTSTP=y
          CONFIG_USART1_RXBUFSIZE=600
          CONFIG_USART1_TXBUFSIZE=300
          CONFIG_USBDEV=y
          CONFIG_USBDEV_BUSPOWERED=y
          CONFIG_USBDEV_MAXPOWER=500

The “board_timehook” implemented gets called every timer interrupt in bootloader:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/src/bootloader_main.c#L71-L75


      
          extern void sys_tick_handler(void);
          void board_timerhook(void)
          {
          	sys_tick_handler();
          }

Which then controls the LED to show whether the bootloader is active:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/common/bl.c#L423-L437


      
          void
          sys_tick_handler(void)
          {
          	unsigned i;
          
          	for (i = 0; i < NTIMERS; i++)
          		if (timer[i] > 0) {
          			timer[i]--;
          		}
          
          	if ((_led_state == LED_BLINK) && (timer[TIMER_LED] == 0)) {
          		led_toggle(LED_BOOTLOADER);
          		timer[TIMER_LED] = 50;
          	}
          }

Bootloader main

However, the actual bootloader main function is in a totally separate place (although, I agree it is confusing to have bootloader related NuttX function implementations in “bootloadeR_main.c” under targets haha).

First, the fact that we use the bootloader_main function for initialization entry point is defined in the NuttX defconfig for bootloader:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/nuttx-config/bootloader/defconfig#L49


      
          CONFIG_DEBUG_FULLOPT=y
          CONFIG_DEBUG_SYMBOLS=y
          CONFIG_DEBUG_TCBINFO=y
          CONFIG_DEFAULT_SMALL=y
          CONFIG_EXPERIMENTAL=y
          CONFIG_FDCLONE_DISABLE=y
          CONFIG_FDCLONE_STDIO=y
          CONFIG_HAVE_CXX=y
          CONFIG_HAVE_CXXINITIALIZE=y
          CONFIG_IDLETHREAD_STACKSIZE=750
          CONFIG_INIT_ENTRYPOINT="bootloader_main"
          CONFIG_INIT_STACKSIZE=3094
          CONFIG_LIBC_FLOATINGPOINT=y
          CONFIG_LIBC_LONG_LONG=y
          CONFIG_LIBC_STRERROR=y
          CONFIG_MEMSET_64BIT=y
          CONFIG_MEMSET_OPTSPEED=y
          CONFIG_PREALLOC_TIMERS=50
          CONFIG_PTHREAD_STACK_MIN=512
          CONFIG_RAM_SIZE=245760
          CONFIG_RAM_START=0x20010000

So it is in fact, this function defined in “platforms/nuttx/src/bootloader/stm/stm32_common/main.c”:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/stm/stm32_common/main.c#L636-L828


      
          int
          bootloader_main(void)
          {
          	bool try_boot = true;			/* try booting before we drop to the bootloader */
          	unsigned timeout = BOOTLOADER_DELAY;	/* if nonzero, drop out of the bootloader after this time */
          
          	/* Enable the FPU before we hit any FP instructions */
          	SCB_CPACR |= ((3UL << 10 * 2) | (3UL << 11 * 2)); /* set CP10 Full Access and set CP11 Full Access */
          
          #if defined(BOARD_POWER_PIN_OUT)
          
          	/* Here we check for the app setting the POWER_DOWN_RTC_SIGNATURE
          	 * in this case, we reset the signature and wait to die
          	 */
          	if (board_get_rtc_signature() == POWER_DOWN_RTC_SIGNATURE) {
          		board_set_rtc_signature(0);
          
          		while (1);
          	}

This file has been truncated. show original

Here, we really get into the details, but few important steps are:

board general init for GPIO pins
Clock initialization (implementation by NuttX)

And then, it will consider all the possible firmware-update related scenarios, which are:

Checking for Force-bootloader pin status
Check USB connection
Check USART pin status

And if any of them indicate that there may be an entity trying to update the firmware, it will call the bootloader function in “bl.c”, and if it times out, the launching of the normal firmware will continue.

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/stm/stm32_common/main.c#L800-L801


      
          		/* run the bootloader, come back after an app is uploaded or we time out */
          		bootloader(timeout);

And this is the final function that handles all the firmware update protocol part:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/common/bl.c#L609-L1089


      
          void
          bootloader(unsigned timeout)
          {
          	bl_type = NONE; // The type of the bootloader, whether loading from USB or USART, will be determined by on what port the bootloader recevies its first valid command.
          	volatile uint32_t  bl_state = 0; // Must see correct command sequence to erase and reboot (commit first word)
          	uint32_t  address = board_info.fw_size; /* force erase before upload will work */
          	uint32_t  first_word = 0xffffffff;
          
          	/* (re)start the timer system */
          	arch_systic_init();
          
          	/* if we are working with a timeout, start it running */
          	if (timeout) {
          		timer[TIMER_BL_WAIT] = timeout;
          	}
          
          	/* make the LED blink while we are idle */
          	led_set(LED_BLINK);
          
          	while (true) {

This file has been truncated. show original

Here you can see how the full chip erase command gets processed, for example:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/common/bl.c#L725-L775


      
          		// erase and prepare for programming
          		//
          		// command:   ERASE/EOC
          		// success reply: INSYNC/OK
          		// erase failure: INSYNC/FAILURE
          		//
          		case PROTO_CHIP_ERASE:
          
          			/* expect EOC */
          			if (!wait_for_eoc(2)) {
          				goto cmd_bad;
          			}
          
          #if defined(TARGET_HW_PX4_FMU_V4)
          
          			if (check_silicon()) {
          				goto bad_silicon;
          			}
          
          #endif

This file has been truncated. show original

Post-bootloader startup sequence

So that’s all cool and all but then how does the PX4 jump to the main function when there’s no-one trying to upgrade the firmware? That would happen if either the upgrade conditions written above were not met, or the timeout has been reached without any effective command reaching the board.

The answer to that is in the bootloader main function again:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/stm/stm32_common/main.c#L822-L823


      
          		/* look to see if we can boot the app */
          		jump_to_app();

It calls the “jump_to_app”, which is also defined in the “bl.c” file:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/common/bl.c#L293-L419


      
          void
          jump_to_app()
          {
          	const uint32_t *app_base = (const uint32_t *)APP_LOAD_ADDRESS;
          	const uint32_t *vec_base = (const uint32_t *)app_base;
          
          	/*
          	 * We refuse to program the first word of the app until the upload is marked
          	 * complete by the host.  So if it's not 0xffffffff, we should try booting it.
          	 */
          	if (app_base[0] == 0xffffffff) {
          		return;
          	}
          
          #ifdef BOOTLOADER_USE_TOC
          
          #ifdef BOOTLOADER_USE_SECURITY
          	crypto_init();
          #endif

This file has been truncated. show original

Here the intricate checking of the validity of the APP’s base address, and checks whether the Table of Contents (TOC) saved in the Flash section (details can be fine tuned via the “hw_config.h” file under the board directory: PX4-Autopilot/boards/matek/h743/src/hw_config.h at 95b30056794b47bb415f4c1d96028ec77a567446 · PX4/PX4-Autopilot · GitHub), etc.

Then after de-initializing the clocks, and the board, the actual jump to the app is made:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/bootloader/stm/stm32_common/main.c#L619-L634


      
          /* Make the actual jump to app */
          void
          arch_do_jump(const uint32_t *app_base)
          {
          	/* extract the stack and entrypoint from the app vector table and go */
          	uint32_t stacktop = app_base[0];
          	uint32_t entrypoint = app_base[1];
          
          	asm volatile(
          		"msr msp, %0  \n"
          		"bx %1  \n"
          		: : "r"(stacktop), "r"(entrypoint) :);
          
          	// just to keep noreturn happy
          	for (;;) ;
          }

Actual entrance into the PX4 code

So the actual PX4 starting function (at least part of it), in terms of initialization of PX4 system can be found here:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/platforms/nuttx/src/px4/common/px4_init.cpp#L95-L194


      
          int px4_platform_init()
          {
          
          #if !defined(CONFIG_BUILD_FLAT)
          	cxx_initialize();
          
          	/* initialize userspace-kernelspace call gate interface */
          	kernel_ioctl_initialize();
          #endif
          
          	int ret = px4_console_buffer_init();
          
          	if (ret < 0) {
          		return ret;
          	}
          
          	// replace stdout with our buffered console
          	int fd_buf = open(CONSOLE_BUFFER_DEVICE, O_WRONLY);
          
          	if (fd_buf >= 0) {

This file has been truncated. show original

And that, seems to be called from the “board_app_initialize” function of per-target implementation in “init.c” like here:

github.com

PX4/PX4-Autopilot/blob/95b30056794b47bb415f4c1d96028ec77a567446/boards/matek/h743-mini/src/init.c#L149-L152


      
          __EXPORT int board_app_initialize(uintptr_t arg)
          {
          	/* Need hrt running before using the ADC */
          	px4_platform_init();

However, I wasn’t able to definitely come up with a clear sequence of commands that leads to this yet. It seems to be somehow related with “nsh_initialize”, the NuttX console, but I am hesitant to believe that’s the case, since the shell doesn’t seem like a necessity for PX4 (at least it can run without the shell, I think).

I guess I can cover that in a follow up post / edit this!

Bhagyashree_Dhumal1 · July 17, 2024, 1:52pm

I want to add SHA256 implementation in the bootloader code so I can match the firmware’s hash to the stored has. What would be the right place for this implementation?