Dynamic Loading and Linking

Basic Concepts
Working Principles
Development Guidelines
- Available APIs
- How to Develop

Basic Concepts

The OpenHarmony dynamic loading and linking mechanism includes a kernel loader and a dynamic linker. The kernel loader loads application programs and the dynamic linker. The dynamic linker loads the shared library on which the application programs depend, and performs symbol relocation for the application programs and shared libraries. Compared with static linking, dynamic linking is a mechanism for delaying the linking of applications and dynamic libraries to run time.

Advantages of Dynamic Linking

Dynamic linking allows multiple applications to share code. The minimum loading unit is page. It saves disk and memory space than static linking.
When a shared library is upgraded, the new shared library overwrites the earlier version (the APIs of the shared library are downward compatible). You do not need to re-link the shared library.
The loading address can be randomized to prevent attacks and ensure security.

Working Principles

Figure 1 Dynamic loading process

The kernel maps the PT_LOAD section in the ELF file of the application to the process space. For files of the ET_EXEC type, fixed address mapping is performed based on p_vaddr in the PT_LOAD section. For files of the ET_DYN type (position-independent executable programs, obtained through the compile option -fPIE), the kernel selects the base address via mmap for mapping (load_addr = base + p_vaddr).
If the application is statically linked (static linking does not support the compile option -fPIE), after the stack information is set, the system redirects to the address specified by e_entry in the ELF file of the application and runs the application. If the program is dynamically linked, the application ELF file contains the PT_INTERP section, which stores the dynamic linker path information (ET_DYN type). The dynamic linker of musl is a part of the libc-musl.so. The entry of libc-musl.so is the entry of the dynamic linker. The kernel selects the base address for mapping via the mmap API, sets the stack information, redirects to the base + e_entry (entry of the dynamic linker) address, and runs the dynamic linker.
The dynamic linker bootstraps and searches for all shared libraries on which the application depends, relocates the imported symbols, and finally redirects to the e_entry (or base + e_entry) of the application to run the application.

Figure 2 Program execution process

The loader and linker call mmap to map the PT_LOAD section.
The kernel calls map_pages to search for and map the existing page cache.
If the memory does not have the required code or data when the program is executed, a page missing interrupt is triggered to read the content of the ELF file into the memory and add the memory block to the page cache.
Map the memory blocks of the file read to the virtual address region.
The program continues to run.

The program is executed with continuous page missing interrupts.

Development Guidelines

Available APIs

LOS_DoExecveFile

Function prototype:

INT32 LOS_DoExecveFile(const CHAR *fileName, CHAR * const *argv, CHAR * const *envp);

Function: Executes a new user program based on fileName.

Parameter Description

Parameter	Description
fileName	Name of a binary executable file. It can be a path name.
argv	Parameters, ended with NULL, required for program execution. If no parameter is required, set this parameter to NULL.
envp	New environment variables, ended with NULL, required for program execution. If no new environment variable is required, set this parameter to NULL.

How to Develop

The kernel cannot directly call the LOS_DoExecveFile API to start a new process. This API is generally called through the execve API to create a new process using the system call mechanism.