Debugging a LinuxThreads-based application with GDB

What's New ?

May 6th, 1999: Added a new link to the TDI site.
February 25th, 1999: Fixed the Debug Malloc Library related site.
October 16th, 1998: Updated the related sites.
July 7th, 1998: The glibc changes will be added to the linuxthread add-on for the glibc-2.1.
June 22nd, 1998: The page is made publicly available.

As part of its Java-related projects, Silicomp Research Institute has made the required enhancements to gdb-4.17 to debug GNU/Linux multithreaded applications. The basic Linus Torvald's intuition that "all we need in the kernel is clone()" has been kept within this development, and all the work done by GDB is made at user level. Therefore, no Linux kernel extension is needed and all tests have been made on a standard Linux-2.0.32 kernel.

I. Multithreading-awareness of the GDB architecture

By looking into the debugged process memory, GDB scans the internal structures of the LinuxThreads and is able to report all threads status to its standard user interface. There is always one current thread which reports an event to the user interface, and the command typed by the user always apply to the current thread. But it is always possible to switch current thread with another and to apply commands to the new one.

The main problem with this architecture is that the internal structures of the LinuxThreads are assumed to be never broken by any user program, in which cases GDB may report permanent errors that may require GDB and the application threads to be killed by hand; fortunately, these cases remain very unfrequent.

II. The new behaviour of gdb-4.17

Before the first thread is created, there is no difference in the behaviour of GDB with a single-threaded program; but as soon as it is created, GDB switches into a multithreaded mode which allows to get access to thread-related commands (see your GDB manual). In the list of threads, the thread number 1 is the LinuxThreads internal manager thread and never executes user code. It should be ignored when an application is debugged, and only be used when the LinuxThreads library is itself debugged.

There are two new messages that will appear asynchronously on the screen:

"[New Thread xxxxx]" is output when the new thread xxxxx is created,
"[Switching to Thread xxxxx]" is output when the current thread is now thread xxxxx.

The different threads have completely independent behaviour from a debugging point of view. This means that when an event has to be reported for a current thread, GDB scans the thread list and stops individually each of them, thus introducing asynchronism between them (the status reported for non-current threads is the one that may have occured long after the time of the current one). The primary consequence is that more than one thread may have hit breakpoints at a given time, but only the first one is reported at the beginning. As soon as a continuation command is reported, then event of other threads can be reported safely to the user.

A multithreaded application can be run directly from inside GDB, or attached dynamically by the PID of any application thread. GDB will first stop the LinuxThreads manager (thus stopping any new thread from being created), and then all others before the application threads can be analyzed. The intr character (usually ^C) can be hit in the start window of the application in order to stop it. But, if it has been dynamically attached by GDB, it will receive as many SIGINTR signals as there are application threads, since it is the kernel standard behaviour (which is not the case for processes run within GDB).

III. Other enhancements included

As part of the LinuxThreads debugging patches, there are two other enhancements included that are specific to the the i386 target:

GDB can recognize the Linux signal trampoline, and is able to manage correctly step/next commands through a signal handler.
GDB can recognize the longjmp() family procedures and is able to stop the current process executing step/next command at the instruction following the last longjmp() one.

IV. Patches curently available

There are two patch files that are required to be applied in order to access to the Linux multithreading enhancements of GDB:

Against glibc:
- The latest one against glibc-2.0.6 is glibc-2.0.6.patch.gz
Against gdb:
- The latest one against gdb-4.17 is gdb-4.17.patch.gz

V. Current status

Although most of this stuff should be machine-independent, all the code available for GDB has only been tested on Linux-2.0.32 for i386 uniprocessor systems. Nobody has yet reported that it worked on other targets, nor on i386 multiprocessor systems.

In the same idea, the only multithreaded programs that have been debugged with GDB have been written in C programming language. Opinion of people having debugged multithreaded programs written in other programming languages (like C++) is welcome.

Discussion on the LinuxThreads debugging is welcome in the regular places where LinuxThreads are discussed: either in the newsgroup comp.programming.threads or in the mailing list [email protected] (which you can subscribe to by writing at [email protected]).

VI. TODO list

As long as there is no support for multithreaded corefile, it will be not possible for GDB to debug corefiles generated by multithreaded applications. We have some ideas on how to generate such beasts in user mode, but no time (yet) to propose an enhancement to the LinuxThread implementation to do so.
The GDB multithreaded enhancements do not deal with thread priorities, so that the thread stop/restart commands are done in any order and scheduling differences may appear between different program runs.
Specific to the i386, it would be nice to dynamicly detect that %ebp is not used as the Frame Pointer register by a particular procedure, in order to avoid giving wrong stack trace due to erroneous interpretation of %ebp.

VII. Related sites

The LinuxThreads library is the original POSIX1003.1c Linux implementation.
The Cygnus GDB home page contains information on how to contribute to GDB.
The Data Display Debugger is a popular graphical user interface for Unix/Java debuggers.
The Debug Malloc Library provides memory management routines with thread-safe powerful debugging features.
The Thread Debug Interface (TDI) provides a generic debug interface for the POSIX Threads (Pthreads) Standard.

[an error occurred while processing this directive]