Fix a rare segfault in apr_global_mutex_child_init() #34

Open

monkburger wants to merge 1 commit intoapache:trunk from

monkburger:trunk

Open

Fix a rare segfault in apr_global_mutex_child_init() #34
monkburger wants to merge 1 commit intoapache:trunk from
monkburger:trunk

Conversation

@monkburger

Copy link

@monkburger monkburger commented Feb 25, 2022

Under specific circumstances, apr_global_mutex_child_init can be called with NULL mutex. We've seen this behavior under certain modules, specifically mod_lsapi and a few others under high load/IO wait during graceful restarts:

 #0 apr_proc_mutex_child_init (mutex=0x8, fname=0x0, pool=0x5566e972f128) at locks/unix/proc_mutex.c:1570
 #1 0x00002b9a5730cd2c in apr_global_mutex_child_init (mutex=<optimized out>, fname=<optimized out>, pool=<optimized out>) at locks/unix/global_mutex.c:89

With this patch, it shows the following - as well as no more segfaults:

[Fri Feb 25 01:13:58.924341 2022] [:emerg] [pid 92020] (20009)No lock was provided and one was required.: [host test.test] mod_lsapi: apr_global_mutex_child_init error for pipe mutex


 Under certain circumstances, Apache will crash because apr_global_mut...

78567c7

...ex_child_init() is being passed a NULL
mutex. We've seen this behavior under certain modules, specifically mod_lsapi and a few others under high load/IO wait during graceful restarts:
 #0 apr_proc_mutex_child_init (mutex=0x8, fname=0x0, pool=0x5566e972f128) at locks/unix/proc_mutex.c:1570
 apache#1 0x00002b9a5730cd2c in apr_global_mutex_child_init (mutex=<optimized out>, fname=<optimized out>, pool=<optimized out>) at locks/unix/global_mutex.c:89
With this patch, it shows the following - as well as no more segfaults:
[Fri Feb 25 01:13:58.924341 2022] [:emerg] [pid 92020] (20009)No lock was provided and one was required.: [host test.test] mod_lsapi: apr_global_mutex_child_init error for pipe mutex

@wrowe

Copy link

Member

wrowe commented Mar 15, 2022

Is ignoring these the right answer? Or should we set ourselves in a 5-retry loop? With sleeps?

wrowe

wrowe reviewed

Mar 15, 2022

View reviewed changes

locks/unix/global_mutex.c

if (*mutex == NULL) {

return APR_ENOLOCK;

}

Copy link

Member

@wrowe wrowe Mar 15, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks to me like an APR contract change, that we should docx the new return code and suggest to the author they can set up the appropriate retry schema, and put it back on mod_isapi.

Copy link

Member

@wrowe wrowe Mar 15, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Specifically, for APR 2.0

@monkburger

Copy link

Author

monkburger commented Mar 15, 2022

Is ignoring these the right answer? Or should we set ourselves in a 5-retry loop? With sleeps?

I was thinking about that, maybe a retry loop (hybrid-ish spinlock?)

@ylavic

Copy link

Member

ylavic commented Mar 15, 2022 •

edited

Loading

Why is the *mutex NULL in the first place? And why would retrying work?

@wrowe

Copy link

Member

wrowe commented Mar 31, 2022

Why is the *mutex NULL in the first place? And why would retrying work?

Short of some ACL problem that the process doesn't have permission to create a mutex, this almost always is going to be a resource contention problem, too many mutexes held. Ultimately, if they are 'forgotten' kernel resources, the process (and perhaps the system) is going down in flames, but sometimes, we are just in an overbooked state, and a retry after a significant (1s) pause will be enough for other threads to release some mutexes back to the pool of resources.

@crrodriguez

Copy link

crrodriguez commented May 2, 2023

Is ignoring these the right answer? Or should we set ourselves in a 5-retry loop? With sleeps?

I was thinking about that, maybe a retry loop (hybrid-ish spinlock?)

0006-thread_mutex-APR_THREAD_MUTEX_DEFAULT-should-map-to-.patch

I do not have code for other platforms, this patch is a tested (had it on file for quite a while) linux-glibc only approach.

@crrodriguez

Copy link

crrodriguez commented May 2, 2023

Is ignoring these the right answer? Or should we set ourselves in a 5-retry loop? With sleeps?

I was thinking about that, maybe a retry loop (hybrid-ish spinlock?)

0006-thread_mutex-APR_THREAD_MUTEX_DEFAULT-should-map-to-.patch

I do not have code for other platforms, this patch is a tested (had it on file for quite a while) linux-glibc only approach.

the adaptative_np mutex spins glibc.pthread.mutex_spin_count (tunable at runtime, default 100) times before calling into the kernel..

@ylavic

Copy link

Member

ylavic commented May 4, 2023

apr_global_mutex_child_init() is not supposed to be called with *mutex == NULL (it should have been allocated with apr_global_mutex_create() first), so this is a usage/user error (i.e. don't do that)?
Also, if something has to be done with the new APR_ENOLOCK error from this PR, it should be possible to do the same by testing *mutex == NULL before in the caller, so I don't get what this PR is fixing, looks like defensive programming only to me.

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a rare segfault in apr_global_mutex_child_init() #34

Fix a rare segfault in apr_global_mutex_child_init() #34
monkburger wants to merge 1 commit intoapache:trunk from
monkburger:trunk

Conversation

@monkburger monkburger commented Feb 25, 2022

Uh oh!

wrowe commented Mar 15, 2022

Uh oh!

@wrowe wrowe Mar 15, 2022

Choose a reason for hiding this comment

Uh oh!

@wrowe wrowe Mar 15, 2022

Choose a reason for hiding this comment

Uh oh!

monkburger commented Mar 15, 2022

Uh oh!

ylavic commented Mar 15, 2022 •

edited

Loading

Uh oh!

Uh oh!

wrowe commented Mar 31, 2022

Uh oh!

crrodriguez commented May 2, 2023

Uh oh!

crrodriguez commented May 2, 2023

Uh oh!

ylavic commented May 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

Conversation

@monkburger monkburger commented Feb 25, 2022

Uh oh!

wrowe commented Mar 15, 2022

Uh oh!

@wrowe wrowe Mar 15, 2022

Choose a reason for hiding this comment

Uh oh!

@wrowe wrowe Mar 15, 2022

Choose a reason for hiding this comment

Uh oh!

monkburger commented Mar 15, 2022

Uh oh!

ylavic commented Mar 15, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wrowe commented Mar 31, 2022

Uh oh!

crrodriguez commented May 2, 2023

Uh oh!

crrodriguez commented May 2, 2023

Uh oh!

ylavic commented May 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments

ylavic commented Mar 15, 2022 •

edited

Loading