Search code, repositories, users, issues, pull requests...

Zend/zend_alloc.c Show resolved Hide resolved

Zend/zend_alloc.c Outdated Show resolved Hide resolved

Zend/zend_alloc.c

#if ZEND_MM_HEAP_SPRAYING_PROTECTION

# define ZEND_MM_ZONES 2

Copy link

Contributor

@jvoisin jvoisin May 28, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change

# define ZEND_MM_ZONES 2

# define ZEND_MM_ZONES 2/* one zone for trusted data, one for user-controlled ones.*/

Copy link

Member Author

@arnaud-lb arnaud-lb May 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm hesitating on this one, as I feel that how zones are used belongs to upper abstraction levels (that's also why I've defined ZEND_MM_ZONE_INPUT elsewhere). In any case I agree that a comment would help.

Copy link

Contributor

@jvoisin jvoisin Nov 4, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Fair, but I also agree that a comment would be nice :)

Copy link

Member

@dstogov dstogov Nov 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2M * 100 workers = waster of 200MB

Copy link

Member Author

@arnaud-lb arnaud-lb Nov 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This consumes an additional 2MiB of VMA per worker, but I expect the overhead of actually committed memory to be much less than that in practice, as not all pages of the extra chunk are touched.

For example, at the end of a symfony-demo request, we have this:

base: VmRSS:	 65772 kB
mm-zones: VmRSS:	 66020 kB
Diff: 249 kB

With USE_ZEND_ALLOC_HUGE_PAGES=1, the RSS actually increases by 2MiB, but this is something to expect with huge pages, and this is not enabled by default.

Zend/zend_alloc.c Outdated Show resolved Hide resolved

@arnaud-lb arnaud-lb changed the title ~~(削除) Remote heap feng chui / heap spraying protection (削除ここまで)~~ (追記) Remote heap feng shui / heap spraying protection (追記ここまで)

May 29, 2024

Copy link

Contributor

jvoisin commented Jul 10, 2024

Now that #14054 was merged, can you please rebase this one, so we can get it landed?

Copy link

Contributor

jvoisin commented Nov 4, 2024

@arnaud-lb ping :)

Copy link

Member Author

arnaud-lb commented Nov 4, 2024

Sorry I didn't see your previous comment. I will get back at this PR soon.

@arnaud-lb arnaud-lb force-pushed the mm-zones branch from 0d05928 to ee191aa Compare

November 4, 2024 17:22

@github-actions github-actions bot added the ABI break label

Nov 4, 2024

@arnaud-lb arnaud-lb force-pushed the mm-zones branch from ee191aa to b9f761d Compare

November 4, 2024 17:24

@github-actions github-actions bot added the Extension: spl label

Nov 4, 2024

@arnaud-lb arnaud-lb marked this pull request as ready for review

November 7, 2024 14:00

@arnaud-lb arnaud-lb requested review from dstogov, bukka and Girgias as code owners

November 7, 2024 14:00

Girgias

Girgias reviewed

Nov 8, 2024

Copy link

Member

@Girgias Girgias left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If this lands, please add a note to UPGRADING in other changes about the increase in memory requirements.

dstogov

dstogov reviewed

Nov 11, 2024

php-src/Zend/zend_alloc.c

Copy link

Member

@dstogov dstogov left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

At the first look, I don't like this. What kind of attacks will this complicate?

Zend/zend_alloc.c Show resolved Hide resolved

Zend/zend_alloc.c

#if ZEND_MM_HEAP_SPRAYING_PROTECTION

# define ZEND_MM_ZONES 2

Copy link

Member

@dstogov dstogov Nov 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2M * 100 workers = waster of 200MB

Zend/zend_alloc.c

Comment on lines -1435 to +1547

zend_mm_set_next_free_slot(heap, bin_num, p, heap->free_slot[bin_num]);

heap->free_slot[bin_num] = p;

zend_mm_set_next_free_slot(heap, bin_num, p, ZEND_MM_FREE_SLOT_EX(heap, chunk, bin_num));

ZEND_MM_FREE_SLOT_EX(heap, chunk, bin_num) = p;

Copy link

Member

@dstogov dstogov Nov 11, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This make regression for each deallocation.

Copy link

Member Author

@arnaud-lb arnaud-lb Nov 11, 2024 •

edited

Loading

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This adds an additional fetch of chunk->zone_free_slot, but this field is just next to chunk->heap, and is likely in cache as we fetch it here:

Lines 2701 to 2702 in 53df3ae

ZEND_MM_CHECK(chunk->heap == AG(mm_heap), "zend_mm_heap corrupted"); \

zend_mm_free_small(AG(mm_heap), ptr, _num); \

Also, benchmark results show very little overhead (0% wall time, 0.04% valgrind)

Copy link

Member Author

arnaud-lb commented Nov 11, 2024 •

edited

Loading

What kind of attacks will this complicate?

An attacker can precisely control the layout of the heap by sending GET or POST parameters. For example, the query string ?a=aaa...&b=bbb...&c=ccc...&b= allocates 4 blocks of arbitrary size, frees the second one, and puts it at the top of the freelist. So the attacker is able to precisely control what is allocated where during application execution, and what is next to it in memory. This greatly facilitates the exploitation of all kinds of memory safety issues. See https://www.youtube.com/watch?v=-FXvUe0tySM&t=1233s (starting at 18:28) for example.

By isolating the user inputs in separate chunks, we prevent grooming the application heap via GET/POST/etc. This is useful in the context of a remote attacker with no ability to execute arbitrary code.

As the overhead is very small (0% wall time / 0.04% valgrind on the symfony-demo benchmark), I believe this is worth it.

Copy link

Member

dstogov commented Nov 11, 2024

@arnaud-lb didn't you already add the protection against heap buffer overflow/underflow through the shadow pointers?
I thought, it shouldn't be possible to corrupt the free-list now.
why do we need this patch on top of shadow pointers?

do you have the source of the exploit used in the presentation?

Personally, I think that the better approach would be filtering input data and just stopping the request in case of unexpected/dangerous data detection (e.g. GET/POST with duplicate names).

Copy link

Member Author

arnaud-lb commented Nov 11, 2024

didn't you already add the protection against heap buffer overflow/underflow through the shadow pointers?

Yes, but that's not the only way to exploit an out of bound write. If an attacker can arrange the heap to override the first bytes of an arbitrary block, there are other things they can attack. Protecting against freelist corruption specifically was worth it because it's an easy and powerful target, but there are other targets that are made practicable by arranging the layout of the heap.

do you have the source of the exploit used in the presentation?

I've tried to find it, but without success

Personally, I think that the better approach would be filtering input data and just stopping the request in case of unexpected/dangerous data detection (e.g. GET/POST with duplicate names).

This will prevent controlling the order of elements in the freelist, but this is not absolutely necessary to control block placement in the heap. For instance, an attacker can control the order of runs in a chunk (without unsetting a GET/POST element), as well as how much they are filled, so they can arrange for a block of size N allocated by the application to be at the end of a run, and just before an other run of their choice.

Copy link

Member

dstogov commented Nov 11, 2024

do you have the source of the exploit used in the presentation?

I've tried to find it, but without success

@cfreal could you please share the sources of exploit used in your presentation? (better to email to my and @arnaud-lb public github email addresses)

Copy link

Contributor

jvoisin commented Nov 11, 2024 •

edited

Loading

Similar exploits are available here, with a detailed write-up about the heap shaping here

@cfreal

Copy link

cfreal commented Nov 11, 2024

What @jvoisin said. Relevant section here. I can provide the adminer exploit as well tomorrow if you find it useful.

Copy link

Member

dstogov commented Nov 12, 2024

@jvoisin @cfreal thank you very much. I'll need to play with this.

@m4p1e

Copy link

m4p1e commented Nov 18, 2024 •

edited

Loading

Future scope would be to activate the input zone in more places (e.g. when parsing json, during unserialize, etc), or to create more zones for various purposes.

Also, do not abuse the userinput zone, as it may introduce a new attack surface. For example, if there is a potential vulnerability in the unserialize process and unserialize-related operations are added to the userinput zone, an attacker could use an HTTP request to arrange an ideal memory layout without needing to account for PHP internals.

Copy link

Contributor

jvoisin commented Nov 18, 2024

Also, do not abuse the userinput zone, as it may introduce a new attack surface. For example, if there is a potential vulnerability in the unserialize process and unserialize-related operations are added to the userinput zone, an attacker could use an HTTP request to arrange an ideal memory layout without needing to account for PHP internals.

Unserialized is already providing enough control to an attacker, this wouldn't change much :D But more seriously, partitioning memory by types (whether primitive types/size, or usage-types) is part of #14083's roadmap :)

Copy link

Contributor

jvoisin commented Apr 8, 2025

@dstogov did you have time to take a look at this?

Copy link

Member

dstogov commented Apr 14, 2025

@arnaud-lb please discuss this with @nielsdos, @iluuu1994 and take the decision.
I don't like this over-complication, but I also don't like to be a blocker.

Copy link

Contributor

jvoisin commented Jul 29, 2025

Hey @nielsdos and @iluuu1994, did you have time to look at this?

@nielsdos

Copy link

Member

nielsdos commented Jul 29, 2025

No I didn't look at this yet in detail.
I need to make time for this in the weekend.

nielsdos

nielsdos reviewed

Aug 3, 2025

Copy link

Member

@nielsdos nielsdos left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't quite see where the initial call for zend_mm_userinput_begin is?

Zend/zend_alloc.c Show resolved Hide resolved

Copy link

Member Author

arnaud-lb commented Aug 4, 2025

@nielsdos it happens in shutdown_memory_manager(full_shutdown: false), which is called before every request

@nielsdos

Copy link

Member

nielsdos commented Aug 9, 2025 •

edited

Loading

I see. I'm neutral to it, I like the protection but it is also limited in scope and doubles the minimum ~~(削除) chunk size (削除ここまで)~~ allocated memory (although not by a lot and it could be overcommitted).

iluuu1994

iluuu1994 reviewed

Aug 26, 2025

Copy link

Member

@iluuu1994 iluuu1994 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is_zend_ptr() looks like it will also need adjustments to loop through all zones.

The concept makes sense to me. Though I'd expect quite a lot of string operations can move user-controlled inputs to the default zone.

Zend/zend_alloc.c Outdated Show resolved Hide resolved

arnaud-lb added 3 commits

August 27, 2025 11:34


 Remote heap spraying / feng shui protection

8759ad6

Isolate request environment / input in separate chunks to makes it more
difficult to remotely control the layout of the heap.


 Fix zend_mm_chunk.reserve

f0ad377


 Fix non-ZEND_MM_HEAP_PROTECTION build

4ac02f7

@github-actions GitHub Actions

Copy link

github-actions bot commented Aug 27, 2025

AWS x86_64 (c7i.24xl)

Attribute	Value
Environment	aws
Runner	host
Instance type	c7i.metal-24xl (dedicated)
Architecture	x86_64
CPU	48 cores
CPU settings	disabled deeper C-states, disabled turbo boost, disabled hyper-threading
RAM	188 GB
Kernel	6.1.147-172.266.amzn2023.x86_64
OS	Amazon Linux 2023年8月20日250818
GCC	11.5.0
Time	2025年08月27日 09:53:07 UTC

Laravel 12.2.0 demo app - 100 consecutive runs, 50 warmups, 100 requests (sec)

PHP	Min	Max	Std dev	Rel std dev %	Mean	Mean diff %	Median	Median diff %	Skew	P-value	Instr count	Memory
PHP - baseline@0ce1	0.46244	0.47158	0.00118	0.25%	0.46914	0.00%	0.46902	0.00%	-3.185	0.999	177372416	43.28 MB
PHP - mm-zones	0.47070	0.47510	0.00060	0.13%	0.47374	0.98%	0.47371	1.00%	-1.074	0.000	177817819	43.57 MB

Symfony 2.7.0 demo app - 100 consecutive runs, 50 warmups, 100 requests (sec)

PHP	Min	Max	Std dev	Rel std dev %	Mean	Mean diff %	Median	Median diff %	Skew	P-value	Instr count	Memory
PHP - baseline@0ce1	0.74021	0.75287	0.00184	0.25%	0.74235	0.00%	0.74199	0.00%	3.685	0.999	291652548	39.84 MB
PHP - mm-zones	0.74407	0.75144	0.00154	0.21%	0.74643	0.55%	0.74622	0.57%	1.784	0.000	292300931	39.84 MB

Wordpress 6.2 main page - 100 consecutive runs, 20 warmups, 20 requests (sec)

PHP	Min	Max	Std dev	Rel std dev %	Mean	Mean diff %	Median	Median diff %	Skew	P-value	Instr count	Memory
PHP - baseline@0ce1	0.57916	0.58261	0.00057	0.10%	0.58025	0.00%	0.58020	0.00%	1.061	0.999	1129906527	43.43 MB
PHP - mm-zones	0.58203	0.58488	0.00063	0.11%	0.58329	0.52%	0.58316	0.51%	0.431	0.000	1133164084	43.40 MB

bench.php - 100 consecutive runs, 10 warmups, 2 requests (sec)

PHP	Min	Max	Std dev	Rel std dev %	Mean	Mean diff %	Median	Median diff %	Skew	P-value	Instr count	Memory
PHP - baseline@0ce1	0.43064	0.44237	0.00182	0.42%	0.43344	0.00%	0.43324	0.00%	2.640	0.999	2031002294	26.50 MB
PHP - mm-zones	0.43231	0.44449	0.00154	0.35%	0.43472	0.30%	0.43456	0.30%	2.735	0.000	2031419569	27.16 MB

arnaud-lb added 4 commits

August 27, 2025 12:53


 Adjust iz_zend_ptr()

e685306


 Fix new tests after rebase

f600414


 Revise zend_mm_check_in_userinput()

e931ad3


 Allow to opt-out of userinput isolation

5e7fb2c

@arnaud-lb arnaud-lb force-pushed the mm-zones branch from ff21aab to 5e7fb2c Compare

August 27, 2025 10:55

@arnaud-lb arnaud-lb requested a review from devnexen as a code owner

August 27, 2025 10:55

@github-actions github-actions bot added the Extension: gd label

Aug 27, 2025