7d7735a21622ded09e1064fcdecb981009b3927b
1287 Commits
| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|
Riccardo Pittau
|
7d7735a216 |
Fix API URL reachability test to use full URL with port
The _test_ip_reachability method was only using the hostname/IP address when testing reachability, ignoring the port number from the API URL. This caused LookupAgentIPError when the Ironic API was running on a non-standard port (e.g., 6385). This change modifies _test_ip_reachability to: - Accept the full API URL instead of just an IP address - Use the complete URL (including protocol and port) when testing The _find_routable_addr method now passes the full api_url to _test_ip_reachability instead of just the hostname, ensuring the port is included in reachability tests. Assisted-By: Claude Sonnet 4.5 Change-Id: Ibb407255cfcd5cf9617f040338561fd494e8b41f Signed-off-by: Riccardo Pittau <elfosardo@gmail.com> |
||
|
Riccardo Pittau
|
bae591a808 |
Fix RuntimeError when stopping heartbeater in rescue mode
In rescue mode, the agent attempts to stop the heartbeater thread even though it was never started, causing a RuntimeError. This fix adds checks to ensure the heartbeater thread is alive before attempting to stop it. Assisted-By: Claude Sonnet 4.5 Change-Id: I3e97b10f2c7f3c454f0db2a3c3c8efb61ffeda5a Signed-off-by: Riccardo Pittau <elfosardo@gmail.com> |
||
|
Riccardo Pittau
|
2c6cf7cf1f |
Test advertised ip reachability before assigning it
The advertised ip for ironic API is checked only as routable but it could still be unreachable, we need to check the actual connectivity before assigning it. Assisted-By: Claude Sonnet 4 Change-Id: I0adca5ad00ba419a7e2aa6883b3690b4507c25e5 Signed-off-by: Riccardo Pittau <elfosardo@gmail.com> |
||
|
Morten Stephansen
|
bb4b4fdb38 |
Fix for matching hints with lists of strings
Added logic for matching hints with lists of WWN/Serial. These lists appear when both lsblk and udev are used to fetch the information about a device. One consequence of this is that it allows a device on the skip list to be used as root device, thus overwriting the protected data. This has previously been handled before matching the hints, e.g. the removed section in hardware.py. This patch aims to fix the problem globally by handling the issue inside the find_devices_by_hints function. Closes-bug: #2130410 Change-Id: I28129f2ededb37474025f35164d5dc9ece21ec8e Signed-off-by: Morten Stephansen <morten.kaastrup.stephansen@cern.ch> Signed-off-by: Jakub Jelinek <jakub.jelinek@cern.ch> |
||
|
Zuul
|
f0888131c1 | Merge "Fix skip block devices for RAID arrays" | ||
|
kubajj
|
d43913453b |
Fix skip block devices for RAID arrays
The original implementation of the skip block devices for RAID arrays: https://review.opendev.org/c/openstack/ironic-python-agent/+/852999 introduced a couple bugs which were uncaught: 1. Key error when a holder disk contains just logical disks on the skip list. 2. RAID arrays on skip list throw "Failed to remove partitions" because they are not removed from the list of remaining RAID devices when running wipefs 3. list_block_devices_check_skip_list does not match volume names to RAID arrays 4. MD superblock wrongly checked (detail instead of examine) 5. Partition tables are being created when a partition is on a skip list 6. EFI partition handling in a scenario when a partition on the same physical disk is not deleted Closes-bug: #2080871 Signed-off-by: Jakub Jelinek <jakub.jelinek@cern.ch> Signed-off-by: Morten Stephansen <morten.kaastrup.stephansen@cern.ch> Change-Id: I59b65c6b69af2385ed8a5dcd427e4d9c91f90abe |
||
|
Jakub Jelinek
|
f14c187a64 |
Fix erasable devices check
There is a conditional which is supposed to check whether there are any erasable devices. However, in the current state, the conditional is wrong as the call is missing the node as a parameter. Signed-off-by: Jakub Jelinek <jakub.jelinek@cern.ch> Signed-off-by: Morten Stephansen <morten.kaastrup.stephansen@cern.ch> Change-Id: I38768b9ba3dc1bb5160e5841865450a8d7df5466 |
||
|
Zuul
|
ffafce66ca | Merge "Support transport type as a root device hint" | ||
|
Morten Stephansen
|
41a00b2465 |
Removed redundant side-effects
In test_create_configuration_with_different_disks_skip_list there are redundant side-effects which are never used. Change-Id: I7b6acbd6583831ba82d6a68ce0d8410ee0bd18a2 Signed-off-by: Morten Stephansen <morten.kaastrup.stephansen@cern.ch> |
||
|
Zuul
|
0fa82ee2ee | Merge "Fix for motherboards where efibootmgr returns UTF-8." | ||
|
Morten Stephansen
|
f302c1c73e |
Fix for motherboards where efibootmgr returns UTF-8.
Some motherboards return UTF-8 for the efibootmgr. This results in IPA not being able to remove duplicate records, etc. This change implements a check for UTF-16 compatibility and tries to decode efibootmgr output as UTF-8 otherwise. Closes-bug: #2072336 Co-Authored-By: Jakub Jelinek <jakub.jelinek@cern.ch> Change-Id: I35432773826d13edb1dc9dd25f99bb0907a8fa0d Signed-off-by: Morten Stephansen <morten.kaastrup.stephansen@cern.ch> Signed-off-by: Jakub Jelinek <jakub.jelinek@cern.ch> |
||
|
Zuul
|
f46f56decc | Merge "Hard stop on image download duration threshold" | ||
|
Zuul
|
07f8c07674 | Merge "Hint at sector sizes when reporting an invalid written image" | ||
|
Kaifeng Wang
|
2e4172a024 |
Support transport type as a root device hint
Adds a tran field to the block device and allow to use it as a root device hint. Change-Id: I3fc83730a6100abb2b2aa98fc894713ecbbe3043 Closes-Bug: #2100951 Signed-off-by: Kaifeng Wang <kaifeng.w@gmail.com> |
||
|
Zuul
|
5a96e0a937 | Merge "Vendor own option for tls cert file and key file" | ||
|
Zuul
|
c2c72eef97 | Merge "Provide usable __repr__ for serializable objects" | ||
|
Afonne-CID
|
e1a31eb97a |
Hard stop on image download duration threshold
Adds a wall timeout `image_download_max_timeout` to enforce an upper bound on total download duration. While the per-chunk timeout protects against stalled reads, downloads that trickle in just under the timeout threshold (e.g., due to heavy TCP retransmits) can hang for longer than intended. Now, if the total allowed time is exceeded, the download is aborted with a non-retryable `ImageDownloadTimeoutError` regardless of per-chunk retry or connection success. A value of 0 (the default) disables this feature. Closes-Bug: #2115995 Change-Id: I3b56d21abae0488853bfed14072ba21116d47baf Signed-off-by: Afonne-CID <afonnepaulc@gmail.com> |
||
|
Dmitry Tantsur
|
0ad2685ce4 |
Fix wrong result of filter_device in the base interface
Returning None means "exclude any device". It still works if the only hardware manager is GenericHardwareManager, but enabling any more without overriding filter_device causes all devices to get filtered. Change-Id: I25eb028baa1b9182caee07a0b935e9fa107999a8 Closes-Bug: #2117234 Signed-off-by: Dmitry Tantsur <dtantsur@protonmail.com> |
||
|
Zuul
|
c3ef9a563d | Merge "Fix software RAID creation on different physical devices" | ||
|
Zuul
|
a7d8afde40 | Merge "Trivial: avoid root logger in modules" | ||
|
Zuul
|
4b0a3c162d | Merge "Log efibootmgr output as part of the system logs" | ||
|
kubajj
|
46af01c82f |
Log efibootmgr output as part of the system logs
Create a file efibootmgr with the verbose output of the efibootmgr command when collecting the system logs. This can be used for debugging of boot order. Change-Id: Ic957024d19bb01a45a3014dc2a5e4492d087e893 Signed-off-by: Jakub Jelinek <vilouskubajj@gmail.com> |
||
|
Dmitry Tantsur
|
9e9334fa00 |
Hint at sector sizes when reporting an invalid written image
Change-Id: Iee7f22e9958b68f263d267c434678478b9a38648 Signed-off-by: Dmitry Tantsur <dtantsur@protonmail.com> |
||
|
Dmitry Tantsur
|
1840e987d5 |
Trivial: avoid root logger in modules
Change-Id: I1eb8c9637c38496b5d4dd6493be43f414f35680c Signed-off-by: Dmitry Tantsur <dtantsur@protonmail.com> |
||
|
Takashi Kajinami
|
a2739f7e56 |
Vendor own option for tls cert file and key file
... instead of using oslo.service. Current usage of oslo.service is too limited to add the dependency, because - oslo.service registers multiple options but only two of these are used - the wrap implementation from oslo.service is not actually used Change-Id: I4e8f18951d73e329a54cf6546344c5704fe4aa90 Signed-off-by: Takashi Kajinami <kajinamit@oss.nttdata.com> |
||
|
Dmitry Tantsur
|
9db3cd1e4d |
Graceful way for hardware managers to ignore certain devices
My use case for this feature is to exclude network devices that use the cdc_ether driver. These USB network interfaces often cause all sorts of issues. For example, some models have the same hardcoded MAC address, which breaks inspection. Currently, to exclude a certain device, a hardware manager must override the entire listing function (in my case, list_interfaces). Not only is it tedious, but it also requires constantly updating the hardware managers to match the implementation in GenericHardware. Realistically, it will cause hardware manager authors to inherit GenericHardware, which is the opposite of how hardware managers should be written. Note that the node-level skip list only affects root device selection and cleaning for block devices. This feature affects everything that uses list_block_devices and is applied before the node-level skip list. This change adds a new hardware manager call filter_device. For each network, block or USB device, it allows a hardware manager to do either of four things: 1. Delegate the decision to a lower level hardware manager by raising IncompatibleHardwareMethodError 2. Remove the device by returning None 3. Change the device by returning a modified instance 4. Return the device unchanged to keep it in the listing. Note that I'm removing debug logging when IncompatibleHardwareMethodError is raised. Not only the log message is incorrect (the error does not necessarily mean that the method is not implemented at all), it already noticeable space in the logs, and with this change will become very noisy. Change-Id: I5437343af6c6157882bcf0600dd89bd20478c948 Signed-off-by: Dmitry Tantsur <dtantsur@protonmail.com> |
||
|
Dmitry Tantsur
|
9426df9ab3 |
Split hardware manager initialize out of evaluate_hardware_support
The current code in GenericHardware.evaluate_hardware_support ends up using hardware manager calls, which then use partly initialized hardware manager list and can even cause a recursion. This change introduces a new optional call initialize() which is guaranteed to run: 1) After all hardware managers have been evaluated 2) After the hardware manager cache is populated 3) In the order of the support level of hardware managers Change-Id: I068d3d73483c161062aa3b48f3154a2d99941382 Signed-off-by: Dmitry Tantsur <dtantsur@protonmail.com> |
||
|
Dmitry Tantsur
|
ddeed6fa9e |
Provide usable __repr__ for serializable objects
These objects are frequently logged or compared in unit tests. It's very helpful to be able to inspect their content. Change-Id: Ib725dcd5f54f4492205f95974d887b8b42c74039 |
||
|
Dmitry Tantsur
|
521811cbcc |
Fix software RAID creation on different physical devices
When creating multiple software RAID logical disks that use different sets of physical devices, the partition indices were incorrectly shared across all devices. This caused the second RAID array creation to fail because it tried to use partition indices that didn't exist on those specific devices. This change fixes the issue by tracking partition indices separately for each physical device, ensuring that each device's partitions are numbered correctly starting from their first available index. Closes-Bug: #2115211 Change-Id: I440db4654f3d1d54274d1eee8c4b21c2b0a18d22 Signed-off-by: Mohammed Naser <mnaser@vexxhost.com> |
||
|
Zuul
|
b64a41372d | Merge "trivial: Fix missing extra space in logging" | ||
|
Zuul
|
b51cc75ff3 | Merge "netutils: Use ethtool ioctl to get permanent mac address" | ||
|
Nicolas Belouin
|
48422a532f |
netutils: Use ethtool ioctl to get permanent mac address
Fetching the permanent MAC address of the interface instead of the default one allows to get the right one in case it got changed during setup (likely with a bonding setup). In order to fetch the permanent MAC address of a given interface, one can either use Netlink (either rtnetlink or ethtool), or use ethtool ioctl. The use of ioctl feels simpler and requires no additional dependency. The implementation falls back to older behavior should an error occur. Closes-Bug: #2103450 Change-Id: I54151990e396ddcf775128ca24d3db08e45c256d Signed-off-by: Nicolas Belouin <nicolas.belouin@suse.com> |
||
|
cid
|
c03021fee2 |
Remove eventlet from Ironic Python Agent
This change removes several usages of eventlet from IPA: - Upgrades all requirements on oslo library versions to new ones that support non-eventlet use. - Removes use of the eventlet wsgi server (via oslo_service.wsgi) and replaces it with the cheroot wsgi server. - Removes explicit patching of python modules with eventlet Note that due to some oslo libraries still using ``eventlet`` to detect and workaround it's use. This means that it is still installed in environments alongside IPA, even if it's not used or patched into any modules. Depends-On: https://review.opendev.org/c/openstack/requirements/+/947727 Change-Id: I9accab2d5e9529a88ef5d3db85e76901f14114eb |
||
|
Zuul
|
b20ed83d78 | Merge "Remove unnecessary +x mode" | ||
|
Takashi Kajinami
|
9b03ca7d52 |
Remove unnecessary +x mode
These files are not actually executable. Change-Id: Id4208a91c005b8199d62320882c4f14dcd7f7761 |
||
|
Julia Kreger
|
f359c81c28 |
trivial: Fix missing extra space in logging
Change-Id: Id85563015b3ea9e2802baa7b8ab1ca1d858568d5 |
||
|
Zuul
|
53349cc7cf | Merge "Remove agent_token_required upgrade knob" | ||
|
Zuul
|
88b6e914ad | Merge "Run UT of metricis_lib" | ||
|
satoshi-sh
|
7e6bc5cb85 |
Implement manual cleaning for ContainerHardwareManager
- Implemented manual cleaning - Refactored the code Change-Id: I301aaf9dfd6aff90f505148b65e75033f5043553 |
||
|
satoshi-sh
|
c3e9266f92 |
Ironic Config Intergration for IPA
Update container-based cleaning hardware manager to use ironic conductor config. Note: - Moved conf variables from __init__ and evaluate_hardware_support as the config overwritten after those process - Utilized getattr instead of making methods beforehand. The methods created don't stick for a new instance. Partial-Bug: #2100556 Change-Id: I53d5a4f112fbed455d5574840611ef6ea2db3eae |
||
|
satoshi-sh
|
7efe3dfc04 |
Fix test_timeout on Python3.13
- Python 3.13 uses time.time_ns for logging https://github.com/python/cpython/blob/main/Lib/logging/__init__.py#L303 Change-Id: I3de44cc0fda662f3d5b1c6ea8add973cf2ed3bd9 |
||
|
luocanhui
|
228033e9b6 |
Run UT of metricis_lib
Change-Id: I54dc23d8e8cec12a4685a82c4807ddd6a2267533 |
||
|
Zuul
|
c4f62beedb | Merge "Remove eventlet greendns patching" | ||
|
Jay Faulkner
|
5d3192e93a |
Remove eventlet greendns patching
Eventlet patches two things in socket, effectively: - create_connection - various greendns things By adding this environment variable, we're going to disable a large portion of the greened module code in IPA, which will be a boost to our migration off. Change-Id: I1f94238c8d83f9e7cb0f7e096172ffb7c20c862b |
||
|
satoshi-sh
|
a3a2b83594 |
Add ContainerHardwareManager
Implement container-based cleaning process Partial-Bug: #2100556 Change-Id: I39b92462d1454df888fc413e0aac439b9df199f7 |
||
|
Zuul
|
1e73cba5cb | Merge "Remove pre-victoria cycle agent token transition upgrade support" | ||
|
Zuul
|
03afbfe4bc | Merge "Fix the way qemu-img is called with prlimits" | ||
|
Zuul
|
20e75c2480 | Merge "Missing __init__.py file" | ||
|
Zuul
|
0067239074 | Merge "Lockout agent command results if a token is received" | ||
|
Thomas Goirand
|
14009b994f |
Missing __init__.py file
Change-Id: I916fe8187fb9d8513852b620df133309fbc92af1 |