4bf88b204f27489fef0b97c66232a9e34a8d0806
1070 Commits
| Author | SHA1 | Message | Date | |
|---|---|---|---|---|
|
Zuul
|
f89d54f4b8 | Merge "Improve function list_block_devices_check_skip_list" | ||
|
Jakub Jelinek
|
1ac61e1dbd |
Improve function list_block_devices_check_skip_list
Fix minor issues suggested by dtantsur Add an example of skip list specification to the documentation A follow-up patch to I3bdad3cca8acb3e0a69ebb218216e8c8419e9d65 Change-Id: Ic94a33b7bc0572a1cc8f92b330474ec63a173e81 |
||
|
Zuul
|
3a4baa637f | Merge "Enable skipping disks for cleaning" | ||
|
Jakub Jelinek
|
0212337bd5 |
Enable skipping disks for cleaning
Introduce a field skip_block_devices in properties - this is a list of dictionaries Create a helper function list_block_devices_check_skip_list Update tests of erase_devices_express to use node when calling _list_erasable_devices Add tests covering various options of the skip list definition Use the helper function in get_os_install_device when node is cached Story: 2009914 Change-Id: I3bdad3cca8acb3e0a69ebb218216e8c8419e9d65 |
||
|
Zuul
|
eb2215090a | Merge "Use lsblk json output for safety_check_block_device" | ||
|
Jakub Jelinek
|
e196fdfb62 |
Remove unused lines of code
The 5 lines of code were extracted from erase_devices_metadata to _list_erasable_devices, but now are duplicated in both functions. The variable block_devices is not used in erase_devices_metadata. Change-Id: I89f56c69d90fb0eb61907d6667266fbd57d333af |
||
|
Riccardo Pittau
|
b5fac66bc3 |
Use lsblk json output for safety_check_block_device
Change-Id: Ibfc2e203287d92e66567c33dc48f59392852b88e |
||
|
Zuul
|
21b21a5f15 | Merge "Guard shared device/cluster filesystems" | ||
|
Julia Kreger
|
beb7484858 |
Guard shared device/cluster filesystems
Certain filesystems are sometimes used in specialty computing environments where a shared storage infrastructure or fabric exists. These filesystems allow for multi-host shared concurrent read/write access to the underlying block device by *not* locking the entire device for exclusive use. Generally ranges of the disk are reserved for each interacting node to write to, and locking schemes are used to prevent collissions. These filesystems are common for use cases where high availability is required or ability for individual computers to collaborate on a given workload is critical, such as a group of hypervisors supporting virtual machines because it can allow for nearly seamless transfer of workload from one machine to another. Similar technologies are also used for cluster quorum and cluster durable state sharing, however that is not specifically considered in scope. Where things get difficult is becuase the entire device is not exclusively locked with the storage fabrics, and in some cases locking is handled by a Distributed Lock Manager on the network, or via special sector interactions amongst the cluster members which understand and support the filesystem. As a reult of this IO/Interaction model, an Ironic-Python-Agent performing cleaning can effectively destroy the cluster just by attempting to clean storage which it percieves as attached locally. This is not IPA's fault, often this case occurs when a Storage Administrator forgot to update LUN masking or volume settings on a SAN as it relates to an individual host in the overall computing environment. The net result of one node cleaning the shared volume may include restoration from snapshot, backup storage, or may ultimately cause permenant data loss, depending on the environment and the usage of that environment. Included in this patch: - IBM GPFS - Can be used on a shared block device... apparently according to IBM's documentation. The standard use of GPFS is more Ceph like in design... however GPFS is also a specially licensed commercial offering, so it is a red flag if this is encountered, and should be investigated by the environment's systems operator. - Red Hat GFS2 - Is used with shared common block devices in clusters. - VMware VMFS - Is used with shared SAN block devices, as well as local block devices. With shared block devices, ranges of the disk are locked instead of the whole disk, and the ranges are mapped to virtual machine disk interfaces. It is unknown, due to lack of information, if this will detect and prevent erasure of VMFS logical extent volumes. Co-Authored-by: Jay Faulkner <jay@jvf.cc> Change-Id: Ic8cade008577516e696893fdbdabf70999c06a5b Story: 2009978 Task: 44985 |
||
|
Dmitry Tantsur
|
6a1334a068 |
Drop support for instance netboot
Change-Id: I2b4c543537dac8904028fdcdb590c1c214238e10 |
||
|
Zuul
|
5129eb4933 | Merge "Fix passing kwargs in clean steps" | ||
|
Zuul
|
ccf4ee31cf | Merge "Gather details about bond interfaces if present" | ||
|
Zuul
|
7d15efd7a6 | Merge "Remove oslo.serialization dependency" | ||
|
Zuul
|
0cf5959f67 | Merge "Collect udev properties in the ramdisk logs" | ||
|
waleedm
|
eb07839bd4 |
Fix passing kwargs in clean steps
Pass kwargs to dispatch_to_managers method in execute_clean_step Change-Id: Ida4ed4646659b2ee3f8f92b0a4d73c0266dd5a99 Story: 2010123 Task: 45705 |
||
|
Zuul
|
a9de7f80cc | Merge "Use json for lsblk output" | ||
|
Zuul
|
312e1527ab | Merge "Warn when smartctl not found" | ||
|
Zuul
|
2d486d9061 | Merge "Remove importlib-metadata from requirements" | ||
|
Mark Goddard
|
b68fa6b2e1 |
Warn when smartctl not found
Currently, if smartctl is not found by IPA, it will silently skip ATA secure erase and proceed to shred (if enabled). This is supposedly for backwards compatibility, but is quite hard to diagnose. This change adds a warning message to make it more obvious what is happening. TrivialFix Change-Id: I03a381e99de79f201ec7e9a388777c3d48457e93 |
||
|
Riccardo Pittau
|
1356157ec8 |
Remove importlib-metadata from requirements
We don't need it anymore as we don't support python < 3.8 Also it was removed from global requirements so it breaks the requirements check. Change-Id: Ia12cbef3515f823fdd627a36020cf7801bf6d734 |
||
|
Derek Higgins
|
7e4fe3bf6a |
Gather details about bond interfaces if present
If present gather information about bonded interfaces. Story: #2010093 Task: #45637 Change-Id: I394187640b4788ebec21c3391d33ed728fb72ffa |
||
|
Dmitry Tantsur
|
a98675890f |
Collect udev properties in the ramdisk logs
Change-Id: Ifcf3dfff00b604dec1e2f430369ab8053f50f137 |
||
|
Riccardo Pittau
|
64ffd2ee80 |
Remove oslo.serialization dependency
Use pure json instead of jsonutils. Borrow encode function from oslo.serialization to be used in the utils module. Change-Id: Ied9a2259a4329a86b4f0853bd1fb187563c0a036 |
||
|
Dmitry Tantsur
|
69e2254503 |
Fix discovering WWN/serial for devicemapper devices
UDev prefix is DM_ not ID_ for them. On top of that, they don't have short serials (or at least don't always have). Change-Id: I5b6075fbff72201a2fd620f789978acceafc417b |
||
|
Riccardo Pittau
|
09ea41c83d |
Use json for lsblk output
The lsblk output is available in json format since version 2.27 of util-linux [1] https: //mirrors.edge.kernel.org/pub/linux/utils/util-linux/v2.27/v2.27-ReleaseNotes Change-Id: I0c5812736b7a320cc4ecc333f80db70eb78cc76d |
||
|
Julia Kreger
|
014d37743a |
Multipath Hardware path handling
Removes multipath base devices from consideration by default, and instead allows the device-mapper device managed by multipath to be picked up and utilized instead. In effect, allowing us to ignore standby paths *and* leverage multiple concurrent IO paths if so offered via ALUA. In reality, anyone who has previously built IPA with multipath tooling might not have encountered issues previously because they used Active/Active SAN storage environments. They would have worked because the IO lock would have been exchanged between controllers and paths. However, Active/Passive environments will block passive paths from access, ultimately preventing new locks from being established without proper negotiation. Ultimately requiring multipathing *and* the agent to be smart enough to know to disqualify underlying paths to backend storage volumes. An additional benefit of this is active/active MPIO devices will, as long as ``multipath`` is present inside the ramdisk, no longer possibly result in duplicate IO wipes occuring accross numerous devices. Story: #2010003 Task: #45108 Resolves: rhbz#2076622 Resolves: rhbz#2070519 Change-Id: I0fd6356f036d5ff17510fb838eaf418164cdfc92 |
||
|
Zuul
|
6b8f387498 | Merge "Collect a full lsblk output in the ramdisk logs" | ||
|
Zuul
|
979eea621e | Merge "Do not try to guess EFI partition path by its number" | ||
|
Dmitry Tantsur
|
f09f6c9f1a |
Do not try to guess EFI partition path by its number
The logic of adding a partition number to the device path does not work for devicemapper devices (e.g. a multipath storage device). Change-Id: I9a445e847d282c50adfa4bad5e7136776861005d |
||
|
Dmitry Tantsur
|
65c4de903a |
Use a pre-defined partition UUID to detect configdrive on GPT
Using partition numbers is currently broken for devicemapper devices. Fortunately, GPT has partition UUIDs, so we can just generate one and use it for lookup. Change-Id: I41ffe4f8e4c6e43182090b5aa2a2b4b34f32efd5 |
||
|
Dmitry Tantsur
|
424e649bed |
Collect a full lsblk output in the ramdisk logs
The existing lsblk call is very handy for an overview, but there a lot more useful pairs to collect. Collect them in a machine-readable format to be able to use in debugging and further development. Change-Id: Ib27843524421944ee93de975d275e93276a5597a |
||
|
Riccardo Pittau
|
8111475eb0 |
Use Werkzeug modern version
Request class from Werkzeug now includes json capability by default. See [1] and [2] for more info. [1] |
||
|
Zuul
|
a247fbcc8c | Merge "Refactor efi_utils for easier maintaining and debugging" | ||
|
Zuul
|
f08f70134d | Merge "Improve efficiency of storage cleaning in mixed media envs" | ||
|
Jacob Anders
|
c5f7f18bcb |
Improve efficiency of storage cleaning in mixed media envs
https://storyboard.openstack.org/#!/story/2008290 added support for NVMe-native storage cleaning, greatly improving storage clean times on NVMe-based nodes as well as reducing device wear. This is a follow up change which aims to make further improvements to cleaning efficiency in mixed NVMe-HDD environments. This is achieved by combining NVMe-native cleaning methods on NVMe devices with traditional metadata clean on non-NVMe devices. Story: 2009264 Task: 43498 Change-Id: I445d8f4aaa6cd191d2e540032aed3148fdbff341 |
||
|
Zuul
|
de28b7bfdc | Merge "Create fstab entry with appropriate label" | ||
|
Julia Kreger
|
99ca1086db |
Create fstab entry with appropriate label
Depending on the how the stars align with partition images being written to a remote system, we *may* end up with *either* a Partition UUID value, or a Partition's UUID value. Which are distinctly different. This is becasue the value, when collected as a result of writing an image to disk *falls* back and passes the value to enable partition discovery and matching. Later on, when we realized we ought to create an fstab entry, we blindly re-used the value thinking it was, indeed, always a Partition's UUID and not the Partition UUID. Obviously, the label type is quite explicit, either UUID or PARTUUID respectively, when initial ramdisk utilities such as dracut are searching and mounting filesystems. Adds capability to identify the correct label to utilize based upon the current state of the block devices on disk. Granted, we are likely only exposed to this because of IO race conditions under high concurrecy load operations. Normally this would only be seen on test VMs, but systems being backed by a Storage Area Network *can* exibit the same IO race conditions as virtual machines. Change-Id: I953c936cbf8fad889108cbf4e50b1a15f511b38c Resolves: rhbz#2058717 Story: #2009881 Task: 44623 |
||
|
Zuul
|
59c02f48cc | Merge "Run partx in verbose mode to simplify debugging" | ||
|
Zuul
|
63171c7f38 |
Merge "Add mount and parted -l to the collected commands"
|
||
|
Zuul
|
bcd5d11d9a | Merge "Rescan device after filesystem creation" | ||
|
Riccardo Pittau
|
697fa6f3b6 |
Use utf-16-le if BOM not present
In case no BOM is present in the CSV file the utf-16 codec won't work. We fail over to utf-16-le as Little Endian is commonly used. Change-Id: I3e25ce4997f5dd3df87caba753daced65838f85a |
||
|
Dmitry Tantsur
|
f1ee454a0e |
Add mount and parted -l to the collected commands
Change-Id: I1c759552220291890704d0002a62ea3f51701691 |
||
|
Dmitry Tantsur
|
3d3df17e5a |
Refactor efi_utils for easier maintaining and debugging
* Move irrelevant code from inside the giant try..except block * Do not bother removing the (empty) temporary mountpoint * Fix log messages according to the actual code * Fix some code duplication * Add missing unit tests for failure case Change-Id: Id7b557419d513375816d73901e2ab6f139d765ad |
||
|
Dmitry Tantsur
|
4d16ea413f |
Run partx in verbose mode to simplify debugging
Otherwise the actual failure cause is not recorded. Change-Id: If66ee97016ddf0e5c3f40ad9400ff3bc6fdebedc |
||
|
Arne Wiebalck
|
a83f38479e |
Move prepare_boot_partitions_for_softraid to raid_utils
prepare_boot_partitions_for_softraid() is used in BIOS and UEFI
modes to prepare the partitions for the bootloader. Move it from
the image extensions to raid_utils to reflect this and avoid the
import of an extension to efi_utils.
Follow-up to
|
||
|
Vanou Ishii
|
fa70a1909b |
Rescan device after filesystem creation
In work_on_disk function, IPA runs mkfs commands without following device rescan operation. This leads to incorrect content of uuids_to_return to be returned. These mkfs commands modify partition label but IPA fails to catch such changes because of no following device rescan operation. This commit adds call of device rescan function before uuids_to_return construction. Change-Id: I4e8b30deb5e2247f51ce8f10bd3271f64a264089 |
||
|
Dmitry Tantsur
|
6ebf041704 |
Use canonical device name for RAID device for ESP
It seems like tinyIPA silently replaces /dev/md/esp with /dev/md127. Find the next free /dev/md device and use it instead. Also rescan the resulting device before copying files. Change-Id: Ie04f530be434c4b1561e75f387b9da679e4607e0 Depends-On: https://review.opendev.org/c/openstack/ironic/+/827129/ |
||
|
Arne Wiebalck
|
62c5674a60 |
SoftwareRAID: Use efibootmgr (and drop grub2-install)
Move the software RAID code path from grub2-install to efibootmgr: - remove the UEFI efibootmgr exception for software RAID - create and populate the ESPs on the holder disks - update the NVRAM with all ESPs (the component devices of the ESP mirror, use unique labels to avoid unintentional deduplication of entries in the NVRAM) Story: #2009794 Change-Id: I7ed34e595215194a589c2f1cd0b39ff0336da8f1 |
||
|
Zuul
|
e06dd22e78 | Merge "Burn-in: Dynamic network pairing" | ||
|
Arne Wiebalck
|
7f15455d8d |
Burn-in: Dynamic network pairing
Pair nodes dynamically via a distributed coordination backend for network burn-in. The algorithm uses a group to pair nodes: after acquiring a lock, a first node joins the group, releases the lock, waits for a second node, then they both leave, and release the lock for the next pair. Story: #2007523 Task: #42796 Change-Id: I572093b144bc90a49cd76929c7e8685ed45d9f6e |