1f15a10c6a48cff3e0384505fada13012ba5d5e0
Commit Graph

2069 Commits

This Branch
This Branch
All Branches
Author SHA1 Message Date
Riccardo Pittau
1f15a10c6a Run dib ipa src jobs on ubuntu focal
Also disabling dstat in all focal based jobs because of
https://bugs.launchpad.net/ubuntu/+source/dstat/+bug/1866619
Plus including increase of alloc memory to increase job
success rate, original change from [1]
And re-enabling ipa-tempest-bios-ipmi-direct-src and
ipa-tempest-bios-ipmi-iscsi-src
[1] https://review.opendev.org/759765
Co-authored-by: Julia Kreger <juliaashleykreger@gmail.com>
Change-Id: Ia154745b71c17063eeaaabeb56950ab1cc9ce927
2020年10月27日 21:31:05 +01:00
Julia Kreger
a67807b9b6 Mark standalone job non-voting/remove from gate
The standalone job at present has a high chance of failure
due to two separate things occuring:
1) The deployed nodes from raid tests can be left in a dirty state
 as the raid configuration remains and is chosen as the root
 device for the next deployment. IF this is chosen by any job,
 such as rescue or a deployment test that attempts to login,
 then the job fails with unable to ssh. The fix for this is
 in the ironic-tempest-plugin but we need to get other fixes
 into stablilize the gate first.
 https://review.opendev.org/#/c/757141/
2) Long running scenarios run in cleaning such as deployment with
 RAID in the standalone suite can encounter conditions where
 the conductor tries to send the next command along before the
 present configuration command has completed. An example is
 downloading the image is still running, while a heartbeat
 has occured in the background and the conductor then seeks
 to perform a second action. This then causes the entire
 deployment to fail, even though it was transitory.
 This should be a relatively easy fix.
 https://review.opendev.org/759906
Also disabling ipa-tempest-bios-ipmi-direct-src and
ipa-tempest-bios-ipmi-iscsi-src for the time being, will revert
in https://review.opendev.org/757812
Change-Id: I08c8c258587dd3786dfa92e7879a80c80971d7df
2020年10月27日 21:28:08 +01:00
Zuul
81ba64a5ed Merge "Set safe version of hacking" 2020年10月22日 08:46:43 +00:00
Riccardo Pittau
7bcddeee09 Set safe version of hacking
Versions of hacking from 3.1.0 until 4.0.0 NOT included are supposed
to require a safe version of flake8, so besides normale issues related
to minor version upgrades, we can safely use those versions.
Also forcing pycodestyle versions to be compatible with flake8 installed
by hacking.
Change-Id: Ifa2af7404f4cd85cffd29ed725278d04a7c5f8c6
2020年10月19日 13:47:04 +00:00
Arne Wiebalck
c7f6baf7f4 [trivial] Remove redundant list conversion
Follow-up to https://review.opendev.org/#/c/756300/
Change-Id: Ibc6c044e24dde82928f19a9b9a7eaf68be53fb0e
2020年10月13日 08:29:53 +02:00
Zuul
80b0a9a132 Merge "Software RAID: Re-add missing devices" 2020年10月12日 12:24:24 +00:00
Zuul
0e6e023574 Merge "Use focal for tinyipa src jobs used by ipa-builder" 2020年10月10日 16:19:28 +00:00
Dmitry Tantsur
420ebc0d73 Do not silently swallow errors in the write_image deploy step
Calling join() does not raise, we need to explicitly check the result.
Change-Id: I81d3d727af220c2b50358edab8139f07874611f0
Story: #2008240
Task: #41083 
2020年10月09日 11:24:12 +02:00
Zuul
bd127d193b Merge "Reduce the duration of retries in the inspector tests" 2020年10月08日 23:04:24 +00:00
Zuul
35d2292aa4 Merge "Log a warning of target_boot_mode does not match current boot mode" 2020年10月07日 17:01:51 +00:00
Zuul
6706e02691 Merge "Use bionic nodeset for DIB centos src jobs" 2020年10月07日 14:39:28 +00:00
Dmitry Tantsur
62672de131 Reduce the duration of retries in the inspector tests
Currently the test takes 5*5=25 seconds. Re-arrange the code so
that it's possible to change the retry delay in tests.
Change-Id: Ia559dad4bc656f8ad6b2cb8cb0137a97e2614db7
2020年10月07日 12:39:01 +02:00
Dmitry Tantsur
1a67dddde7 Log a warning of target_boot_mode does not match current boot mode
This is not a normal situation and is likely to cause problems.
Change-Id: Id0668fd160ac0539d85997e985f8c43d9da75c90
2020年10月07日 12:30:23 +02:00
Dmitry Tantsur
fc4e0eed6a Don't try to call GRUB when root UUID is not provided
We don't have a really working way to detect root UUID for whole
disk images at the moment, which results in an ignored traceback
every time install_bootloader is called with whole disk images in
UEFI mode. Avoid it by skipping GRUB2 if root UUID is unknown.
Change-Id: I84245538f59c664b72d1cafbca8d61be0978f489
2020年10月07日 12:06:42 +02:00
Zuul
abd9f91813 Merge "Add basic retries for inspection" 2020年10月06日 17:07:20 +00:00
Riccardo Pittau
5c99f60083 Use focal for tinyipa src jobs used by ipa-builder
We can use focal here but we need to disable dstat because of
https://bugs.launchpad.net/ubuntu/+source/dstat/+bug/1866619
Change-Id: I2c013e871fc56aa0077656dfc76de9aad6bed83f
2020年10月06日 17:43:03 +02:00
Riccardo Pittau
13de98bd98 Use bionic nodeset for DIB centos src jobs
We'll keep using bionic as nodeset for certain jobs because of limitations
in DIB when building centos images.
DIB uses yum-tools to prepare the chroot environment, but the package is
not present in ubuntu focal because of lack of Python 2.x support.
Change-Id: I8635b7dfa6910c1721b89739d4d51fa7df488432
2020年10月06日 15:16:28 +02:00
Arne Wiebalck
253b4887d5 Software RAID: Re-add missing devices
Upon md device creation, component devices are sometimes removed
immediately again due to a "disk failure". The disks seem healthy,
though. This patch re-adds compoenent devices in such cases to
prevent that the md device will remain in a degraded state (which
would cause issues later, e.g. during ESP creation).
Story: #2008164
Task: #40914
Change-Id: I2ac7cb4a546de84686d5c3435e850c14b3f6c1d7
2020年10月06日 14:00:57 +02:00
3ddca46131 Add Python3 wallaby unit tests
This is an automatically generated patch to ensure unit testing
is in place for all the of the tested runtimes for wallaby.
See also the PTI in governance [1].
[1]: https://governance.openstack.org/tc/reference/project-testing-interface.html
Change-Id: Ib5cfc3be530851be8e426693c668f3ebcab2ecb3
2020年10月01日 18:42:43 +00:00
fb45e58d1c Update master for stable/victoria
Add file to the reno documentation build to show release notes for
stable/victoria.
Use pbr instruction to increment the minor version number
automatically so that master versions are higher than the versions on
stable/victoria.
Change-Id: Ia3696da8663c140504924b0a1cd23f9aaa517f0a
Sem-Ver: feature
2020年10月01日 18:42:40 +00:00
Zuul
99dee5067e Merge "Software RAID: Get component devices by md UUID" 6.4.0 2020年09月30日 18:30:56 +00:00
Zuul
faeb9441d3 Merge "Simplify heartbeating by removing use of select()" 2020年09月29日 15:47:08 +00:00
Arne Wiebalck
044c64dbc0 Software RAID: Get component devices by md UUID
Scanning the output of mdadm commands for RAID members will
miss component devices which are currently not part of the
RAID. For proper cleaning it is better to scan block devices
for a signature of the md device for which we would like to
get the components.
Story: #2008186
Task: #40947
Change-Id: Ib46612697851e36a16d272ccaeb0115106253863
2020年09月29日 17:08:40 +02:00
Arne Wiebalck
c7aec775ff Software RAID: Don't delete partitions too early
Partions on the holder disk should only be deleted after
all RAID devices have been deleted. Otherwise, super blocks
on partitions which reside on the same disks cannot be cleaned.
Story: #2008199
Task: #40979
Change-Id: I19293f5b992cd1fa68957d6f306dcec8f3b7a820
2020年09月28日 10:35:12 +02:00
Zuul
6cb781b24e Merge "Remove unused jobs" 2020年09月23日 22:32:42 +00:00
Zuul
5e61ad18e3 Merge "When reporting that agent is busy, report the executed command" 2020年09月23日 22:00:15 +00:00
Zuul
afd8cf3d95 Merge "Document ramdisk TLS and update existing TLS docs" 2020年09月23日 21:37:27 +00:00
Zuul
c7ff931fe6 Merge "Fix: make Intel CNA hardware manager none generic" 2020年09月23日 14:57:40 +00:00
Dmitry Tantsur
565d596dae Document ramdisk TLS and update existing TLS docs
Story: #2007214
Task: #40945
Change-Id: I1a930a0e52ab860edcd597df4d95a4e4eb51da96
2020年09月23日 15:07:49 +02:00
Zuul
11a87365fb Merge "Generate a TLS certificate and send it to ironic" 2020年09月23日 12:14:38 +00:00
Qianbiao.NG
4b0ef13d08 Fix: make Intel CNA hardware manager none generic
Currently, IntelCnaHardwareManager inherits GenericHardwareManager
which makes it a new "GenericHardwareManager" with "MAINLINE" priority.
This causes all other hardware-managers with lower priority than
"MAINLINE" never be used. To fix this, make IntelCnaHardwareManager
inherit basic HardwareManager.
Change-Id: I28b665d8841b0b2e83b132e1f25df95e03e7ba10
Story: 2008142
Task: 40882
2020年09月23日 18:24:26 +08:00
Zuul
b81df96821 Merge "CI: Lower memory usage of VMs/Increase swap" 2020年09月22日 18:46:23 +00:00
Jay Faulkner
a01646f56b Simplify heartbeating by removing use of select()
Heartbeating in IPA has used select.poll() for years to workaround
a bug where changing the time in the ramdisk could cause heartbeats
to stop and never resume.
Now that IPA syncs time at start and exit, this workaround is no
longer needed. So instead, we'll revert to using threading.Event()
in order to make the code simpler and easier to understand.
Since we need this to be an eventlet-event, and not a standard-thread
event, also monkey_patch threading.
Additionally, there were a few completely unused backoff interval
values set, that were never applied. In respect of maintaining the
5+ years old behavior of not doing error backoffs, that code was
removed instead of being made to work.
Change-Id: Ibcde99de64bb7e95d5df63a42a4ca4999f0c4c9b
2020年09月22日 16:59:47 +00:00
Julia Kreger
afb521d18e CI: Lower memory usage of VMs/Increase swap
The IPA images we presently build/use for centos are
now smaller than their peak size a few months ago.
As such, we should be able to reduce that memory footprint
and still execcute all of the necessary tests without any
issues.
Also increases the amount of swap to 8GB as the infra
default is now 1GB.
Change-Id: I5705f1f2504803a049ed88a0d8b1d6d0bbccf9e4
2020年09月22日 06:09:07 -07:00
Dmitry Tantsur
8f4975763b Documentation: fix incorrect step names
I can never remember if it's "device" or "devices".
Change-Id: Ib1388f0425ec4c02fbeacb26ad55bc90b0f3483c
2020年09月21日 18:16:01 +02:00
Dmitry Tantsur
fe6b687968 When reporting that agent is busy, report the executed command
Also make this API return a proper HTTP code (409 instead of 500).
Change-Id: I5d86878b5ed6142ed2630adee78c0867c49b663f
2020年09月18日 17:52:49 +02:00
Arne Wiebalck
9dd968fa30 [doc] Clarify custom hardware manager inheritance
Change-Id: I94657967763f058fccd4f19656925d02cd8973b8
2020年09月15日 11:34:11 +02:00
Julia Kreger
bb27badf76 Add basic retries for inspection
A transitory connection failure, such as one caused by
a port being held down for traffic forwarding, can experience
intermittent connectivity failures which result in failed
introspections.
Now the agent retries.
Change-Id: I72c5e3aca000d3854a17f8a461b1a2935e5c0d9b
2020年09月14日 22:38:18 +00:00
Zuul
f45d2cc0e7 Merge "Move ironic-standalone-ipa-src to Ubuntu Focal" 2020年09月14日 12:52:42 +00:00
Zuul
42df6c174f Merge "Fix backup node lookup" 2020年09月14日 09:11:21 +00:00
Zuul
d43dc1ee36 Merge "Refactor API version negotiation code" 2020年09月12日 15:24:42 +00:00
Zuul
a3b10db95a Merge "Replace oslo's loopingcall with tenacity" 2020年09月12日 15:24:41 +00:00
Zuul
a597117541 Merge "Prepare to move ipa-builder jobs" 2020年09月11日 21:07:58 +00:00
Dmitry Tantsur
021e0a6a46 Generate a TLS certificate and send it to ironic
Adds a new flag (on by default) that enables generating a TLS
certificate and sending it to ironic via heartbeat. Whether
ironic supports auto-generated certificates is determined by
checking its API version.
Change-Id: I01f83dd04cfec2adc9e2a6b9c531391773ed36e5
Depends-On: https://review.opendev.org/747136
Depends-On: https://review.opendev.org/749975
Story: #2007214
Task: #40604 
2020年09月11日 17:46:52 +02:00
Dmitry Tantsur
6a8056414e Refactor API version negotiation code
Makes sure heartbeats can send versions higher than one required for
tokens while also making sure we never send a version we don't know.
Also makes code easier to understand.
Change-Id: Ice1e7d45ea90c9fd8220c4b94e691b6015e23074
2020年09月11日 17:45:37 +02:00
Zuul
67f053d94c Merge "Fix l-c" 2020年09月11日 06:37:28 +00:00
Iury Gregory Melo Ferreira
08fdae9081 Fix l-c
Change-Id: I98e087f85e166da02890828054ed46ec9177eeb5
2020年09月10日 15:26:16 +02:00
Julia Kreger
3426963552 Fix backup node lookup
The node lookup code added in change
I27201319f31cdc01605a3c5ae9ef4b4218e4a3f6
was slightly broken in that we call a method
with a keyword arguemnt which doesn't exist.
uuid versus node_uuid.
It happens, it is a quick fix!
Spotted on a metalsmith job:
[-] Agent is requesting to perform an explicit node cache update.
 This is to pickup any chanages in the cache before deployment.
[-] Failed to update node cache. Error lookup_node() got an
 unexpected keyword argument 'uuid'
Change-Id: I59ecec65707a2f03918b233f1925395ebe59b8c4
2020年09月09日 15:19:38 -07:00
Dmitry Tantsur
9b75453339 Fix and run the correct functional tests job
Apparently, functional-py36 just runs unit tests.
Fix the test that has regressed in the meantime and make it voting
so that we don't regress again.
Change-Id: Id5efe89a12a00c27e6299380a51cdb840285d691
2020年09月04日 17:10:41 +02:00
Zuul
e73b7220c4 Merge "If listen_tls is true, enable TLS on wsgi server" 2020年09月03日 18:59:48 +00:00