podman

mirror of https://github.com/containers/podman.git synced 2025-12-02 02:58:03 +08:00

Author	SHA1	Message	Date
Ed Santiago	73cbc13190	CORS system test: clean up Primary motivator: 'curl -v' format changes in f42 Drive-bys: * 127.0.0.1, not localhost * use wait_for_port, not sleep * show curl commands and their output, to ease debugging failures * better failure assertions Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-25 07:46:07 -06:00
Jan Rodák	de856dab99	Add --health-max-log-count, --health-max-log-size, --health-log-destination flags These flags can affect the output of the HealtCheck log. Currently, when a container is configured with HealthCheck, the output from the HealthCheck command is only logged to the container status file, which is accessible via `podman inspect`. It is also limited to the last five executions and the first 500 characters per execution. This makes debugging past problems very difficult, since the only information available about the failure of the HealthCheck command is the generic `healthcheck service failed` record. - The `--health-log-destination` flag sets the destination of the HealthCheck log. - `none`: (default behavior) `HealthCheckResults` are stored in overlay containers. (For example: `$runroot/healthcheck.log`) - `directory`: creates a log file named `<container-ID>-healthcheck.log` with JSON `HealthCheckResults` in the specified directory. - `events_logger`: The log will be written with logging mechanism set by events_loggeri. It also saves the log to a default directory, for performance on a system with a large number of logs. - The `--health-max-log-count` flag sets the maximum number of attempts in the HealthCheck log file. - A value of `0` indicates an infinite number of attempts in the log file. - The default value is `5` attempts in the log file. - The `--health-max-log-size` flag sets the maximum length of the log stored. - A value of `0` indicates an infinite log length. - The default value is `500` log characters. Add --health-max-log-count flag Signed-off-by: Jan Rodák <hony.com@seznam.cz> Add --health-max-log-size flag Signed-off-by: Jan Rodák <hony.com@seznam.cz> Add --health-log-destination flag Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2024-09-25 14:01:35 +02:00
David Gibson	1f2658e0ef	test/system: For pasta port forwarding tests don't bind socat server The various pasta port forwarding tests run a socat server inside a container, then connect to it from a socat client on the host. Currently we have the server bind to the same specific address within the container as we connect to on the host. That's not quite what we want. For "tap" tests where the traffic goes over pasta's L2 link to the container it's fine, though unnecessary. For "loopback" tests where traffic is forwarded by pasta at the L4 socket level, however, it's not quite right. In this case the address used is either 127.0.0.1 or ::. That's correct and as needed for the host side address we're connecting to. However on the container side, this only works because of an odd and arguably undesirable behaviour of pasta: we use the fact that we have an L4 socket within the container to make such "spliced" L4 connections appear as if they come from loopback within the container. A container will generally expect it's loopback address to be only accessible from within the container, and this odd behaviour may be changed in pasta in future. In any case, the binding of the container side server is unnecessary, so simply remove it. Link: https://github.com/containers/podman/issues/24045 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2024-09-25 14:47:46 +10:00
openshift-merge-bot[bot]	66139fc266	Merge pull request #24056 from edsantiago/skip-quadlet-flake CI: skip the flaking quadlet test (temporary)	2024-09-24 14:31:15 +00:00
Ed Santiago	fd4c63838b	CI: skip the flaking quadlet test Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-24 07:18:50 -06:00
Ed Santiago	faf4604883	CI: make systemd tests parallel-safe () Mostly just switch to safename. Rewrite setup() to guarantee unique service file names, atomically created. IMPORTANT NOTE: enabling parallelization on these tests triggers #24010 ("fragment file" flake), but only on my f40 laptop. I have never seen the flake in Cirrus despite many many runs in #23275. I am submitting this for review and merging because even though _something_ is broken, this breakage is unlikely to affect our CI. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-24 06:13:50 -06:00
Ed Santiago	4be6bf2270	CI: parallelize logs test as much as possible Any test that uses --events-backend=file cannot be run in parallel due to #23750. This seems to be a hard block, unfixable. All other tests, enable ci:parallel. And, bring in timing fixes #23600. Thanks, @Honny1! Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-23 13:46:00 -06:00
openshift-merge-bot[bot]	edcee32116	Merge pull request #24041 from edsantiago/610-search-local CI: format test: use local registry if available	2024-09-23 17:33:38 +00:00
openshift-merge-bot[bot]	b98fffd36a	Merge pull request #23998 from edsantiago/safename-700 CI: make 700-play parallel-safe	2024-09-23 17:22:48 +00:00
openshift-merge-bot[bot]	3fb9619298	Merge pull request #23336 from dgibson/pasta-dns Fix several reliability problems with pasta DNS handling tests	2024-09-23 16:02:53 +00:00
Ed Santiago	600634c62c	CI: format test: use local registry if available The format test flakes when quay is down, because we've been doing 'podman search $IMAGE', which is a quay image. Solution: check if local registry is running, and use it. We don't need a real image. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-23 07:48:35 -06:00
Ed Santiago	d92f2d39ee	CI: make 700-play parallel-safe (where possible. Not all tests are parallelizable). And, refactor two complicated tests into one. This one is hard to review, sorry. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-23 05:27:45 -06:00
openshift-merge-bot[bot]	7aedb541d5	Merge pull request #24018 from ygalblum/allow-symlinks Quadlet - add full support for Symlinks	2024-09-23 06:49:12 +00:00
Ygal Blum	133ea31ffb	Quadlet - add full support for Symlinks Use os.ReadDir recursively instead of filepath.WalkDir Use map instead of list to easily find looped Symlinks Update existing tests and add a more elaborate one Update the man page Signed-off-by: Ygal Blum <ygal.blum@gmail.com>	2024-09-20 11:11:03 -04:00
Paul Holzinger	792796183f	libpod: setupNetNS() correctly mount netns The netns dir has a special logic to bind mout itself and make itslef shared. This code here didn't which lead to catastrophic bug during netns unmounting as we were unable to unmount the netns as the mount got duplicated and had the wrong parent mount. This caused us to loop forever trying to remove the file. Fixes https://issues.redhat.com/browse/RHEL-59620 Fixes #23685 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-20 15:19:22 +02:00
Ed Santiago	a08ae98161	CI: Quadlet rootfs test: use container image as rootfs Test was written to use / (root). This is not parallel-safe. Fixes: #23909 Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-19 15:19:14 -06:00
openshift-merge-bot[bot]	217ecac740	Merge pull request #23996 from edsantiago/safename-200 CI: make 200-pod parallel-safe	2024-09-19 14:27:38 +00:00
openshift-merge-bot[bot]	80776fa5bb	Merge pull request #24007 from edsantiago/systest-cleanup CI: system tests: various small cleanups	2024-09-19 14:05:36 +00:00
Ed Santiago	9c51eead06	CI: system test registry: use --net=host This removes the need for a tricky/fragile namespace workaround. Huge thanks to Paul for discovering documentation on the Registry container, and how to override config.yml settings: https://distribution.github.io/distribution/about/configuration/#override-specific-configuration-options Drive-by: consistentize quotes in -eVAR="value". Minor, but makes them all easier to read with emacs/vi syntax highlighting. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-19 05:17:15 -06:00
Ed Santiago	e3af5a38d3	CI: rm system test: bump grace period The "rm on stopping containers" test is flaking under high load, probably because I bumped up two timeouts in the healthcheck container that it relies on. Bump up this test's timeout as well. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-18 11:35:00 -06:00
Ed Santiago	3396dabdf3	CI: system tests: minor documentation on parallel Only in 000-TEMPLATE. I know I need to write more thorough documentation. I choose to defer that. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-18 11:32:36 -06:00
Ed Santiago	1d5c8ac18e	CI: system tests: always create pause image ...not just when running parallel Bats, because Bats does not provide any way to know if we're parallel. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-18 11:23:12 -06:00
Ed Santiago	5e5c68ffbe	CI: quadlet system test: be more forgiving ...of high system load (such as when running parallel tests). Allow time for services to reach desired state, by retrying a few times in a loop. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-18 11:22:48 -06:00
openshift-merge-bot[bot]	04d193daa9	Merge pull request #23987 from edsantiago/safename-090 CI: make 090-events parallel-safe	2024-09-18 16:06:31 +00:00
openshift-merge-bot[bot]	bef0aabbdd	Merge pull request #23995 from Luap99/netns-leak CI: netns leak checks for system and e2e	2024-09-18 15:49:59 +00:00
openshift-merge-bot[bot]	7fee222d52	Merge pull request #23997 from Luap99/expose-sctp allow exposed sctp ports	2024-09-18 15:08:45 +00:00
openshift-merge-bot[bot]	f580ae0d19	Merge pull request #23985 from Luap99/wait-hang wait: fix handling of multiple conditions with exited	2024-09-18 12:26:28 +00:00
Ed Santiago	6fe832d5d6	CI: make 200-pod parallel-safe ...as much as possible. Not all tests can be parallelized. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-18 06:25:18 -06:00
Paul Holzinger	d7335855d7	allow exposed sctp ports There is no reason to disallow exposed sctp ports at all. As root we can publish them find and as rootless it should error later anyway. And for the case mentioned in the issue it doesn't make sense as the port is not even published thus it is just part of the metadata which is totally in all cases. Fixes #23911 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-18 14:24:45 +02:00
Paul Holzinger	755a06aa44	test/e2e: add netns leak check Like we do in system tests now check for netns leaks in e2e as well. Now because things run in parallel and this dir is shared we cannot test after each test only once per suite. This will be a PITA to debug if leaks happen as the netns files do not contain the container ID and are just random bytes (maybe we should change this?) Fixes #23715 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-18 14:05:26 +02:00
Paul Holzinger	2d469e517d	test/system: netns leak check for rootless as well This fixes the problem where even as root we check the netns files from root. But in order to catch any rootless bugs we must check the rootless files from $XDG_RUNTIME_DIR/netns. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-18 12:07:11 +02:00
David Gibson	2505381551	test/system: Improve TODO comments on IPv6 pasta custom DNS forward test This test is currently disabled due to several issues, only some of which are described in the existing comments. Add some more details to clarify the situation. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2024-09-18 11:19:47 +10:00
David Gibson	4919440428	test/system: Clarify "Local forwarder" pasta tests This name for the tests is misleading, since in the default configuration podman will already configure a forwarding addres, which could forward to either another local forwarder or an external nameserver on the host side. What this test is really about is explicitly configuring the pasta DNS forwarding address. Rename accordingly. The IPv4 version of the test doesn't use the podman --dns option, only the pasta --dns-forward option. This exercises the podman behaviour that pasta --dns-forward options are added to /etc/resolv.conf automatically. However there could also be other things in /etc/resolv.conf, so the nslookup might not use the custom forwarding address for the lookup. To fix that, split the test into two parts: one verifying that the custom address is in /etc/resolv.conf and another performing the nslookup with an explicit server address to make sure we exercise the pasta side as well. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2024-09-18 10:59:43 +10:00
David Gibson	63e286ac55	test/system: Simplify testing for nameserver connectivity In both the "Basic nameserver lookup" and "Local forwarder, IPv4" pasta tests, we check whether DNS resolution is working by running "nslookup 127.0.0.1" in the container and checking if 1.0.0.127.in-addr.arpa is in the output. 1.0.0.127.in-addr.arpa isn't the expected result of the resolution though, it's just the DNS name that nslookup will tranlated 127.0.0.1 into. The test mostly works, because nslookup echoes that on successful lookups. However, it could also echo it in certain sorts of failure, so it's not a very reliable test. Furthermore, resolving 127.0.0.1 from a nameserver is a rather strange thing to do. It's done that way because RFC1912[0] suggests it should always resolve, even for nameservers on a disconnected network. But, this doesn't really appear to be true in practice: a number of resolvers return NXDOMAIN. That works by accident because nslookup seems to echo the name above as part of the error message. Change to instead looking up one of the root servers by name. This does now rely on access to the global DNS during tests, but other podman tests attempt to resolve google.com, so that should be ok. One of the root servers is about as close to universal resolvability as it's possible to get [0] https://datatracker.ietf.org/doc/html/rfc1912#section-4.1 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2024-09-18 10:59:43 +10:00
David Gibson	6c79fe292b	test/system: Consolidate "External resolver" pasta tests The idea behind the "External resolver" tests is simply to check that we can contact a nameserver, regardless of this configuration. To this end the "IPv4" version looks up 127.0.0.1 which RFC1912[0] suggests should always be resolvable. The IPv6 version instead looks up [::1]. While it makes sense for that to be resolvable in a similar way, there appear to be quite a few nameservers which do not resolve it, making this test flaky. Furthermore the idea behind resolving [::1] is that it should make nslookup prefer to resolve over IPv6. That appears to be very unreliable at best. Since making a different query doesn't actually exercise anything different in pasta, drop the test. The remaining IPv4 test isn't really specific to an "external" resolver, it's simply checking that we can contact some sort of resolver with the default podman configuration. Rename accordingly, and run it regardless of IPv4 connectivity on the host: we can still query a nameserver about an IPv4 address, even if we only have IPv6 connectivity ourselves. [0] https://datatracker.ietf.org/doc/html/rfc1912#section-4.1 Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2024-09-18 10:59:43 +10:00
David Gibson	85045406b5	test/system: Move test for default forwarder into its own case The "Local forwarder, IPv4" pasta test, amongst other things, checks that podman's default DNS forwarding address - 169.254.0.1 - appears in the container's /etc/resolv.conf. That's not really related to anything else going on in that test (which is about _changing_ that default address). So, move it into its own test case. Signed-off-by: David Gibson <david@gibson.dropbear.id.au>	2024-09-18 10:59:43 +10:00
Ed Santiago	5468718f22	CI: make 090-events parallel-safe ...or at least as much as possible. Some tests cannot be run in parallel due to #23750: "--events-backend=file" does not actually work the way a naïve user would intuit. Stop/die events are asynchronous, and can be gathered by ANY OTHER podman process running after it, and if that process has the default events-backend=journal, that's where the event will be logged. See #23987 for further discussion. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 18:21:58 -06:00
openshift-merge-bot[bot]	1e9464c9b4	Merge pull request #23937 from edsantiago/test-crun-17 New VMs: test crun 1.17	2024-09-17 20:28:43 +00:00
openshift-merge-bot[bot]	4dfff40840	Merge pull request #23989 from edsantiago/enable-bats-parallel CI: system tests: enable parallel tests	2024-09-17 19:30:57 +00:00
openshift-merge-bot[bot]	75369fd283	Merge pull request #23986 from mheon/fix_23981 Match output of Compat Top API to Docker	2024-09-17 19:06:13 +00:00
Ed Santiago	8402b6535f	Misc minor test fixes ...for dealing with flakes in parallel mode Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:37 -06:00
Ed Santiago	7fcf94d7b5	Add network namespace leak check Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:37 -06:00
Ed Santiago	b3da5be2b1	Add workaround for buildah parallel bug Need --layers=false in podman build, otherwise a buildah race can trigger "layer not known" failures: https://github.com/containers/buildah/issues/5674 Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:37 -06:00
Ed Santiago	5fc3de5583	registry: lock start attempts When running parallel, multiple tests could be trying to start the registry at once. Make this parallel-safe. Also, use a safer port range for the registry. Something outside of /proc/sys/net/ipv4/ip_local_port_range Sorry, I'm including a FIXME section that I haven't investigated deeply enough. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:37 -06:00
Ed Santiago	bf6131780a	Update system test template and README Add a few best-practices examples, and add a whole section describing the dos and donts of writing parallel-safe tests. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:37 -06:00
Ed Santiago	6502e30cfd	bats log: differentiate parallel tests from sequential For tests run in parallel, show file number as \|nnn\| (vs [nnn]) Teach logformatter to distinguish the two, adding 'p' to anchors in parallel tests. Necessary because in this scheme we run bats twice, thus see 'ok 1' twice, and we want to differentiate them. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:37 -06:00
Ed Santiago	bcffa9ce30	clean_setup: create pause image Workaround for #23292, where simultaneous 'pod create' commands will all start a podman-build of the pause image, but only one of them will be tagged, and the others will leak <none> images. Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 11:19:36 -06:00
Ed Santiago	812c7e9436	CI: make 012-manifest parallel-safe Signed-off-by: Ed Santiago <santiago@redhat.com>	2024-09-17 10:35:01 -06:00
Paul Holzinger	aa108924ea	test/system: remove wait workaround The issue is closed and I recently fixed a number of races (`bf74797c69`) in the remote attach API that sound like exactly like the same error that was mentioned in issue #9597. As such I think this works, if it start flaking again we can revert this or better fix the actual bug. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-17 17:35:18 +02:00
Paul Holzinger	fbed3a01d2	wait: fix handling of multiple conditions with exited As it turns on things are not so simple after all... In podman-py it was reported[1] that waiting might hang, per our docs wait on multiple conditions should exit once the first one is hit and not all of them. However because the new wait logic never checked if the context was cancelled the goroutine kept running until conmon exited and because we used a waitgroup to wait for all of them to finish it blocked until that happened. First we can remove the waitgroup as we only need to wait for one of them anyway via the channel. While this alone fixes the hang it would still leak the other goroutine. As there is no way to cancel a goroutine all the code must check for a cancelled context in the wait loop to no leak. Fixes `8a943311db` ("libpod: simplify WaitForExit()") [1] https://github.com/containers/podman-py/issues/425 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-17 17:35:17 +02:00

1 2 3 4 5 ...

6741 Commits