podman

mirror of https://github.com/containers/podman.git synced 2025-05-17 15:18:43 +08:00

Author	SHA1	Message	Date
Jan Kaluza	701aade262	Add --env and --unsetenv to podman update. The --env is used to add new environment variable to container or override the existing one. The --unsetenv is used to remove the environment variable. It is done by sharing "env" and "unsetenv" flags between both "update" and "create" commands and later handling these flags in the "update" command handler. The list of environment variables to add/remove is stored in newly added variables in the ContainerUpdateOptions. The Container.Update API call is refactored to take the ContainerUpdateOptions as an input to limit the number of its arguments. The Env and UnsetEnv lists are later handled using the envLib package and the Container is updated. The remote API is also extended to handle Env and EnvUnset. Fixes: #24875 Signed-off-by: Jan Kaluza <jkaluza@redhat.com>	2025-03-21 13:15:44 +01:00
Jan Rodák	fff42ac232	Fix HealthCheck log destination, count, and size defaults GoLang sets unset values to the default value of the type. This means that the destination of the log is an empty string and the count and size are set to 0. However, this means that size and count are unbounded, and this is not the default behavior. Fixes: https://github.com/containers/podman/issues/25473 Fixes: https://issues.redhat.com/browse/RHEL-83262 Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2025-03-12 21:27:00 +01:00
Paul Holzinger	47a743bba2	report healthcheck start errors When starting a container consider healthcheck errors fatal. That way user know when systemd-run failed to setup the timer to run the healthcheck and we don't get into a state where the container is running but not the healthcheck. This also fixes the broken error reporting from the systemd-run exec, if the binary could not be run the output was just empty leaving the users with no idea what failed. Fixes #25034 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2025-03-04 16:48:50 +01:00
Matt Heon	ce8813dc8d	Remove persist directory when cleaning up Conmon files This seems to have been added as part of the cleanup of our handling of OOM files, but code was never added to remove it, so we leaked a single directory with an exit file and OOM file per container run. Apparently have been doing this for a while - I'd guess since March of '23 - so I'm surprised more people didn't notice. Fixes #25291 Signed-off-by: Matt Heon <mheon@redhat.com>	2025-02-11 14:51:34 -05:00
Matt Heon	46d874aa52	Refactor graph traversal & use for pod stop First, refactor our existing graph traversal code to improve code sharing. There still isn't much sharing between inward traversal (stop, remove) and outward traversal (start) but stop and remove are sharing most of their code, which seems a positive. Second, add a new graph-traversal function to stop containers. We already had start and remove; stop uses the newly-refactored inward-traversal code which it shares with removal. Third, rework the shared stop/removal inward-traversal code to add locking. This allows parallel execution of stop and removal, which should improve the performance of `podman pod rm` and retain the performance of `podman pod stop` at about what it is right now. Fourth and finally, use the new graph-based stop when possible to solve unordered stop problems with pods - specifically, the infra container stopping before application containers, leaving those containers without a working network. Fixes https://issues.redhat.com/browse/RHEL-76827 Signed-off-by: Matt Heon <mheon@redhat.com>	2025-02-06 18:28:12 -05:00
Matt Heon	06fa617f61	Lock pod while starting and stopping containers The intention behind this is to stop races between `pod stop\|start` and `container stop\|start` being run at the same time. This could result in containers with no working network (they join the still-running infra container's netns, which is then torn down as the infra container is stopped, leaving the container in an otherwise unused, nonfunctional, orphan netns. Locking the pod (if present) in the public container start and stop APIs should be sufficient to stop this. Signed-off-by: Matt Heon <mheon@redhat.com>	2025-02-03 11:19:20 -05:00
Jan Rodák	511c8b249d	Remove timer for HealthCheck when container is paused. If is unpaused then new timer for HealthCheck is created. Fixes: https://issues.redhat.com/browse/RUN-2468 Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2025-01-29 13:34:26 +01:00
Jan Rodák	8f1266c717	Fix overwriting of LinuxResources structure in the database with defaults values when changes configuration with podman update. The new LinuxResource structure does not represent the current unchanged configuration, which was not affected by the change. Fixes: https://issues.redhat.com/browse/RUN-2375 Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2024-12-04 13:16:32 +01:00
Jan Rodák	a1249425bd	Configure HealthCheck with `podman update` New flags in a `podman update` can change the configuration of HealthCheck when the container is started, without having to restart or recreate the container. This can help determine why a given container suddenly started failing HealthCheck without interfering with the services it provides. For example, reconfigure HealthCheck to keep logs longer than the usual last X results, store logs to other destinations, etc. Fixes: https://issues.redhat.com/browse/RHEL-60561 Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2024-11-19 19:44:14 +01:00
Jan Rodák	de856dab99	Add --health-max-log-count, --health-max-log-size, --health-log-destination flags These flags can affect the output of the HealtCheck log. Currently, when a container is configured with HealthCheck, the output from the HealthCheck command is only logged to the container status file, which is accessible via `podman inspect`. It is also limited to the last five executions and the first 500 characters per execution. This makes debugging past problems very difficult, since the only information available about the failure of the HealthCheck command is the generic `healthcheck service failed` record. - The `--health-log-destination` flag sets the destination of the HealthCheck log. - `none`: (default behavior) `HealthCheckResults` are stored in overlay containers. (For example: `$runroot/healthcheck.log`) - `directory`: creates a log file named `<container-ID>-healthcheck.log` with JSON `HealthCheckResults` in the specified directory. - `events_logger`: The log will be written with logging mechanism set by events_loggeri. It also saves the log to a default directory, for performance on a system with a large number of logs. - The `--health-max-log-count` flag sets the maximum number of attempts in the HealthCheck log file. - A value of `0` indicates an infinite number of attempts in the log file. - The default value is `5` attempts in the log file. - The `--health-max-log-size` flag sets the maximum length of the log stored. - A value of `0` indicates an infinite log length. - The default value is `500` log characters. Add --health-max-log-count flag Signed-off-by: Jan Rodák <hony.com@seznam.cz> Add --health-max-log-size flag Signed-off-by: Jan Rodák <hony.com@seznam.cz> Add --health-log-destination flag Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2024-09-25 14:01:35 +02:00
Paul Holzinger	f93fcf7dee	bump go to 1.22 Many dependencies started using go 1.22 which means we have to follow in order to update. Disable the now depracted exportloopref linter as it was replaced by copyloopvar as go fixed the loop copy problem in 1.22[1] Another new chnage in go 1.22 is the for loop syntax over ints, the intrange linter chacks for this but there a lot of loops that have to be converted so I didn't do it here and disable th elinter for now, th eold syntax is still fine. [1] https://go.dev/blog/loopvar-preview Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-09-03 15:14:15 +02:00
Paul Holzinger	30eb6b6aae	libpod: do not stop pod on init ctr exit Init containers are meant to exit early before other containers are started. Thus stopping the infra container in such case is wrong. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-08-15 11:07:27 +02:00
Paul Holzinger	ecf88f17b6	libpod: reset state error on init If we manage to init/start a container successfully we should unset any previously stored state errors. Otherwise a user might be confused why there is an error in the state about some old error even though the container works/runs. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-08-12 14:30:48 +02:00
openshift-merge-bot[bot]	69862b7251	Merge pull request #23460 from Luap99/cleanup-term libpod: inhibit SIGTERM during cleanup()	2024-08-06 11:38:27 +00:00
Giuseppe Scrivano	3ae1568933	libpod: fix volume copyup with idmap if idmap is specified for a volume, reverse the mappings when copying up from the container, so that the original permissions are maintained. Closes: https://github.com/containers/podman/issues/23467 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-08-01 22:49:27 +02:00
Giuseppe Scrivano	61def05cd9	libpod: avoid hang on errors Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-08-01 22:49:27 +02:00
Paul Holzinger	7610cedc80	libpod: inhibit SIGTERM during cleanup() The network cleanup can handle it when it is killed half way through as it spits out a bunch of error in that case on the next cleanup attempt. Try to avoid getting into such a state and ignore sigterm during this section. Of course we stil can get SIGKILL so we should work on fixing the underlying problems in network cleanup but let's see if this helps us with the CI flakes in the meantime. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-07-31 19:00:16 +02:00
Giuseppe Scrivano	b59918e536	libpod: force rootfs for OCI path with idmap when a --rootfs is specified with idmap, always use the specified rootfs since we need a new mount on top of the original directory. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-07-27 19:25:10 +02:00
Paul Holzinger	3280da0500	fix race conditions in start/attach logic The current code did something like this: lock() getState() unlock() if state != running lock() getState() == running -> error unlock() This of course is wrong because between the first unlock() and second lock() call another process could have modified the state. This meant that sometimes you would get a weird error on start because the internal setup errored as the container was already running. In general any state check without holding the lock is incorrect and will result in race conditions. As such refactor the code to combine both StartAndAttach and Attach() into one function that can handle both. With that we can move the running check into the locked code. Also use typed error for this specific error case then the callers can check and ignore the specific error when needed. This also allows us to fix races in the compat API that did a similar racy state check. This commit changes slightly how we output the result, previously a start on already running container would never print the id/name of the container which is confusing and sort of breaks idempotence. Now it will include the output except when --all is used. Then it only reports the ids that were actually started. Fixes #23246 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-07-12 15:11:34 +02:00
Giuseppe Scrivano	49eb5af301	libpod: intermediate mount if UID not mapped into the userns if the current user is not mapped into the new user namespace, use an intermediate mount to allow the mount point to be accessible instead of opening up all the parent directories for the mountpoint. Closes: https://github.com/containers/podman/issues/23028 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-06-21 18:01:26 +02:00
Giuseppe Scrivano	08a8429459	libpod: avoid chowning the rundir to root in the userns so it is possible to remove the code to make the entire directory world accessible. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-06-21 18:01:26 +02:00
Paul Holzinger	e8ea1e7632	libpod: do not leak systemd hc startup unit timer This fixes a regression added in commit 4fd84190b8, because the name was overwritten by the createTimer() timer call the removeTransientFiles() call removed the new timer and not the startup healthcheck. And then when the container was stopped we leaked it as the wrong unit name was in the state. A new test has been added to ensure the logic works and we never leak the system timers. Fixes #22884 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-06-04 18:03:46 +02:00
Matthew Heon	046c0e5fc2	Only stop chowning volumes once they're not empty When an empty volume is mounted into a container, Docker will chown that volume appropriately for use in the container. Podman does this as well, but there are differences in the details. In Podman, a chown is presently a one-and-done deal; in Docker, it will continue so long as the volume remains empty. Mount into a dozen containers, but never add content, the chown occurs every time. The chown is also linked to copy-up; it will always occur when a copy-up occurred, despite the volume now not being empty. This PR changes our logic to (mostly) match Docker's. For some reason, the chowning also stops if the volume is chowned to root at any point. This feels like a Docker bug, but as they say, bug for bug compatible. In retrospect, using bools for NeedsChown and NeedsCopyUp was a mistake. Docker isn't actually tracking this stuff; they're just doing a copy-up and permissions change unconditionally as long as the volume is empty. They also have the two linked as one operation, seemingly, despite happening at very different times during container init. Replicating that in our stateful system is nontrivial, hence the need for the new CopiedUp field. Basically, we never want to chown a volume with contents in it, except if that data is a result of a copy-up that resulted from mounting into the current container. Tracking who did the copy-up is the easiest way to do this. Fixes #22571 Signed-off-by: Matthew Heon <matthew.heon@pm.me>	2024-05-22 17:47:01 -04:00
Giuseppe Scrivano	b06c58b4a5	libpod: wait for healthy on main thread wait for the healthy status on the thread where the container lock is held. Otherwise, if it is performed from a go routine, a different thread is used (since the runtime.LockOSThread() call doesn't have any effect), causing pthread_mutex_unlock() to fail with EPERM. Closes: https://github.com/containers/podman/issues/22651 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-05-14 22:55:02 +02:00
openshift-merge-bot[bot]	0c09421f85	Merge pull request #22641 from mheon/handle_stopping_loop Ensure that containers do not get stuck in stopping	2024-05-13 12:32:40 +00:00
Giuseppe Scrivano	8433a01aa2	Revert "container stop: kill conmon" This reverts commit 909ab594191ce964529398bcf7600edff9540d71. The workaround was added almost 5 years ago to workaround an issue with old conmon releases. It is safe to assume such ancient conmon releases are not used anymore. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-05-09 22:49:14 +02:00
Matt Heon	3fa8e98a31	Ensure that containers do not get stuck in stopping The scenario for inducing this is as follows: 1. Start a container with a long stop timeout and a PID1 that ignores SIGTERM 2. Use `podman stop` to stop that container 3. Simultaneously, in another terminal, kill -9 `pidof podman` (the container is now in ContainerStateStopping) 4. Now kill that container's Conmon with SIGKILL. 5. No commands are able to move the container from Stopping to Stopped now. The cause is a logic bug in our exit-file handling logic. Conmon being dead without an exit file causes no change to the state. Add handling for this case that tries to clean up, including stopping the container if it still seems to be running. Fixes #19629 Signed-off-by: Matt Heon <mheon@redhat.com>	2024-05-09 11:17:24 -04:00
Matt Heon	4fd84190b8	Add a random suffix to healthcheck unit names Systemd dislikes it when we rapidly create and remove a transient unit. Solution: If we change the name every time, it's different enough that systemd is satisfied and we stop having errors trying to restart the healthcheck. Generate a random 32-bit integer, and add it (formatted as hex) to the end of the unit name to do this. As a result, we now have to store the unit name in the database, but it does make backwards compat easy - if the unit name in the DB is empty, we revert to the old behavior because the timer was created by old Podman. Should resolve RHEL-26105 Signed-off-by: Matt Heon <mheon@redhat.com>	2024-05-03 11:45:05 -04:00
Paul Holzinger	83dbbc3a51	Replace golang.org/x/exp/slices with slices from std Use "slices" from the standard library, this package was added in go 1.21 so we can use it now. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-04-23 11:16:40 +02:00
openshift-merge-bot[bot]	c2cadfb5c5	Merge pull request #22322 from mheon/update_the_config Make `podman update` changes persistent	2024-04-22 07:50:48 +00:00
Giuseppe Scrivano	5656ad40b1	libpod: use fileutils.(Le\|E)xists Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2024-04-19 09:52:14 +02:00
Matt Heon	482ef7bfcf	Add support for updating restart policy This is something Docker does, and we did not do until now. Most difficult/annoying part was the REST API, where I did not really want to modify the struct being sent, so I made the new restart policy parameters query parameters instead. Testing was also a bit annoying, because testing restart policy always is. Signed-off-by: Matt Heon <mheon@redhat.com>	2024-04-17 08:23:51 -04:00
Matt Heon	be3f075402	Make `podman update` changes persistent The logic here is more complex than I would like, largely due to the behavior of `podman inspect` for running containers. When a container is running, `podman inspect` will source as much as possible from the OCI spec used to run that container, to grab up-to-date information on things like devices. We don't want to change this, it's definitely the right behavior, but it does make updating a running container inconvenient: we have to rewrite the OCI spec as part of the update to make sure that `podman inspect` will read the correct resource limits. Also, make update emit events. Docker does it, we should as well. Signed-off-by: Matt Heon <mheon@redhat.com>	2024-04-17 08:23:50 -04:00
lvyaoting	59ee130048	chore: fix function names in comment Signed-off-by: lvyaoting <lvyaoting@outlook.com>	2024-04-08 11:36:50 +08:00
Paul Holzinger	15b8bb72a8	libpod: restart always reconfigure the netns Always teardown the network, trying to reuse the netns has caused a significant amount of bugs in this code here. It also never worked for containers with user namespaces. So once and for all simplify this by never reusing the netns. Originally this was done to have a faster restart of containers but with netavark now we are much faster so it shouldn't be that noticeable in practice. It also makes more sense to reconfigure the netns as it is likely that the container exited due some broken network state in which case reusing would just cause more harm than good. The main motivation for this change was the pasta change to use --dns-forward by default. As the restarted contianer had no idea what nameserver to use as pasta just kept running. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-03-19 12:21:18 +01:00
Urvashi Mohnani	667311c7d5	Use persist dir for oom file Conmon writes the exit file and oom file (if container was oom killed) to the persist directory. This directory is retained across reboots as well. Update podman to create a persist-dir/ctr-id for the exit and oom files for each container to be written to. The oom state of container is set after reading the files from the persist-dir/ctr-id directory. The exit code still continues to read the exit file from the exits directory. Signed-off-by: Urvashi Mohnani <umohnani@redhat.com>	2024-02-12 09:13:39 -05:00
Matt Heon	72f1617fac	Bump Go module to v5 Moving from Go module v4 to v5 prepares us for public releases. Move done using gomove [1] as with the v3 and v4 moves. [1] https://github.com/KSubedi/gomove Signed-off-by: Matt Heon <mheon@redhat.com>	2024-02-08 09:35:39 -05:00
openshift-merge-bot[bot]	5e081e47aa	Merge pull request #21332 from rhatdan/timezone Reuse timezone code from containers/common	2024-02-08 14:13:40 +00:00
openshift-merge-bot[bot]	8a6165e592	Merge pull request #21522 from Luap99/restart-userns fix userns + restart policy with slirp4netns	2024-02-08 10:41:54 +00:00
Matt Heon	3cf2f8ccf4	Handle more states during refresh We were preserving ContainerStateExited, which is better than nothing, but definitely not correct. A container that ran at any point during the last boot should be moved to Exited state to preserve the fact that they were run at least one. This means we have to convert Running, Stopped, Stopping, Paused containers to exited as well. Signed-off-by: Matt Heon <mheon@redhat.com>	2024-02-07 08:33:56 -05:00
Paul Holzinger	7d15bc2efb	fix userns + restart policy with slirp4netns Currently we deadlock in the slirp4netns setup code as we try to configure an non exissting netns. The problem happens because we tear down the netns in the userns case correctly since commit bbd6281ecc but that introduces this slirp4netns problem. The code does a proper new network setup later so we should only use the short cut when not in a userns. Fixes #21477 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-02-06 13:50:07 +01:00
Daniel J Walsh	fcae702205	Reuse timezone code from containers/common Replaces: https://github.com/containers/podman/pull/21077 [NO NEW TESTS NEEDED] Existing tests should handle this. Signed-off-by: Sohan Kunkerkar <sohank2602@gmail.com> Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2024-02-06 07:09:16 -05:00
Oleksandr Redko	8bdf77aa20	Refactor: replace StringInSlice with slices.Contains Signed-off-by: Oleksandr Redko <Oleksandr_Redko@epam.com>	2024-01-05 16:25:56 +02:00
Oleksandr Redko	2a2d0b0e18	chore: delete obsolete // +build lines Signed-off-by: Oleksandr Redko <Oleksandr_Redko@epam.com>	2024-01-04 11:53:38 +02:00
Dan Čermák	5c7f745468	Remove deprecated field ContainerState.NetworkStatusOld This field drags in a dependency on CNI and thereby blocks us from disabling CNI support via a build tag [NO NEW TESTS NEEDED] Signed-off-by: Dan Čermák <dcermak@suse.com>	2023-12-12 17:09:39 +01:00
Daniel J Walsh	c8f262fec9	Use idtools.SafeChown and SafeLchown everywhere If we get an error chowning a file or directory to a UID/GID pair for something like ENOSUP or EPERM, then we should ignore as long as the UID/GID pair on disk is correct. Fixes: https://github.com/containers/podman/issues/20801 [NO NEW TESTS NEEDED] Since this is difficult to test and existing tests should be sufficient to ensure no regression. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2023-11-27 20:41:56 -05:00
Daniel J Walsh	ddd6cdfd77	Ignore SELinux relabel on unsupported file systems We were ignoreing relabel requests on certain unsupported file systems and not on others, this changes to consistently logrus.Debug ENOTSUP file systems. Fixes: https://github.com/containers/podman/discussions/20745 Still needs some work on the Buildah side. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2023-11-22 09:25:38 -05:00
openshift-ci[bot]	77d2658201	Merge pull request #20369 from cgiradkar/Issue-16759-docs Define better error message for container name conflicts with external storage	2023-10-30 10:22:00 +00:00
Valentin Rothberg	e966c86d98	container.conf: support attributed string slices All `[]string`s in containers.conf have now been migrated to attributed string slices which require some adjustments in Buildah and Podman. [NO NEW TESTS NEEDED] Signed-off-by: Valentin Rothberg <vrothberg@redhat.com>	2023-10-27 12:44:33 +02:00
Paul Holzinger	bad25da92e	libpod: add !remote tag This should never be pulled into the remote client. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2023-10-24 12:11:34 +02:00

1 2 3 4 5 ...

496 Commits