podman

mirror of https://github.com/containers/podman.git synced 2025-05-30 07:04:03 +08:00

Author	SHA1	Message	Date
Giuseppe Scrivano	fe919e4914	oci: propagate NOTIFY_SOCKET on runtime start with https://github.com/opencontainers/runc/pull/1807 we moved the systemd notify initialization from "create" to "start", so that the OCI runtime doesn't hang while waiting on reading from the notify socket. This means we also need to set the correct NOTIFY_SOCKET when start'ing the container. Closes: https://github.com/containers/libpod/issues/746 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-11-28 16:25:12 +01:00
OpenShift Merge Robot	effd63d6d5	Merge pull request #1848 from adrianreber/master Add tcp-established to checkpoint/restore	2018-11-28 07:00:24 -08:00
OpenShift Merge Robot	d346996e15	Merge pull request #1849 from giuseppe/report-rootless-netmode rootless: add new netmode "slirp4netns"	2018-11-28 06:18:28 -08:00
Giuseppe Scrivano	0365f57371	rootless: fix cleanup The conmon exit command is running inside of a namespace where the process is running with uid=0. When it launches again podman for the cleanup, podman is not running in rootless mode as the uid=0. Export some more env variables to tell podman we are in rootless mode. Closes: https://github.com/containers/libpod/issues/1859 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-11-28 10:19:13 +01:00
Giuseppe Scrivano	95f22a2ca0	network: allow slirp4netns mode also for root containers Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-11-28 09:21:59 +01:00
Adrian Reber	03c88a3deb	Added tcp-established to checkpoint/restore CRIU can checkpoint and restore processes/containers with established TCP connections if the correct option is specified. To implement checkpoint and restore with support for established TCP connections with Podman this commit adds the necessary options to runc during checkpoint and also tells conmon during restore to use 'runc restore' with '--tcp-established'. For this Podman feature to work a corresponding conmon change is required. Example: $ podman run --tmpfs /tmp --name podman-criu-test -d docker://docker.io/yovfiatbeb/podman-criu-test $ nc `podman inspect -l \| jq -r '.[0].NetworkSettings.IPAddress'` 8080 GET /examples/servlets/servlet/HelloWorldExample Connection: keep-alive 1 GET /examples/servlets/servlet/HelloWorldExample Connection: keep-alive 2 $ # Using HTTP keep-alive multiple requests are send to the server in the container $ # Different terminal: $ podman container checkpoint -l criu failed: type NOTIFY errno 0 $ # Looking at the log file would show errors because of established TCP connections $ podman container checkpoint -l --tcp-established $ # This works now and after the restore the same connection as above can be used for requests $ podman container restore -l --tcp-established The restore would fail without '--tcp-established' as the checkpoint image contains established TCP connections. Signed-off-by: Adrian Reber <areber@redhat.com>	2018-11-28 08:00:38 +01:00
Adrian Reber	0592558289	Use also a struct to pass options to Restore() This is basically the same change as ff47a4c2d5485fc49f937f3ce0c4e2fd6bdb1956 (Use a struct to pass options to Checkpoint()) just for the Restore() function. It is used to pass multiple restore options to the API and down to conmon which is used to restore containers. This is for the upcoming changes to support checkpointing and restoring containers with '--tcp-established'. Signed-off-by: Adrian Reber <areber@redhat.com>	2018-11-28 08:00:37 +01:00
OpenShift Merge Robot	e679e768f1	Merge pull request #1832 from giuseppe/always-make-explicit-tty-to-exec exec: always make explicit the tty value	2018-11-27 04:08:03 -08:00
Adrian Reber	b0572d6229	Added option to keep containers running after checkpointing CRIU supports to leave processes running after checkpointing: -R\|--leave-running leave tasks in running state after checkpoint runc also support to leave containers running after checkpointing: --leave-running leave the process running after checkpointing With this commit the support to leave a container running after checkpointing is brought to Podman: --leave-running, -R leave the container running after writing checkpoint to disk Now it is possible to checkpoint a container at some point in time without stopping the container. This can be used to rollback the container to an early state: $ podman run --tmpfs /tmp --name podman-criu-test -d docker://docker.io/yovfiatbeb/podman-criu-test $ curl 10.88.64.253:8080/examples/servlets/servlet/HelloWorldExample 3 $ podman container checkpoint -R -l $ curl 10.88.64.253:8080/examples/servlets/servlet/HelloWorldExample 4 $ curl 10.88.64.253:8080/examples/servlets/servlet/HelloWorldExample 5 $ podman stop -l $ podman container restore -l $ curl 10.88.64.253:8080/examples/servlets/servlet/HelloWorldExample 4 So after checkpointing the container kept running and was stopped after some time. Restoring this container will restore the state right at the checkpoint. Signed-off-by: Adrian Reber <areber@redhat.com>	2018-11-20 17:25:44 +01:00
Giuseppe Scrivano	fd01402930	exec: always make explicit the tty value otherwise runc will take by default the value used for creating the container. Setting it explicit overrides its default value and we won't end up trying to use a terminal when not available. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1625876 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-11-20 17:08:57 +01:00
Šimon Lukašík	223d102ec7	Lint: Do not ignore errors from docker run command when selinux enabled Redefining err by := operator within block makes this err variable block local. Addressing lint: libpod/oci.go:368:3⚠️ ineffectual assignment to err (ineffassign) Signed-off-by: Šimon Lukašík <slukasik@redhat.com>	2018-11-10 10:52:24 +01:00
Matthew Heon	0f45403c9b	Fix misspelling Signed-off-by: Matthew Heon <matthew.heon@gmail.com>	2018-11-07 11:36:01 -05:00
Matthew Heon	3286b0185d	Retrieve container PID from conmon Instead of running a full sync after starting a container to pick up its PID, grab it from Conmon instead. Signed-off-by: Matthew Heon <matthew.heon@gmail.com>	2018-11-07 11:36:01 -05:00
Matthew Heon	94763a47a6	If a container ceases to exist in runc, set exit status When we scan a container in runc and see that it no longer exists, we already set ContainerStatusExited to indicate that it no longer exists in runc. Now, also set an exit code and exit time, so PS output will make some sense. Signed-off-by: Matthew Heon <matthew.heon@gmail.com>	2018-11-07 11:36:01 -05:00
Matthew Heon	140f87c474	EXPERIMENTAL: Do not call out to runc for sync When syncing container state, we normally call out to runc to see the container's status. This does have significant performance implications, though, and we've seen issues with large amounts of runc processes being spawned. This patch attempts to use stat calls on the container exit file created by Conmon instead to sync state. This massively decreases the cost of calling updateContainer (it has gone from an almost-unconditional fork/exec of runc to a single stat call that can be avoided in most states). Signed-off-by: Matthew Heon <matthew.heon@gmail.com>	2018-11-07 11:36:01 -05:00
baude	318e33ce2c	read conmon output and convert to json in two steps when reading the output from conmon using the JSON methods, it appears that JSON marshalling is higher in pprof than it really is because the pipe is "waiting" for a response. this gives us a clearer look at the real CPU/time consumers. Signed-off-by: baude <bbaude@redhat.com>	2018-10-23 13:21:33 -05:00
Giuseppe Scrivano	fc89065a80	oci: cleanup process status I've seen a runc zombie process hanging around, it is caused by not cleaning up the "$OCI status" process. Also adjust another location that has the same issue. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-10-23 09:32:44 +02:00
Daniel J Walsh	2444ac9926	Move rootless directory handling to the libpod/pkg/util directory This should allow us to share this code with buildah. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2018-10-22 09:43:59 -04:00
OpenShift Merge Robot	094b8b7350	Merge pull request #1570 from giuseppe/fix-gvisor podman: allow usage of gVisor as OCI runtime	2018-10-04 13:24:57 -07:00
Adrian Reber	f7c8fd8a3d	Add support to checkpoint/restore containers runc uses CRIU to support checkpoint and restore of containers. This brings an initial checkpoint/restore implementation to podman. None of the additional runc flags are yet supported and container migration optimization (pre-copy/post-copy) is also left for the future. The current status is that it is possible to checkpoint and restore a container. I am testing on RHEL-7.x and as the combination of RHEL-7 and CRIU has seccomp troubles I have to create the container without seccomp. With the following steps I am able to checkpoint and restore a container: # podman run --security-opt="seccomp=unconfined" -d registry.fedoraproject.org/f27/httpd # curl -I 10.22.0.78:8080 HTTP/1.1 403 Forbidden # <-- this is actually a good answer # podman container checkpoint <container> # curl -I 10.22.0.78:8080 curl: (7) Failed connect to 10.22.0.78:8080; No route to host # podman container restore <container> # curl -I 10.22.0.78:8080 HTTP/1.1 403 Forbidden I am using CRIU, runc and conmon from git. All required changes for checkpoint/restore support in podman have been merged in the corresponding projects. To have the same IP address in the restored container as before checkpointing, CNI is told which IP address to use. If the saved network configuration cannot be found during restore, the container is restored with a new IP address. For CRIU to restore established TCP connections the IP address of the network namespace used for restore needs to be the same. For TCP connections in the listening state the IP address can change. During restore only one network interface with one IP address is handled correctly. Support to restore containers with more advanced network configuration will be implemented later. v2: * comment typo * print debug messages during cleanup of restore files * use createContainer() instead of createOCIContainer() * introduce helper CheckpointPath() * do not try to restore a container that is paused * use existing helper functions for cleanup * restructure code flow for better readability * do not try to restore if checkpoint/inventory.img is missing * git add checkpoint.go restore.go v3: * move checkpoint/restore under 'podman container' v4: * incorporated changes from latest reviews Signed-off-by: Adrian Reber <areber@redhat.com>	2018-10-03 21:41:39 +02:00
Giuseppe Scrivano	c5546729b8	oci: split the stdout and stderr pipes read the OCI status from stdout, not the combined stdout+stderr stream. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-10-03 21:39:35 +02:00
Giuseppe Scrivano	c21e85e5f4	oci: always set XDG_RUNTIME_DIR Fix an issue when using gVisor that couldn't start the container since the XDG_RUNTIME_DIR env variable used for the "create" and "start" commands is different. Set the environment variable for each command so that the OCI runtime gets always the same value. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-10-03 21:39:34 +02:00
OpenShift Merge Robot	a3c4ce6717	Merge pull request #1531 from mheon/add_exited_state Add ContainerStateExited and OCI delete() in cleanup()	2018-10-03 06:06:14 -07:00
Matthew Heon	b7c5fa70ab	Fix Wait() to allow Exited state as well as Stopped Signed-off-by: Matthew Heon <matthew.heon@gmail.com>	2018-10-02 14:26:19 -04:00
baude	4f825f2e07	Add container runlabel command Execute the command as described by a container image. The value of the label is processed into a command by: 1. Ensuring the first argument of the command is podman. 2. Substituting any variables with those defined by the environment or otherwise. If no label exists in the container image, nothing is done. podman container runlabel LABEL IMAGE extra_args Signed-off-by: baude <bbaude@redhat.com>	2018-09-28 14:14:13 -05:00
Matthew Heon	95a374100b	Add a way to disable port reservation We've increased the default rlimits to allow Podman to hold many ports open without hitting limits and crashing, but this doesn't solve the amount of memory that holding open potentially thousands of ports will use. Offer a switch to optionally disable port reservation for performance- and memory-constrained use cases. Signed-off-by: Matthew Heon <matthew.heon@gmail.com>	2018-09-13 14:42:47 -04:00
Giuseppe Scrivano	46acded58d	rootless, exec: use the new function to join the userns since we have a way for joining an existing userns use it instead of nsenter. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1371 Approved by: rhatdan	2018-08-29 16:25:20 +00:00
Giuseppe Scrivano	8b5823a62d	rootless: don't use kill --all The OCI runtime might use the cgroups to see what PIDs are inside the container, but that doesn't work with rootless containers. Closes: https://github.com/containers/libpod/issues/1337 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1331 Approved by: rhatdan	2018-08-26 07:22:42 +00:00
Giuseppe Scrivano	c5753f57c1	rootless: exec handle processes that create an user namespace Manage the case where the main process of the container creates and joins a new user namespace. In this case we want to join only the first child in the new hierarchy, which is the user namespace that was used to create the container. Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1331 Approved by: rhatdan	2018-08-26 07:22:42 +00:00
Giuseppe Scrivano	720eb85ba5	rootless: fix exec We cannot re-exec into a new user namespace to gain privileges and access an existing as the new namespace is not the owner of the existing container. "unshare" is used to join the user namespace of the target container. The current implementation assumes that the main process of the container didn't create a new user namespace. Since in the setup phase we are not running with euid=0, we must skip the setup for containers/storage. Closes: https://github.com/containers/libpod/issues/1329 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1331 Approved by: rhatdan	2018-08-26 07:22:42 +00:00
Daniel J Walsh	d20f3a5146	switch projectatomic to containers Need to get some small changes into libpod to pull back into buildah to complete buildah transition. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Closes: #1270 Approved by: mheon	2018-08-16 17:12:36 +00:00
Valentin Rothberg	e9b23f7cca	oci.go: syslog: fix debug formatting Signed-off-by: Valentin Rothberg <vrothberg@suse.com> Closes: #1242 Approved by: rhatdan	2018-08-09 12:24:24 +00:00
Matthew Heon	b01ddc7b09	Pass newly-added --log-level flag to Conmon Signed-off-by: Matthew Heon <matthew.heon@gmail.com> Closes: #1232 Approved by: rhatdan	2018-08-08 19:23:41 +00:00
Giuseppe Scrivano	cfcd928476	network: add support for rootless network with slirp4netns slirp4netns is required to setup the network namespace: https://github.com/rootless-containers/slirp4netns Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1156 Approved by: rhatdan	2018-07-31 13:39:29 +00:00
Giuseppe Scrivano	9ae7b1a5b1	oci: keep exposed ports busy and leak the fd into conmon Bind all the specified TCP and UDP ports so that another process cannot reuse them. The fd of the listener is then leaked into conmon so that the socket is kept busy until the container exits. Closes: https://github.com/projectatomic/libpod/issues/210 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1100 Approved by: mheon	2018-07-19 13:21:50 +00:00
Matthew Heon	028374b99e	Record whether the container has exited Use this to supplement exit codes returned from containers, to make sure we know when exit codes are invalid (as the container has not yet exited) Signed-off-by: Matthew Heon <mheon@redhat.com>	2018-07-13 14:28:41 -04:00
Giuseppe Scrivano	340becf542	rootless: propagate errors from GetRootlessRuntimeDir() Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2018-07-11 09:38:04 +02:00
W. Trevor King	b2344b83ed	pkg/ctime: Factor libpod/finished* into a separate package This removes some boilerplate from the libpod package, so we can focus on container stuff there. And it gives us a tidy sub-package for focusing on ctime extraction, so we can focus on unit testing and portability of the extraction utility there. For the unsupported implementation, I'm falling back to Go's ModTime [1]. That's obviously not the creation time, but it's likely to be closer than the uninitialized Time structure from cc6f0e85 (more changes to compile darwin, 2018-07-04, #1047). Especially for our use case in libpod/oci, where we're looking at write-once exit files. The test is more complicated than I initially expected, because on Linux filesystem timestamps come from a truncated clock without interpolation [2] (and network filesystems can be completely decoupled [3]). So even for local disks, creation times can be up to a jiffie earlier than 'before'. This test ensures at least monotonicity by creating two files and ensuring the reported creation time for the second is greater than or equal to the reported creation time for the first. It also checks that both creation times are within the window from one second earlier than 'before' through 'after'. That should be enough of a window for local disks, even if the kernel for those systems has an abnormally large jiffie. It might be ok on network filesystems, although it will not be very resilient to network clock lagging behind the local system clock. [1]: https://golang.org/pkg/os/#FileInfo [2]: https://groups.google.com/d/msg/linux.kernel/mdeXx2TBYZA/_4eJEuJoAQAJ Subject: Re: Apparent backward time travel in timestamps on file creation Date: Thu, 30 Mar 2017 20:20:02 +0200 Message-ID: <tqMPU-1Sb-21@gated-at.bofh.it> [3]: https://groups.google.com/d/msg/linux.kernel/mdeXx2TBYZA/cTKj4OBuAQAJ Subject: Re: Apparent backward time travel in timestamps on file creation Date: Thu, 30 Mar 2017 22:10:01 +0200 Message-ID: <tqOyl-36A-1@gated-at.bofh.it> Signed-off-by: W. Trevor King <wking@tremily.us> Closes: #1050 Approved by: mheon	2018-07-06 17:54:32 +00:00
baude	cc6f0e85f9	more changes to compile darwin this should represent the last major changes to get darwin to compile. again, the purpose here is to get darwin to compile so that we can eventually implement a ci task that would protect against regressions for darwin compilation. i have left the manual darwin compilation largely static still and in fact now only interject (manually) two build tags to assist with the build. trevor king has great ideas on how to make this better and i will defer final implementation of those to him. Signed-off-by: baude <bbaude@redhat.com> Closes: #1047 Approved by: rhatdan	2018-07-05 16:05:12 +00:00
Giuseppe Scrivano	77758a6c9f	rootless: set XDG_RUNTIME_DIR also for state and exec Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #1048 Approved by: mheon	2018-07-05 13:30:15 +00:00
baude	b96be3af1b	changes to allow for darwin compilation Signed-off-by: baude <bbaude@redhat.com> Closes: #1015 Approved by: baude	2018-06-29 20:44:09 +00:00
Daniel J Walsh	7fc1a329bd	Add `podman container cleanup` to CLI When we run containers in detach mode, nothing cleans up the network stack or the mount points. This patch will tell conmon to execute the cleanup code when the container exits. It can also be called to attempt to cleanup previously running containers. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Closes: #942 Approved by: mheon	2018-06-29 15:25:21 +00:00
Daniel J Walsh	c9eddd22eb	conmon no longer writes to syslog If the caller sets up the app to be in logrus.DebugLevel, then we will add the --syslog flag to conmon to get all of the messages. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Closes: #1014 Approved by: TomSweeneyRedHat	2018-06-29 08:22:27 +00:00
Giuseppe Scrivano	4415bad6fe	oci: set XDG_RUNTIME_DIR to the runtime from GetRootlessRuntimeDir() Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #936 Approved by: rhatdan	2018-06-27 14:07:17 +00:00
Giuseppe Scrivano	399c3a5e4b	oci: do not set the cgroup path in Rootless mode Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #871 Approved by: mheon	2018-06-15 14:53:18 +00:00
Giuseppe Scrivano	ca03627a80	oci: pass XDG_RUNTIME_DIR down to the OCI runtime Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com> Closes: #871 Approved by: mheon	2018-06-15 14:53:18 +00:00
Daniel J Walsh	dedc7cc329	Remove SELinux transition rule after conmon is started. We have an issue where iptables command is being executed by podman and attempted to run with a different label. This fix changes podman to only change the label on the conmon command and then set the SELinux interface back to the default. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Closes: #906 Approved by: giuseppe	2018-06-06 18:23:37 +00:00
Daniel J Walsh	d6b8f62dd6	Catch does not exist error There was a new line at the end of does not exist which was causing this to fail. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Closes: #863 Approved by: baude	2018-05-31 19:28:00 +00:00
Daniel J Walsh	7c6034e161	We need to change the SELinux label of the conmon process to s0 If SELinux is enabled, we are leaking in pipes into the container owned by conmon. The container processes are not allowed to use these pipes, if the calling process is fully ranged. By changing the level of the conmon process to s0, this allows container processes to use the pipes. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com> Closes: #854 Approved by: mheon	2018-05-31 13:51:11 +00:00
Matthew Heon	20bceb787d	Use container cleanup() functions when removing Instead of manually calling the individual functions that cleanup uses to tear down a container's resources, just call the cleanup function to make sure that cleanup only needs to happen in one place. Signed-off-by: Matthew Heon <matthew.heon@gmail.com> Closes: #790 Approved by: rhatdan	2018-05-17 18:55:59 +00:00

1 2

84 Commits