Mirror/k3s - k3s - Tyler's Private Git Repository

Mirror/k3s

mirror of https://github.com/k3s-io/k3s.git synced 2024-06-07 19:41:36 +00:00

Author	SHA1	Message	Date
Brad Davidson	c0d129003b	Handle loadbalancer port in TIME_WAIT If the port wanted by the client load balancer is in TIME_WAIT, startup will fail. Set SO_REUSEPORT so that it can be listened on again immediately. The configurable Listen call wants a context, so plumb that through as well. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-03-08 17:05:25 -08:00
Brad Davidson	7cdfaad6ce	Always use static ports for client load-balancers (#3026 ) * Always use static ports for the load-balancers This fixes an issue where RKE2 kube-proxy daemonset pods were failing to communicate with the apiserver when RKE2 was restarted because the load-balancer used a different port every time it started up. This also changes the apiserver load-balancer port to be 1 below the supervisor port instead of 1 above it. This makes the apiserver port consistent at 6443 across servers and agents on RKE2. Additional fixes below were required to successfully test and use this change on etcd-only nodes. * Actually add lb-server-port flag to CLI * Fix nil pointer when starting server with --disable-etcd but no --server * Don't try to use full URI as initial load-balancer endpoint * Fix etcd load-balancer pool updates * Update dynamiclistener to fix cert updates on etcd-only nodes * Handle recursive initial server URL in load balancer * Don't run the deploy controller on etcd-only nodes	2021-03-06 02:29:57 -08:00
Hussein Galal	c26b737b24	Mark disable components flags as experimental (#3018 ) Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-05 00:05:20 +02:00
Brian Downs	4d1f9eda9d	Etcd Snapshot/Restore to/from S3 Compatible Backends (#2902 ) * Add functionality for etcd snapshot/restore to and from S3 compatible backends. * Update etcd restore functionality to extract and write certificates and configs from snapshot.	2021-03-03 11:14:12 -07:00
Hussein Galal	1bf04b6a50	Merge pull request #3003 from galal-hussein/fix_etcd_only_nodes Fix etcd only nodes	2021-03-02 02:16:02 +02:00
Brad Davidson	4fb073e799	Log clearer error on startup if NPC cannot be started Servers should always be upgraded before agents, but generally this isn't required because things are compatible between versions. In this case we're OK with failing closed if the user upgrades out of order, but we should give a clearer message about what steps are required to fix the issue. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-03-01 14:23:59 -08:00
galal-hussein	ef999f0b4f	change error to warn when removing self from etcd members Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-02 00:19:57 +02:00
galal-hussein	d6124981d5	remove etcd member if disable etcd is passed Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-03-01 23:50:50 +02:00
Erik Wilson	4e5218b62c	Apply suggestions from code review Logging cleanup Co-authored-by: Brad Davidson <brad@oatmail.org>	2021-03-01 10:44:24 -07:00
Erik Wilson	4aac6b6bd0	Update to Traefik 2.4.2 and combine manifests	2021-03-01 10:44:24 -07:00
Erik Wilson	54a35505f0	Remove Traefik v1 migration	2021-03-01 10:44:24 -07:00
Chin-Ya Huang	cc96f8140a	Allow download traefik static file and rename Allow writing static files regardless of the version. Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>	2021-03-01 10:44:24 -07:00
Chin-Ya Huang	10e0328977	Traefik v2 integration K3s upgrade via watch over file change of static file and manifest and triggers helm-controller for change. It seems reasonable to only allow upgrade traefik v1->v2 when there is no existing custom traefik HelmChartConfig in the cluster to avoid any incompatibility. Here also separate the CRDs and put them into a different chart to support CRD upgrade. Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>	2021-03-01 10:44:23 -07:00
Brad Davidson	f970e49b7d	Wait for apiserver to become healthy before starting agent controllers It is possible that the apiserver may serve read requests but not allow writes yet, in which case flannel will crash on startup when trying to configure the subnet manager. Fix this by waiting for the apiserver to become fully ready before starting flannel and the network policy controller. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-26 19:28:53 -08:00
Brad Davidson	9b39c1c117	Hide the airgap-extra-registry flag Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-26 16:08:49 -08:00
Brad Davidson	88dd601941	Limit zstd decoder memory Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-17 11:48:03 -08:00
Brad Davidson	ae5b93a264	Use HasSuffixI utility function Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-17 11:48:03 -08:00
Brad Davidson	ec661c67d7	Add support for retagging images on load from tarball Adds support for retagging images to appear to have been sourced from one or more additional registries as they are imported from the tarball. This is intended to support RKE2 use cases with system-default-registry where the images need to appear to have been pulled from a registry other than docker.io. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-17 11:48:03 -08:00
Hussein Galal	5749f66aa3	Add disable flags for control components (#2900 ) * Add disable flags to control components Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * golint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fixes to disable flags Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Add comments to functions Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * Fix joining problem Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * golint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix ticker Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix role labels Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2021-02-12 17:35:57 +02:00
Brian Downs	21d1690d5d	update usage text (#2926 ) update to the --cluster-init usage flag to indicate it's for Etcd	2021-02-10 15:54:04 -07:00
Brad Davidson	6e768c301e	Use appropriate response codes for authn/authz failures Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-09 16:28:20 -08:00
Brad Davidson	374271e9a0	Collect IPs from all pods before deciding to use internal or external addresses (#2909 ) * Collect IPs from all pods before deciding to use internal or external addresses @Taloth correctly noted that the code that iterates over ServiceLB pods to collect IP addresses was failing to add additional internal IPs once the map contained ANY entry from a previous node. This may date back to when ServiceLB used a Deployment instead of a DaemonSet, so there was only ever a single pod. The new behavior is to collect all internal and external IPs, and then construct the address list of a single type - external if there are any, otherwise internal. https://github.com/k3s-io/k3s/issues/1652#issuecomment-774497788 Signed-off-by: Brad Davidson <brad.davidson@rancher.com> Co-authored-by: Brian Downs <brian.downs@gmail.com>	2021-02-09 16:26:57 -08:00
Brad Davidson	e06119729b	Improve handling of comounted cpu,cpuacct controllers (#2911 ) * Improve handling of comounted cpu,cpuacct controllers Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-09 16:12:58 -08:00
Brad Davidson	ad5e504cf0	Allow joining clusters when the server CA is trusted by the OS CA bundle (#2743 ) * Add tests to clientaccess/token * Fix issues in clientaccess/token identified by tests * Update tests to close coverage gaps * Remove redundant check turned up by code coverage reports * Add warnings if CA hash will not be validated Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-08 22:28:57 -08:00
Brad Davidson	6c472b5942	Use zstd instead of gzip for embedded tarball Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-08 21:08:35 -08:00
Brad Davidson	c5e2676d5c	Update local-path-provisioner and helper busybox (#2885 ) * Update local-path-provisioner and helper busybox Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-04 10:49:25 -08:00
Brad Davidson	65c78cc397	Replace options.KubeRouterConfig with config.Node and remove metrics/waitgroup stuff Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Brad Davidson	07256cf7ab	Add ServiceIPRange and ServiceNodePortRange to agent config Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Brad Davidson	95a1a86847	Spell check upstream code Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Brad Davidson	29483d0651	Initial update of netpol and utils from upstream Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-02-03 10:41:51 -08:00
Akihiro Suda	f3c41b7650	fix cgroup2 support Fix issue 900 cgroup2 support was introduced in PR 2584, but got broken in `f3de60ff31` It was failing with "F1210 19:13:37.305388 4955 server.go:181] cannot set feature gate SupportPodPidsLimit to false, feature is locked to true" Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-01-25 22:45:07 -08:00
Akihiro Suda	728ebcc027	rootless: remove rootful /run/{netns,containerd} symlinks Since a recent commit, rootless mode was failing with the following errors: ``` E0122 22:59:47.615567 21 kuberuntime_manager.go:755] createPodSandbox for pod "helm-install-traefik-wf8lc_kube-system(9de0a1b2-e2a2-4ea5-8fb6-22c9272a182f)" failed: rpc error: code = Unknown desc = failed to create network namespace for sandbox "285ab835609387f82d304bac1fefa5fb2a6c49a542a9921995d0c35d33c683d5": failed to setup netns: open /var/run/netns/cni-c628a228-651e-e03e-d27d-bb5e87281846: permission denied ... E0122 23:31:34.027814 21 pod_workers.go:191] Error syncing pod 1a77d21f-ff3d-4475-9749-224229ddc31a ("coredns-854c77959c-w4d7g_kube-system(1a77d21f-ff3d-4475-9749-224229ddc31a)"), skipping: failed to "CreatePodSandbox" for "coredns-854c77959c-w4d7g_kube-system(1a77d21f-ff3d-4475-9749-224229ddc31a)" with CreatePodSandboxError: "CreatePodSandbox for pod \"coredns-854c77959c-w4d7g_kube-system(1a77d21f-ff3d-4475-9749-224229ddc31a)\" failed: rpc error: code = Unknown desc = failed to create containerd task: io.containerd.runc.v2: create new shim socket: listen unix /run/containerd/s/8f0e40e11a69738407f1ebaf31ced3f08c29bb62022058813314fb004f93c422: bind: permission denied\n: exit status 1: unknown" ``` Remove symlinks to /run/{netns,containerd} so that rootless mode can create their own /run/{netns,containerd}. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2021-01-22 19:51:43 -08:00
Brad Davidson	071de833ae	Fix typo in field tag Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-01-22 19:38:37 -08:00
Brad Davidson	8011697175	Only container-runtime-endpoint wants RuntimeSocket path as URI Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-01-22 18:56:30 -08:00
Yuriy	06fda7accf	Add functionality to bind custom IP address for Etcd metrics endpoint (#2750 ) * Add functionality to bind custom IP address for Etcd metrics endpoint Signed-off-by: yuriydzobak <yurii.dzobak@lotusflare.com>	2021-01-22 17:40:48 -08:00
Brad Davidson	f152f656a0	Replace k3s cloud provider wrangler controller with core node informer (#2843 ) * Replace k3s cloud provider wrangler controller with core node informer Upstream k8s has exposed an interface for cloud providers to access the cloud controller manager's node cache and shared informer since Kubernetes 1.9. This is used by all the other in-tree cloud providers; we should use it too instead of running a dedicated wrangler controller. Doing so also appears to fix an intermittent issue with the uninitialized taint not getting cleared on nodes in CI. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2021-01-22 16:59:48 -08:00
Brian Downs	13229019f8	Add ability to perform an etcd on-demand snapshot via cli (#2819 ) * add ability to perform an etcd on-demand snapshot via cli	2021-01-21 14:09:15 -07:00
Waqar Ahmed	3ea696815b	Do not validate snapshotter argument if docker is enabled Problem: While using ZFS on debian and K3s with docker, I am unable to get k3s working as the snapshotter value is being validated and the validation fails. Solution: We should not validate snapshotter value if we are using docker as it's a no-op in that case. Signed-off-by: Waqar Ahmed <waqarahmedjoyia@live.com>	2021-01-20 12:25:28 -08:00
Erik Wilson	c71060f288	Merge pull request #2744 from erikwilson/rke2-node-password-bootstrap Bootstrap node password with local file	2021-01-11 09:51:30 -07:00
MonzElmasry	86f68d5d62	change etcd dir permission if it exists Signed-off-by: MonzElmasry <menna.elmasry@rancher.com>	2021-01-08 23:47:36 +02:00
Erik Wilson	4245fd7b67	Return http.StatusOK instead of 0 Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-23 16:55:47 -07:00
Erik Wilson	2fb411fc83	Fix spelling mistake Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-23 15:08:07 -07:00
Erik Wilson	09eb44ba53	Bootstrap node password with local file Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-23 15:08:06 -07:00
JenTing Hsiao	57041f0239	Add codespell CI test and fix codespell error (#2740 ) * Add codespell CI test * Fix codespell error	2020-12-22 12:35:58 -08:00
Brad Davidson	8936cf577f	Bump coredns to 1.8.0 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-17 15:20:19 -08:00
Chris Kim	332fd73d46	Add support for both config-file and data-dir at a global level in the self-extracting wrapper for K3s (#2594 ) * Add support for both config-file and data-dir at a global level in the self-extracting wrapper for K3s Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-16 09:27:57 -08:00
Erik Wilson	1230d7b7df	Fix HA server initialization Signed-off-by: Erik Wilson <Erik.E.Wilson@gmail.com>	2020-12-15 16:08:28 -08:00
Brad Davidson	8e4d3e645b	Restore legacy master role for etcd nodes Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-15 15:15:46 -08:00
Chris Kim	61ef2ce95e	use version.Program Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-09 12:34:13 -08:00
Chris Kim	48925fcb88	Simplify checkCgroups function call Co-authored-by: Brian Downs <brian.downs@gmail.com>	2020-12-09 11:59:54 -08:00
Chris Kim	a3f87a81bd	Independently set kubelet-cgroups and runtime-cgroups, and detect if we are running under a systemd scope Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-09 11:39:33 -08:00
Brad Davidson	c5aad1b5ed	Disable the ServiceAccountIssuerDiscovery feature-gate. We're not setting ``--service-account-issuer` to a https URL, which causes an error message at startup when the feature gate is enabled. From the docs on that flag: > If this option is not a valid URI per the OpenID Discovery 1.0 spec, the > ServiceAccountIssuerDiscovery feature will remain disabled, even if the > feature gate is set to true. It is highly recommended that this value > comply with the OpenID spec: > https://openid.net/specs/openid-connect-discovery-1_0.html. In practice, > this means that service-account-issuer must be an https URL. It is also > highly recommended that this URL be capable of serving OpenID discovery > documents at {service-account-issuer}/.well-known/openid-configuration. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 22:51:34 -08:00
Brad Davidson	63f2211b31	deprecate the "node-role.kubernetes.io/master" label / taint Related to https://github.com/kubernetes/kubernetes/pull/95382 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 22:51:34 -08:00
Brad Davidson	c6950d2cb0	Update Kubernetes to v1.20.0-k3s1 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 22:51:34 -08:00
Brad Davidson	cd27c6fcbe	Bump coredns to 1.7.1 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-12-08 15:58:17 -08:00
Erik Wilson	0ae7f2d5ae	Merge pull request #2407 from erikwilson/node-passwd-cleanup Use secrets for node-passwd entries	2020-12-08 16:25:13 -07:00
Chris Kim	3d1e40eaa3	Handle the case when systemd lives under `/init.scope` Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-08 10:26:54 -08:00
Chris Kim	e71e11fed0	Merge pull request #2642 from Oats87/issues/k3s/2548-cgroup Set a cgroup if containerized	2020-12-08 10:05:21 -08:00
Chris Kim	f3de60ff31	When there is a defined cgroup for PID 1, assume we are containerized and set a root Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-12-07 13:15:15 -08:00
Hussein Galal	fadc5a8057	Add tombstone file to etcd and catch errc etcd channel (#2592 ) * Add tombstone file to embedded etcd Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod update Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more fixes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * more changes Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * gofmt and goimports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod update Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go lint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go lint Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go mod tidy Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-12-07 22:30:44 +02:00
Chin-Ya Huang	3f0f2b342e	Show go version when executes with --version. Signed-off-by: Chin-Ya Huang <chin-ya.huang@suse.com>	2020-12-04 12:51:15 -08:00
transhapHigsn	87a43c69e1	Problem: CoreDNS getting preempted by other pods Solution: Set priorityClassName to system-node-critical of traefik, metrics-server, local storage and coredns deployment Signed-off-by: transhapHigsn <fet.prashantsingh@gmail.com>	2020-12-04 12:50:12 -08:00
Akihiro Suda	eb72d509ce	pkg/agent/config: validate containerd snapshotter value Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 11:00:00 -08:00
Akihiro Suda	05f6255437	add fuse-overlayfs snapshotter (mainly for rootless mode) Ubuntu and Debian kernels support mounting real overlayfs inside userns, but the vanilla kernel still does not allow it. OTOH fuse-overlayfs can be mounted inside userns with the vanilla kernel (>= 4.18). Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 11:00:00 -08:00
Akihiro Suda	43f7eaedf8	rootless: fix "stat /run/user/1000: no such file or directory" on `kubectl run` k3s was mounting a tmpfs on `/run` by itself, so it was hiding RootlessKit's `/run`. Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 10:31:21 -08:00
Akihiro Suda	67410d2757	rootless: validate sysctl before starting up Fix #2420 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-12-01 09:21:39 -08:00
Jacob Blain Christen	3647654fe4	[migration k3s-io] update helm-controller dependency (#2569 ) rancher/helm-controller ➡️ k3s-io/helm-controller Part of https://github.com/rancher/k3s/issues/2189 Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2020-12-01 08:59:10 -07:00
Akihiro Suda	0b45e32486	Support cgroup v2 Signed-off-by: Akihiro Suda <akihiro.suda.cz@hco.ntt.co.jp>	2020-11-30 22:57:37 -08:00
Jacob Blain Christen	36230daa86	[migration k3s-io] update kine dependency (#2568 ) rancher/kine ➡️ k3s-io/kine Part of https://github.com/rancher/k3s/issues/2189 Signed-off-by: Jacob Blain Christen <jacob@rancher.com>	2020-11-30 16:45:22 -07:00
Brad Davidson	b873d3a03b	Explicitly set agent paths within --data-dir Removing the cfg.DataDir mutation in `3e4fd7b` did not break anything, but did change some paths in unwanted ways. Rather than mutating the user-supplied command-line flags, explicitly specify the agent subdirectory as needed. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-11 09:26:41 -08:00
Brad Davidson	58b5b21f0d	Don't pass cloud-provider flag to controller-manager As per documentation, the cloud-provider flag should not be passed to controller-manager when using cloud-controller. However, the legacy cloud-related controllers still need to be explicitly disabled to prevent errors from being logged. Fixing this also prevents controller-manager from creating the cloud-controller-manager service account that needed extra RBAC. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-09 13:55:09 -08:00
Brad Davidson	3e4fd7b41f	Respect --data-dir path for crictl.yaml Related to rancher/rke2#474 Note that anyone who customizes the data-dir path will have to set CRI_CONFIG_FILE to the correct path when using the wrapped binaries (crictl, etc). This is better than dropping files in the incorrect location. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	f50e3140f9	Disable configure-cloud-routes and external service/route programming support when using k3s stub cloud controller Resolves warning 3 from #2471 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	31575e407a	Add Cluster ID support to k3s stub cloud controller Resolves warning 2 from #2471. As per https://github.com/kubernetes/cloud-provider/issues/12 the ClusterID requirement was never really followed through on, so the flag is probably going to be removed in the future. One side-effect of this is that the core k8s cloud-controller-manager also wants to watch nodes, and needs RBAC to do so. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	5b318d093f	Fix containerd sock path warning Resolves warning 1 from #2471 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Brad Davidson	d1424626ac	Disable containerd experimental snapshot labels Related to #2455 and containerd/containerd#4684 These were not meant to be enabled by default, break images with many layers, and will be disabled by default on the next containerd release. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-05 15:51:10 -08:00
Erik Wilson	992ca52c31	Enable go test in ci	2020-11-05 09:48:53 -07:00
Erik Wilson	92d04355f4	Use secrets for node-passwd entries and cleanup	2020-11-05 09:48:53 -07:00
Brad Davidson	3b8ec74049	Update disables list when building with no_stage The --disable/--no-deploy flags actually turn off some built-in controllers, in addition to preventing manifests from getting loaded. Make it clear which controllers can still be disabled even when the packaged components are ommited by the no_stage build tag. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-11-04 13:39:45 -08:00
Menna Elmasry	523ccaf3f2	Merge pull request #2448 from MonzElmasry/new_b Make etcd use node private ip	2020-10-29 00:23:56 +02:00
MonzElmasry	e8436cc76b	Make etcd use node private ip Signed-off-by: MonzElmasry <menna.elmasry@rancher.com>	2020-10-28 23:45:24 +02:00
Chris Kim	7b8a147a1b	Merge pull request #2408 from Oats87/rpm-install-selinux Add auto-install capability to install.sh for k3s-selinux	2020-10-28 14:24:09 -04:00
Hussein Galal	fcd18d1b6e	skip node delete from removed member (#2413 ) * skip node delete from removed member Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use grpc errors Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go imports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * exit if node is the etcd that being removed Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-10-28 18:32:51 +02:00
Chris Kim	96fc4c4b21	Add iptable_nat to modprobe list Signed-off-by: Chris Kim <oats87g@gmail.com>	2020-10-27 14:22:14 -04:00
Brad Davidson	de18528412	Make etcd voting members responsible for managing learners (#2399 ) * Set etcd timeouts using values from k8s instead of etcdctl Fix for one of the warnings from #2303 * Use etcd zap logger instead of deprecated capsnlog Fix for one of the warnings from #2303 * Remove member self-promotion code paths * Add learner promotion tracking code * Fix RaftAppliedIndex progress check * Remove ErrGRPCKeyNotFound check This is not used by v3 API - it just returns a response with 0 KVs. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-10-27 11:06:26 -07:00
Erik Wilson	6b11d86037	Merge pull request #2377 from erikwilson/no-proxy-fix Use no_proxy env, add .svc and cluster domains	2020-10-12 13:46:22 -07:00
Erik Wilson	56e077eb29	Use no_proxy env, add .svc and cluster domains	2020-10-12 11:02:07 -07:00
Erik Wilson	114b5ccad1	Merge pull request #2363 from erikwilson/netpol-informers Add event handlers to network policy controller	2020-10-12 08:53:39 -07:00
Erik Wilson	e26e333b7e	Add network policy controller CacheSyncOrTimeout	2020-10-07 12:35:44 -07:00
Erik Wilson	045cd49ab5	Add event handlers to network policy controller	2020-10-07 12:10:27 -07:00
Erik Wilson	ce0da0a0f4	Add file verification for data directory	2020-10-06 10:29:27 -07:00
Erik Wilson	66d29148f7	Add Release function for flock	2020-10-06 10:29:27 -07:00
Erik Wilson	360d82d20e	Add flock from k8s.io/kubernetes/pkg/util/flock	2020-10-06 10:29:26 -07:00
Brad Davidson	c3c983198f	Add temporary fix for issue with interrupted etcd promote This is a minimal fix for https://github.com/rancher/rke2/issues/392 Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-30 11:45:58 -07:00
Hussein Galal	373449ec0a	Allow for multiple etcd snapshot restoration (#2307 ) * add reset tmp file Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * go imports Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix multiple lines string Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * fix typo Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com> * use resetFile function Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>	2020-09-30 02:53:31 +02:00
Brad Davidson	8262e23169	Revert removal of EndpointName hooks (#2319 ) * Revert "Remove dead EndpointName code" This reverts commit `8025da5a8d`. * Fix docstrings based on proper understanding of use	2020-09-28 18:13:55 -07:00
Brad Davidson	360b0f1ee5	Add timeout to clientaccess http client The default http client does not have an overall request timeout, so connections to misbehaving or unavailable servers can stall for an excessive amount of time. At the moment, just attempting to join an unavailable cluster takes 2 minutes and 40 seconds to timeout. Resolve that by setting a reasonable request timeout. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:26:27 -07:00
Brad Davidson	cdfc6cfa1a	Split clientaccess token/kubeconfig code Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:26:27 -07:00
Brad Davidson	45dd4afe50	Simplify token parsing Improves readability, reduces round-trips to the join server to validate certs. Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:26:24 -07:00
Brad Davidson	9074da7405	Fix misc nits and missing/unused imports Signed-off-by: Brad Davidson <brad.davidson@rancher.com>	2020-09-27 03:10:00 -07:00

1 2 3 4 5 ...

658 Commits