Commit Graph

710 Commits

Author SHA1 Message Date
Brian Downs
864e800896
[Release-1.21] All bootstrap backport (#4452)
Add ability to reconcile bootstrap data between datastore and disk (#3398)
2021-11-10 16:20:46 -07:00
Brad Davidson
df033fa248 Fix log/reap reexec
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-11-01 14:59:08 -07:00
Brad Davidson
254d2f696e Fix other uses of NewForConfigOrDie in contexts where we could return err
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 3da1bb3af2)
2021-10-29 16:43:47 -07:00
Brad Davidson
388963440d Watch the local Node object instead of get/sleep looping
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 5acd0b9008)
2021-10-29 16:43:47 -07:00
Brad Davidson
afa1981f1d Block scheduler startup on untainted node when using embedded CCM
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 3fe460d080)
2021-10-29 16:43:47 -07:00
Brad Davidson
bb50c45a6f Revert "Backport bootstrap release 1.21 (#4313)"
This reverts commit
f0ea0a0946

Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-27 14:17:22 -07:00
galal-hussein
d413f97146 Update peer address when running cluster-reset
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-10-25 15:45:51 -07:00
Brian Downs
f0ea0a0946
Backport bootstrap release 1.21 (#4313) 2021-10-25 13:03:21 -07:00
Derek Nola
50fb1ce065
Added configuration input to etcd-snapshot (#4280) (#4282)
Signed-off-by: dereknola <derek.nola@suse.com>
2021-10-22 13:08:22 -07:00
Manuel Buil
11dce34b4e Update to the newest flannel
Fixes issues such as the internode connectivity

Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-10-21 13:02:18 +02:00
Manuel Buil
41b0997e31 Add dual-stack support
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-10-21 10:29:03 +02:00
Brad Davidson
a18c2efb4c Refactor log and reaper exec to omit MAINPID
Using MAINPID breaks systemd's exit detection, as it stops watching the
original pid, but is unable to watch the new pid as it is not a child
of systemd itself. The best we can do is just notify when execing the child
process.

We also need to consolidate forking into a sigle place so that we don't
end up with multiple levels of child processes if both redirecting log
output and reaping child processes.

Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit dc18ef2e51)
2021-10-20 14:36:54 -07:00
Brad Davidson
504e249a5e Add containerd ready channel to delay etcd node join
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 88178ae65e)
2021-10-20 11:06:12 -07:00
Brad Davidson
e814850eef Fix premature etcd shutdown when joining an existing cluster
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 086ca8ba6a)
2021-10-20 11:06:12 -07:00
Brad Davidson
557d425010 Minor cleanup on cribbed function
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 57377d2cd4)
2021-10-20 11:06:12 -07:00
Brad Davidson
4f28561e34 Wait for apiserver readyz instead of healthz
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 3449d5b9f9)
2021-10-20 11:06:12 -07:00
Manuel Buil
89f5721a3a Fix race condition in cloud provider
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-10-19 10:25:34 +02:00
Brian Downs
4aa9553978
[Release-1.21] - Add etcd s3 timeout (#4207) (#4228) 2021-10-18 10:45:38 -07:00
Hussein Galal
22f7f1c41a
Make sure there are no duplicates in etcd member list (#4025) (#4213)
* Make sure there are no duplicates in etcd member list

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fix node names with hyphens

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* use full server name for etcd node name

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-10-14 23:39:24 +02:00
Hussein Galal
e7bf7b141f
Display cluster tls error only in debug mode (#4201)
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-10-13 01:10:54 +02:00
Brian Downs
aa5a0a8c78
set transport to skip verify if se skip flag passed (#4102) (#4104) 2021-10-07 12:47:32 -07:00
Derek Nola
3ee5098225
Add "etcd-" prefix to etcd-snapshot commands as aliases (#4161) (#4171)
* Add "etcd-" prefix to etcd-snapshot commands as alias

Signed-off-by: dereknola <derek.nola@suse.com>
2021-10-07 12:26:36 -07:00
Brad Davidson
69a9f46bce Don't evacuate the root cgroup when rootless
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-10-01 16:19:07 -07:00
Brad Davidson
38ddda587a Properly handle operation as init process
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-28 11:33:45 -07:00
Manuel Buil
15f3a2ebfb Enable the inheritance of settings for ipv6
Signed-off-by: Manuel Buil <mbuil@suse.com>
2021-09-28 18:05:33 +02:00
dereknola
8c2f7ac41c Remove experimental from cluster commands
Signed-off-by: dereknola <derek.nola@suse.com>
2021-09-17 15:37:45 -07:00
Chris Kim
acad8ef840
[release-1.21] Update Kubernetes to v1.21.5 (#4032)
* Update to v1.21.5

Signed-off-by: Chris Kim <oats87g@gmail.com>
2021-09-17 09:21:32 -07:00
Chris Kim
6acee2e2f5
No-op when etcd member was already removed and use existing name for etcd controller (#4015)
Signed-off-by: Chris Kim <oats87g@gmail.com>
2021-09-15 07:36:10 -07:00
Chris Kim
863512e055
Initial leader elected etcd member management controller (#4010)
Signed-off-by: Chris Kim <oats87g@gmail.com>
2021-09-14 10:19:42 -07:00
Brad Davidson
37caf87d6d Add exposed metrics listener instead of replacing loopback listener
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-10 10:03:49 -07:00
Brad Davidson
a8a6edfb0d Add missing node name entry to apiserver SAN list
Also honor node-ip when adding the node address to the SAN list, instead
of hardcoding the autodetected IP address.

Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-09-01 13:22:50 -07:00
Hussein Galal
659307d327
Fix condition for adding kubernetes endpoints (#3941) (#3946)
* Fix condition for adding kubernetes endpoints

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* Fix condition for adding kubernetes endpoints

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-08-31 19:37:42 +02:00
Brad Davidson
7cf85c235a Fix issue where addon checksum was never stored
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-08-27 10:26:38 -07:00
Hussein Galal
656c190629
Reset load balancer state during restoraion (#3878)
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

Reset load balancer state during restoraion

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-08-18 18:59:03 +02:00
Malte Starostik
e45726f610 Fix URL pruning when joining an etcd member
* Fix URL pruning when joining an etcd member

Problem:
Existing member clientURLs were checked if they contain the joining
node's IP. In some edge cases this would prune valid URLs when the
joining IP is a substring match of the only existing member's IP.
Because of this, it was impossible to e.g. join 10.0.0.2 to an existing
node that has an IP of 10.0.0.2X or 10.0.0.2XX:

level=fatal msg="starting kubernetes: preparing server: start managed database:
joining etcd cluster: etcdclient: no available endpoints"

Solution:
Fixed by properly parsing the URLs and comparing the IPs for equality
instead of substring match.

Signed-off-by: Malte Starostik <info@stellaware.de>
(cherry picked from commit b23955e835)
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
2021-08-13 11:49:26 -07:00
Brian Downs
18bc38d838
account for an s3 folder when listing objects (#3807) (#3812)
* account for an s3 folder when listing objects
2021-08-09 17:35:14 -07:00
Hussein Galal
12ec437605
fix Node stuck at deletion (#3775)
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-08-05 23:23:34 +02:00
Hussein Galal
cc694b1f09
Notify systemd for etcd only node (#3733)
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-07-30 00:41:52 +02:00
Derek Nola
e6247d583c
[Backport 1.21] Cannot write data to local PVC (#3721)
* Fix to allow non-root users access to storage volumes. (#3714)

* Fix to prevent non-root users from accessing storage directory, while allowing non-root users access to subdirectories.

Signed-off-by: dereknola <derek.nola@suse.com>
2021-07-28 13:47:22 -07:00
galal-hussein
786f91b997 Fix multiple bootstrap keys found
Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

migrate empty string key properly

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

simplifying the code

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

migrate old token key format

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

fix warning msg

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

replace error with warn in delete

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

more fixes

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

more fixes

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-07-21 22:54:25 +02:00
Derek Nola
c15259d925
Fix to allow prune to correctly cleanup custom named snapshots (#3649) (#3672)
Signed-off-by: dereknola <derek.nola@suse.com>
2021-07-20 12:17:10 -07:00
Hussein Galal
9859ec7a81
[release-1.21] - Backport Fix storing bootstrap data with empty token string (#3514)
* Fix storing bootstrap data with empty token string (#3422)

* Fix storing bootstrap data with empty token string

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* delete node password secret after restoration

fixes to bootstrap key

vendor update

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fix comment

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fix typo

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* more fixes

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fixes

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fixes

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* typos

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* Removing dynamic listener file after restoration

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* go mod tidy

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* fix a runtime core panic

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* update kine

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>

* Fix calling delete in kine

Signed-off-by: galal-hussein <hussein.galal.ahmed.11@gmail.com>
2021-07-13 22:28:38 +02:00
Brad Davidson
5a88b5b3ea Emit events for AddOn lifecycle
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit a7d1159ba6)
2021-07-12 12:12:11 -07:00
Brad Davidson
ab0520f44e Add comments, clean up imports and function names
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit ea2cd6d727)
2021-07-12 12:12:11 -07:00
Brad Davidson
411d7e6753 Tidy up function calls with many args
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 6e48ca9b53)
2021-07-12 12:12:11 -07:00
Brad Davidson
ff0451c4dd Add nodename to UA string for deploy controller
Signed-off-by: Brad Davidson <brad.davidson@rancher.com>
(cherry picked from commit 6ef000091a)
2021-07-09 14:27:29 -07:00
Brian Downs
c3d134a405
prevent snapshot save when snapshots are disabled (#3475) (#3610)
* prevent snapshot save when snapshots are disabled
2021-07-09 12:09:35 -07:00
Chris Kim
42ab13a869
Update etcd snapshot error message to be more informative when etcd database is not found (#3592)
Signed-off-by: Chris Kim <oats87g@gmail.com>
2021-07-08 15:01:05 -07:00
Derek Nola
0c5577a8ec
[Backport 1.21] Update embedded kube-router (#3557) (#3595)
* Update embedded kube-router

Signed-off-by: dereknola <derek.nola@suse.com>
2021-07-07 15:04:23 -07:00
Derek Nola
0c2d8376d0
Changes local storage pods to have 700 permissions (#3537) (#3548)
* Changes local storage pods to have 700 permissions

Signed-off-by: dereknola <derek.nola@suse.com>
2021-06-30 09:22:44 -07:00