Skip to content

Update versions #2304

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 42 commits into from
Jul 1, 2021
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
42 commits
Select commit Hold shift + click to select a range
c9099f0
Update eksctl to v0.54.0
deliahu Jun 25, 2021
9c7297e
Update kubernetes to v1.20
deliahu Jun 25, 2021
e5d147f
Update Istio to 1.10.2
deliahu Jun 28, 2021
221c8a4
Update go version and modules
deliahu Jun 28, 2021
e6c69d8
Update aws-iam-authenticator
deliahu Jun 28, 2021
5a61818
Update nvidia device driver
deliahu Jun 28, 2021
f45a186
Update metrics server
deliahu Jun 29, 2021
c71d5ae
Update neuron device plugin, add neuron scheduler, rename nvidia devi…
deliahu Jun 29, 2021
5a7ce00
Revert to go v1.15.13
deliahu Jun 29, 2021
21e4cf1
Rename num_hugepages
deliahu Jun 29, 2021
b20eb88
Update cluster autoscaler to 1.20.0
deliahu Jun 29, 2021
d2815b7
Update fluent-bit to 2.16.1
deliahu Jun 29, 2021
6696d96
Update kubernetes-event-exporter
deliahu Jun 29, 2021
3339e0b
Update alpine version
deliahu Jun 29, 2021
69809b8
Update kubectl
deliahu Jun 29, 2021
611e469
Update versions.md
deliahu Jun 29, 2021
f5289fa
Istio fixes
deliahu Jun 29, 2021
22cedd8
Update fluentbit resources
deliahu Jun 30, 2021
5106809
Decrease istiod cpu request
deliahu Jun 30, 2021
097c090
Add cluster-configure make command
deliahu Jun 30, 2021
fda1f2e
eksctl nits
deliahu Jun 30, 2021
54e38fe
Update eksctl to 0.55.0-rc.0
deliahu Jun 30, 2021
985bcb3
Upgrade go
deliahu Jul 1, 2021
7729372
Update prometheus
RobertLucian Jul 1, 2021
96f7dac
Update statsd exporter
RobertLucian Jul 1, 2021
7ffd6fa
Update kube-state-metrics and dcgm exporter
RobertLucian Jul 1, 2021
92c0f9f
Nit for kube-state-metrics
RobertLucian Jul 1, 2021
55c7907
Update kubelet exporter
RobertLucian Jul 1, 2021
a35391c
Update node exporter
RobertLucian Jul 1, 2021
084c2df
Update grafana
RobertLucian Jul 1, 2021
91c511e
Fix dcgm/node exporters
RobertLucian Jul 1, 2021
c0e0517
Fix istio metrics
RobertLucian Jul 1, 2021
5a19320
Update Python client dependencies
deliahu Jul 1, 2021
742a64a
Update Makefile
deliahu Jul 1, 2021
2617d28
Revert eksctl
deliahu Jul 1, 2021
5fd4685
Update create.md
deliahu Jul 1, 2021
b7e7775
Update eksctl to 0.51.0
deliahu Jul 1, 2021
711b2dd
Merge branch 'master' into update-versions
RobertLucian Jul 1, 2021
64d5801
Updating the dashboards
RobertLucian Jul 1, 2021
d6cb62e
Update eksctl verbose flag
deliahu Jul 1, 2021
c88ecc7
Update registry.sh
deliahu Jul 1, 2021
54267cd
Remove inferentia scheduler node selector
deliahu Jul 1, 2021
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 2 additions & 2 deletions .circleci/config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -10,8 +10,8 @@ commands:
name: Install Go
command: |
sudo rm -rf /usr/local/go
wget https://dl.google.com/go/go1.15.12.linux-amd64.tar.gz
sudo tar -C /usr/local -xzf go1.15.12.linux-amd64.tar.gz
wget https://dl.google.com/go/go1.16.5.linux-amd64.tar.gz
sudo tar -C /usr/local -xzf go1.16.5.linux-amd64.tar.gz
rm -rf go*.tar.gz
echo 'export PATH=$PATH:/usr/local/go/bin' >> $BASH_ENV
echo 'export PATH=$PATH:~/go/bin' >> $BASH_ENV
Expand Down
10 changes: 3 additions & 7 deletions CONTRIBUTING.md
Original file line number Diff line number Diff line change
Expand Up @@ -22,10 +22,10 @@ To install Go on linux, run:

```bash
mkdir -p ~/bin && \
wget https://dl.google.com/go/go1.14.7.linux-amd64.tar.gz && \
sudo tar -xvf go1.14.7.linux-amd64.tar.gz && \
wget https://dl.google.com/go/go1.16.5.linux-amd64.tar.gz && \
sudo tar -xvf go1.16.5.linux-amd64.tar.gz && \
sudo mv go /usr/local && \
rm go1.14.7.linux-amd64.tar.gz && \
rm go1.16.5.linux-amd64.tar.gz && \
echo 'export PATH="/usr/local/go/bin:$HOME/go/bin:$PATH"' >> $HOME/.bashrc
```

Expand Down Expand Up @@ -135,10 +135,6 @@ export AWS_REGION="***" # you can use any AWS region you'd like, e.g. "us-west-
export AWS_ACCESS_KEY_ID="***" # alternatively, you can remove this to use the default credentials chain on your machine
export AWS_SECRET_ACCESS_KEY="***" # alternatively, you can remove this to use the default credentials chain on your machine
export DEFAULT_USER_ARN="arn:aws:iam::<ACCOUNT_ID>:<AWS IAM ENTITY>" # (e.g. arn:aws-us-gov:iam::123456789:user/foo)

# If you are working on govcloud, you will need to set credentials from a public (non govcloud) account to access inferentia ECR
# export NORMAL_REGION_AWS_ACCESS_KEY_ID="***"
# export NORMAL_REGION_AWS_SECRET_ACCESS_KEY="***"
```

Create the ECR registries:
Expand Down
17 changes: 15 additions & 2 deletions Makefile
Original file line number Diff line number Diff line change
Expand Up @@ -51,14 +51,13 @@ operator-local-dbg:

# configure kubectl to point to the cluster specified in dev/config/cluster.yaml
kubectl:
@eval $$(python3 ./manager/cluster_config_env.py ./dev/config/cluster.yaml) && eval $$(python3 ./dev/create_user.py $$CORTEX_CLUSTER_NAME $$AWS_ACCOUNT_ID $$CORTEX_REGION) && eksctl utils write-kubeconfig --cluster="$$CORTEX_CLUSTER_NAME" --region="$$CORTEX_REGION" | (grep -v "saved kubeconfig as" | grep -v "using region" | grep -v "eksctl version" || true); eksctl create iamidentitymapping --region $$CORTEX_REGION --cluster $$CORTEX_CLUSTER_NAME --arn $$DEFAULT_USER_ARN --group system:masters --username $$DEFAULT_USER_ARN
@eval $$(python3 ./manager/cluster_config_env.py ./dev/config/cluster.yaml) && eval $$(python3 ./dev/create_user.py $$CORTEX_CLUSTER_NAME $$AWS_ACCOUNT_ID $$CORTEX_REGION) && eksctl utils write-kubeconfig --cluster="$$CORTEX_CLUSTER_NAME" --region="$$CORTEX_REGION" --verbose=0 | (grep -v "saved kubeconfig as" || true); eksctl create iamidentitymapping --region $$CORTEX_REGION --cluster $$CORTEX_CLUSTER_NAME --arn $$DEFAULT_USER_ARN --group system:masters --username $$DEFAULT_USER_ARN

cluster-up:
@$(MAKE) images-all
@$(MAKE) cli
@kill $(shell pgrep -f rerun) >/dev/null 2>&1 || true
@eval $$(python3 ./manager/cluster_config_env.py ./dev/config/cluster.yaml) && eval $$(python3 ./dev/create_user.py $$CORTEX_CLUSTER_NAME $$AWS_ACCOUNT_ID $$CORTEX_REGION) && sleep 10 && ./bin/cortex cluster up ./dev/config/cluster.yaml --configure-env="$$CORTEX_CLUSTER_NAME"; eksctl create iamidentitymapping --region $$CORTEX_REGION --cluster $$CORTEX_CLUSTER_NAME --arn $$DEFAULT_USER_ARN --group system:masters --username $$DEFAULT_USER_ARN

@$(MAKE) kubectl

cluster-up-y:
Expand All @@ -68,6 +67,20 @@ cluster-up-y:
@eval $$(python3 ./manager/cluster_config_env.py ./dev/config/cluster.yaml) && eval $$(python3 ./dev/create_user.py $$CORTEX_CLUSTER_NAME $$AWS_ACCOUNT_ID $$CORTEX_REGION) && sleep 10 && ./bin/cortex cluster up ./dev/config/cluster.yaml --configure-env="$$CORTEX_CLUSTER_NAME" --yes; eksctl create iamidentitymapping --region $$CORTEX_REGION --cluster $$CORTEX_CLUSTER_NAME --arn $$DEFAULT_USER_ARN --group system:masters --username $$DEFAULT_USER_ARN
@$(MAKE) kubectl

cluster-configure:
@$(MAKE) images-manager-skip-push
@$(MAKE) cli
@kill $(shell pgrep -f rerun) >/dev/null 2>&1 || true
@eval $$(python3 ./manager/cluster_config_env.py ./dev/config/cluster.yaml) && eval $$(python3 ./dev/create_user.py $$CORTEX_CLUSTER_NAME $$AWS_ACCOUNT_ID $$CORTEX_REGION) && sleep 10 && ./bin/cortex cluster configure ./dev/config/cluster.yaml; eksctl create iamidentitymapping --region $$CORTEX_REGION --cluster $$CORTEX_CLUSTER_NAME --arn $$DEFAULT_USER_ARN --group system:masters --username $$DEFAULT_USER_ARN
@$(MAKE) kubectl

cluster-configure-y:
@$(MAKE) images-manager-skip-push
@$(MAKE) cli
@kill $(shell pgrep -f rerun) >/dev/null 2>&1 || true
@eval $$(python3 ./manager/cluster_config_env.py ./dev/config/cluster.yaml) && eval $$(python3 ./dev/create_user.py $$CORTEX_CLUSTER_NAME $$AWS_ACCOUNT_ID $$CORTEX_REGION) && sleep 10 && ./bin/cortex cluster configure ./dev/config/cluster.yaml --yes; eksctl create iamidentitymapping --region $$CORTEX_REGION --cluster $$CORTEX_CLUSTER_NAME --arn $$DEFAULT_USER_ARN --group system:masters --username $$DEFAULT_USER_ARN
@$(MAKE) kubectl

cluster-down:
@$(MAKE) images-manager-skip-push
@$(MAKE) cli
Expand Down
4 changes: 0 additions & 4 deletions build/build-image.sh
Original file line number Diff line number Diff line change
Expand Up @@ -27,10 +27,6 @@ image=$3
is_multi_arch=$4
arch=$5

if [ "$image" == "inferentia" ]; then
aws ecr get-login-password --region us-west-2 | docker login --username AWS --password-stdin 790709498068.dkr.ecr.us-west-2.amazonaws.com
fi

if [ "$is_multi_arch" = "true" ]; then
tag="manifest-${CORTEX_VERSION}-$arch"
else
Expand Down
2 changes: 1 addition & 1 deletion build/generate_ami_mapping.go
Original file line number Diff line number Diff line change
Expand Up @@ -222,7 +222,7 @@ func main() {
json.Unmarshal(jsonBytes, &k8sVersionMap)
}

k8sVersion := "1.18"
k8sVersion := "1.20"

if k8sVersionMap[k8sVersion] == nil {
k8sVersionMap[k8sVersion] = map[string]map[string]string{}
Expand Down
5 changes: 3 additions & 2 deletions build/images.sh
Original file line number Diff line number Diff line change
Expand Up @@ -45,8 +45,9 @@ non_dev_images=(
"grafana"
"event-exporter"
"metrics-server"
"inferentia"
"nvidia"
"nvidia-device-plugin"
"neuron-device-plugin"
"neuron-scheduler"
"kubexit"
)

Expand Down
19 changes: 3 additions & 16 deletions dev/registry.sh
Original file line number Diff line number Diff line change
Expand Up @@ -84,19 +84,6 @@ function registry_login() {
if [ "$is_registry_logged_in" = "false" ]; then
blue_echo "Logging in to ECR..."
aws ecr get-login-password --region $AWS_REGION | docker login --username AWS --password-stdin $registry_push_url

blue_echo "Logging in to 790709498068.dkr.ecr.us-west-2.amazonaws.com for inferentia..."
set +e
echo "$AWS_REGION" | grep "us-gov"
is_gov_cloud=$?
set -e
if [ "$is_gov_cloud" == "0" ]; then
# set NORMAL_REGION_AWS_ACCESS_KEY_ID and NORMAL_REGION_AWS_SECRET_ACCESS_KEY credentials from a regular AWS account (non govcloud) in your dev/config/env.sh
AWS_ACCESS_KEY_ID=$NORMAL_REGION_AWS_ACCESS_KEY_ID AWS_SECRET_ACCESS_KEY=$NORMAL_REGION_AWS_SECRET_ACCESS_KEY aws ecr get-login-password --region us-west-2 | docker login --username AWS --password-stdin 790709498068.dkr.ecr.us-west-2.amazonaws.com
else
aws ecr get-login-password --region us-west-2 | docker login --username AWS --password-stdin 790709498068.dkr.ecr.us-west-2.amazonaws.com
fi

is_registry_logged_in="true"
green_echo "Success\n"
fi
Expand Down Expand Up @@ -152,15 +139,15 @@ function build_and_push() {
docker buildx build $ROOT -f $dir/Dockerfile -t $AWS_ACCOUNT_ID.dkr.ecr.$AWS_REGION.amazonaws.com/cortexlabs/$image:$tag --platform $platforms $push_or_not_flag

if [ "$include_arm64_arch" = "true" ]; then
green_echo "$finished_operation $image:$tag (amd64 and arm64)..."
green_echo "$finished_operation $image:$tag (amd64 and arm64)"
else
green_echo "$finished_operation $image:$tag (amd64)..."
green_echo "$finished_operation $image:$tag (amd64)"
fi

if [[ " $images_that_can_run_locally " =~ " $image " ]] && [[ "$include_arm64_arch" == "false" ]]; then
blue_echo "Exporting $image:$tag to local docker..."
docker buildx build $ROOT -f $dir/Dockerfile -t cortexlabs/$image:$tag -t $AWS_ACCOUNT_ID.dkr.ecr.$AWS_REGION.amazonaws.com/cortexlabs/$image:$tag --platform $platforms --load
green_echo "Exported $image:$tag to local docker..."
green_echo "Exported $image:$tag to local docker"
fi
}

Expand Down
Loading