Canary Release com Docker, Kubernetes, Gitlab e Traefik
About me
From Fortaleza - CE
10 years+ coding
CTO/Co-founder @Agendor
Java, Javascript, Android, PHP, Node.js, Ruby, React, Infra, Linux, AWS, MySQL, Business, Product.
data:image/s3,"s3://crabby-images/d5363/d53630bc92eeb20b108ffacde061e8ef925ec5e6" alt=""
Tulio Monte Azul
@tuliomonteazul
tulio@agendor.com.br
data:image/s3,"s3://crabby-images/bf25f/bf25f9352c919e602cc3a18695c37aee35d827e8" alt=""
data:image/s3,"s3://crabby-images/6dc03/6dc036ced2d503427b8e27ab9cb9ab4fe8c9f073" alt=""
Agendor é a ferramenta principal do vendedor.
(Github do Vendedor)
Versão Web e Mobile com plano gratuito e pago.
Antes
Depois
data:image/s3,"s3://crabby-images/85a93/85a9341b564af9e7c1f8dd46c1b63679188b9c46" alt=""
data:image/s3,"s3://crabby-images/197b2/197b2cba64b23cdb20fef73a71bd58e840eb3d48" alt=""
- SSH + ./deploy.sh
- 19hs + Edit ELB
- Rollback? Hã?
- Medo
- Auto recovery
- Auto scaling
- Canary Release
- Rollback
Contexto
Soluções
Desafios
Contexto
- Projeto em PHP (legado)
- Migração pra SPA com Rails + React
- Equipe enxuta (6 devs)
- Mercado competitivo
- 25k usuários ativos
- 5k rpm
Soluções
data:image/s3,"s3://crabby-images/47298/472987c0a4a7d258b9ea5f0c1669a0f0cae443a5" alt=""
data:image/s3,"s3://crabby-images/03bc7/03bc7a80cf3a649b6e27b0ef5689f5c4a010f6e1" alt=""
data:image/s3,"s3://crabby-images/42683/4268306714376d8bc7ad95daca2963b47e9c78ea" alt=""
créditos: https://martinfowler.com/bliki/CanaryRelease.html
"Eu acho que eu vi um bug"
data:image/s3,"s3://crabby-images/6d5de/6d5de69f06afe5bced621158a707af39ff3bd441" alt=""
data:image/s3,"s3://crabby-images/1b61b/1b61bfc81d770ae825ac625a435d000cdf015c75" alt=""
CI/CD
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Deploy/Rollback fácil
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Automação
Custo
data:image/s3,"s3://crabby-images/f5ec1/f5ec118b8f4c77782acb5aebc5a110caadad45c1" alt=""
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/9c95c/9c95c3b7d93c1dd489fddb2ab4b345d59a1b0f8b" alt=""
stages:
- test
- release
- deploy
image: azukiapp/docker-kubernetes-ci:17.04
services:
- docker:dind
variables:
DOCKER_DRIVER: overlay
before_script:
- docker info
- docker-compose --version
# Install kubectl and generate configurations
# More info in https://gitlab.com/agendorteam/agendor-infra#configure-kubectl-context-in-ci
- kube-config-generator
- kubectl cluster-info
- helm version
# Test
###########
test:
stage: test
tags:
- docker
script:
- docker-compose build test
- docker-compose run test
# Release
###########
.release: &release
stage: release
script:
# REFERENCES: https://docs.gitlab.com/ce/ci/variables/
# - ./scripts/release.sh agendor-web master
- ./scripts/release.sh $CI_PROJECT_NAME $IMAGE_TAG
release_current_staging:
<<: *release
tags:
- docker
variables:
DEPLOY_TYPE: staging
IMAGE_TAG: stg-$CI_BUILD_REF_SLUG-$CI_BUILD_REF
except:
- master
release_current_test:
<<: *release
tags:
- docker
variables:
DEPLOY_TYPE: test
IMAGE_TAG: tst-$CI_BUILD_REF_SLUG-$CI_BUILD_REF
except:
- master
release_current_production:
<<: *release
tags:
- docker
variables:
DEPLOY_TYPE: production
IMAGE_TAG: $CI_BUILD_REF_SLUG-$CI_BUILD_REF
only:
- master
release_latest_production:
<<: *release
tags:
- docker
variables:
DEPLOY_TYPE: production
IMAGE_TAG: latest
only:
- master
# Deploy
###########
.deploy_template: &deploy_definition
stage: deploy
tags:
- docker
script:
# ./scripts/deploy.sh [PROJECT_NAME [REF_NAME [HELM_NAME]]]
- ./scripts/deploy.sh $CI_PROJECT_NAME $IMAGE_TAG
- ./scripts/deploy.sh $CI_PROJECT_NAME $IMAGE_TAG agendor-api-mobile
deploy_staging:
<<: *deploy_definition
environment:
name: staging
variables:
DEPLOY_TYPE: staging
IMAGE_TAG: stg-$CI_BUILD_REF_SLUG-$CI_BUILD_REF
only:
- develop
deploy_canary:
<<: *deploy_definition
environment:
name: canary
variables:
DEPLOY_TYPE: canary
IMAGE_TAG: $CI_BUILD_REF_SLUG-$CI_BUILD_REF
only:
- master
# manuals
deploy_staging_manual:
<<: *deploy_definition
environment:
name: staging
variables:
DEPLOY_TYPE: staging
IMAGE_TAG: stg-$CI_BUILD_REF_SLUG-$CI_BUILD_REF
when: manual
except:
- develop
- master
deploy_test_manual:
<<: *deploy_definition
environment:
name: test
variables:
DEPLOY_TYPE: test
IMAGE_TAG: tst-$CI_BUILD_REF_SLUG-$CI_BUILD_REF
when: manual
except:
- master
deploy_production_manual:
<<: *deploy_definition
environment:
name: production
variables:
DEPLOY_TYPE: production
when: manual
only:
- master
gitlab-ci.yaml
Usado em produção
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Google, Red Hat, CoreOs
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Monitoramento, Rolling Updates, Auto recovery
Comunidade
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/43c4b/43c4ba1adc9a4dee7f83b4cb0284dd4eaf055079" alt=""
Documentação
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Ferramentas auxiliares
data:image/s3,"s3://crabby-images/43c4b/43c4ba1adc9a4dee7f83b4cb0284dd4eaf055079" alt=""
-
Kubectl
-
Kubernetes Dashboard
-
Kops
-
Helm
-
Cluster-Autoscaler
Request
Bah Guri! Como é que
tá o CPU dos Pods aí?
O Pod 1 tá com 43%
e o Pod 2 com 62%
Então, a média tá 52,5%
Tchê! Tu precisa mudar tua quantidade de réplicas de 2 pra 3
data:image/s3,"s3://crabby-images/aa346/aa346b98c9f6dbf5f8f86b256dec0acb70b89097" alt=""
data:image/s3,"s3://crabby-images/a5dcf/a5dcf6c17fdfe813924fe8253ddbde941f29b675" alt=""
data:image/s3,"s3://crabby-images/960a8/960a87f122ffe90b7b4560ea353d7b60e69a85f0" alt=""
data:image/s3,"s3://crabby-images/8331d/8331d94720bf0854ba9753527924ff25f306dc72" alt=""
data:image/s3,"s3://crabby-images/f7031/f70314f049ca88cf89d792bfcc8f2c4f3263e18f" alt=""
data:image/s3,"s3://crabby-images/0a8cd/0a8cd4f3d72af9f7e1c1898ebb5b70ac3edcefe8" alt=""
data:image/s3,"s3://crabby-images/2add8/2add85a9a89c9578830648f429066a6dc6b5f5ce" alt=""
data:image/s3,"s3://crabby-images/8f3aa/8f3aa817b714ac25250761f9d44cfe811bc28851" alt=""
data:image/s3,"s3://crabby-images/5e2fd/5e2fdbb60d93b03c0a36d99a9ca25f95874bbbcb" alt=""
(node)
Usado em produção
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Múltiplos ambientes
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Deploy por ambiente
Hipster
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Infrastructure as Code
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Ambiente dev = prod
data:image/s3,"s3://crabby-images/2811c/2811ca2613a7b693c3dd3eec6bc848779d58dc06" alt=""
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/fa9eb/fa9eb77f4c2d78acb83a17cd4c010cff2784765a" alt=""
data:image/s3,"s3://crabby-images/5b7eb/5b7eb9d79ff8d5e82760c1efbd1acf7e02ba479a" alt=""
Host mapping
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Fast
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
data:image/s3,"s3://crabby-images/7d2f8/7d2f858112c7f40b826a5e42bfb28a3d4463c663" alt=""
Canary Release
frontends:
# agendor-web
web:
rules:
- "Host:web.agendor.com.br"
web_canary:
rules:
- "Host:web.agendor.com.br"
- "HeadersRegexp:Cookie,_AUSER_TYPE=canary"
backends:
## agendor-web
web:
- name: k8s
url: "http://agendor-web.production.svc.cluster.local"
web_canary:
- name: k8s
url: "http://agendor-web-canary.production.svc.cluster.local"
Desafios
#1 Pods Scaling
NewRelic FTW
data:image/s3,"s3://crabby-images/0494d/0494df8eb82f7e1d4130647c698d37348307f19a" alt=""
Kubectl FTW
$ kubectl --namespace=production scale --replicas=8 deployment/agendor-web-canary
$ kubectl --namespace=production get pods -l=app=agendor-web-canary
NAME READY STATUS RESTARTS AGE
agendor-web-canary-1388324162-2qqgr 1/1 Running 0 5h
agendor-web-canary-1388324162-38xlb 1/1 Running 0 1m
agendor-web-canary-1388324162-3g815 1/1 Running 0 5h
agendor-web-canary-1388324162-89j6k 1/1 Running 0 5h
$ kubectl --namespace=production get pods -l=app=agendor-web-canary
NAME READY STATUS RESTARTS AGE
agendor-web-canary-1388324162-2qqgr 1/1 Running 0 5h
agendor-web-canary-1388324162-38xlb 1/1 Running 0 1m
agendor-web-canary-1388324162-3g815 1/1 Running 0 5h
agendor-web-canary-1388324162-89j6k 1/1 Running 0 5h
agendor-web-canary-1388324162-gjndp 0/1 ContainerCreating 0 4s
agendor-web-canary-1388324162-n57wc 0/1 ContainerCreating 0 4s
agendor-web-canary-1388324162-q0pcv 0/1 ContainerCreating 0 4s
agendor-web-canary-1388324162-t8v4s 0/1 ContainerCreating 0 4s
Limits tunning
# Default values for agendor-web.
# This is a YAML-formatted file.
# Declare variables to be passed into your templates.
name: agendor-web
namespace: default
app_env: staging
# autoscale configurations
minReplicas: 2
maxReplicas: 10
targetCPUUtilizationPercentage: 50
...
resources:
limits:
cpu: 100m
memory: 256Mi
requests:
cpu: 100m
memory: 256Mi
# Default values for agendor-web.
# This is a YAML-formatted file.
# Declare variables to be passed into your templates.
name: agendor-web
namespace: default
app_env: staging
# autoscale configurations
minReplicas: 2
maxReplicas: 10
targetCPUUtilizationPercentage: 30 # decreased
...
resources:
limits:
cpu: 200m # increased
memory: 512Mi # increased
requests:
cpu: 100m
memory: 256Mi
Add Monitor
data:image/s3,"s3://crabby-images/3ecdb/3ecdbc932db87d7c8e2b341ea7ed914c6818fe41" alt=""
# back-end
web_canary:
- name: k8s
url: "http://agendor-web-canary.production.svc.cluster.local"
# front-end
web_canary:
rules:
- "Host:web.agendor.com.br"
- "HeadersRegexp:Cookie,_AUSER_TYPE=canary"
# back-end
web_canary:
- name: k8s
url: "http://agendor-web-canary.production.svc.cluster.local"
web_canary_host:
- name: k8s
url: "http://agendor-web-canary.production.svc.cluster.local"
# front-end
web_canary:
rules:
- "Host:web.agendor.com.br"
- "HeadersRegexp:Cookie,_AUSER_TYPE=canary"
web_canary_host:
rules:
- "Host:web-canary.agendor.com.br"
#2 Pods Scaling + Deploy
Rolling Update Fail
Deployment vs HPA
kind: Deployment
apiVersion: extensions/v1beta1
metadata:
name: {{ .Values.name }}
...
spec:
replicas: {{ .Values.minReplicas }}
revisionHistoryLimit: 10
selector:
matchLabels:
app: {{ .Values.name }}
apiVersion: autoscaling/v1
kind: HorizontalPodAutoscaler
metadata:
name: {{ .Values.name }}
...
spec:
scaleTargetRef:
apiVersion: extensions/v1beta1
kind: Deployment
name: {{ .Values.name }}
minReplicas: {{ .Values.minReplicas }}
maxReplicas: {{ .Values.maxReplicas }}
targetCPUUtilizationPercentage: {{ .Values.targetCPUUtilizationPercentage }}
# values.yaml
minReplicas: 1
maxReplicas: 10
targetCPUUtilizationPercentage: 30
strategy:
rollingUpdate:
maxSurge: 3
maxUnavailable: 0
Rolling Update Win
#3 Cluster Scaling
Aviso de lentidão no sistema
data:image/s3,"s3://crabby-images/4ef52/4ef52357ded640e476a7114c4c2ce9d06fa94c3f" alt=""
data:image/s3,"s3://crabby-images/04df9/04df96c59c3a2d438f48940a8245ae83bad24c45" alt=""
Análise da lentidão
data:image/s3,"s3://crabby-images/75664/756641e9bb856542c47ceeac433e2cd2712bc8a3" alt=""
Análise da lentidão
data:image/s3,"s3://crabby-images/04c59/04c592f129ab2dc3e971d643bbc15afd41ec1510" alt=""
data:image/s3,"s3://crabby-images/3f2e4/3f2e42f410e0455e4ac4664a1f21054ba25b2416" alt=""
data:image/s3,"s3://crabby-images/d8ce1/d8ce1ebea3503124319e50c0bfb8074d7dced093" alt=""
SSH into Server
$ ssh -i ~/.ssh/kube_aws_rsa
admin@ec2-XX-XXX-XXX-XXX.sa-east-1.compute.amazonaws.com
Debugging Nodes
$ kubectl top nodes
NAME CPU(cores) CPU% MEMORY(bytes) MEMORY%
ip-172-20-44-176.sa-east-1.compute.internal 35m 15% 2132Mi 55%
ip-172-20-35-247.sa-east-1.compute.internal 82m 40% 2426Mi 62%
ip-172-20-41-137.sa-east-1.compute.internal 125m 68% 2406Mi 62%
ip-172-20-60-122.sa-east-1.compute.internal 130m 40% 2459Mi 63%
ip-172-20-116-96.sa-east-1.compute.internal 429m 21% 2271Mi 58%
ip-172-20-117-4.sa-east-1.compute.internal 140m 46% 2262Mi 58%
Debugging Nodes
$ kubectl describe nodes
data:image/s3,"s3://crabby-images/3b284/3b284fc4097ffa18ab50cae612fbc36d3d37e2d0" alt=""
data:image/s3,"s3://crabby-images/80896/8089690330a44e17a192d7c1d605354042ea4fdb" alt=""
data:image/s3,"s3://crabby-images/14405/14405d7305fd4e3cbaef28c4669041c28f6cc28e" alt=""
data:image/s3,"s3://crabby-images/bb69f/bb69f87725392a5786ac107c91d932aa1c42c61a" alt=""
Debugging auto scaler
I0622 18:08:59 scale_up.go:62] Upcoming 1 nodes
I0622 18:08:59 scale_up.go:81] Skipping node group nodes.k8s.agendor.com.br - max size reached
I0622 18:08:59 scale_up.go:132] No expansion options
kubectl logs -f --tail=100 --namespace=kube-system cluster-autoscaler-2462627368-c1hhl
containers:
- name: cluster-autoscaler
image: "{{ .Values.image.repository }}:{{ .Values.image.tag }}"
imagePullPolicy: "{{ .Values.image.pullPolicy }}"
command:
- ./cluster-autoscaler
- --cloud-provider=aws
{{- range .Values.autoscalingGroups }}
- --nodes={{ .minSize }}:{{ .maxSize }}:{{ .name }}
{{- end }}
autoscalingGroups:
- name: nodes.k8s.agendor.com.br
minSize: 1
maxSize: 15
values.yaml
deployment.yaml
data:image/s3,"s3://crabby-images/3781f/3781fc5b46a0934c9499663ac516aa68a91155ab" alt=""
Change instance groups
$ kops get ig
Using cluster from kubectl context: k8s.agendor.com.br
NAME ROLE MACHINETYPE MIN MAX SUBNETS
master-sa-east-1a Master t2.medium 1 1 sa-east-1a
nodes Node t2.medium 2 6 sa-east-1a,sa-east-1c
apiVersion: kops/v1alpha2
kind: InstanceGroup
metadata:
creationTimestamp: "2017-02-04T07:30:23Z"
labels:
kops.k8s.io/cluster: k8s.agendor.com.br
name: nodes
spec:
image: kope.io/k8s-1.6-debian-jessie-amd64-hvm-ebs-2017-05-02
machineType: t2.medium
maxSize: 20
minSize: 2
role: Node
rootVolumeSize: 50
subnets:
- sa-east-1a
- sa-east-1c
$ kops edit ig nodes
apiVersion: kops/v1alpha2
kind: InstanceGroup
metadata:
creationTimestamp: "2017-02-04T07:30:23Z"
labels:
kops.k8s.io/cluster: k8s.agendor.com.br
name: nodes
spec:
image: kope.io/k8s-1.6-debian-jessie-amd64-hvm-ebs-2017-05-02
machineType: t2.medium
maxSize: 6
minSize: 2
role: Node
rootVolumeSize: 50
subnets:
- sa-east-1a
- sa-east-1c
Change instance groups
$ kops update cluster
Using cluster from kubectl context: k8s.agendor.com.br
*********************************************************************************
A new kubernetes version is available: 1.6.6
Upgrading is recommended (try kops upgrade cluster)
More information: https://github.com/kubernetes/kops/blob/master/permalinks/upgrade_k8s.md#1.6.6
*********************************************************************************
I0623 10:13:07.893227 35173 executor.go:91] Tasks: 0 done / 60 total; 27 can run
I0623 10:13:10.226305 35173 executor.go:91] Tasks: 27 done / 60 total; 14 can run
I0623 10:13:10.719476 35173 executor.go:91] Tasks: 41 done / 60 total; 17 can run
I0623 10:13:11.824240 35173 executor.go:91] Tasks: 58 done / 60 total; 2 can run
I0623 10:13:11.933869 35173 executor.go:91] Tasks: 60 done / 60 total; 0 can run
Will modify resources:
AutoscalingGroup/nodes.k8s.agendor.com.br
MaxSize 6 -> 20
Must specify --yes to apply changes
#4 Cluster Scaling
pt. 2
Black Hole
data:image/s3,"s3://crabby-images/93183/93183753c1ac0370541e6d1f1d7a0759b7774e9b" alt=""
data:image/s3,"s3://crabby-images/905bc/905bc05cc658365534f689a75a4f6f7ea70c6ebe" alt=""
kubectl logs -f --tail=200 --namespace=kube-system \
$(kubectl get pods --namespace=kube-system \
| grep cluster-autoscaler \
| awk '{print $1}' \
| head -1)
kubectl describe nodes
data:image/s3,"s3://crabby-images/ce466/ce466f479faf7426f472a1c0b514bb52b5e4cf5f" alt=""
data:image/s3,"s3://crabby-images/79756/79756d6043b33f7f4c835da8290f93ee9ff2390a" alt=""
data:image/s3,"s3://crabby-images/94dcc/94dcc345c3ea6afe99375f5d703315fc3a1ba8c7" alt=""
data:image/s3,"s3://crabby-images/0459a/0459a395421ab8fdf19c9a881b167c351e2a3e1f" alt=""
data:image/s3,"s3://crabby-images/716cc/716cc5b65c9c561edd6ee57e91694ddf71fd960c" alt=""
$ aws ec2 describe-route-tables \
--filters "Name=route-table-id, Values=rtb-7923f31d" \
| grep blackhole \
| awk '{print $2}' \
| xargs -n 1 \
aws ec2 delete-route --route-table-id rtb-7923f31d --destination-cidr-block
12:51:26 scale_up.go:62] Upcoming 0 nodes
12:51:26 scale_up.go:145] Best option to resize: nodes.k8s.agendor.com.br
12:51:26 scale_up.go:149] Estimated 1 nodes needed in nodes.k8s.agendor.com.br
12:51:26 scale_up.go:169] Scale-up: setting group nodes.k8s.agendor.com.br size to 7
12:51:26 aws_manager.go:124] Setting asg nodes.k8s.agendor.com.br size to 7
12:51:26 event.go:217] Event(v1.ObjectReference{Kind:"ConfigMap", Namespace:"kube-system", Name:"cluster-autoscaler-status", UID:"3d1fa43e-59e8-11e7-b0ba-02fccc543533", APIVersion:"v1", ResourceVersion:"22154083", FieldPath:""}): type: 'Normal' reason: 'ScaledUpGroup' Scale-up: group nodes.k8s.agendor.com.br size set to 7
12:51:26 event.go:217] Event(v1.ObjectReference{Kind:"Pod", Namespace:"production", Name:"agendor-web-canary-3076992955-95b3p", UID:"2542369b-5a6e-11e7-b0ba-02fccc543533", APIVersion:"v1", ResourceVersion:"22154097", FieldPath:""}): type: 'Normal' reason: 'TriggeredScaleUp' pod triggered scale-up, group: nodes.k8s.agendor.com.br, sizes (current/new): 6/7
...
12:51:37 scale_up.go:62] Upcoming 1 nodes
12:51:37 scale_up.go:124] No need for any nodes in nodes.k8s.agendor.com.br
12:51:37 scale_up.go:132] No expansion options
12:51:37 static_autoscaler.go:247] Scale down status: unneededOnly=true lastScaleUpTime=2017-06-26 12:51:26.777491768 +0000 UTC lastScaleDownFailedTrail=2017-06-25 20:52:45.172500628 +0000 UTC schedulablePodsPresent=false
12:51:37 static_autoscaler.go:250] Calculating unneeded nodes
Deletar rotas em status "Black Hole"
$ kubectl logs -f --tail=100 --namespace=kube-system
$(kubectl get pods --namespace=kube-system |
grep cluster-autoscaler | awk '{print $1}' | head -1)
#5 Canary Release
Canary Bug Tracking
Traefik redirecionando requests aleatoriamente
Cliente reclama de bug: problema na aplicação ou na nova infra?
data:image/s3,"s3://crabby-images/6d048/6d0485f9a2b8f2b4c4effae7e73bbd6bc6949c7f" alt=""
Lição sobre redirecionamento aleatório em Canary:
Bom pra testar escalabilidade e performance
Ruim se mantido por muito tempo
Solução:
backends:
web:
- name: legacy_1
values:
- weight = 5
- url = "http://ec2-54-94-XXX-XXX.sa-east-1.compute.amazonaws.com"
- name: legacy_2
values:
- weight = 5
- url = "http://ec2-54-233-XX-XX.sa-east-1.compute.amazonaws.com"
- name: k8s
values:
- weight = 1
- url = "http://agendor-api-v2-canary.production.svc.cluster.local"
Canary Bug Tracking
# aws.yml (traefik)
backends:
web:
- name: legacy_1
values:
- weight = 5
- url = "http://ec2-54-94-XXX-XXX.sa-east-1.compute.amazonaws.com"
- name: legacy_2
values:
- weight = 5
- url = "http://ec2-54-233-XX-XX.sa-east-1.compute.amazonaws.com"
- name: k8s
values:
- weight = 1
- url = "http://agendor-api-v2-canary.production.svc.cluster.local"
web_canary:
- name: k8s
url: "http://agendor-web-canary.production.svc.cluster.local"
# aws.yml (traefik)
backends:
web:
- name: legacy_1
url = "http://ec2-54-94-XXX-XXX.sa-east-1.compute.amazonaws.com"
- name: legacy_2
url = "http://ec2-54-233-XX-XXX.sa-east-1.compute.amazonaws.com"
web_canary:
- name: k8s
url: "http://agendor-web-canary.production.svc.cluster.local"
# PHP
public function addBetaTesterCookie($accountId = NULL) {
if ($this->isCanaryBetaTester($accountId)) { // check on Redis
$this->setCookie(BetaController::$BETA_COOKIE_NAME,
BetaController::$BETA_COOKIE_VALUE);
} else {
$this->setCookie(BetaController::$BETA_COOKIE_NAME,
BetaController::$LEGACY_COOKIE_VALUE);
}
}
Multiple versions
Múltiplas versões do código simultâneas
Todos devem desenvolver pensando no deploy/release desde o começo
id | nome | idade |
---|---|---|
1 | Jon | 29 |
2 | Arya | 16 |
id | nome | idade | dt. nasc |
---|---|---|---|
1 | Jon | 29 | 19/07/88 |
2 | Arya | 16 | 07/05/01 |
id | nome | dt. nasc |
---|---|---|
1 | Jon | 19/07/88 |
2 | Arya | 07/05/01 |
Deploy 1
(add)
Deploy 2
(remove)
#6 Helm + recreate-pods
Helm
--recreate-pods para corrigir o problema dos pods não serem re-criados
Impacto: Rolling Update fail
env:
- name: APP_NAME
value: "{{ .Values.name }}"
- name: APP_NAMESPACE
value: "{{ .Values.namespace }}"
- name: APP_ENV
value: "{{ .Values.app_env }}"
- name: APP_FULLNAME
value: "{{ .Values.name }}-{{ .Values.app_env }}"
# Force recreate pods
- name: UPDATE_CONFIG
value: {{ $.Release.Time }}
Obrigado!
linkedin.com/in/tuliomonteazul
@tuliomonteazul
tulio@agendor.com.br
data:image/s3,"s3://crabby-images/759b1/759b1a2ec563883181f2a4ef49ea9eff79c46de6" alt=""
Everton Ribeiro
Gullit Miranda
Marcus Gadbem
Special thanks to
data:image/s3,"s3://crabby-images/a6303/a630317f0eda1545b6ba09bb015085730e61bebe" alt=""
Canary Release com Kubernetes e Traefik
By Tulio Monte Azul
Canary Release com Kubernetes e Traefik
- 554