Deploying Serverless Docker applications on AWS
Andrew Ang
Harvard IT Summit 2019
May 14, 2019
Harvard VPAL Research Group
data:image/s3,"s3://crabby-images/00645/00645e90f662c26f805bd0b51e26ee637b28e92d" alt=""
data:image/s3,"s3://crabby-images/56a1d/56a1d8c8988bf9fb0da68e391466adf47f65efc4" alt=""
data:image/s3,"s3://crabby-images/b1d6e/b1d6eded5c855fa60749b39628852975ab7d0617" alt=""
data:image/s3,"s3://crabby-images/316f4/316f4483e9927dafb3eff1be54e1e0c8bf025f91" alt=""
data:image/s3,"s3://crabby-images/b88a8/b88a89e7fa52d9c9e8b0178fe599880a63c8b103" alt=""
Learning about learning, at scale, using MOOCs ...
data:image/s3,"s3://crabby-images/78bb0/78bb0420b4f528d7c2f21d31f59ff9e93a62935b" alt=""
data:image/s3,"s3://crabby-images/0e2a6/0e2a6f7aa40978e490aaae7bfc535feff5c4e9a0" alt=""
data:image/s3,"s3://crabby-images/8d379/8d3792a53f51b55c54679fb98f30321e553c3b1e" alt=""
… and bringing research and technology innovations back to campus ...
data:image/s3,"s3://crabby-images/050a9/050a9e9b402767d0421e90597d9a2edee15d44b3" alt=""
data:image/s3,"s3://crabby-images/70b3f/70b3f6364bc55f363b55ab8df1d8ba70d46ab730" alt=""
… to create engaging learning experiences
data:image/s3,"s3://crabby-images/1f9c6/1f9c6642b66b7bc152b1bc5d02a3176cede36c53" alt=""
data:image/s3,"s3://crabby-images/1956e/1956e561ff087a3d2677a8e5a2143fdf96f82659" alt=""
data:image/s3,"s3://crabby-images/a198d/a198d5f0ac0ee0b2e3a1bab346f9404f67ee5b53" alt=""
Our use cases for dockerized applications in education research + technology
data:image/s3,"s3://crabby-images/1956e/1956e561ff087a3d2677a8e5a2143fdf96f82659" alt=""
data:image/s3,"s3://crabby-images/cc1c4/cc1c464c2955e37f20d69cad835fae44a7434dc6" alt=""
data:image/s3,"s3://crabby-images/987d4/987d4bf006e0b6039b8424f826c5509711160129" alt=""
Research and analytics with education data
Data pipelines
data:image/s3,"s3://crabby-images/f0de5/f0de55bb5fbdf7b2a980e603c5109eb2b696621b" alt=""
Data pipelines
data:image/s3,"s3://crabby-images/7a7e6/7a7e6c369e6e4fa302bad1552ec2fd765661b010" alt=""
API scraper, web crawler, data ETL
Dashboards - Django admin panel for data pipeline diagnostics
data:image/s3,"s3://crabby-images/8c024/8c0248c49f38bd1dbfe078e3847b4cb908d73894" alt=""
data:image/s3,"s3://crabby-images/1f9c6/1f9c6642b66b7bc152b1bc5d02a3176cede36c53" alt=""
data:image/s3,"s3://crabby-images/b4c19/b4c190c90e6625c1df456f4d5d394e4ece5dbf21" alt=""
Adaptivity - service architecture
data:image/s3,"s3://crabby-images/7fac1/7fac1a9895e40184224fcad3ddaa4b6104f6dedc" alt=""
Typical dockerized web application architecture
data:image/s3,"s3://crabby-images/1f84d/1f84d20ebdf19e89aee12de052519cd939e6615a" alt=""
Ideal service scaling
data:image/s3,"s3://crabby-images/fdfd7/fdfd73b246f05e0500268364f903f1099a4e3344" alt=""
Deployment solution #1:
EC2 + docker-compose
data:image/s3,"s3://crabby-images/3a93a/3a93a97384bd2d1e1f0f8ec38221f341ed6a2b1a" alt=""
Deployment solution #2a:
Elastic Beanstalk
data:image/s3,"s3://crabby-images/1fa8c/1fa8c025447763a177307c45b915f19e546c32cc" alt=""
Previous deployment solutions #2b:
Elastic Beanstalk
(Multicontainer Docker)
data:image/s3,"s3://crabby-images/41747/4174760d5289fc42a41db277eed3f8f9f3608c2b" alt=""
Previous deployment (attempt) #3:
ECS - Elastic Container Service
data:image/s3,"s3://crabby-images/4120c/4120c712795c05dc3049eb727492f64b330b0bd1" alt=""
Fargate introduced in Nov 2017
data:image/s3,"s3://crabby-images/73542/7354244f2f4f3405d69067e617f2bc468c463fdc" alt=""
“Fargate is like EC2 but instead of giving you a virtual machine you get a container.”
ECS launch types:
EC2 vs. Fargate
data:image/s3,"s3://crabby-images/cc872/cc872ea4c15b212185a2099a2304f6b917be3e64" alt=""
ECS launch types:
EC2 vs. Fargate
data:image/s3,"s3://crabby-images/5cf68/5cf68132b59e01e8b359389d970c8305a1b150b3" alt=""
Workflow
data:image/s3,"s3://crabby-images/60d9e/60d9e131ba801114488639fd8edf0817b3bc5808" alt=""
Workflow:
Build containers and upload to registry
data:image/s3,"s3://crabby-images/0ad91/0ad911fb8041cbbd2e6742cb65705365a14f1b12" alt=""
Workflow:
Pull containers from registry and run
data:image/s3,"s3://crabby-images/ea5ec/ea5ec821350d05cfc698e921588ca134eb1f2fe2" alt=""
Workflow:
Task definitions
data:image/s3,"s3://crabby-images/381b9/381b98931878eac1fc2b9b83d8ac96acf91956d7" alt=""
Task: scalable unit of ECS
data:image/s3,"s3://crabby-images/50f29/50f29c455610e7447615325e3ea39e405dbb2691" alt=""
{
"family": "web",
"taskRoleArn": "arn:...",
"networkMode": "awsvpc",
"containerDefinitions": [
{
"name": "web",
"image": "${web_image}",
"essential": true,
"memory": 256,
"portMappings": [
{
"containerPort": 8000
}
],
"command": [
"/usr/local/bin/gunicorn",
"config.wsgi:application",
"-w=2",
"-b=:8000",
"--log-file=-",
"--access-logfile=-"
],
"environment": [
{"name": "DJANGO_SETTINGS_MODULE", "value": "${DJANGO_SETTINGS_MODULE}"},
{"name": "ENV_LABEL", "value": "${env_label}"},
{"name": "HOST", "value": "${domain_name}"}
],
"logConfiguration": {
"logDriver": "awslogs",
"options": {
"awslogs-group": "${log_group_name}",
"awslogs-region": "us-east-1",
"awslogs-stream-prefix": "ecs"
}
},
{
"name": "nginx",
"image": "${nginx_image}",
"essential": false,
"memory": 256,
"portMappings": [
{
"containerPort": 80
}
],
...
}
]
}
Tasks can have one or more containers.
Task instance: instantiation of a task
data:image/s3,"s3://crabby-images/b7dc5/b7dc508bdd89e634b4e7000efd67f6ad5921629c" alt=""
Default settings from task definition such as command, memory allocation, etc. can be overridden when instantiating a task.
Each task instantiation from a task definition has the same set of containers.
Service: an "auto scaling group" for tasks
data:image/s3,"s3://crabby-images/c4591/c459125416e97376185383d5c656ce7fb9108601" alt=""
data:image/s3,"s3://crabby-images/352a0/352a0ad24f82524b8fcb21c9ee5d65862a22e60b" alt=""
Services are used for tasks that run indefinitely
(e.g. web service)
Cluster: a logical grouping of tasks/services
data:image/s3,"s3://crabby-images/82d93/82d93df184a33e100be5da101897554b18b76d1c" alt=""
ECS abstractions
ECS term | description / analog |
---|---|
container definition | docker-compose |
task definition | docker-compose + AWS config |
task instance | instantiation of a task definition |
service | auto-scaling group for task instances |
cluster | grouping of tasks/services |
Example application 1
web microservice
(adaptive learning recommendation engine)
Workflow
- build with docker-compose
- push image to ECR
- define ECS tasks and services
- setup additional infrastructure
data:image/s3,"s3://crabby-images/1f84d/1f84d20ebdf19e89aee12de052519cd939e6615a" alt=""
Build images
e.g. `docker-compose build`
# docker-compose.yml
version: '2'
services:
bridge:
container_name: BFA
build:
context: .
dockerfile: Dockerfile
image: bridge_adaptivity
command: bash -c "./prod_run.sh"
volumes:
- .:/bridge_adaptivity
- static:/www/static
ports:
- "8000:8000"
links:
- postgres
# Celery worker
worker:
image: bridge_adaptivity
environment:
DJANGO_SETTINGS_MODULE: config.settings.prod
command: bash -c "sleep 5 && celery -A config worker -l info"
volumes:
- .:/bridge_adaptivity
links:
- rabbit
- postgres
depends_on:
- bridge
rabbit:
container_name: rabbitmq
image: rabbitmq
env_file: ./envs/rabbit.env
nginx:
container_name: nginx_BFA
build: ./nginx
ports:
- "80:80"
- "443:443"
volumes_from:
- bridge
volumes:
- /etc/nginx/ssl/:/etc/nginx/ssl/
links:
- bridge
postgres:
container_name: postgresql_BFA
image: postgres
env_file: ./envs/pg.env
volumes:
- pgs:/var/lib/postgresql/data/
ports:
- "5432:5432"
data:image/s3,"s3://crabby-images/1f84d/1f84d20ebdf19e89aee12de052519cd939e6615a" alt=""
Push images
e.g. `docker-compose push`
data:image/s3,"s3://crabby-images/6f92b/6f92bf3f13a6d5302a9b94c8dd5799627025d17d" alt=""
Task definition: web
data:image/s3,"s3://crabby-images/50f29/50f29c455610e7447615325e3ea39e405dbb2691" alt=""
{
"family": "web",
"taskRoleArn": "arn:...",
"networkMode": "awsvpc",
"containerDefinitions": [
{
"name": "web",
"image": "123456789.dkr.ecr.us-east-1.amazonaws.com/namespace/app",
"essential": true,
"memory": 256,
"portMappings": [
{
"containerPort": 8000
}
],
"command": [
"/usr/local/bin/gunicorn",
"config.wsgi:application",
"-w=2",
"-b=:8000",
"--log-file=-",
"--access-logfile=-"
],
"environment": [
{"name": "DJANGO_SETTINGS_MODULE", "value": "${DJANGO_SETTINGS_MODULE}"},
{"name": "ENV_LABEL", "value": "${env_label}"},
{"name": "HOST", "value": "${domain_name}"}
],
"logConfiguration": {
"logDriver": "awslogs",
"options": {
"awslogs-group": "${log_group_name}",
"awslogs-region": "us-east-1",
"awslogs-stream-prefix": "ecs"
}
},
{
"name": "nginx",
"image": "${nginx_image}",
"essential": false,
"memory": 256,
"portMappings": [
{
"containerPort": 80
}
],
...
}
]
}
Task definition: queue
{
"family": "web",
"taskRoleArn": "arn:...",
"networkMode": "awsvpc",
"containerDefinitions": [
[
{
"name": "rabbit",
"image": "rabbitmq",
"essential": true,
"memory": 256,
"environment": [
{"name": "RABBITMQ_DEFAULT_PASS", "value": "${celery_password}"},
{"name": "RABBITMQ_DEFAULT_USER", "value": "${celery_user}"}
],
"logConfiguration": {
"logDriver": "awslogs",
"options": {
"awslogs-group": "${log_group_name}",
"awslogs-region": "us-east-1",
"awslogs-stream-prefix": "ecs"
}
}
}
]
]
}
data:image/s3,"s3://crabby-images/c080e/c080e9fa8542e7be0083417cfbe1a29ee74c393b" alt=""
Task definition: worker
{
"family": "web",
"taskRoleArn": "arn:...",
"networkMode": "awsvpc",
"containerDefinitions": [
{
"name": "worker",
"image": "123456789.dkr.ecr.us-east-1.amazonaws.com/namespace/app",
"essential": true,
"memory": 256,
"command": ["celery","-A","config","worker","-l","info"],
"environment": [
{"name": "DJANGO_SETTINGS_MODULE", "value": "${DJANGO_SETTINGS_MODULE}"}
],
"logConfiguration": {
"logDriver": "awslogs",
"options": {
"awslogs-group": "${log_group_name}",
"awslogs-region": "us-east-1",
"awslogs-stream-prefix": "ecs"
}
}
}
]
}
data:image/s3,"s3://crabby-images/356f6/356f6318ee612994528e58cea840e871b57c972b" alt=""
Define services
data:image/s3,"s3://crabby-images/157f3/157f3eddf14b5db3640f0daa7d302a79b1432673" alt=""
data:image/s3,"s3://crabby-images/c4591/c459125416e97376185383d5c656ce7fb9108601" alt=""
Additional infrastructure
data:image/s3,"s3://crabby-images/157f3/157f3eddf14b5db3640f0daa7d302a79b1432673" alt=""
-
database, associated security group
-
task IAM role
-
ECS service discovery
-
application load balancer
-
route 53 zone
data:image/s3,"s3://crabby-images/ee092/ee0921285fed0a282fac34863a4db5a2d7a4c3fe" alt=""
Example application #2:
Long-running (~ few days) web crawler that triggers on a scheduled basis, or in response to new records
data:image/s3,"s3://crabby-images/7a7e6/7a7e6c369e6e4fa302bad1552ec2fd765661b010" alt=""
Run scripts with finite execution time as tasks, instead of services
e.g. data processing jobs, api scraper, web crawler
Scheduling jobs with Airflow
data:image/s3,"s3://crabby-images/f3185/f31858cfa540ebdae72306aedd980408f57fd559" alt=""
Task dependencies expressed as a DAG (Directed Acyclic Graph)
Typical Airflow cluster setup
data:image/s3,"s3://crabby-images/1d592/1d592d3316c09b644beb529275a9908e689cff80" alt=""
Scheduling ECS tasks with Airflow
data:image/s3,"s3://crabby-images/e1b5f/e1b5fb634a3de2bdb6961367d61c5363327f3501" alt=""
DevOps considerations and streamlining deployment tasks
DevOps considerations
- Secrets management
- Access control
- Application configuration
- Versioning
- Build/Deploy Automation
- Cost
Secrets management:
Specify values from SSM Param Store in task definition to inject as environment variables (available Nov 2018)
{
"family": "web",
"taskRoleArn": "arn:...",
"networkMode": "awsvpc",
"containerDefinitions": [
{
"name": "web",
"image": "${web_image}",
...
"environment": [
{"name": "DJANGO_SETTINGS_MODULE", "value": "${DJANGO_SETTINGS_MODULE}"},
{"name": "HOST", "value": "${domain_name}"}
],
"secrets": [
{
"name": "SECRET_KEY",
"valueFrom": "arn:aws:ssm:us-east-1:123456789:parameter/app/${env_label}/SECRET_KEY"
},
{
"name": "DATABASE_CONNECTION",
"valueFrom": "arn:aws:ssm:us-east-1:123456789:parameter/app/${env_label}/DATABASE_CONNECTION"
}
]
},
...
]
}
Secrets management:
Populate values in SSM Param Store; access control can be controlled via IAM
data:image/s3,"s3://crabby-images/65b6f/65b6fd6a6e40d205cb8de37ff0b4d24e897d158a" alt=""
Access control:
Control access to AWS resources on startup with
Task Execution Role
data:image/s3,"s3://crabby-images/40c3f/40c3fc7e13b3f5d93f6af096ce6468d1ea4d4448" alt=""
data:image/s3,"s3://crabby-images/5c039/5c0390cae622898ac13659af871e8c94ae57b59d" alt=""
Useful for controlling access to secrets, ECR repos
Security groups:
Define security group for task to be associated with
data:image/s3,"s3://crabby-images/68658/6865863d347abee174eb40ba406eca2d4b11c33b" alt=""
Useful for giving access to
ip-restricted databases
Access control:
Control access to AWS resources at task runtime with Task Role
data:image/s3,"s3://crabby-images/40c3f/40c3fc7e13b3f5d93f6af096ce6468d1ea4d4448" alt=""
data:image/s3,"s3://crabby-images/91453/9145306a91bde8b5d12c613bd6d22b37df4ae1f9" alt=""
Useful for controlling access to data sources / destinations
Versioning:
Tag ECR images with version at build/upload
data:image/s3,"s3://crabby-images/4ca2b/4ca2ba35273724bbf4677528f45953e9fb2d2dc8" alt=""
Versioning:
Reference image tags in task definition
{
"family": "app",
"networkMode": "awsvpc",
"containerDefinitions": [
{
"name": "web",
"image": "123456789.dkr.ecr.us-east-1.amazonaws.com/namespace/app:3.1.2",
...
}
]
}
Build / deploy automation
- Build
- build versioned docker image
- add additional config if applicable
- Deploy
- Push version to image repo
- Create infrastructure
- load balancer, security groups and policies, ECS task definitions and services
- Apply infrastructure changes
- Deploy app (image) version
- Scale up/down
docker-compose file for deploy builds
Alternate docker-compose file for versioned/custom builds -
Uses APP_TAG env variable and builds from github source
version: '3'
services:
# base app image
app_base:
image: ${APP_IMAGE}:${APP_TAG}-base
build:
dockerfile: Dockerfile_opt
# context: app_base/src/bridge_adaptivity # if building from local version; ensure volume mount is configured in other docker-compose
# build from github, using reference APP_TAG and bridge_adaptivity subdirectory
context: https://github.com/harvard-vpal/bridge-adaptivity.git#${APP_TAG}:bridge_adaptivity
# copy custom settings into base app image (see Dockerfile)
app:
build:
context: app
args:
- APP_IMAGE=${APP_IMAGE}:${APP_TAG}-base
image: ${APP_IMAGE}:${APP_TAG}
environment:
- DJANGO_SETTINGS_MODULE=config.settings.custom
# custom nginx image build that collects static assets from app image and copies to nginx image
nginx:
build:
context: nginx
args:
- APP_IMAGE=${APP_IMAGE}:${APP_TAG}
image: ${NGINX_IMAGE}:${APP_TAG}
Extending an app image
Adding custom settings
# Dockerfile that derives from base app image and adds some custom settings
# Base app image:tag to use
ARG APP_IMAGE
FROM ${APP_IMAGE} as app
WORKDIR /bridge_adaptivity
# copy custom settings into desired location
COPY settings/custom.py config/settings/custom.py
COPY settings/collectstatic.py config/settings/collectstatic.py
# generate staticfiles.json even if app image is not serving static images directly
RUN python manage.py collectstatic -c --noinput --settings=config.settings.collectstatic
Custom image build
ECS-specific settings - a django example
# django custom settings (custom.py)
def get_ecs_task_ips():
"""
Retrieve the internal ip address(es) for task, if running with AWS ECS and awsvpc networking mode
"""
ip_addresses = []
try:
r = requests.get("http://169.254.170.2/v2/metadata", timeout=0.01)
except requests.exceptions.RequestException:
return []
if r.ok:
task_metadata = r.json()
for container in task_metadata['Containers']:
for network in container['Networks']:
if network['NetworkMode'] == 'awsvpc':
ip_addresses.extend(network['IPv4Addresses'])
return list(set(ip_addresses))
ecs_task_ips = get_ecs_task_ips()
if ecs_task_ips:
# ALLOWED_HOSTS comes from config.settings.base
ALLOWED_HOSTS.extend(ecs_task_ips)
Managing infrastructure state with Terraform
"Define infrastructure as code"
resource "aws_ecs_task_definition" "main" {
family = "${var.name}"
container_definitions = "${var.container_definitions}"
execution_role_arn = "${var.execution_role_arn}" # required for awslogs
task_role_arn = "${var.role_arn}"
network_mode = "awsvpc"
memory = "${var.memory}"
cpu = "${var.cpu}"
requires_compatibilities = ["FARGATE"]
}
terraform workspace select dev
terraform apply -var-file="dev.tfvars"
Manage multiple environments (dev/stage/prod) with respective config values
ECS resources in Terraform
resource "aws_ecs_task_definition" "main" {
family = "${var.name}"
container_definitions = "${var.container_definitions}"
execution_role_arn = "${var.execution_role_arn}" # required for awslogs
task_role_arn = "${var.role_arn}"
network_mode = "awsvpc"
memory = "${var.memory}"
cpu = "${var.cpu}"
requires_compatibilities = ["FARGATE"]
}
resource "aws_ecs_service" "main" {
name = "${var.name}"
cluster = "${var.cluster_name}"
task_definition = "${aws_ecs_task_definition.main.arn}"
desired_count = "${var.count}"
launch_type = "FARGATE"
load_balancer {
target_group_arn = "${var.target_group_arn}"
container_name = "${var.load_balancer_container_name}"
container_port = "${var.load_balancer_container_port}"
}
network_configuration {
subnets = ["${data.aws_subnet_ids.main.ids}"],
security_groups = ["${var.security_group_id}"]
assign_public_ip = true
}
}
Other AWS resources in Terraform
route53 record, load balancer, target groups, ...
resource "aws_alb" "main" {
name = "${var.project}-${var.env_label}"
subnets = ["${data.aws_subnet_ids.main.ids}"]
security_groups = ["${var.security_group_id}"]
}
resource "aws_alb_listener" "main" {
load_balancer_arn = "${aws_alb.main.id}"
port = "443"
protocol = "HTTPS"
ssl_policy = "ELBSecurityPolicy-2016-08"
certificate_arn = "${var.ssl_certificate_arn}"
default_action {
type = "fixed-response"
fixed_response {
content_type = "text/plain"
message_body = "Service Temporarily Unavailable (ALB Default Action)"
status_code = "503"
}
}
}
resource "aws_route53_record" "main" {
zone_id = "${data.aws_route53_zone.main.zone_id}"
name = "${var.domain_name}"
type = "A"
alias {
name = "${aws_alb.main.dns_name}"
zone_id = "${aws_alb.main.zone_id}"
evaluate_target_health = false
}
}
## Assumes only one service is being load balanced but may make sense to move these to service modules if not the case
resource "aws_alb_target_group" "main" {
name_prefix = "${var.short_project_label}" # using name_prefix instead of name used because of create_before_destroy option
port = "${var.container_port}"
protocol = "HTTP"
target_type = "ip" # required for use of awsvpc task networking mode
vpc_id = "${var.vpc_id}"
health_check {
path = "${var.health_check_path}"
}
# Resolves: (Error deleting Target Group: Target group is currently in use by a listener or a rule)
lifecycle {
create_before_destroy = true
}
# Resolves: The target group does not have an associated load balancer
depends_on = ["aws_alb.main"]
}
resource "aws_alb_listener_rule" "main" {
listener_arn = "${aws_alb_listener.main.arn}"
action {
target_group_arn = "${aws_alb_target_group.main.id}"
type = "forward"
}
condition {
field = "path-pattern"
values = ["*"]
}
}
Terraform modules in ecs-app-utils repo
# creates load balancer, security group, route 53 records, and target groups
module "network" {
source = "git::https://github.com/harvard-vpal/ecs-app-utils.git//terraform/network/public?ref=2.3.0"
vpc_id = "${var.vpc_id}"
ssl_certificate_arn = "${var.ssl_certificate_arn}"
hosted_zone = "${var.hosted_zone}"
domain_name = "${var.domain_name}"
env_label = "${var.env_label}"
project = "${var.project}"
short_project_label = "${var.short_project_label}"
}
Available terraform modules in ecs-app-utils
- execution role (IAM role)
-
network
- base
- public (base + open inbound security group)
-
service
- load balanced (e.g. web)
- discoverable (e.g. queue)
- generic (e.g. worker)
Container definitions are application-specific
Use of templating to pass in environment-specific variables or version tags
data "template_file" "container_definitions_web" {
template = "${file("./container_definitions_web.tpl")}"
vars {
web_image = "${var.app_image}:${var.app_tag}"
nginx_image = "${var.nginx_image}:${var.app_tag}"
project = "${var.project}"
env_label = "${var.env_label}"
log_group_name = "${aws_cloudwatch_log_group.main.name}"
DJANGO_SETTINGS_MODULE = "${var.DJANGO_SETTINGS_MODULE}"
domain_name = "${var.domain_name}"
}
}
module "web_service" {
source = "git::https://github.com/harvard-vpal/ecs-app-utils.git//terraform/services/load_balanced?ref=3.2.0"
vpc_id = "${var.vpc_id}"
cluster_name = "${var.cluster_name}"
role_arn = "${aws_iam_role.task.arn}"
execution_role_arn = "${module.execution_role.arn}"
security_group_id = "${aws_security_group.ecs_service.id}"
name = "${var.project}-${var.env_label}-web"
container_definitions = "${data.template_file.container_definitions_web.rendered}"
target_group_arn = "${module.network.target_group_arn}"
count = "${var.web_count}"
cpu = 512
memory = 1024
}
# container_definitions_web.tpl
[
{
"name": "web",
"image": "${web_image}",
"essential": true,
"portMappings": [
{
"containerPort": 8000
}
],
"command": [
"/usr/local/bin/gunicorn",
"itero.wsgi:application",
"-w=2",
"-b=:8000",
"--log-level=debug",
"--log-file=-",
"--access-logfile=-"
],
"environment": [
{"name": "DJANGO_SETTINGS_MODULE", "value": "${DJANGO_SETTINGS_MODULE}"},
{"name": "HOST", "value": "${domain_name}"}
],
"logConfiguration": {
"logDriver": "awslogs",
"options": {
"awslogs-group": "${log_group_name}",
"awslogs-region": "us-east-1",
"awslogs-stream-prefix": "ecs"
}
},
"secrets": [
{
"name": "SECRET_KEY",
"valueFrom": "arn:aws:ssm:us-east-1:123456789:parameter/itero/${env_label}/SECRET_KEY"
},
{
"name": "DATABASE_CONNECTION",
"valueFrom": "arn:aws:ssm:us-east-1:361808764124:parameter/itero/${env_label}/DATABASE_CONNECTION"
},
{
"name": "CELERY_BROKER_URL",
"valueFrom": "arn:aws:ssm:us-east-1:123456789:parameter/itero/${env_label}/CELERY_BROKER_URL"
}
{
"name": "GOOGLE_PICKER_CLIENT_ID",
"valueFrom": "arn:aws:ssm:us-east-1:123456789:parameter/itero/${env_label}/GOOGLE_CLIENT_ID"
},
{
"name": "GOOGLE_PICKER_APP_ID",
"valueFrom": "arn:aws:ssm:us-east-1:123456789:parameter/itero/common/GOOGLE_APP_ID"
}
]
},
{
"name": "nginx",
"image": "${nginx_image}",
"essential": false,
"portMappings": [
{
"containerPort": 80
}
],
"logConfiguration": {
"logDriver": "awslogs",
"options": {
"awslogs-group": "${log_group_name}",
"awslogs-region": "us-east-1",
"awslogs-stream-prefix": "ecs"
}
}
}
]
ecs-utils
CLI for common ecs build/deploy tasks
Supports image versioning and multiple environments (dev/stage/prod)
# Checkout the app code with the specified version and tag image with that tag
deploy build --tag 1.0.0
# Push images with the specified tag to ECR repositories
deploy push --tag 1.0.0
# Run 'terraform apply' with specified image tag against 'dev' environment
deploy apply --tag 1.0.0 --env dev
# Build, push, and apply
deploy all --tag 1.0.0 --env dev
# Redeploy services (force restart of specific services, even if no config changes)
deploy redeploy --env dev web worker
Comparisions with other AWS services
Fargate vs EC2
Fargate | EC2 |
---|---|
Easy scalability, automatic, built-in failure recovery, no need to think about instance provisioning More expensive Compute/memory capacity flexible within a limited range (max 4vCPU / 30 GB memory) |
Build your own scaling, failure recovery, container orchestration (if using docker) If not using docker, dependency management/setup may be complex depending on library/system dependencies Cheaper More high-end options for compute/memory configurations |
Fargate pricing
data:image/s3,"s3://crabby-images/5d589/5d5896f827cde32d825605fef56b11286791a204" alt=""
Fargate vs EC2 cost
data:image/s3,"s3://crabby-images/25dfd/25dfd0481fdef5742ed45a33fea84fb02b098b78" alt=""
Fargate can be a good option for use cases that are memory-limited (e.g. data processing/transformation in memory) vs upgrading to next ec2 tier
Fargate vs Lambda
Fargate | Lambda |
---|---|
High level of control of library and system dependencies with docker Slower startup time (30 sec - 1 min) Can use for long running applications |
No docker support 50mb (zipped) deployment package size limit - barely enough for basic python data science stack (numpy / pandas / sklearn / statsmodel) 900 second execution time limit Fast startup time |
Fargate vs Kubernetes
Fargate | Kubernetes |
---|---|
No need to consider instance provisioning Tight integration with other AWS resources - (task IAM roles, SSM secrets, Cloudwatch logs) |
Abstractions are more complex (imo) EKS more expensive (need to run control plane - $0.2 / hour) Managing your own K8s cluster is complex Open-source; community plug-ins (e.g. canary deployments) |
Thanks!
Questions
Slides:
https://bit.ly/itsummit-fargate
ecs-utils
https://github.com/harvard-vpal/ecs-app-utils
andrew_ang@harvard.edu
Deploying Serverless Docker applications on AWS
By kunanit
Deploying Serverless Docker applications on AWS
Sharing our experience and DevOps considerations in using AWS Fargate to support a variety of containerized applications for education data science.
- 691