What Streamlit

Can and Can't

(or Shouldn't)

Do

Hi👋

Yuichiro Tachibana

@whitphx

Pythonista
OSS enthusiast
ML Developer Advocate at Hugging Face
Streamlit Creator

Agenda

What's Streamlit
- How to use it
What Streamlit can and can't (or shouldn't) do
- Is it suitable for XX?

Streamlit is...

Python framework
to create Web UI
only in Python

Its target users are...

Any Python programmers
who want Web UI
but don't want to write JavaScript
Data scientists
Machine learning engineers

https://streamlit.io/

Hello Streamlit

$ pip install streamlit
$ code app.py

Hello Streamlit

import streamlit as st

st.title("Hello PyConFR :rainbow[2024]!")

st.markdown("This is our **first** [Streamlit](https://streamlit.io/) app :balloon:")

$ streamlit run app.py

  You can now view your Streamlit app in your browser.

  Local URL: http://localhost:8501
  Network URL: http://192.168.1.1:8501

Pure-Python Web UI framework

Write only Python, get a web app

import streamlit as st

st.title("Hello PyConFR :rainbow[2024]!")

st.markdown("This is our **first** [Streamlit](https://streamlit.io/) app :balloon:")

Display contents as you write

import streamlit as st

st.title('Awesome Streamlit app')

st.header("Data Visualizations")
st.markdown("Streamlit is a great tool to create visualizations")

st.subheader("Plotting")

st.area_chart({'data': [1, 5, 2, 6, 2, 1], 'data2': [10, 15, 12, 16, 12, 11]})

st.subheader("Maps")

st.map({"lat": [37.7749295, 35.6895, 34.052235], "lon": [-122.4194155, -139.6917, -118.243683]})

st.header("Dataframes")

st.write('Here is a simple dataframe')
st.dataframe({'A': [random.randint(0, 100) for _ in range(10)], 'B': [random.randint(0, 100) for _ in range(10)], 'C': [random.randint(0, 100) for _ in range(10)]})

st.header("Images")

st.image(image, caption='Random image from picsum.photos', use_column_width=True)

Lots of built-in components

st.line_chart()

st.table()

st.title()

st.text()

st.scatter_chart()

st.plotly_chart()

st.button()

st.selectbox()

st.date_input()

st.camera_input()

st.column()

st.chat_input()

st.navigation()

st.secret()

st.dialog()

st.fragment()

st.connection()

st.bokeh_chart()

st.area_chart()

st.bar_chart()

st.alteir_chart()

st.markdown()

Interactive UI

Interactive apps

import streamlit as st

toggle_value = st.toggle("Click me!")

if toggle_value:
    st.markdown(":large_green_circle: ON")
else:
    st.markdown(":large_red_square: OFF")

Interactive apps

import streamlit as st

toggle_value = st.toggle("Click me!")

if toggle_value:
    st.markdown(":large_green_circle: ON")
else:
    st.markdown(":large_red_square: OFF")

$ streamlit run app.py

$ streamlit run app.py

📜

app.py

Web browser

Python runtime

Streamlit

Server

Script runner

Static files

Web server

React

SPA

React SPA

JavaScript runtime

$ streamlit run app.py

📜

app.py

Web browser

Python runtime

Streamlit

Server

Script runner

Static files

Web server

React

SPA

React SPA

JavaScript runtime

User input triggers re-run

Frontend

import streamlit as st

toggle_value = st.toggle("Click me!")

if toggle_value:
    st.markdown(":large_green_circle: ON")
else:
    st.markdown(":large_red_square: OFF")

Python code

# True

Trigger re-run

Update UI

User input triggers re-run

Frontend

import streamlit as st

toggle_value = st.toggle("Click me!")

if toggle_value:
    st.markdown(":large_green_circle: ON")
else:
    st.markdown(":large_red_square: OFF")

Python code

# False

Trigger re-run

Update UI

Top-to-Bottom execution

makes it possible to build interactive UI without callbacks.

import streamlit as st

toggle_value = st.toggle("Click me!")

if toggle_value:
    st.markdown("ON")
else:
    st.markdown("OFF")

import some_framework as sf

def callback(toggle_value):
    text_field = sf.get_element(id="value")
    if toggle_value:
        text_field.text = "ON"
    else:
        text_field.text = "OFF"

sf.toggle("Click me!", callback=callback)

sf.text(id="value", text="ON")

✅

🚨

Aside: is it like React?

Declarative UI
Colocated logic and view

import streamlit as st

toggle_state = st.toggle("Toggle me!")

if toggle_state:
    st.write("The toggle is ON!")
else:
    st.write("The toggle is OFF.")

import React, { useState } from "react";

function ToggleButton() {
  const [toggleState, setToggleState] = useState(false);

  const handleToggle = () => {
    setToggleState((prevState) => !prevState);
  };

  return (
    <div>
      <button onClick={handleToggle}>
        {toggleState ? "ON" : "OFF"}
      </button>
      <p>The toggle is {toggleState ? "ON!" : "OFF."}</p>
    </div>
  );
}

export default ToggleButton;

Imperative escape hatch

Next example: ToDo app

ToDo app code?

import streamlit as st

tasks = []

new_task_name = st.text_input("Task Name")
if new_task_name:
    tasks.append(new_task_name)

st.write("## Tasks")
for task in tasks:
    st.write(task)

🤔

Session State

import streamlit as st

# tasks = []   # ⚠️ Doesn't work.
if "tasks" not in st.session_state:
    st.session_state.tasks = []
tasks = st.session_state.tasks

new_task_name = st.text_input("Task Name")
if new_task_name:
    tasks.append(new_task_name)

st.write("## Tasks")
for task in tasks:
    st.write(task)

👍

ToDo app

State persisted over reruns

Frontend

import streamlit as st

if "tasks" not in st.session_state:
    st.session_state.tasks = []
tasks = st.session_state.tasks

new_task_name = st.text_input("Task Name")
if new_task_name:
    tasks.append(new_task_name)

st.write("## Tasks")
for task in tasks:
    st.write(task)

Python code

# []

# ["Yuichiro"]

# ["Yuichiro",
   "Alice"]

Session State

Use

for data that persists between multiple runs.

st.session_state[key]

Revisit: React

Streamlit shares some concepts with React (inspired by React?)

Unlike React,
components hide the state management.
Like React,
state management outside of the components is explicit.

Wrap-up: Top-to-Bottom exec

Subtitle

Pros ✅

Cons 🚨

Text

Layout/Design

Auto layout alignment

import streamlit as st

st.title('Awesome Streamlit app')

st.header("Data Visualizations")
st.markdown("Streamlit is a great tool to create visualizations")

st.subheader("Plotting")

st.area_chart({'data': [1, 5, 2, 6, 2, 1], 'data2': [10, 15, 12, 16, 12, 11]})

st.subheader("Maps")

st.map({"lat": [37.7749295, 35.6895, 34.052235], "lon": [-122.4194155, -139.6917, -118.243683]})

st.header("Dataframes")

st.write('Here is a simple dataframe')
st.dataframe({'A': [random.randint(0, 100) for _ in range(10)], 'B': [random.randint(0, 100) for _ in range(10)], 'C': [random.randint(0, 100) for _ in range(10)]})

st.header("Images")

st.image(image, caption='Random image from picsum.photos', use_column_width=True)

Theming

Sidebar

Main column

Unified Streamlit-ish look

Theming

Sidebar

Main column

Unified Streamlit-ish look

textColor, font

primaryColor

secondaryBackgroundColor

backgroundColor

Flexibility vs Simplicity

st.logo

👉https://docs.streamlit.io/develop/concepts/configuration/theming

👉https://docs.streamlit.io/develop/api-reference/media/st.logo

You just have these options. Streamlit does the rest.

https://x.com/Disney_UK/status/822495642287271936

Restrictions make things simpler

You can get a nice look 'n' feel interactive app like this:

...just by writing Python code

calling components,

and configuring the theme if needed.

st.*()

@st.cache_data
def get_UN_data():
    AWS_BUCKET_URL = "https://streamlit-demo-data.s3-us-west-2.amazonaws.com"
    df = pd.read_csv(AWS_BUCKET_URL + "/agri.csv.gz")
    return df.set_index("Region")

try:
    df = get_UN_data()
    countries = st.multiselect(
        "Choose countries", list(df.index), ["China", "United States of America"]
    )
    if not countries:
        st.error("Please select at least one country.")
    else:
        data = df.loc[countries]
        data /= 1000000.0
        st.write("### Gross Agricultural Production ($B)", data.sort_index())

        data = data.T.reset_index()
        data = pd.melt(data, id_vars=["index"]).rename(
            columns={"index": "year", "value": "Gross Agricultural Product ($B)"}
        )
        chart = (
            alt.Chart(data)
            .mark_area(opacity=0.3)
            .encode(
                x="year:T",
                y=alt.Y("Gross Agricultural Product ($B):Q", stack=None),
                color="Region:N",
            )
        )
        st.altair_chart(chart, use_container_width=True)
except URLError as e:
    st.error(
        """
        **This demo requires internet access.**
        Connection error: %s
    """
        % e.reason
    )

You just write Python. Streamlit does the rest.

Escape hatches for custom design

Custom CSS

https://ai-apps-readiness-jp.streamlit.app/

So why you should use Streamlit?

So why we should use Streamlit?

Fast and Easy development of
Interactive Web apps
Only in Python

Callback-less code with the top-to-bottom execution model
Pre-defined theming/layout

👉https://pythonframeworks.com/

Web UI frameworks

Server-side web frameworks

Among many frameworks...

Comparison

Streamlit

(Web UI Frameworks)

Serverside Web Frameworks

+ Frontend dev

All-in-one

Pick what you need

Python only

Tech stack

Languages

Python + JS

+ Data serializer (JSON/Protobuf/...)

Design

Low-flexibility/Easy

Full flexiblity

Logic/Modeling

Declarative

Up to you

👉https://pythonframeworks.com/

Declarative vs Imperative

Jupyter-embeddable vs not

Flexible layout vs Easy design

Web UI frameworks

Server-side web frameworks

Comparison

Streamlit

Other Web UI Frameworks

All-in-one

Python only

Tech stack

Languages

Python only

Design

Low-flexibility/Easy

Per-framework

Logic/Modeling

Declarative

Per-framework

Use cases

Revisit: Todo app

import streamlit as st

if "tasks" not in st.session_state:
    st.session_state.tasks = []
tasks = st.session_state.tasks

new_task_name = st.text_input("Task Name")
if new_task_name:
    tasks.append(new_task_name)

st.write("## Tasks")
for task in tasks:
    st.write(task)

Dynamic contents

relying on data/states

Dynamic number of fields
Multi-step wizards
Data visualization
Conditional UI

Budget Allocator

Data dependencies⚡️

Budget Allocator

import streamlit as st

st.title("Budget Allocator")

# Set total budget and number of categories
total_budget = st.number_input("Total Budget ($)", min_value=1000, value=5000, step=500)
num_categories = st.number_input("Number of Categories", min_value=1, value=3)

# Generate sliders for each category
allocations = [
    st.slider(f"Category {i+1} Allocation", min_value=0, max_value=total_budget, value=total_budget // num_categories)
    for i in range(num_categories)
]

# Calculate allocated and remaining budget
total_allocated = sum(allocations)
remaining_budget = total_budget - total_allocated

# Display budget summary
st.write("### Allocation Summary")
st.write(f"Total Allocated: ${total_allocated}")
st.write(f"Remaining Budget: ${remaining_budget}")

# Provide feedback based on budget status
if total_allocated > total_budget:
    st.error("You have exceeded the total budget!")
elif remaining_budget < total_budget * 0.1:
    st.warning("Warning: You are nearing your total budget.")
else:
    st.success("You are within the budget.")

import dash
from dash import dcc, html
from dash.dependencies import Input, Output, State, ALL

# Initialize the Dash app
app = dash.Dash(__name__, suppress_callback_exceptions=True)

# App layout
app.layout = html.Div([
    html.H1("Budget Allocator"),

    # Input for total budget
    html.Label("Total Budget ($):"),
    dcc.Input(id="total-budget", type="number", value=5000, min=1000, step=500, style={"margin-bottom": "20px"}),

    # Input for number of categories
    html.Label("Number of Categories:"),
    dcc.Input(id="num-categories", type="number", value=3, min=1, style={"margin-bottom": "20px"}),

    # Placeholder for dynamically generated sliders
    html.Div(id="sliders-container", style={"margin-bottom": "20px"}),

    # Summary of allocation
    html.H3("Allocation Summary"),
    html.Div(id="summary-output"),
])

# Callback to generate sliders dynamically based on the number of categories
@app.callback(
    Output("sliders-container", "children"),
    [Input("num-categories", "value"), Input("total-budget", "value")]
)
def create_sliders(num_categories, total_budget):
    sliders = []
    for i in range(num_categories):
        slider = html.Div([
            html.Label(f"Category {i + 1} Allocation"),
            dcc.Slider(
                id={'type': 'category-slider', 'index': i},
                min=0, max=total_budget, value=total_budget // num_categories,
                marks={0: "0", total_budget: str(total_budget)}
            )
        ], style={"margin-bottom": "20px"})
        sliders.append(slider)
    return sliders

# Callback to update allocation summary based on slider values
@app.callback(
    Output("summary-output", "children"),
    Input("total-budget", "value"),
    Input({'type': 'category-slider', 'index': ALL}, 'value')
)
def update_summary(total_budget, allocations):
    total_allocated = sum(allocations)
    remaining_budget = total_budget - total_allocated

    summary = [
        html.P(f"Total Allocated: ${total_allocated}"),
        html.P(f"Remaining Budget: ${remaining_budget}")
    ]

    if total_allocated > total_budget:
        summary.append(html.Div("You have exceeded the total budget!", style={"color": "red"}))
    elif remaining_budget < total_budget * 0.1:
        summary.append(html.Div("Warning: You are nearing your total budget.", style={"color": "orange"}))
    else:
        summary.append(html.Div("You are within the budget.", style={"color": "green"}))

    return summary

# Run the app
if __name__ == "__main__":
    app.run_server(debug=True)

Example case: Dash

✅ Dynamic UI relying on data/states
✅ Complex data dependencies

Data uploader and visualizer

import streamlit as st
import pandas as pd

st.title("Upload and Visualize CSV Data with Streamlit")

# File uploader for CSV
uploaded_file = st.file_uploader("Choose a CSV file", type="csv")

# Display the visualization if the file is uploaded
if uploaded_file:
    data = pd.read_csv(uploaded_file)
    st.write("Data Preview:")
    st.write(data.head())

    st.line_chart(data, x="Date", y="Stock Price")

Reactive: Update the chart reacting the input data
Conditional: show the chart only when the file has been uploaded

without callbacks/imperative code!

Example case: Dash

import dash
from dash import dcc, html
from dash.dependencies import Input, Output, State
import pandas as pd
import base64
import io

# Initialize the Dash app
app = dash.Dash(__name__)

app.layout = html.Div([
    html.H1("Upload and Visualize CSV Data with Dash"),

    # File upload component that accepts only CSV files
    dcc.Upload(
        id='upload-data',
        children=html.Button('Upload CSV'),
        accept='.csv',  # Accept only CSV files
        multiple=False
    ),

    # Placeholders for the data preview and chart
    html.Div(id='output-data-upload'),
    dcc.Graph(id='data-graph')
])

# Callback to parse the uploaded CSV and update the output
@app.callback(
    [Output('output-data-upload', 'children'), Output('data-graph', 'figure')],
    [Input('upload-data', 'contents')],
    [State('upload-data', 'filename')]
)
def update_output(contents, filename):
    if contents is None:
        return None, {}

    # Decode and parse the file
    content_type, content_string = contents.split(',')
    decoded = base64.b64decode(content_string)
    df = pd.read_csv(io.StringIO(decoded.decode('utf-8')))

    # Ensure the columns for Date and Stock Price exist
    if 'Date' not in df.columns or 'Stock Price' not in df.columns:
        return html.Div(["Error: CSV must contain 'Date' and 'Stock Price' columns."]), {}

    # Convert Date column to datetime for accurate plotting
    df['Date'] = pd.to_datetime(df['Date'])

    # Data preview
    preview = html.Div([
        html.H5(f"Uploaded File: {filename}"),
        html.P("Data Preview:"),
        html.Table([
            html.Tr([html.Th(col) for col in df.columns])] +
            [html.Tr([html.Td(df.iloc[i][col]) for col in df.columns]) for i in range(min(len(df), 5))])
    ])

    # Define the figure with specified columns
    fig = {
        'data': [{
            'x': df['Date'],
            'y': df['Stock Price'],
            'type': 'line',
            'name': 'Stock Price'
        }],
        'layout': {
            'title': 'Uploaded Data Visualization',
            'xaxis': {'title': 'Date'},
            'yaxis': {'title': 'Stock Price'}
        }
    }

    return preview, fig

# Run the app
if __name__ == '__main__':
    app.run_server(debug=True)

✅ Conditional UI
✅ Data visualization: No need to take care of
- Visualization framework
- Data transportation (JSON/Protobuf/...)

Dashboard

# Get data

# Call line_chart()

# Loop

import streamlit as st
import time
from collections import deque
from data_source import get_stock_data

st.title("Data Streaming Dashboard")

max_rows = 100
stock_data = deque(maxlen=max_rows)

chart = st.empty()

while True:
    timestamp, price = get_stock_data()

    stock_data.append((timestamp, price))

    timestamps, prices = zip(*stock_data)

    chart.line_chart({"timestamp": timestamps, "price": prices}, x="timestamp", y="price")

    time.sleep(0.01)

Large-scale dashboard

# For each chart,

# Get data and
# call line_chart()

# Loop

import streamlit as st
import time
from collections import deque
from data_source import get_stock_data
...
st.title(f"Data Streaming Dashboard ({NUM_ROWS}x{NUM_COLUMNS})")

tickers = [f'TICKER{i+1}' for i in range(MAX_TICKERS)]

stock_data = {ticker: deque(maxlen=MAX_ROWS) for ticker in tickers}

rows = [st.columns(NUM_COLUMNS) for _ in range(NUM_ROWS)]
charts = {}
for i, ticker in enumerate(tickers):
    row = i // NUM_COLUMNS
    col = i % NUM_COLUMNS
    charts[ticker] = rows[row][col].empty()

while True:
    for ticker in tickers:
        timestamp, price = get_stock_data(ticker)

        stock_data[ticker].append((timestamp, price))

        timestamps, prices = zip(*stock_data[ticker])

        charts[ticker].line_chart({"timestamp": timestamps, "price": prices}, x="timestamp", y="price")

    time.sleep(0.01)

Large-scale dashboard

Performance 🤔

And...

Polling vs Event-driven

✅Easy

🚨Inefficient, not precise

✅Efficient, precise

🚨Complex

while True:
    data = get_data()
    show_data(data)

def on_update(data):
    show_data(data)

data_source.subscribe(on_update)

Event-driven dashboards

Event listener👍

Queue()🤔

Can't get rid of the loop😭

import streamlit as st
from collections import deque
from queue import Queue
from event_dispatcher import data_source

st.title("Data Streaming Dashboard (Event-driven)")

max_rows = 100
stock_data = deque(maxlen=max_rows)

chart = st.empty()

q = Queue()

def on_update(timestamp, price):
    q.put((timestamp, price))

data_source.subscribe(on_update)

while True:
    timestamp, price = q.get()

    stock_data.append((timestamp, price))

    timestamps, prices = zip(*stock_data)

    chart.line_chart({"timestamp": timestamps, "price": prices}, x="timestamp", y="price")

Streamlit is not intuitive when event-driven

def on_update(new_data):
    ...
    st.line_chart(data_to_display)

data_source.subscribe(on_update)

Streamlit can't do this🚨

Revisit:

Top-to-Bottom execution

import streamlit as st
from event_dispatcher import data_source

st.title("Data Streaming Dashboard")

def on_update(new_data):
    data_to_display = ...
    st.line_chart(data_to_display)

data_source.subscribe(on_update)

The callback function is
called in another thread that is outside of Streamlit's control.

✅ Live-updated UI
🚨 High-frequency updates
🚨 Event-driven code

More examples:

Tailor-made Frontend

https://trello.com/

https://slack.com/

https://www.notion.so/

https://asana.com/

👉 https://docs.streamlit.io/develop/api-reference

Function-centric apps

e.g. AI/ML demos

def your_awesome_logic(arg1, arg2, ...):
    # your awesome logic here
    return res1, res2, ...

When you want the WebUI to use a function...

Function-centric apps

e.g. AI/ML demos

Streamlit fits perfectly,
but another might give a better abstraction.

import gradio as gr

# Core logic function
def process_data(number, text):
    transformed_number = number * 2
    uppercased_text = text.upper()
    return transformed_number, uppercased_text

# Create and launch Gradio Interface
demo = gr.Interface(
    fn=process_data,
    inputs=[
        gr.Number(label="Enter a number"),
        gr.Textbox(label="Enter some text"),
    ],
    outputs=[
        gr.Textbox(label="Transformed Number"),
        gr.Textbox(label="Uppercased Text")
    ],
    title="Data Processor",
)

demo.launch()

import streamlit as st

# Core logic function
def process_data(number, text):
    transformed_number = number * 2
    uppercased_text = text.upper()
    return transformed_number, uppercased_text

# Streamlit UI setup
st.title("Data Processor")

# Input fields
number = st.number_input("Enter a number:", value=1)
text = st.text_input("Enter some text:")

# Button to execute the function
if st.button("Process"):
    # Call the core function with user inputs
    transformed_number, uppercased_text = process_data(number, text)

    # Display the outputs
    st.write("Transformed Number:", transformed_number)
    st.write("Uppercased Text:", uppercased_text)

Function-centric apps

e.g. AI/ML demos

Streamlit fits perfectly,
but another might give a better abstraction.

Cross-user Interaction

Session

Session

Session

Isolated

e.g. multi-user chat

💥

Scaling up/out

Streamlit's server is difficult to scale up/out

Thread

Per-user WebSocket connection

→Difficult to load balancing/scale-out

No option to use multi cores

→Difficult to scale-up

Thread

Thread

High traffic apps

Subtitle

Wrap-up

Streamlit is a very good WebUI framework,
but it's not the silver bullet.

It simplifies the development of interactive web UI based on Python logic,
with its unique execution model and pre-built components and themes.
It may not be suitable in some cases;
large systems, event-driven data source, high traffic apps, etc.
Different technologies may be more appropriate for your case.
Choose the right one based on a clear understanding of these tech stacks.

Happy Streamlit-ing!

Bonus: LLM-friendliness

Streamlit

has stable API
has large amount of user code

What Streamlit can and can't (or shouldn't) do

By whitphx

What Streamlit can and can't (or shouldn't) do

5 months ago
496