🤖

💭

👨🏻‍💻

Vibe Coding

AI Augmented Development

How did we get here?

🤖 

💭

AI Augmented Development

How did we get here?

🤖 

💭

Context is  👑

AI had been around
for a while...

🤖 

💭

AI has existed in universities and
big tech organizations for many years.

A friend of mine studied neural networks at the Technion about 20 years ago… 

Deep Blue beat Gary Kasparov in 1996
IBM Watson, Siri & Alexa are old news...

Interactive timeline from 2015 - 2025

A famous interview

🤖 

💭

It was a futuristic, geeky gimmick.

Eric Elliott's famous interview with GPT3 was in 2020

It did not significantly impact our lives.

The machine is sentient!

🤖 

💭

In 2022, Blake Lemoine, a Google AI engineer, was fired from Google for violating its employee confidentiality and data security policies after he publicly claimed its LaMDA artificial intelligence was sentient.

Later that year, we had the GPT moment…

GPT moment!

🤖 

💭

In late Nov 2022, OpenAI made ChatGPT
available to the public for free!

100 million monthly users in 2 months!

It reached 1 million users
within just 5 days of its launch!

476 million December 2024

1 billion monthly users by October 2025

But Chat GPT was just a Front!

🤖 

💭

While ChatGPT was offered for free
to the general public...

The web exploded with AI services as a result.

OpenAI also offered its AI models
via a paid API platform

This directory lists over 40,000 AI-related
tools and services at the time of this writing.

Multi-Domain Use cases of AI models

🤖 

💭

Generative AI 

Text & Language Generation
     - Conversational AI (Chatbots, Assistants)
     - Content creation (emails, stories, summaries)
     - Code generation
     - Translation & grammar correction

 Image Generation
     - Creating & editing images with Midjourney, DALL.E, GPT...

 Video Generation
     - Video from text or images, Deepfakes & Avatars, Auto-edits, Animation

 Audio Generation
     - Text-to-speech (TTS), AI voice cloning, Music generation

Perception AI (Understanding the real world)

Vision (object recognition, tracking), Speech Recognition, Sensor Fusion (e.g. for robotics, drones)

Predictive / Analytical AI

Fraud detection, Forecasting, Diagnostics, Recommendations etc.

Tech Giants had to join in

🤖 

💭

Google was forced to change its strategy and integrate AI into its search results to stay relevant.

Google also made its own models available via API, along with other public tools like Gemini and NotebookLM.

Microsoft, Amazon, Meta, Apple,  and X have integrated their own AI models into their services and are offering them as cloud services as well.

New players like Anthropic and Mistral had emerged.

The Open Source Eco System

🤖 

💭

Open source experienced a significant growth as well.

HuggingFace offers over 1M smaller, customized models that are free for download or used via their API.

Tools like Ollama, LM Studio, and OpenRouter make it easy to run models on your infrastructure (on prem).

Chinese Models emerged like DeepSeek & Qwen 

NVIDIA launched AI Supercomputer DGX Spark  for your desk.

Running models Locally require a strong infrastructure.

NVIDIA also launched Jetson Orin Nano and Jetson Thor for autonomous physical AI and robotics.

Why can't I use GPT or Claude for everything?

🤖 

💭

Do you just want to use the model or include it in your product?

Is the LLM intended for general use?
Or will it need to be custom tailored for a specific use-case?

Can you use hosted LLMs over the network?
Or is "on-prem" a requirenment?

Some leading questions to help you choose a model that fit your needs

Does the model need to contain reasoning, or other traits?

Is the budget a consideration?
You may want to optimize for a faster customized model

Why do we need so many models?

Known issues when working with LLMs

🤖 

💭

Training Data Limitations

Hallucination and Accuracy

Context Window Constraints

Reasoning Limitations

Inconsistency

Domain-Specific Limitation

Some of the solutions

🤖 

💭

Web search tools, fact-checking tools

System prompts and Prompt engineering

Condense information before analysis

Break complex problems into smaller, sequential steps

Implement Retrieval Augmented Generation (RAG) systems

Implement human-in-the-loop workflows for critical decisions

Fine-tune models on domain-specific data when possible

Vibe Coding

By Yariv Gilad