That's not my name

BĂ¥rd Aase - Booster 2024

That's not my name

# Booster 2024 - BÃĨrd Aase

They call me BŒrd.
They call me BÃ¥rd.
They call me B�rd.


That's not my name.
That's not my name.
That's not my name.


They call me Ose.
They call me Asse.
They call me Aas.


That's not my name.
That's not my name.
That's not my name.

# Booster 2024 - Bård Aase

My name is Bård

Bård Aase / @elzapp

  • Platform Engineer @ Sbanken/DNB
# Booster 2024 - B├¸rd Aase

My name is Bård

Bård Aase / @elzapp

  • Platform Engineer @ Sbanken/DNB
  • Bergen Linux User Group
# Booster 2024 - Bård Aase

My name is Bård

Bård Aase / @elzapp

  • Platform Engineer @ Sbanken/DNB
  • Bergen Linux User Group
  • Family
# Booster 2024 - BΓ₯rd Aase

My name is Bård

Bård Aase / @elzapp

  • Platform Engineer @ Sbanken/DNB
  • Bergen Linux User Group
  • Family
  • Geek
         b        o        o        s        t        e        r
booster [01100010 01101111 01101111 01110011 01110100 01100101 01110010] ISO-8859-15
        [01100010 01101111 01101111 01110011 01110100 01100101 01110010] Windows Codepage 1252
        [01100010 01101111 01101111 01110011 01110100 01100101 01110010] Macintosh
        [01100010 01101111 01101111 01110011 01110100 01100101 01110010] UTF-8

Binary ASCII

# Booster 2024 - B├ąrd Aase
         b        å                 r        d        
   bård [01100010 ________          01110010 01100100] ASCII (brd) ❌
        [01100010 11100101          01110010 01100100] ISO-8859-15
        [01100010 11100101          01110010 01100100] Windows Codepage 1252
        [01100010 10001100          01110010 01100100] Macintosh
        [01100010 11000011 10100101 01110010 01100100] UTF-8

Binary "Western languages"

# Booster 2024 - B├Цrd Aase
         b  å     r  d        
   bård [62 __    72 64] ASCII (brd) ❌
        [62 E5    72 64] ISO-8859-15
        [62 E5    72 64] Windows Codepage 1252
        [62 8C    72 64] Macintosh
        [62 C3 A5 72 64] UTF-8

Hex Latin

# Booster 2024 - BĆ„rd Aase

Character encoding

# Booster 2024 - Bård Aase
65 A
66 B
67 C
68 D
69 E
70 F
71 G
72 H
73 I
74 J
75 K
76 L
77 M
78 N
79 O
80 P
81 Q
82 R
83 S
84 T
85 U
86 V
87 W
88 X
89 Y
90 Z
91 [
92 \
93 ]
94 ^
95 _
96 `

*decimal codepoints from UNICODE

Numbers and stuff

# Booster 2024 - BĆ„rd Aase
48 0
49 1
50 2
51 3
52 4
53 5
54 6
55 7
56 8
57 9
58 :
59 ;
60 <
61 =
62 >
63 ?

*decimal codepoints from UNICODE

ASCII

US-ASCII
1963

EBCDIC

Used in IBM mainframes
Incompatible with ASCII
and itself

1963

ISO-646

NS-4551-1
ISO-646
IEC-646
ECMA-6
1967

ISO-IR-197

Proposed revision to ISO-8859-1 to add support for Sámi

1996

ISO-8859-1

Western
IEC-8859-1
Windows-1252*
1987

ISO-8859-15

IEC-8859-15
Adds €

1999

# Booster 2024 - âCvÊÀ€ /ËÁ

NON-UNICODE

# Booster 2024 - BУЅrd Aase
æ &aelig; &#xE6; &#230;
ø &oslash; &#xF8; &#248;
å &aring; &#xE5; &#229;
& &amp; &#x26; &#38;

HTML Entities

Hello B&aring;rd!

# BĆ„rd Aase - Booster 2024

UNICODE v1

1991

UTF-8

1993

Netscape

1994
using Windows-1252

HTML5

2008
Standardizing on Windows-1252

UNICODE v2

1996

EMOJIs

Unicode v6
2011

# Booster 2024 - BĂĄrd Aase

UNICODE

# Booster 2024 - BĂĄrd Aase

259*

*https://www.iana.org/assignments/character-sets

Azure Devops

# Booster 2024 - BĆ„rd Aase

The Kebab receipt

# Booster 2024 - Bц╔rd Aase

Stored as UTF-8

Bård

BÃ¥rd

Read as if it was

ISO-8859-1

or CP1252

# Booster 2024 - BΓ¥rd Aase
# Booster 2024 - Bård Aase
# Booster 2024 - Bård Aase

Bård

Stored as UTF-8
66 C3 A5 72 64

Bård

Bård

Read as Mac Roman
66 C3 A5 72 64

66 E2 88 9A E2 80 A2 72 64

Bård

# Booster 2024 - BĂĽrd Aase

B†rd

# Booster 2024 - Bรฅrd Aase
# Booster 2024 - BĆ„rd Aase
7_ 8_ 9_
_0 p
_1 q
_2 r
_3 s ƒ
_4 t
_5 u
_6 v
_7 w
_8 x ˆ ˜
_9 y
_A z Š š
_B {
_C | Œ œ
_D }
_E ~ Ž ž
_F Ÿ

CP-1252
Windows-1252

7_ 8_ 9_
_0 p
_1 q
_2 r
_3 s
_4 t
_5 u
_6 v
_7 w
_8 x
_9 y
_A z
_B {
_C |
_D }
_E ~
_F

ISO-8859-1
 

B†rd
0x86 → †

7_ 8_ 9_
_0 p Ç É
_1 q ü æ
_2 r é Æ
_3 s â ô
_4 t ä ö
_5 u à ò
_6 v å û
_7 w ç ù
_8 x ê ÿ
_9 y ë Ö
_A z è Ü
_B { ï ø
_C | î £
_D } ì Ø
_E ~ Ä ×
_F Å ƒ

CP-850
DOS-850

# Booster 2024 - BĂ„rd Aase
# Booster 2024 - Bård Aase

Bård

Stored as UTF-8
66 C3 A5 72 64

Bård

Bård

Read as Mac Roman
66 C3 A5 72 64

Stored as UTF-8
66 E2 88 9A E2 80 A2 72 64

Bård

B√â¢rd

Read as CP-1252
66 E2 88 9A E2 80 A2 72 64

66 C3 A2 CB 86 C5 A1 C3 A2 E2 82 AC C2 A2 72 64

Bård

# Booster 2024 - Bård Aase

Bård

Bård

Bård

Bård

Bård

breaks

breaks

fixes?

fixes

# Booster 2024 - Bård Aase

Bård

Bård

BÃ¥rd

Bård

B�rd

keeps

breaks

fixes

breaks

# Booster 2024 - Bฤ�๏ฟฝrd Aase
# Booster 2024 - Bård Aase

Bård

Stored as UTF-8
66 C3 A5 72 64

Bård

BÃ¥rd

Read as CP-1252
66 C3 A5 72 64

Stored as UTF-8
66 C3 83 C2 A5 72 64

BÃ¥rd

BÃ¥rd

Read as CP-1252
66 C3 83 C2 A5 72 64

66 C3 83 C6 92 C3 82 C2 A5 72 64

BÃ¥rd

# Booster 2024 - B�rd Aase

Multibyte characters

# Booster 2024 - BФ�т��rd Aase

💩

# Booster 2024 - BĂ„rd Aase

Æ Ø Å
Ä Ö Å

# Booster 2024 - B�rd Aase

Å

# Booster 2024 - BĂ„rd Aase

Capital letter Å - 197
11000011 10000101
0xC3 0x85

A  +  ◌̊  =  Å

# Booster 2024 - Brd Aase

Capital letter A - 65
10000001
0x41

Combining ring above - 778
11001100 10001010
0xCC 0x8A

Å

Capital letter Å - 197
11000011 10000101
0xC3 0x85

Capital Letter A with Ring Above
10000001 11001100 10001010
0x41 0xCC 0x8A

# Booster 2024 - Brd Aase

Ångström

11100010 10000010 10101011
0xE2 0x84 0xAB

A  +  ◌̊  =  Å

Capital letter A - 65
10000001
0x41

Combining ring above - 778
11001100 10001010
0xCC 0x8A

Capital Letter A with Ring Above
10000001 11001100 10001010
0x41 0xCC 0x8A

Å

Capital letter Å - 197
11000011 10000101
0xC3 0x85

Plain text

# Booster 2024 - B�rd Aase

BOM

# Booster 2024 - Brd Aase
UTF-16 (UCS-2) Byte order mark
11111111 11111110 - 0xFFFE    Big endian
11111110 11111111 - 0xFEFF    Little endian

UTF-16 / UCS-2

▮the quick brown fox jumps over the lazy dog

ŷ

# Booster 2024 - BΨ£οΏ½rd Aase

ISO-8859-14

Resources

  • The Absolute Minimum Every Software Developer Absolutely, Positively Must Know About Unicode and Character Sets (No Excuses!)
    by Joel Spolsky
     
  • The Absolute Minimum Every Software Developer Must Know About Unicode in 2023 (Still No Excuses!)

    by Niki Tonsky

  • IANA List of known Character sets

  • And more

# Booster 2024 - BƒЖпњљƒАвАЮrd Aase

http://bit.ly/booster24-plaintext

# Booster 2024 - Bård Aase

Thanks!

Bård Aase

🐘 @elzapp@mastodon.cloud
📧 elzapp@elzapp.com

linkedin.com/in/bardaase/

http://bit.ly/booster24-plaintext

That's not my name

By Bård Aase

That's not my name

  • 16