8-Bit Unicode Transformation Format: Unterschied zwischen den Versionen

Aus Mikiwiki
Wechseln zu: Navigation, Suche
(Einige Spezialzeichen)
 
(21 dazwischenliegende Versionen desselben Benutzers werden nicht angezeigt)
Zeile 1: Zeile 1:
Das <b>8-bit Unicode Transformation Format / UTF-8</b> ist die am weitesten verbreitete Kodierung für [[Unicode]]-Zeichen. Dabei wird jedem Unicode-Zeichen eine besonders kodierte Bytekette von variabler Länge zugeordnet. UTF-8 unterstützt bis zu vier Byte, auf die sich wie bei allen UTF-Formaten alle Unicode-Zeichen abbilden lassen.
+
Das <b>8-Bit Unicode Transformation Format / UTF-8</b> ist die am weitesten verbreitete Kodierung für [[Unicode]]-Zeichen. Dabei wird jedem Unicode-Zeichen eine besonders kodierte Bytekette von variabler Länge zugeordnet. UTF-8 unterstützt bis zu vier Byte, auf die sich wie bei allen UTF-Formaten alle Unicode-Zeichen abbilden lassen.
  
UTF-8 wurde als Alternative zu [[UTF-16]] entwickelt und stellt jedes Zeichen durch 8 Bit bzw. ein Byte dar. Dabei werden die US-ASCII-Zeichen (7 Bit) wie bisher durch ein Byte dargestellt, deren oberstes Bit 0 ist. Alle anderen Unicode-Zeichen werden durch zwei bis vier Byte lange Byte-Ketten dargestellt. Damit gibt es keinen unmittelbaren Zusammenhang mehr zwischen der Anzahl Bytes und der Anzahl Zeichen einer Datei.
+
UTF-8 wurde als Alternative zu [[UTF-16]] entwickelt und stellt jedes Zeichen durch 8 Bit bzw. ein Byte dar. Dabei werden die US-ASCII-Zeichen (7 Bit) wie bisher durch ein Byte dargestellt, deren oberstes Bit 0 ist. Alle anderen Unicode-Zeichen werden durch zwei bis vier Byte lange Byte-Ketten dargestellt. Damit gibt es keinen unmittelbaren Zusammenhang mehr zwischen der Anzahl Byte und der Anzahl Zeichen einer Datei.
  
 
UTF-8 hat eine zentrale Bedeutung als globale Zeichenkodierung im [[Internet]] und ist inzwischen die meist genutzte Unicode-Anwendung. Die Internet Engineering Task Force verlangt von allen neuen Internetkommunikationsprotokollen, dass die Zeichenkodierung deklariert wird und dass UTF-8 eine der unterstützten Kodierungen ist. Das Internet Mail Consortium / IMC empfiehlt, dass alle E-Mail-Programme UTF-8 darstellen und senden können.
 
UTF-8 hat eine zentrale Bedeutung als globale Zeichenkodierung im [[Internet]] und ist inzwischen die meist genutzte Unicode-Anwendung. Die Internet Engineering Task Force verlangt von allen neuen Internetkommunikationsprotokollen, dass die Zeichenkodierung deklariert wird und dass UTF-8 eine der unterstützten Kodierungen ist. Das Internet Mail Consortium / IMC empfiehlt, dass alle E-Mail-Programme UTF-8 darstellen und senden können.
  
 
Auch bei dem in [[Webbrowser]]n verwendeten [[HTML]] setzt sich UTF-8 zur Darstellung von länder- und sprachspezifischen Zeichen zunehmend durch und ersetzt die vorher benutzten HTML-Sonderzeichen.
 
Auch bei dem in [[Webbrowser]]n verwendeten [[HTML]] setzt sich UTF-8 zur Darstellung von länder- und sprachspezifischen Zeichen zunehmend durch und ersetzt die vorher benutzten HTML-Sonderzeichen.
 +
 +
== Verwendung ==
 +
 +
{| class=wikismall
 +
! rowspan=2 | Zeichen
 +
! colspan=2 | ISO 8859-1 (Latin1)
 +
! colspan=3 | Unicode (UTF-8)
 +
! colspan=1 rowspan=2 | URL
 +
|-
 +
! Dezimal !! Oktal
 +
! Dezimal !! Oktal !! Hexadez.
 +
|-
 +
| Ä
 +
| 196 || 304 || || 303 204 || C3 84 || %C3%84
 +
|-
 +
| ä
 +
| 228 || 344 || || 303 244 || C3 A4 || %C3%A4
 +
|-
 +
| Ö
 +
| 214 || 326 || || 303 226 || C3 96 || %C3%96
 +
|-
 +
| ö
 +
| 246 || 366 || || 303 266 || C3 B6 || %C3%B6
 +
|-
 +
| Ü
 +
| 220 || 334 || || 303 234 || C3 9C || %C3%9C
 +
|-
 +
| ü
 +
| 252 || 374 || || 303 274 || C3 BC || %C3%BC
 +
|-
 +
| α
 +
| || || || 316 261 || CE B1 ||
 +
|-
 +
| (Zeilenumbruch)
 +
| || || || \n || 0A ||
 +
|-
 +
| (nichts)
 +
| || || || \0 || 00 ||
 +
|-
 +
! colspan=7 | Unklar, ob folgende stimmen:
 +
|-
 +
| (Leerzeichen)
 +
| || || || || 0000 || %00%00
 +
|-
 +
| /
 +
| || || || || 2215 ||  %22%15
 +
|-
 +
| (
 +
| || || || || 0028 || %00%28
 +
|-
 +
| )
 +
| || || || || 0029 || %00%29
 +
|-
 +
|
 +
| || || || || ||
 +
|}
 +
 +
Ausgabe der UTF-8-Kodierung des Zeichens "α" (Alpha aus dem [http://www.unicode.org/charts/PDF/U0370.pdf griechischen Alphabet]). Im Beispiel wird für das Zeichen "α" die (hexadezimale) UTF-8-Kodierung "ceb1" (also genau umgekehrt wie von "echo" ausgegeben; dezimal: 316 261, hier ist die Reihenfolge wie angezeigt). Danach folgt noch ein Zeilenumbruch ("0a") und nichts weiter ("00"). Zu beachten ist natürlich, dass das verwendete Terminal auch tatsächlich auf die Anzeige von UTF-8 eingestellt ist! Der entsprechende Code Point für die Darstellung als Zeichen in HTML lautet "03b1".
 +
 +
$ <b>echo α | od -xc</b>
 +
0000000 b1ce 000a
 +
        316 261  \n  \0
 +
0000003
 +
 +
Dieselbe Ausgabe, aber ohne von "echo" an die Ausgabe angehängten Zeilenumbruch.
 +
 +
$ <b>echo -n α | od -xc</b>
 +
0000000 b1ce 000a
 +
        316 261
 +
0000002
 +
 +
Anzeige von UTF-8 Zeichen in Codepoint hex-Darstellung.
 +
 +
$ <b>echo 'UTF-8 ist schön' | preconv -r</b>
 +
UTF-8 ist sch\[u00F6]n
 +
 +
Umwandlung des Wertes "FC" von Hexadezimal zu Dezimal (252) mit "bc".
 +
 +
$ <b>echo "ibase=16; FC" | bc</b>
 +
252
 +
 +
Umwandlung des Wertes "334" von Oktal zu Dezimal (220) mit "bc".
 +
 +
$ <b>echo "ibase=8; 334" | bc</b>
 +
220
 +
 +
Umwandlung einer Datei mit Zeichensatz Latin1 zu Zeichensatz UTF-8.
 +
 +
$ <b>iconv -f latin1 -t utf-8 latin1.txt > utf8.txt</b>
 +
 +
== Einige Spezialzeichen ==
 +
 +
<table style="BORDER-COLLAPSE: collapse" cellspacing="1" cellpadding="0">
 +
  <tr>
 +
  <td valign="top">
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">special signs 1</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9650;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9650;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9658;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9658;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9660;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9660;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9668;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9668;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9632;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9632;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9633;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9633;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9635;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9635;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9636;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9636;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9637;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9637;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9638;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9638;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9639;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9639;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9640;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9640;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9641;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9641;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9642;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9642;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9643;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9643;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&loz;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;loz;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9674;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9675;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9675;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9679;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9679;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9786;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9786;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9787;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9787;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9788;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9788;</td></tr>
 +
    </table>
 +
  </td>
 +
  <td valign=top>&nbsp;</td>
 +
  <td valign=top>
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">german umlaut</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&curren;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;auml;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#228;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&bdquo;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;Auml;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#196;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&para;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;ouml;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#246;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&ndash;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;Ouml;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#214;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&frac14;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;uuml;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#252;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&oelig;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;Uuml;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#220;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&Yuml;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;szlig;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#223;</td></tr>
 +
    </table><br>
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">money signs</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&acirc;&sbquo;&not;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;euro;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8364;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&cent;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;cent;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#162;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&pound;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;pound;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#163;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&yen;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;yen;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#165;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&curren;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;curren;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#164;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8362;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8362;</td></tr>
 +
    </table>
 +
  </td>
 +
  <td valign="top">&nbsp;</td>
 +
  <td valign="top">
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">special signs 2</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9792;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9792;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9794;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9794;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&spades;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;spades;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9824;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9828;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9828;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&clubs;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;clubs;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9827;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9831;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9831;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&hearts;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;hearts;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9829;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9825;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9825;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&diams;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;diams;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9830;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9733;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9733;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9734;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9734;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8962;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8962;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8470;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8470;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9742;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9742;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9743;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9743;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9832;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9832;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9756;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9756;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9758;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9758;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9833;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9833;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9834;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9834;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9835;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9835;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9836;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9836;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#9837;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#9837;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&acirc;&euro;&nbsp;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8224;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&acirc;&euro;&iexcl;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8225;</td></tr>
 +
    </table><br>
 +
  </td>
 +
  <td valign=top>&nbsp;</td>
 +
  <td valign=top">
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">arrows</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&larr;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;larr;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8592;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&uarr;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;uarr;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8593;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&rarr;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;rarr;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8594;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&darr;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;darr;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8595;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&harr;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;harr;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8596;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8597;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8597;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8598;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8598;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8599;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8599;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8600;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8600;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8601;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8601;</td></tr>
 +
    </table><br>
 +
  <td valign=top>&nbsp;</td>
 +
  <td valign=top>
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">special signs 3</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&nbsp;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;nbsp;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#160;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&quot;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;quot;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#34;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&amp;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;amp;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#38;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&lt;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;lt;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#60;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&gt;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;gt;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#62;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&sect;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;sect;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#167;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&laquo;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;laquo;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#171;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&raquo;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;raquo;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#187;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&copy;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;copy;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#169;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&reg;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;reg;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#174;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&acirc;&bdquo;&cent;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;trade;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8482;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&plusmn;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;plusmn;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#177;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&iquest;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;iquest;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#191;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&iexcl;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;iexcl;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#161;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">@</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#64;</td></tr>
 +
    </table><br>
 +
  <td>
 +
  <td valign=top>&nbsp;</td>
 +
  <td valign=top>
 +
    <table onmouseover="changeto(event, 'FEF3CD')" style="BORDER-COLLAPSE: collapse" onmouseout="changeback(event, 'white')" bordercolor="#c0c0c0" cellspacing="0" cellpadding="3" border="1">
 +
      <tr>
 +
        <td style="BORDER-RIGHT: 0px solid; BORDER-TOP: 0px solid; BORDER-BOTTOM-WIDTH: 1px; BORDER-LEFT: 0px solid" align="middle" bgcolor="#000080" colspan="3"><b><font color="#ffffff" size="2">math signs</font></b></td></tr>
 +
      <tr>
 +
        <td align="middle" bgcolor="#ffd763">&nbsp;</td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Code</b></td>
 +
        <td align="middle" bgcolor="#ffd763"><b>Unicode</b></td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&mdash;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;times;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#215;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Atilde;&middot;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;divide;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#247;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">+</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#43;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">-</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#45;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8486;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8486;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&radic;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8730;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&frac14;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;frac14;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#188;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&frac12;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;frac12;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#189;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&frac34;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;frac34;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#190;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8531;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8531;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8532;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8532;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8539;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8539;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8540;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8540;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8541;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8541;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&#8542;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8542;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">%</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#37;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&acirc;&euro;&deg;</font></td>
 +
        <td nowrap="nowrap" align="middle"></td>
 +
        <td nowrap="nowrap" align="middle">&amp;#8240;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&sup1;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;sup1;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#185;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&sup2;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;sup2;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#178;</td></tr>
 +
      <tr>
 +
        <td nowrap="nowrap" align="middle"><font color="#f97c00" size="4">&Acirc;&sup3;</font></td>
 +
        <td nowrap="nowrap" align="middle">&amp;sup3;</td>
 +
        <td nowrap="nowrap" align="middle">&amp;#179;</td></tr>
 +
    </table>
 +
  </td>
 +
</tr></table>
  
 
== Weblinks ==
 
== Weblinks ==
  
{{dewi|UTF-8|UTF-8}}
+
{{Weblinks}}
 +
{{url_dewikipedia|UTF-8|UTF-8}}
 +
{{url|DE||ger|http://www.utf8-zeichentabelle.de/|UTF-8-Codetabelle mit Unicode-Zeichen|Anmerkung: der Unicode Codepoint ist gelb hinterlegt, der UTF-8-Code ist rot hinterlegt.}}
 +
{{Fuss}}
 +
 
 +
* [http://www.utf8-zeichentabelle.de/unicode-utf8-table.pl?utf8=oct UTF-8-Codetabelle mit Unicode-Zeichen (Oktal)]
 +
* [http://www.utf8-zeichentabelle.de/unicode-utf8-table.pl?utf8=hex UTF-8-Codetabelle mit Unicode-Zeichen (Hexadezimal)]
 +
 
 +
* [http://bueltge.de/wp-content/download/wk/utf-8_kodierungen.pdf UTF-8-Kodierungen Cheatsheet]
 +
 
 +
* https://www.utf8-chartable.de/unicode-utf8-table.pl?utf8=dec
 +
 
 +
* [https://unix.stackexchange.com/questions/6516/filtering-invalid-utf8 Filtering invalid UTF-8]
  
 +
Unicode-Zeichen mit Hexadezimalcode 00df (das Zeichen "ß" bzw. "LATIN SMALL LETTER SHARP S").
 +
* https://www.fileformat.info/info/unicode/char/00df/index.htm
  
 
{{cat|Unicode}}
 
{{cat|Unicode}}
 
{{cat|Zeichensatz}}
 
{{cat|Zeichensatz}}

Aktuelle Version vom 26. Juni 2021, 11:10 Uhr

Das 8-Bit Unicode Transformation Format / UTF-8 ist die am weitesten verbreitete Kodierung für Unicode-Zeichen. Dabei wird jedem Unicode-Zeichen eine besonders kodierte Bytekette von variabler Länge zugeordnet. UTF-8 unterstützt bis zu vier Byte, auf die sich wie bei allen UTF-Formaten alle Unicode-Zeichen abbilden lassen.

UTF-8 wurde als Alternative zu UTF-16 entwickelt und stellt jedes Zeichen durch 8 Bit bzw. ein Byte dar. Dabei werden die US-ASCII-Zeichen (7 Bit) wie bisher durch ein Byte dargestellt, deren oberstes Bit 0 ist. Alle anderen Unicode-Zeichen werden durch zwei bis vier Byte lange Byte-Ketten dargestellt. Damit gibt es keinen unmittelbaren Zusammenhang mehr zwischen der Anzahl Byte und der Anzahl Zeichen einer Datei.

UTF-8 hat eine zentrale Bedeutung als globale Zeichenkodierung im Internet und ist inzwischen die meist genutzte Unicode-Anwendung. Die Internet Engineering Task Force verlangt von allen neuen Internetkommunikationsprotokollen, dass die Zeichenkodierung deklariert wird und dass UTF-8 eine der unterstützten Kodierungen ist. Das Internet Mail Consortium / IMC empfiehlt, dass alle E-Mail-Programme UTF-8 darstellen und senden können.

Auch bei dem in Webbrowsern verwendeten HTML setzt sich UTF-8 zur Darstellung von länder- und sprachspezifischen Zeichen zunehmend durch und ersetzt die vorher benutzten HTML-Sonderzeichen.

Verwendung

Zeichen ISO 8859-1 (Latin1) Unicode (UTF-8) URL
Dezimal Oktal Dezimal Oktal Hexadez.
Ä 196 304 303 204 C3 84 %C3%84
ä 228 344 303 244 C3 A4 %C3%A4
Ö 214 326 303 226 C3 96 %C3%96
ö 246 366 303 266 C3 B6 %C3%B6
Ü 220 334 303 234 C3 9C %C3%9C
ü 252 374 303 274 C3 BC %C3%BC
α 316 261 CE B1
(Zeilenumbruch) \n 0A
(nichts) \0 00
Unklar, ob folgende stimmen:
(Leerzeichen) 0000 %00%00
/ 2215 %22%15
( 0028 %00%28
) 0029 %00%29

Ausgabe der UTF-8-Kodierung des Zeichens "α" (Alpha aus dem griechischen Alphabet). Im Beispiel wird für das Zeichen "α" die (hexadezimale) UTF-8-Kodierung "ceb1" (also genau umgekehrt wie von "echo" ausgegeben; dezimal: 316 261, hier ist die Reihenfolge wie angezeigt). Danach folgt noch ein Zeilenumbruch ("0a") und nichts weiter ("00"). Zu beachten ist natürlich, dass das verwendete Terminal auch tatsächlich auf die Anzeige von UTF-8 eingestellt ist! Der entsprechende Code Point für die Darstellung als Zeichen in HTML lautet "03b1".

$ echo α | od -xc
0000000 b1ce 000a
        316 261  \n  \0
0000003

Dieselbe Ausgabe, aber ohne von "echo" an die Ausgabe angehängten Zeilenumbruch.

$ echo -n α | od -xc
0000000 b1ce 000a
        316 261
0000002

Anzeige von UTF-8 Zeichen in Codepoint hex-Darstellung.

$ echo 'UTF-8 ist schön' | preconv -r
UTF-8 ist sch\[u00F6]n

Umwandlung des Wertes "FC" von Hexadezimal zu Dezimal (252) mit "bc".

$ echo "ibase=16; FC" | bc
252

Umwandlung des Wertes "334" von Oktal zu Dezimal (220) mit "bc".

$ echo "ibase=8; 334" | bc
220

Umwandlung einer Datei mit Zeichensatz Latin1 zu Zeichensatz UTF-8.

$ iconv -f latin1 -t utf-8 latin1.txt > utf8.txt

Einige Spezialzeichen

special signs 1
  Code Unicode
&#9650;
&#9658;
&#9660;
&#9668;
&#9632;
&#9633;
&#9635;
&#9636;
&#9637;
&#9638;
&#9639;
&#9640;
&#9641;
&#9642;
&#9643;
&loz; &#9674;
&#9675;
&#9679;
&#9786;
&#9787;
&#9788;
 
german umlaut
  Code Unicode
ä &auml; &#228;
Ä &Auml; &#196;
ö &ouml; &#246;
Ö &Ouml; &#214;
ü &uuml; &#252;
Ü &Uuml; &#220;
ß &szlig; &#223;

money signs
  Code Unicode
€ &euro; &#8364;
¢ &cent; &#162;
£ &pound; &#163;
Â¥ &yen; &#165;
¤ &curren; &#164;
&#8362;
 
special signs 2
  Code Unicode
&#9792;
&#9794;
&spades; &#9824;
&#9828;
&clubs; &#9827;
&#9831;
&hearts; &#9829;
&#9825;
&diams; &#9830;
&#9733;
&#9734;
&#8962;
&#8470;
&#9742;
&#9743;
&#9832;
&#9756;
&#9758;
&#9833;
&#9834;
&#9835;
&#9836;
&#9837;
† &#8224;
‡ &#8225;

 
arrows
  Code Unicode
&larr; &#8592;
&uarr; &#8593;
&rarr; &#8594;
&darr; &#8595;
&harr; &#8596;
&#8597;
&#8598;
&#8599;
&#8600;
&#8601;

 
special signs 3
  Code Unicode
  &nbsp; &#160;
" &quot; &#34;
& &amp; &#38;
< &lt; &#60;
> &gt; &#62;
§ &sect; &#167;
« &laquo; &#171;
» &raquo; &#187;
© &copy; &#169;
® &reg; &#174;
â„¢ &trade; &#8482;
± &plusmn; &#177;
¿ &iquest; &#191;
¡ &iexcl; &#161;
@ &#64;

 
math signs
  Code Unicode
× &times; &#215;
÷ &divide; &#247;
+ &#43;
- &#45;
&#8486;
&#8730;
¼ &frac14; &#188;
½ &frac12; &#189;
¾ &frac34; &#190;
&#8531;
&#8532;
&#8539;
&#8540;
&#8541;
&#8542;
% &#37;
‰ &#8240;
¹ &sup1; &#185;
² &sup2; &#178;
³ &sup3; &#179;

Weblinks

Herausgeber Sprache Webseitentitel Anmerkungen
country DE.gif Wikipedia ger UTF-8wbm Enzyklopädischer Artikel
country DE.gif ger UTF-8-Codetabelle mit Unicode-Zeichenwbm Anmerkung: der Unicode Codepoint ist gelb hinterlegt, der UTF-8-Code ist rot hinterlegt.

Unicode-Zeichen mit Hexadezimalcode 00df (das Zeichen "ß" bzw. "LATIN SMALL LETTER SHARP S").