cover image

Mojibake

Garbled text as a result of incorrect character encodings / From Wikipedia, the free encyclopedia

Dear Wikiwand AI, let's keep it short by simply answering these key questions:

Can you list the top facts and stats about Mojibake?

Summarize this article for a 10 year old

SHOW ALL QUESTIONS

Mojibake (Japanese: 文字化け; IPA: [mod͡ʑibake], "character transformation") is the garbled or gibberish text that is the result of text being decoded using an unintended character encoding.[1] The result is a systematic replacement of symbols with completely unrelated ones, often from a different writing system.

Mojibakevector.png
The UTF-8-encoded Japanese Wikipedia article for Mojibake displayed as if interpreted as Windows-1252
Mojibakevector2.png
The UTF-8-encoded Russian Wikipedia article on Church Slavonic displayed as if interpreted as KOI8-R

This display may include the generic replacement character ("�") in places where the binary representation is considered invalid. A replacement can also involve multiple consecutive symbols, as viewed in one encoding, when the same binary code constitutes one symbol in the other encoding. This is either because of differing constant length encoding (as in Asian 16-bit encodings vs European 8-bit encodings), or the use of variable length encodings (notably UTF-8 and UTF-16).

Failed rendering of glyphs due to either missing fonts or missing glyphs in a font is a different issue that is not to be confused with mojibake. Symptoms of this failed rendering include blocks with the code point displayed in hexadecimal or using the generic replacement character. Importantly, these replacements are valid and are the result of correct error handling by the software.

Oops something went wrong: