No utf-8 for me

| Web Design | 3 Comments

I never figured that the solution would be so easy. Those bothersome question marks within black diamonds that were appearing occasionally on some of my older blog entries were driving me nuts.

All that I had to do was change the following:

<meta http-equiv="Content-Type" content="text/html;
   charset=utf-8" />

into this:

<meta http-equiv="Content-Type" content="text/html;
   charset=iso-8859-1" />

See the difference? Perhaps this is not the most elegant way to fix this, not internationally compatible with every known character set on the planet, but it works.

3 Comments

UTF-8 is a standard way to store characters outside the ISO-8859-1 specification within 8 or 16 bit characters. Another way to do this is by using ISO-8859-2 for eastern europe, but since the ISO standard only stores characters inside 8 bits the ISO-8859-2 spec is not fully compatible with the ISO-8859-1 spec and you will lose characters like the copyright symbol.

The reason why some of the entries are not submitted correctly is because some browsers will follow the language setting of itself to submit fields inside a form. This will result in ISO-8859-1 submission to an UTF-8 site if there is no characterset specified inside the form itself.

Another, and better, way to solve the UTF-8 problem is by making all forms submit in UTF-8.

Thanks for the tip, Art. Now I figured an easy way to search and replace those characters. First I switch over to ISO-8859-1 so that the search field accepts the correct characters. Second, each entry containing such unwanted characters can be scanned and zapped accordingly. Finally when I am all done, I switch back to the more acceptable utf-8 and it's square one all over again. Piece of cake, really.

Oh yeah, about the copyright symbol. The limitiation you mention should not be a problem as long as you use the &copy thingie instead.

The copy thingie is not part of any characterset. It's a workaround introduced in html 3, just as well as some other characters and entities. If you go that way you can use &#number; as well for all your characters which are not in your current characterset.

Random entries

Here are some random entries that you might be interested in:

Recent Assets

  • tegen-par-2024-2nd-place.jpeg
  • stanford-reunion.png
  • kiff.png
  • hoid.png
  • Dad-in-front-of-log-cabin.png
  • mistborn-trilogy.png
  • 2024-03-Heren1-27h.png
  • three-body-problem.png
  • 10CC.png
  • minds-and-machines.jpeg
  • puglia.png
  • 2023-09-24-jong-tegen-oud-1.jpg

Recent Comments

  • No utf-8 for me: The copy thingie is not part of any characterset. ...
    - Art van Scheppingen
  • No utf-8 for me: Thanks for the tip, Art. Now I figured an easy way ...
    - Kiffin
  • No utf-8 for me: UTF-8 is a standard way to store characters outsid ...
    - Art van Scheppingen

Golf Handicap

Information

This personal weblog was started way back on July 21, 2001 which means that it is 7-21-2001 old.

So far this blog contains no less than 2518 entries and as many as 1877 comments.

Important events

Graduated from Stanford 6-5-1979 ago.

Kiffin Rockwell was shot down and killed 9-23-1916 ago.

Believe it or not but I am 10-11-1957 young.

First met Thea in Balestrand, Norway 6-14-1980 ago.

Began well-balanced and healthy life style 1-8-2013 ago.

My father passed away 10-20-2000 ago.

My mother passed away 3-27-2018 ago.

Started Gishtech 04-25-2016 ago.