Quoting

  • Thread starter Thread starter ßen
  • 26 comments
  • 1,543 views
Messages
913
This isn't so much a 'problem' as a 'query'.
I think the reason to this is the odd character that starts my name.
When someone quotes me, instead of my name being displayed I get an odd character instead.

What actually happens is that it displays a '?' instead of the actual character, β .

http://img60.imageshack.us/img60/3619/hmmmk8.jpg
Screenshot
 
βen;2402876
This isn't so much a 'problem' as a 'query'.
I think the reason to this is the odd character that starts my name.

WHat actually happens is that it displays a '?' instead of the actual character, β .

http://img60.imageshack.us/img60/3619/hmmmk8.jpg
Screenshot

It displays correctly in the preview window for me...

edit: And displays correctly when posted too...

edit2: I think we need some Beta testing...
 
Yep, it's happening to me with Safari 1.0.3.
 
βen;2402897
See! We have actual proof that it happens. Now we wait for someone cleverer that can give the cure.
Firefox 1.5.0.6 seems to work fine, even with Preview Post.
 
βen;2402897
See! We have actual proof that it happens. Now we wait for someone cleverer that can give the cure.
I believe it relates to the older software not being able to recognize the character. Thus it inserts a default when posting. It'll happen at circumstances where people haven't upgraded.

IE recognizes fine.
 
Nope, that’s not the problem. Safari has better Unicode support than IE by far, so it wouldn’t make sense that this problem is occurring in Safari and not IE…

Unless the character encoding of the page is specified wrong, which is exactly the problem. The character encoding of these pages is ISO-8859-1 (aka Latin 1), which does not support many Unicode characters, including greek characters (such as Ben’s beta). In fact, here’s a graphical representation of every character supported by Latin 1, which ain’t many.

So, if anything, the only reason IE can handle it is because it’s coded so sloppily that the programmers probably never made any distinction between Latin 1 and Unicode, even though there’s a huge difference.

I would suggest to Jordan to just change the encoding of every page to UTF-8, but it’s not as simple as just changing the meta tag – you have to actually make sure each page is re-encoded, which can be a big pain in the butt.
 
IE doesn’t fully support UTF-8, so I’d still lean towards sloppily. ;)
 
And it's a Microsoft product, which definitively points toward sloppy as well.
;)
 
I've quoted in French before, and haven't had any problems with the accents... I'm using IE (I don't have the version on the top of my head).

Are you saying that it's an issue with language packs? Because the sharfess(?) 'ss' German character is a standard inclusion in MS Word, just like a cedilla 'ç'; They are basic characters. Surely browsers can recognise this?!

FormulaGT
 
Does IE7 support it all the way? Or is it like IE6?
I have no clue – the developers have never mentioned it on the IE Blog, so I sorta-kinda assume that they haven’t done anything about it. They really have no incentive to fix it though, because then suddenly sites like this would be “broken” by an IE7 upgrade – remember, Microsoft is crazy about backwards-compatibility.

Are you saying that it's an issue with language packs? Because the sharfess(?) 'ss' German character is a standard inclusion in MS Word, just like a cedilla 'ç'; They are basic characters. Surely browsers can recognise this?!
No no, that’s not the issue at all.

GTP has a declared character set encoding of ISO-8859-1 (Latin 1). That tells the web browser that this webpage should only have the characters shown on this chart. As a parallel example, if I encoded a web page with a Japanese encoding, that would tell the web browser that there should only be Japanese characters on the web page.

So, these web pages are encoded for Latin only, yet they contain non-Latin characters. For instance, notice in my quote above that Duck’s name shows up as “Du?k” – that’s because the c-with-a-dot-above-it is not a Latin character (notice how it’s not on that chart I linked to). On the other hand, your cedilla shows up fine in my quote, because it is a Latin character.

One might observe that I obviously have no problem viewing somebody’s non-Latin username, and that the problem only surfaces when typing it in a text box. In a nutshell, this is what’s happening: I can see this

properly, because a browser is allowed to try to render a character outside of the page’s encoding. You could have a Japanese character on this page, and I’d be able to see it.

However, the issue is when the text gets processed. As I’m typing up this response, I can insert whatever character I want – for example, right now I’m going to type the ohm symbol –> ?. Here’s a screenshot of what it looked like when I typed it:

But what do you see now in this post? A question mark. That’s because Safari encoded it as Unicode (as it should) – when it reached the GTP server, the server went “Oops, doesn’t match the rest of the page!” (since it wasn’t encoded as Latin 1), and thus replaced it with a question mark.

On the other hand, if you were to type that into IE, it would probably go through just fine. That’s because IE would “pretend” that the ohm is Latin 1–encoded (which it clearly isn’t), thus the server would receive it as Latin 1, and it would have no problem spitting it back out onto the web page.

Everything is way more complicated than that (enough that I don’t understand most of it), but that’s the gist of the problem.
 
Quality post, Sage 👍

Nicely explained with good examples and straightforward language. Well, at least the problem has been diagnosed now (but still no-one got my Beta testing joke... :( ).
 
okay check this out i PMed Duke and this was the original message [this one wasn't really a quote]:

[not a real quote
]shouldn't Иismo be on the banned user log because he was a second account for ÄŁPĨЙΣ?

and i got this reply as a quote from Duke:

lthiele
shouldn't ?ismo be on the banned user log because he was a second account for Ä?P????
 
Back