[go: nahoru, domu]

Open Bug 31691 Opened 25 years ago Updated 2 years ago

Emoticons following Japanese/CJK text without space are not converted to graphics

Categories

(MailNews Core :: Internationalization, defect, P5)

Tracking

(Not tracked)

People

(Reporter: ji, Assigned: smontagu)

Details

(Keywords: helpwanted, intl)

There is one option allowing users to convert the emoticons to graphics. 
But if the emoticons directly follow JPN chars without space, those emoticons
won't be converted to graphics. 

Steps of reproduce:
1. Lauch messenger.
2. Select Edit | Preferences.
3. Select Mail and Newsgroups | Messages
4. Check on Convert emoticons to graphics.
5. Compose a new message with the following lines in the msg body:
   日本語:)
   日本語 :)
6. Send the mail to the testing account itself.
   After receiving, you'll see the first smiley is not converted to 
   graphical happy face while the 2nd one is.

It's common for Asian users to use emoticons following text without space.
cc to rhp
Status: NEW → ASSIGNED
Target Milestone: M16
Please also debug with more than one ASCII spaces between
the end of a JPN string and the beginning of an emoticon
sequence, e.g. JPN_Sring     :)

I have seen some cases earlier where spaces made a difference
whether or not emoticons appear.
QA assingn --> ji
QA Contact: momoi → ji
Looks like this is not Japanese specific.
If I send, Hi:), it's not converted to emoticon either. Need spaces before :).
It this a spec?

Rich, please reassign to the developer of this feature, thanks.
Assignee: nhotta → rhp
Status: ASSIGNED → NEW
Hi Ben,
Can you provide guidance on this one :-)

- rhp
Assignee: rhp → mozilla
The specs can be found at <http://www.bucksch.org/1/projects/mozilla/16507>.



This is failure by intention :). If we don't check for delimitere, we'll mess up

e.g. >I:-) (a woman with an interesting hat) and be bashed by many other customers.



Does somebody have an idea, how to address both interests? Otherwise, I can do

nothing else than mark this WONTFIX.

P5, because I'm not the maintainer of this class anymore.
Priority: P3 → P5
Changed the summary since it's not a Japanese specific problem.
Summary: Emoticons following JPN chars without space are not converted to graphics. → Emoticons following text without space are not converted to graphics.
I don't know how this is implemented, but it should properly parse word breaks
(simple white space parsing does not work for all languages), possibly use the
nsIWordBreaker class.
> nsIWordBreaker class



Cool! If I were aware of this class, I would have used it from the beginning (I

wasn't even aware of the issue).



Are there other similar classes (other than nsLineBreak), which I should be

aware of, e.g. something like FindSentence?



BTW: nsIWordBreak needs documentation. I don't even know, what all the params

are for. (I know TestLineBreak, but still...)



I would /really/ like to fix this bug, but the new maintainer is Netscape (the

long story is, that a Netscape employee messed the class up so much, that I

*can't* maintain it anymore), and Netscape has no interest in enhancing the

class. However, I'll ACCEPT this bug for now and see, how things will develop.





Status: NEW → ASSIGNED
OS: Windows 95 → All
Target Milestone: M16 → M20
BreakInBetween
If we have two pieces of text ( "Th" + "is is a book")
BreakInBetween('Th", 2, "is is a book". 12, &out);
out will be false since it is NOT a word break between "Th" and "is is a book".
This is useful for text which logically continue but physically seperate . For example, "<B>Th</B>is is a book" is logically "This is a book" but physically belong to two text node "Th" and "is is a book"

In other hand, either BreakInBetween('This", 4, " is a book". 10,, &out); or BreakInBetween('This ", 5, "is a book". 9, &out); will be true

Next is used to find the "Next" word boundary
For example, Next("This is a book", 14, 0, &out,&needmore);
out will be 4 since the break position is in 4
needmore will be false because we can decide that the word break in position 4.
In the other hand,Next("This is a book", 14, 10, &out,&needmore);
needmore will be true since from the context, it cannot decide the "book" end after 'k' because it could be the "book" of the "bookcase" and need caller to pass in more context 

You should be able to handle them by just using these two methods. 
ftang, tnx for clarification. Can somebody check this into the tree

(mozilla/intl/lwbrk/public/nsIWordBreaker.h), so others can take advantage of

it, too?

IM and mail shows the same behavior on 03-24-17a jab1 build on Win98-Ja
Keywords: helpwanted
-> helpwanted :-(
Assignee: mozilla → nobody
Status: ASSIGNED → NEW
Target Milestone: M20 → ---
Keywords: intl
Product: MailNews → Core
Filter on "Nobody_NScomTLD_20080620"
Assignee: nobody → smontagu
QA Contact: ji → mailnews.i18n
Product: Core → MailNews Core
still reproduce on Thunderbird 3.1
Summary: Emoticons following text without space are not converted to graphics. → Emoticons following Japanese/CJK text without space are not converted to graphics
Severity: normal → S3
You need to log in before you can comment on or make changes to this bug.