<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<!DOCTYPE bugzilla SYSTEM "https://bugs.webkit.org/page.cgi?id=bugzilla.dtd">

<bugzilla version="5.0.4.1"
          urlbase="https://bugs.webkit.org/"
          
          maintainer="admin@webkit.org"
>

    <bug>
          <bug_id>15630</bug_id>
          
          <creation_ts>2007-10-22 15:16:11 -0700</creation_ts>
          <short_desc>After U+3001, U+3002 (ideographic comma/full stop), lines cannot be broken</short_desc>
          <delta_ts>2008-02-25 14:28:08 -0800</delta_ts>
          <reporter_accessible>1</reporter_accessible>
          <cclist_accessible>1</cclist_accessible>
          <classification_id>1</classification_id>
          <classification>Unclassified</classification>
          <product>WebKit</product>
          <component>Layout and Rendering</component>
          <version>523.x (Safari 3)</version>
          <rep_platform>PC</rep_platform>
          <op_sys>Windows XP</op_sys>
          <bug_status>RESOLVED</bug_status>
          <resolution>DUPLICATE</resolution>
          <dup_id>17411</dup_id>
          
          <bug_file_loc>http://usstock.jrj.com.cn/xhmt</bug_file_loc>
          <status_whiteboard></status_whiteboard>
          <keywords></keywords>
          <priority>P2</priority>
          <bug_severity>Normal</bug_severity>
          <target_milestone>---</target_milestone>
          
          
          <everconfirmed>1</everconfirmed>
          <reporter name="Jungshik Shin">jshin</reporter>
          <assigned_to name="Nobody">webkit-unassigned</assigned_to>
          <cc>ap</cc>
    
    <cc>artension</cc>
    
    <cc>mitz</cc>
          

      

      

      

          <comment_sort_order>oldest_to_newest</comment_sort_order>  
          <long_desc isprivate="0" >
    <commentid>59198</commentid>
    <comment_count>0</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2007-10-22 15:16:11 -0700</bug_when>
    <thetext>Due to the problem described in the summary line, the layout at http://usstock.jrj.com.cn/xhmt is broken.  

WebKit uses ICU line breaking iteratoer, which in turn is based on UAX #14 (Unicode Line Breaking Algorithm). It has the following rule:

CL x (AL|NU)

where CL includes U+3001 and U+3002 (Ideographic Comma and Full Stop).  With the above rule, lines cannot be broken when U+3001 and U+3002 are followed by a Latin letter or a number.  As a result, the box at the url given above with the title</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59199</commentid>
    <comment_count>1</comment_count>
      <attachid>16810</attachid>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2007-10-22 15:19:09 -0700</bug_when>
    <thetext>Created attachment 16810
layout test case 

two columns should be rendered identically.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59200</commentid>
    <comment_count>2</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2007-10-22 15:24:09 -0700</bug_when>
    <thetext>Hmm my comment #0 got trimmed....

As a result, the box at the url given above with the title </thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59210</commentid>
    <comment_count>3</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2007-10-22 17:25:23 -0700</bug_when>
    <thetext>Try once more (this time with FF ;-)). 

The textbox whose title is &apos;美通社简介&apos; is a lot wider than its specified width breaking the layout of the page. 

A fix is very simple. We have to tailor UAX #14&apos;s line breaking property so that U+3001 and U+3002 followed by a Latin letter/number (or more broadly, any character belonging to AL/NU classes) are regarded as a line breaking opportunity. A way to do that is to move those characters from CL class to NS (non-starter) class in ICU&apos;s source/data/brkiter/line.txt. 

For WinSafari, it&apos;d be a simple change, but for Safari on Mac, this may be more involved because it may mean changing the build of ICU shipped with OS X. 

</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59229</commentid>
    <comment_count>4</comment_count>
    <who name="David Kilzer (:ddkilzer)">ddkilzer</who>
    <bug_when>2007-10-22 22:56:34 -0700</bug_when>
    <thetext>(In reply to comment #3)
&gt; Try once more (this time with FF ;-)). 

It sounds like you&apos;re hitting Bug 14562 (or something similar) when entering text in a text area (which is truncated when sent to the server).

Could you please file a new bug on this, stating the version of Safari/WebKit you&apos;re using, and steps to reproduce.  Thanks!

</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59441</commentid>
    <comment_count>5</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2007-10-25 09:47:05 -0700</bug_when>
    <thetext>I can only reproduce this problem on Windows - Mac (Tiger) works as expected for me.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59545</commentid>
    <comment_count>6</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2007-10-26 09:18:33 -0700</bug_when>
    <thetext>Do you know if this has been reported to the Unicode consortium? This rule is new to Unicode 5.0, and doesn&apos;t look quite right, as you point out.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>59573</commentid>
    <comment_count>7</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2007-10-26 14:44:48 -0700</bug_when>
    <thetext>Yes, I&apos;ve been in contact with the author of UAX #14 (indirectly). I talked to the author of ICU break iterator and he agreed with me (actually, we sat together and he suggested changing the class of those two to NS).  </thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>71030</commentid>
    <comment_count>8</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2008-02-18 02:51:21 -0800</bug_when>
    <thetext>Bug 17411 has a patch for this.

I&apos;m still unsure whether the Unicode consortium is aware of this issue. ICU is one thing, but the proposed update to UAX #14 at &lt;http://www.unicode.org/reports/tr14/tr14-21.html&gt; seems to be unchanged.</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>71641</commentid>
    <comment_count>9</comment_count>
    <who name="Satoshi Nakagawa">artension</who>
    <bug_when>2008-02-23 14:33:52 -0800</bug_when>
    <thetext>(In reply to comment #8)
I agree.  They could be not aware of this issue.
I wrote a report about this problem, and sent it to the Unicode ML.

http://limechat.net/report/unicode-line-break-problem.html</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>71673</commentid>
    <comment_count>10</comment_count>
    <who name="Alexey Proskuryakov">ap</who>
    <bug_when>2008-02-23 22:08:00 -0800</bug_when>
    <thetext>Marking as a duplicate, as bug 17411 has an approved fix for this.

*** This bug has been marked as a duplicate of 17411 ***</thetext>
  </long_desc><long_desc isprivate="0" >
    <commentid>71909</commentid>
    <comment_count>11</comment_count>
    <who name="Jungshik Shin">jshin</who>
    <bug_when>2008-02-25 14:28:08 -0800</bug_when>
    <thetext>(In reply to comment #8)
&gt; Bug 17411 has a patch for this.
&gt; 
&gt; I&apos;m still unsure whether the Unicode consortium is aware of this issue. ICU is
&gt; one thing, but the proposed update to UAX #14 at
&gt; &lt;http://www.unicode.org/reports/tr14/tr14-21.html&gt; seems to be unchanged.

In the meantime, they nuked LB30 instead of changing the class for U+3001/3002.  A long-term solution is being worked on according to my source. 

Anyway, on Mac OS X, ICU will always lag behind, I agree that we should fix webkit code (as in bug 17411) .


</thetext>
  </long_desc>
      
          <attachment
              isobsolete="0"
              ispatch="0"
              isprivate="0"
          >
            <attachid>16810</attachid>
            <date>2007-10-22 15:19:09 -0700</date>
            <delta_ts>2007-10-22 15:19:09 -0700</delta_ts>
            <desc>layout test case </desc>
            <filename>ideo_punc.html</filename>
            <type>text/html</type>
            <size>710</size>
            <attacher name="Jungshik Shin">jshin</attacher>
            
              <data encoding="base64">PGh0bWw+DQogIDxoZWFkPg0KPG1ldGEgaHR0cC1lcXVpdj0iQ29udGVudC1UeXBlIiBjb250ZW50
PSJ0ZXh0L2h0bWw7IGNoYXJzZXQ9dXRmLTgiPg0KPHRpdGxlPnRlc3QgY2FzZSBmb3Igd2Via2l0
IGJ1ZyAxNTYzMDwvdGl0bGU+DQo8L2hlYWQ+DQo8Ym9keT4NCg0KICA8cD5UaGUgY29sdW1uIG9u
IHRoZSByaWdodCBoYXMgZXhwbGljaXQgc3BhY2VzIGFmdGVyIGVhY2ggb2YNCiAgVSszMDAxIChp
ZGVvZ3JhcGhpYyBjb21tYSkgYW5kIFUrMzAwMiAoaWRlb2dyYXBoaWMgZnVsbCBzdG9wKSB3aGls
ZQ0KICB0aGUgbGVmdCBjb2x1bW4gZG9lcyBub3QuIFRoZSBsZWZ0IGFuZCB0aGUgcmlnaHQgY29s
dW1ucyBzaG91bGQgYmUgDQogIHJlbmRlcmVkIGlkZW50aWNhbGx5LjwvcD4NCg0KPGRpdiBzdHls
ZT0id2lkdGg6MXB4OyBwb3NpdGlvbjphYnNvbHV0ZTsgbGVmdDozMCUiPg0KRXhwZXJ0c+OAgWVX
YXRjaChUTSnjgIExMjPjgIFhYmM0NTbjgIExMjNFeHBlcnRz44CCZVdhdGNoKFRNKeOAgjEyM+OA
gmFiYzQ1NuOAgjEyMw0KPC9kaXY+DQoNCjxkaXYgc3R5bGU9IndpZHRoOjFweDsgcG9zaXRpb246
YWJzb2x1dGU7IGxlZnQ6NjAlIj4NCkV4cGVydHPjgIEgZVdhdGNoKFRNKeOAgSAxMjPjgIEgYWJj
NDU244CBIDEyM0V4cGVydHPjgIIgZVdhdGNoKFRNKeOAgiAxMjPjgIIgYWJjNDU244CCIDEyMw0K
PC9kaXY+DQo8L2JvZHk+DQo8L2h0bWw+DQo=
</data>

          </attachment>
      

    </bug>

</bugzilla>