11231 – RegExp bug when handling newline characters

RESOLVED FIXED 11231

RegExp bug when handling newline characters

https://bugs.webkit.org/show_bug.cgi?id=11231

Summary RegExp bug when handling newline characters

Brad Choate

Reported 2006-10-09 11:38:45 PDT

Test case: var s1 = "\nAbc\n"; var s2 = s1.replace(/(\n)[^\n]+$/, '$1'); if (s1 == s2) alert("expected: s1 == s2"); else alert('s1 != s2: "' + s2.replace(/\n/g,"\\n") + '" != "\\nAbc\\n"'); In FireFox, IE and Perl, s1 == s2. In Safari (even in the latest nightly builds), s2 is stripped of the inner 'Abc' characters. Changing the regex to use * instead of + doesn't change the result either.

Attachments
Test case from Comment #0 (206 bytes, text/html) 2007-11-04 07:45 PST, David Kilzer (:ddkilzer)	no flags	Details
patch for this and a few other problems (76.72 KB, patch) 2007-11-12 16:26 PST, Darin Adler	ggaren: review+	Details Formatted Diff Diff
View All Add attachment proposed patch, testcase, etc.

David Kilzer (:ddkilzer)

Comment 1 2007-11-04 07:45:44 PST

Created attachment 17031 [details] Test case from Comment #0

Darin Adler

Comment 2 2007-11-12 00:48:26 PST

What's broken here is the $ operator. It's matching a newline at the end of the string. I have a fix.

Darin Adler

Comment 3 2007-11-12 16:26:28 PST

Created attachment 17216 [details] patch for this and a few other problems

Geoffrey Garen

Comment 4 2007-11-12 23:22:38 PST

Comment on attachment 17216 [details] patch for this and a few other problems r=me

Darin Adler

Comment 5 2007-11-13 09:25:57 PST

Committed revision 27752.

Eric Seidel (no email)

Comment 6 2007-11-13 10:12:45 PST

I take it this was no change for sunspider?

Darin Adler

Comment 7 2007-11-13 10:34:21 PST

(In reply to comment #6) > I take it this was no change for sunspider? I'm not sure. I'll test now.

Darin Adler

Comment 8 2007-11-13 10:44:29 PST

SunSpider claims this slowed things down almost 1%. But what is so crazy is that the effects were mostly on the tests that don't use regular expressions at all! For example, it says partial-sums is 2% slower. But there's not a single call into the regular expression for partial-sums. Similarly, it claims that string-base64 is 2% slower. Also doesn't use regular expressions at all. Claims that string-fasta is 2.6% slower. Also doesn't use regular expressions at all. It says regexp-dna is 0.6% faster. That one uses regular expressions. This patch is entirely about correctness, not performance, and it should make regular expressions slightly *faster*. I have no idea why SunSpider thinks it makes non-regular-expression tests slower. We may need to fix SunSpider.

Note You need to log in before you can comment on or make changes to this bug.

Status RESOLVED

Resolution FIXED

Priority P2

Severity Normal

Classification Unclassified

Version 420+

Hardware Mac

OS OS X 10.4

Product WebKit

Component JavaScriptCore

Assignee

Darin Adler

Reported

2006-10-09 11:38 PDT

Modified

2007-11-13 10:44 PST History

CC List

2 users Show

URL

Keywords

Depends on

Blocks