RESOLVED FIXED Bug 48625
[GTK] Optimize foldCase, toLower and toUpper methods in glib unicode backend
https://bugs.webkit.org/show_bug.cgi?id=48625
Summary [GTK] Optimize foldCase, toLower and toUpper methods in glib unicode backend
Carlos Garcia Campos
Reported 2010-10-29 05:34:58 PDT
We could use our owns methods to convert between utf8 and utf16 to avoid the last memcpy needed in every method.
Attachments
Patch (7.24 KB, patch)
2010-10-29 05:37 PDT, Carlos Garcia Campos
no flags
Fixed minor coding style issue in previous patch (7.24 KB, patch)
2010-10-29 05:45 PDT, Carlos Garcia Campos
mrobinson: review-
Updated patch according to review (7.30 KB, patch)
2010-11-12 00:19 PST, Carlos Garcia Campos
no flags
Carlos Garcia Campos
Comment 1 2010-10-29 05:37:41 PDT
Created attachment 72318 [details] Patch GLib methods use UTF-8 strings, so we have to convert from UTF-16 to UTF-8 to perform the case operations and then convert back the result to UTF-16. GLib conversion methods return a new allocated string, so we have to memcpy the result into the destination buffer too. Using our own methods to convert between UTF-8 and UTF-16 from wtf/unicode/UTF8.h we don't need such memcpy, since they take an already allocated buffer rather than returning a new one. There's another optimization for the case when the destination buffer is not large enough. In that case, methods should return the expected destination buffer size and are called again with a new buffer. We can avoid the conversion to UTF-16 by pre-calculating the required size for the destination buffer.
Carlos Garcia Campos
Comment 2 2010-10-29 05:45:08 PDT
Created attachment 72319 [details] Fixed minor coding style issue in previous patch
Martin Robinson
Comment 3 2010-11-11 11:12:28 PST
Comment on attachment 72319 [details] Fixed minor coding style issue in previous patch View in context: https://bugs.webkit.org/attachment.cgi?id=72319&action=review Looks good. It just needs a couple small cleanups. > JavaScriptCore/wtf/unicode/glib/UnicodeGLib.cpp:58 > + utf16Length += (character >= 0x10000) ? 2 : 1; There's a macro in TextBreakIterator.h for this. > JavaScriptCore/wtf/unicode/glib/UnicodeGLib.cpp:83 > + GOwnPtr<char> utf8Result; > + utf8Result.set(caseFunction(buffer.data(), buffer.size())); I think it makes more sense for this to be: GOwnPtr<char> utf8Result(caseFunction(buffer.data(), buffer.size());
Carlos Garcia Campos
Comment 4 2010-11-12 00:19:47 PST
Created attachment 73710 [details] Updated patch according to review
Xan Lopez
Comment 5 2010-11-24 04:29:56 PST
Comment on attachment 73710 [details] Updated patch according to review View in context: https://bugs.webkit.org/attachment.cgi?id=73710&action=review > JavaScriptCore/wtf/unicode/glib/UnicodeGLib.cpp:30 > + Perhaps this could be shared, but you can do that afterwards.
WebKit Commit Bot
Comment 6 2010-11-24 04:51:13 PST
Comment on attachment 73710 [details] Updated patch according to review Clearing flags on attachment: 73710 Committed r72662: <http://trac.webkit.org/changeset/72662>
WebKit Commit Bot
Comment 7 2010-11-24 04:51:18 PST
All reviewed patches have been landed. Closing bug.
Note You need to log in before you can comment on or make changes to this bug.