Bug 7241 - Utility to convert among web archive formats
Summary: Utility to convert among web archive formats
Status: ASSIGNED
Alias: None
Product: WebKit
Classification: Unclassified
Component: WebKit Misc. (show other bugs)
Version: 420+
Hardware: Mac OS X 10.4
: P2 Enhancement
Assignee: David Kilzer (:ddkilzer)
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-02-13 16:58 PST by David Kilzer (:ddkilzer)
Modified: 2006-12-24 18:10 PST (History)
1 user (show)

See Also:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description David Kilzer (:ddkilzer) 2006-02-13 16:58:02 PST
It would be nice to have a utility to convert between web archive formats (Safari's .webarchive plist; MSIE's .mht MIME HTML; Firefox's web page, complete) until such time as Bug 7168, Bug 7169 and Bug 7211 are fixed.
Comment 1 David Kilzer (:ddkilzer) 2006-02-13 17:05:31 PST
Taking my own bug.  I originally thought I could write this in a day, but there were more challenges to overcome than I initially thought.  :)

The current version uses Perl and will only work on Mac OS X since it uses (or will use) the PerlObjCBridge to create/extract .webarchive files (which are binary plists).  I plan to have the script read each of the formats into an internal format-neutral data structure so that it can target any of the formats on output.

Right now .webarchive to "web page, complete" is very close to working.

Lessons learned from this project will undoubtedly help to implement the bugs noted in the Description above.