Difference between revisions of "Roadmap"

From openZIM
Jump to navigation Jump to search
Line 1: Line 1:
See also current [[Status and next steps]].
See also current [[Status and next steps]].


;Until end of 2009
* Finalizing ZIM file format
** ZIM file header:
*** <s>add Pointer to UrlPointerList (IndexPointerList will be named "TitlePointerList")</s>
*** <s>add Pointer to MimeTypeList</s>
** ZIM file structure:
*** <s>add UrlPointerList (article list ordered by URL)</s>
*** <s>add MimeTypeList to store MimeTypes in a zero-terminated list</s>
*** <s>make new integer compression (UTF-8 compression / ZInt compression)</s>
*** <s>break version number into major / minor number</s>
** Directory Entry:
*** <s>drop QUnicode on article titles</s>
*** <s>add URL</s>
*** <s>add rev_id int compressed</s> (not int compressed)
** Index Namespace (X)
*** <s>switch to new int compression</s>
;Later
* Layout Namespace (A / B)
* Layout Namespace (A / B)
** A - HTML body
** A - HTML body
** B - HTML header template
** B - HTML header
** reader sets flag when loading library if it wants to get HTML body or full layout using header template
** two methods for reading either pure article text or combined pages
** MIME types used
*** html-body
*** html-layout


* Cluster compression
* switch to compression streaming (only keep in memory what is really needed)
** <s>add LZMA compression</s> [[LZMA compression]]
** switch to compression streaming (only keep in memory what is really needed)


* Packaging
* Packaging
Line 37: Line 14:
** Emmanuel adds Microsoft Visual Studio project file to SVN
** Emmanuel adds Microsoft Visual Studio project file to SVN


* Category Namespace (U / V)
* Category Namespace (U / V / W)
** U contains standard article text
** U contains standard article text
** V contains article pointers to articles within that category
** V contains article pointers to articles within that category
* [[Metadata]] Namespace (M) - these fields should be available as variables for layout templates
** W contains category pointers to categories of a given article
** "language" - ISO Code 639-3
** "creator"
** "date" - YYYYMMDD
** "description"
** "relation"
** "source" - URL


* ZIM export running on http://download.wikimedia.org/ '''This is not likely to happen until after the WMF fundraiser finishes which will be after 1/2010''' [[User:Tomasz|Tomasz]] 15:06, 22 November 2009 (UTC)
* ZIM export running on http://download.wikimedia.org/ '''This is not likely to happen until after the WMF fundraiser finishes which will be after 1/2010''' [[User:Tomasz|Tomasz]] 15:06, 22 November 2009 (UTC)
Line 55: Line 26:
** tool to merge two ZIM files
** tool to merge two ZIM files


;April 2009:
* [[LinuxTag 2011|May 11th - 14th LinuxTag 2011 in Berlin]]
* [[Developer Meetings/2010-1]]
* [[Developer Meetings/2011-1|August 2nd - 3rd 2011 Developers Meeting in Haifa, Israel]]

Revision as of 16:05, 17 October 2010

See also current Status and next steps.

  • Layout Namespace (A / B)
    • A - HTML body
    • B - HTML header
    • two methods for reading either pure article text or combined pages
  • switch to compression streaming (only keep in memory what is really needed)
  • Packaging
    • Debian maintainer, contact by Tommi
    • RPM? - maybe openSuSE buildservice
    • static binaries should be updated regularly
    • Emmanuel adds Microsoft Visual Studio project file to SVN
  • Category Namespace (U / V / W)
    • U contains standard article text
    • V contains article pointers to articles within that category
    • W contains category pointers to categories of a given article
  • ZIM export running on http://download.wikimedia.org/ This is not likely to happen until after the WMF fundraiser finishes which will be after 1/2010 Tomasz 15:06, 22 November 2009 (UTC)
    • work in MW API
    • dumper has to add license name and link to the HTML content
  • Updating
    • tool to merge two ZIM files