Wikisource:Unify transcription namespaces
The "Index" and "Page" namespaces are usually used together. This page will suggest methods of unifying them, giving benefits and costs, and will explore the disadvantages.
Benefits
edit- One namespace means less for new users to learn and understand transcription projects on Wikisource.
- If the djvu "Index" page and its subpages are in the same namespace, all subpages can be moved when the Index page is moved.
- Stop using the namespace "Index", which means catalogue in English when used in a library context.
Data migration
editDirect data manipulation
editThe database schema contains a 'namespace' for every page. With a single line of SQL, all "Index" and "Page" pages can be changed so that those pages become attached to a different namespace.
Bot
editThe pywikipedia bot framework has the ability to "rename" all pages to the desired namespace.
Proofread Page extension
editIn order for the Proofread Page extension to work with only a single unified namespace, the code will need to be changed.
Djvu files
editFor djvu files, only a few lines of code need to be changed.
Series of images
editIn order for this proposal to work for books comprised of separate images, a little more logic is required. On page creation, the extension will need follow this order:
- if there is an image with the same pagename, the side-by-side view should be shown.
- show the pretty "edit index" page.
Backwards compatibility
edit- Todo : What level of backwards compatibility would be desirable.
Proposed namespaces
editThese sections describe the methods which could be used to unify the two namespaces.
Image
editOne method of unifying the namespaces is to develop a more specialised handler for the "Image" namespace.
Migration
editA book that is a djvu would be changed like this:
Index:Example.djvu -> Image:Example.djvu Page:Example.djvu/1 -> Image:Example.djvu/1 Page:Example.djvu/2 -> Image:Example.djvu/2 Page:Example.djvu/3 -> Image:Example.djvu/3 ..etc
A book that is comprised of separate images would be changed like this:
Index:Example -> Image:Example Page:Example p1.png -> Image:Example p1.png Page:Example p2.png -> Image:Example p2.png Page:Example p3.png -> Image:Example p3.png ..etc
Implementation
editIn this proposal, the purpose of the Image namespace changes in the following ways:
Image:Example.djvu
editCurrently these pages display a thumbnail of the first pagescan, and we add {{Information}} on the Commons page, and that appears on Wikisource. In this idea, if there is no local page, the top of the page would be a thumbnail on the left, and <pagelist/> on the right, and underneath the Commons page would be displayed. However, we could edit this page, and change the presentation if we want to. Our "Index" pages usually have very similar information as the Commons image page.
Image:Example.djvu/1
editThis page isnt a legal image name. The upload form will prevent a file from being uploaded with this name, so there is no harm in putting a normal text page there. I propose that we put the side-by-side view of a djvu page at this name.
Image:Example
editThis is also not a legal image name. The upload form will prevent a file from being uploaded with this name, so there is no harm in putting an index of the pagescans at this name. It would look like:
<nowiki> {{Index | title=Book title | pubyear=1901 | author=Old dead person | pages=[[Image:Book title p1.png|p1]] [[Image:Book title p2.png|p2]] [[Image:Book title p3.png|p3]] | ... }} <nowiki>
Image:Example p1.png
editBy default, image pages would look like the normal Commons page with a thumbnail. When the user clicks create, it would show the side-by-side edit view, allowing transcription. When the editor clicks save, the side by side-by-side view would be shown.
Benefits
editThe most important benefit that is that we eliminate special namespaces. There are a few ways this helps:
- new wikis do not need to set up additional namespaces in order to implement Proofread Page, which means they can start using it immediately.
- After a djvu or image is uploaded, the user is shown the Image page. Users on other wikis are used to an Image page, and are familiar with how to edit them.
Page
editA "Page" namespace will require the least amount of work, and the indexes are fewer in number, and viewing the history of those is less important. The indexes in a "Page" namespace is not very strange.
Scanset
editA "Scanset" or "Transcription" namespace accurately describes the contents of the pages found in that namespace.
Migration
editA book that is a djvu would be changed like this:
Index:Example.djvu -> Scanset:Example.djvu Page:Example.djvu/1 -> Scanset:Example.djvu/1 Page:Example.djvu/2 -> Scanset:Example.djvu/2 Page:Example.djvu/3 -> Scanset:Example.djvu/3 ..etc
A book that is comprised of separate images would be changed like this:
Index:Example -> Scanset:Example Page:Example p1.png -> Scanset:Example p1.png Page:Example p2.png -> Scanset:Example p2.png Page:Example p3.png -> Scanset:Example p3.png ..etc