Archives

Categories

Who Should Edit Wikipedia and Where Should they Do It?

Cory Doctorow wrote an interesting little column for Make magazine about Wikipedia and the way that it “contains facts about facts” [1]. One of the issues with this is that you can’t (or at least shouldn’t) make corrections based on your own knowledge of a subject, you need to cite references. Cory gives a hypothetical example where a world renowned expert couldn’t simply fix an error in Wikipedia, but they could give an interview with the New York Times decrying the quality of Wikipedia and citing the correct data – then once the NYT article was published it could be referenced as an authoritative source and the page updated.

Mako wrote about the Wikireader (an offline reader for Wikipedia) and advocated a facility for writing changes while offline and uploading them [2]. From the perspective of improving the state of the art in computer science and making new features available for more users this sounds like a great idea. But in terms of fitting in with the way Wikipedia works it doesn’t seem viable to me.

Any time that you need to use an offline Wikipedia reader it seems that it will either be impossible or unreasonably difficult to gain the net access necessary to find the reference that Wikipedia requires. Sure you could fix spelling errors, reformat paragraphs, etc but not do anything serious. I wonder how many people actually put any serious effort into Wikipedia with the sole purpose of making it more readable. In terms of my own editing I have made a number of changes that improve readability, but all of them were due to reviewing pages for technical accuracy.

One of the comments on Mako’s post suggested that instead of having an option to directly edit pages there be an option to take notes on pages which could then be used for later editing. Maybe something like a two button combination to flag a particular paragraph as being of dubious accuracy so that the user could later do some Google searches for references.

Of course another problem with this is the delay in updating a Wikireader. For most reading of Wikipedia it doesn’t matter if the content is 6 months old, so we should expect that most users of the Wikireader don’t have the latest data. The Wikireader company offers an update service of posting a new memory card twice a year [3]. Given postage delays etc we should expect that the average age of the content is about 4 months for people who subscribe to the update service, and probably a lot greater for people who download the updates. The popular pages of Wikipedia change fast, in the space of 4 months a page can change significantly.

In terms of general Wikipedia operation I wonder if it would be beneficial to have a button to flag content as possibly not meeting the Wikipedia guidelines. It seems that the number of people who can recognise low quality pages is going to be significantly higher than the number of people who have the skill and time to fix them. I’m sure that there are many people who would just love to be able to choose from a list of Wikipedia pages that have received many negative votes. Such votes could be stored offline and uploaded later, but again that would rely on having recent content on the reader (it would be annoying for a recently fixed page to keep getting voted down).

One area where offline editing of a Wiki would work well is that of niche Wikis. For example if I downloaded a copy of the Debian Wiki (which would only take a tiny fraction of the space in a Wikireader) then I could easily update it every week and the incidence of other people editing a page in the mean time would often be low. I could write long updates that make significant changes to SE Linux related pages while offline without needing any references and in many cases without much risk of conflicts. Implementing a distributed version control system to manage such updates shouldn’t be difficult in principle, although the VCS might have to run on a laptop or desktop system after taking raw data from the Wikireader (tools like GIT seem memory hungry and might not fit on a small device).

Also Mako refers to the “already indefensibly large gap between the number of readers and editors on Wikipedia“. Like Mako I don’t have a TV watching personality. I agree that it is desirable to facilitate read-write access for everyone, but that doesn’t have to involve write access to Wikipedia. I am not convinced that there is a difference between the number of readers and editors of Wikipedia that indicates a problem here. As something becomes more popular the number of people who use it without being committed to it increases, and it also becomes more integrated into society. I don’t think it’s a problem that many people use their writing time for other tasks such as blogging rather than editing Wikipedia. Wikipedia makes a great reference for blog posts and blog posts drive readers (and thus potential editors) to Wikipedia.

Finally I think we should consider the fact that different people have different skill sets. Maybe the Wikipedia reader base gained a significant portion of the people who have the skills to make good editors a long time ago and now the majority of new users just don’t have the skills. If that is the case then having the ratio of readers to editors change in favor of readers while the number of readers expands would be a good thing. I don’t think that I am being elitist in acknowledging that different people have different skill sets, and that some portion of the population will lack the skills necessary to make good contributions to Wikipedia (good enough to outweigh the mistakes that they make). I think that valuing the contributions of all people in society does not require that we value all contributions to Wikipedia.

6 comments to Who Should Edit Wikipedia and Where Should they Do It?

  • Portland State Aerospace Society keeps the wiki in git, so that members who are out launching rockets can do offline updates while other people edit the web copy, then they do a merge and fix up any conflicts. (see ikiwiki.info for more on DVCS-backed wiki possibilities.) Might scale better if you had a prose-aware “diff” program.

  • James

    There are quite a few people who copy-edit Wikipedia articles, but I don’t think it’d be useful to do that offline, since articles can change a lot the corrections might be hard to integrate of could have been already made. I think the killer app for the WikiReader (and WikiReader offline editing) would be a Wikitravel dump – when you’re travelling you often don’t have net access, and it’s also the perfect time to add information.

    Wikipedia has a rating system – check out the talk page of many articles to see how it’s been assessed by the relevant WikiProject(s). This is done as part of Version 1.0 Editorial Team/Work via Wikiprojects, which helps both the individual WikiProjects (which are the real engine of Wikipedia contributions) assess their work and the overall project select articles for the CD-ROM versions.

    If you have a Wikipedia account, you can display the ratings on the article page enabling the “Display an assessment of an article’s quality as part of the page header for each article” gadget in your preferences.

  • Homer

    “One of the issues with this is that you can’t (or at least shouldn’t) make corrections based on your own knowledge of a subject, you need to cite references.”

    Shouldn’t you be able to remove the incorrect statements by pointing to them not having a reference? (or are they by chance pointing to references which are incorrect?)

  • etbe

    Homer: If there are references but they don’t cover the content of the article or if they contradict the article then again you can’t verify this. So it seems to me that statements that are not supported by the references can only be discovered offline if there is an entire lack of references.

    It would of course be possible to ship a copy of Wikipedia along with static copies of the pages it references, there may be some copyright issues in that and also it would take a huge amount of space – not as much as the full database but more than you could reasonably fit on current flash storage.

    James and Don: Thanks for that information.

  • “Any time that you need to use an offline Wikipedia reader it seems that it will either be impossible or unreasonably difficult to gain the net access necessary to find the reference that Wikipedia requires.”

    Not all references are online!

    (Example: I’m looking something up in X University’s law library, but don’t have access to the X.U. network.)

  • etbe

    Bruce: Fair point. However the majority of Wikipedia references appear to be online. Also for the majority of Wikipedia readers the only real benefit they receive from references is from online ones. I have never looked up an offline reference from Wikipedia, but I regularly read online references.

    However it does seem that some Wikipedia pages could be significantly improved by using more offline references. So a Wiki reader/editor that supported convenient offline use would do some good in that regard.