Support open data and defend Aaron Swartz

I fully support Aaron Swartz as he fights unjustified charges from the U.S. government, and hope that my readers will support him too. Aaron is a researcher who works with huge datasets and has worked on many open data projects. Aaron is being charged for having accessed JSTOR, a repository of academic journal articles, and downloading them.

JSTOR itself didn’t want to press charges and says it hasn’t suffered loss or damage. But the U.S. Government indicted Aaron because they feel like they “caught a hacker”.

Aaron Swartz
Aaron Swartz

I’m incredulous that they would pursue this case against a well known researcher and activist who allegedly was doing something quite benign — scraping data.

I worry that this case will have a chilling effect on open data projects. The government has gone to great lengths here to stop a respected activist’s work, siccing the Secret Service on him and wasting an incredible amount of resources to trump up this case. The FBI has already investigated Aaron at least once for downloading PACER data . It looks bad to me, like the government was basically waiting for any excuse to build some sort of charge against Aaron for his briliant open data activism.

Here’s Aaron’s background in open data and analyzing large data sets:

In conjunction with Shireen Barday, he downloaded and analyzed 441,170 law review articles to determine the source of their funding; the results were published in the Stanford Law Review. From 2010-11, he researched these topics as a Fellow at the Harvard Ethics Center Lab on Institutional Corruption.

He has also assisted many other researchers in collecting and analyzing large data sets with theinfo.org. His landmark analysis of Wikipedia, Who Writes Wikipedia?, has been widely cited. He helped develop standards and tutorials for Linked Open Data while serving on the W3C’s RDF Core Working Group and helped popularize them as Metadata Advisor to the nonprofit Creative Commons and coauthor of the RSS 1.0 specification.
In 2008, he created the nonprofit site watchdog.net, making it easier for people to find and access government data. He also served on the board of Change Congress, a good government nonprofit.
In 2007, he led the development of the nonprofit Open Library, an ambitious project to collect information about every book ever published.

I would also like to say that I think that libraries and academics should stop buying into the JSTOR model. JSTOR aggregates academic journal articles which it doesn’t even own, and sells limited access to those articles to large institutions for thousands of dollars. Libraries and universities should act to enable access to information, not to limit it.

ETA: Here is JSTOR’s official statement on the case.

liveblogging at the library

Since I read very quickly I’m done reading the poems for the Redwood City Youth Poetry Contest before the other judges. We’ve read, discussed, and judged K-1, 2-3, 4-5, and now are in the middle of reading poems from grades 6-8.

It’s so much fun! The poems, a good selection and range from English- and Spanish-speaking kids, are knocking my socks off. One of them made me cry. Well, when the contest results are announced and the poems are on the Redwood City Library web site, I’ll link to them and discuss them in detail.

The three of us judges have varying opinions about what make a poem good poetry. Trish likes complex thought and sentiments of beauty and I would say she values form highly. Leslie likes a social issue and a conscience, a poet who looks outside herself. I like to see daring, leaping, unusual juxtapositions, and an awareness of language and form whether that is free verse in its jazzy meter and flow, or regular meter and rhyme.