You are browsing the archive for Joris Pekel.

Talk at Re:Publica – Curating the Digital Commons

May 14, 2013 in Events/Workshops, Featured, Opinion

Last week, the thirteenth edition of the Re:Publica conference was organised in Berlin. With more than 5000 people attending, it is one of the biggest events around new media, journalism and activism. The OpenGLAM team was there to give a talk about the curation of the digital cultural commons.

Together with Daniel Dietrich, chairman of the Open Knowledge Foundation Germany, and member of the OpenGLAM working group, I prepared the talk which is largely inspired by the recent post on OpenGLAM about Small Data in GLAMs. At the moment we are able to get access to such vast amounts of data, that it not longer becomes comprehensible. We therefore need better infrastructure, access and tools to create the most value out of all this metadata and content.

We started the talk by explaining the notion of a commons: the cultural and natural resources accessible to all members of a society. The traditional notion of the environmental commons has been debated many times and are often referred to as ‘the tragedy of the commons’ as these natural resources are not as non-rivalrous and non-excludable as we used to think. However, a digital commons has the quality that when I make a copy of it, any other person is still able to make that exact same copy of the dataset, which will never deplete.

“Digital commons are defined as an information and knowledge resources that are collectively created and owned or shared between or among a community and that be (generally freely) available to third parties. Thus, they are oriented to favor use and reuse, rather than to exchange as a commodity.” - Mayo Fusster Morelli

The fact that these digital artefacts can be re-used by anybody is perhaps the greatest assest of the digital commons, everybody can curate, connect, annotate and remix these materials indefinitely.

After an explanations about the difference between metadata and content (and how difficult the distinction often is!) and an overview of some leading open culture projects such as Europeana and the Digital Public Library of America it became clear how much content we actually have access to at the moment. Just Europeana and the DPLA together provide 30.000.000 metadata records that all link to a digitised object. Wikimedia Commons and the Internet Archive give access to another 25.000.000 media objects. How can a user make sense of that?

For that reason we need to stop thinking about just adding more data and creating huge databases. The Commons need to be structured and made accessible in a way that the user can get meaningful results out of this content and data, and is able to collect the relevant data for his research. The institutions and the users should be able to easily create small data ‘packages’, for example collecting all of Van Gogh’s work. The internet is exceptionally well placed to bring together content in one place, something that would never be possible physically. At the same time we can provide relevant links between collections, artists, time-periods and so on, so the user can explore more related content. This also comes down to good quality metadata, something that is not always there at the moment, not surprising when combining data from thousands of cultural institutions.

Finally we need the relevant tools that allow us to re-use the digital commons. With them, we are able to curate, annotate, visualise, mashup, and much more. Combined, the user and the cultural institution can work together to create the most value out of this enormous amount of digitised content and data.

For a video recording of the talk, click here.

Big Data vs. Small Data: What about GLAMs?

May 2, 2013 in Featured, Opinion

Last week, co-founder of the Open Knowledge Foundation Rufus Pollock published the first blogpost in a series on small data. In his post ‘Forget Big Data, Small Data is the real revolution‘, Pollock writes:

Meanwhile we risk overlooking the much more important story here, the real revolution, which is the mass democratisation of the means of access, storage and processing of data. This story isn’t about large organisations running parallel software on tens of thousand of servers, but about more people than ever being able to collaborate effectively around a distributed ecosystem of information, an ecosystem of small data.

picture

[...] Size in itself doesn’t matter – what matters is having the data, of whatever size, that helps us solve a problem or address the question we have.

And when we want to scale up the way to do that is through componentized small data: by creating and integrating small data “packages” not building big data monoliths, by partitioning problems in a way that works across people and organizations, not through creating massive centralized silos.

This next decade belongs to distributed models not centralized ones, to collaboration not control, and to small data not big data.”

How does this relate to the cultural sector? Europeana now offers access to more than 27 million metadata records, Wikimedia Commons has 16 million media files available, Internet Archive 9 million objects and last week the Digital Public Library of America launched with 2.5 million metadata records, and are quickly expanding. This is a fantastic achievement, but this amount of material is incomprehensible for any person and it is still just a fraction of all the digitised material, which is only a fraction of what could be digitised. How to make sense of that?

As Pollock describes, it is not about the size of your database, the real revolution is the mass democratisation of the public institutions. It is possible to create packages with the complete works of Shakespeare, beautiful paintings by Van Gogh or a set of Medieval Maps. Packages that are ready for re-use which can be linked to other sets of content for further exploration.

One question that arises is: who should create these packages of data? Who decides what content should be put together? Should we leave this to the traditional ‘experts’, the curators and archivists, or do we need to let the community do this? The most logical answer to this question is: both, or better, together. The dialogue between the public institutions and the user has traditionally been very important and when users have access to such vast amounts of content and metadata, guidance and curation becomes perhaps even more needed. At the same time these experts get the chance to work with thousands of contributors who can give feedback, enrich their data, link it, and work with it in ways that could not be imagined by the institution.

For this reason – besides releasing content and data under an open license and providing a standardised technical open infrastructure as described in the OpenGLAM principles – the Open GLAM should be prepared to engage in the discussion and build value together with the community. Opening up data is not about dumping it online and never look at it again, it is about a dialogue where the public institutions tries as much as possible to send the user on his way, only to see him wander off and explore paths and directions never seen before.

We would love to hear your opinion on this topic. Please subscribe to the OpenGLAM mailing list to join the discussion.

Walters Art Museum Removes Non Commercial License

April 25, 2013 in Case Studies, Featured

In early January, we wrote about the Walters Art Museum as a case study in sharing. The museum is a pioneering open advocate and worked extensively with Wikimedia. They have donated over 18.000 images to Wikimedia Commons and hired a dedicated intern to enrich Wikipedia articles with openly licensed content from their collection.

The Walters has also set up a website with a dump of all their high quality scans of manuscripts and the corresponding metadata. The images can be downloaded in different file sizes, from a very small thumbnail, to the extremely high quality .tiff file of about 150 megabytes. Having images of this size available for re-use makes them a great resource for scholarly research and image annotation.

However, the readme page still mentioned at that point that commercial re-use of these images was not allowed. As mentioned previously on the OpenGLAM blog, this greatly reduces the possibilities for re-use. The images can for example not be used in Wikipedia articles and we were also not able to feature them on the Public Domain Review. For that reason we contacted the web manager and we are very happy to see that the Walters has now changed their licensing to the Creative Commons Attribution-Share Alike license (CC-BY-SA).

You are free to download and use the images and descriptions on this website under the licenses named above. You do not need to apply to the Walters prior to using the images. We ask only that you cite the source of the images as the Walters Art Museum.

The Walters also explicitly distances itself from the non-commercial restriction:

Note these terms mark a change from our previous license, which placed a noncommercial restriction on the use of these materials. The noncommercial restriction no longer applies, and this license supercedes the previously advertised license, and replaces that found in many of the archival TIFF image headers. This change follows the Walters Art Museum’s licensing policy. More information on the Walters’ intellectual property policy can be found on the Walters website: http://art.thewalters.org/license/.

It is great to see that the Walters has made a clear and explicit statement about the licensing of their images. Very often still we run into vague or non-existent statements that greatly reduce the possibilities for third parties to re-use the data and content. For that reason one of the five OpenGLAM principles is: “When publishing data make an explicit and robust statement of your wishes and expectations with respect to reuse and repurposing of the descriptions, the whole data collection, and subsets of the collection.” The statement of the Walters Art Museum can be seen as a good example how to do this.

For more beautiful digitised manuscripts see The Digital Walters webpage.

LODLAM Joins the OpenGLAM Network

April 22, 2013 in Featured, Linked Open Data

picture

We are very pleased to announce that the LODLAM Network has joined the OpenGLAM Network.

LODLAM is Linked Open Data in Libraries, Archives, and Museums. LODLAM.net is an informal, borderless network of enthusiasts, technicians, professionals and any number of other people who are interested in or working with Linked Open Data pertaining to galleries, libraries, archives, and museums.

Linked refers to Linked Data, or the concept of connecting data using W3C standards. Open refers to the use of open licenses, such as the Public Domain Mark, Creative Commons Zero, Creative Commons Attribution, and Creative Commons Attribution-Alike. Data can be raw data, metadata, descriptive data, bibliographic data, etc.

LODLAM.net is designed to be a central place for sharing resources and connecting and collaborating with other interested individuals. There is also a Google Group, and the #LODLAM hashtag is used extensively on Twitter for shared news, questions, and projects regarding LODLAM.

We’re looking forward to collaborating with the LODLAM Network around the development of our tools for digital scholarship and crowdsourced data enrichment as well as working with them to further explore the potentials of Linked Open Data in the cultural heritage sector.

Jon Voss, founder of LODLAM, said:

In the last few years we’ve seen a growing convergence of communities working toward usability and discovery of openly licensed cultural heritage assets and data. Increasingly, the institutions that have for so long provided stewardship of these materials and their accompanying data are embracing and investing in new ways of providing access to this information, opening a new world of possibilities for how we celebrate our shared global history. While many industries are litigating in the face of change, galleries, libraries, archives and museums are instead increasingly playing a leading role in innovating for the common good.

The interests of and people within the LODLAM and OpenGLAM communities have so much overlap that it’s a natural fit for the two to be aligned as part of the OpenGLAM network. It’s my hope that the shared knowledge and resources of the network will continue to strengthen the collaborative culture that makes a free and open World Wide Web possible.

For more on the history of LODLAM, see the paper Radically Open Cultural Heritage Data on the Web, presented at Museums and the Web, 2012.

The LODLAM Network joins a host of other organisations including the Internet Archive, Wikimedia, Creative Commons and Europeana who we are collaborating with to help our cultural institutions reach their potential in the digital age.

If you’re an organisation working with cultural institutions and helping them to open up their holdings online and would like to join out Network email openglam@okfn.org.

One year later: Linked Open Data in the German National Library

April 19, 2013 in Case Studies, Featured

A little more than a year ago, the German National Library (DNB) announced that it would release more data as linked data under an open license. It was decided that the metadata would be released with as little restrictions as possible by using the CC0 rights waiver. This means that anybody can use and reuse the data in any way possible, also for commercial purposes.

Now one year later, we talk with Lars G. Svensson, Advisor for Knowledge Networking at the DNB, about what this move has meant for the library.

Welcome Lars, thank you for taking the time.

Thank you for having me!

Frankfurt Lesesaal by Raimond Spekking – CC-BY-SA

Could you tell me why the library decided to open up the metadata?

In September 2011 the Conference of European National Librarians (CENL) decided to adopt CC0 licensing for their data. The DNB had started to publish authority data as linked data in spring 2010. We first used a home-grown license based partly on CC BY-SA but with the restriction that commercial entities needed to register before they can use the data. Since our Director General had been one of the supporters of the CENL decision it was natural for us to move in the same direction. One of the key points with linked data is that other people have to be able to reuse and connect the data with other sources. For that reason we decided last year to discontinue the license based on CC BY-SA and go for CC0 in order to have as few restrictions as possible for reuse. Currently, we publish two datasets: The first one is the authority data, which consists of data for names of persons, organizations, events, places, and works. Since January 2012 there is also bibliographic data available with title, publisher etc., which re-uses the authority data. The data is available under CC0 in many formats including RDF. The only exception is bibliographic data in library specific formats (MARC 21 and MARC XML) from the last two years but we expect that this restriction will disappear after 2015.

And have you seen interesting cases of reuse so far?

Yes definitely. One of my favourite projects is the Museum Digital. This is a German digital open platform where smaller institutions can put their content. The museums curate and manage their own database on the site and enter their own metadata. The site included our metadata to create more links from and to the content available on the platform. They also found out that we include a link to DBpedia in our data. This allowed them to import that data into the platform in various languages. This greatly enriches the information on the platform.

Not all libraries are in the position to release their own metadata because they make use of services and are therefore not the owners of the data. How does that work in the DNB?

We are in the fortunate position to be the national library, so it is basically our job to create this data in the first place. That allows us to freely distribute it in any way we want to. The authority data is curated together with the German library networks, so that is not really our data, but it was not a problem to agree on the open license.. As we are all public institutions, openness helps us to reach out to the public.

Does the German National Library also provide access to digitised books?

We are a relatively young library which was founded in 1913. For that reason we don’t have that much material that is in the public domain. So we do digitise our collection, but since we are not the owners of the rights we can only show the material to people in the reading rooms in the library. We try to make the books that are out of copyright as accessible as possible. We started for example with a collection with 100 classic books such as the works of Goethe and Schiller. These are freely accessible – also in Europeana – as they are in the public domain and we currently have large digitisiation projects also comprising out-of-copyright material. A further service we offer is digitization of tables of contents; Those are very popular among our users since they offer both more terminology we can index in our catalogue and more contextual information making it easier for our patrons to decide whether the publication they found suits their needs or not.

Great to hear, and what’s next for the library?

We are still in the transition phase so not all metadata is yet openly available in all formats, we expect that this will happen in the next few years and then our metadata will be completely open. We keep improving our linked datasets and work hard to also get to make more content available.

That’s great, thank you very much for taking the time for this interview!

My pleasure!

GLAM-Wiki London 2013 Highlights

April 17, 2013 in Events/Workshops, Featured

This weekend, around 150 people with a shared interest in open cultural data came together at the British Library in London for the GLAM-Wiki 2013 conference. During these two days, there were an incredible amount of inspiring keynotes, thought provoking discussions, and grounds for new collaborations. While impossible to put it all into one blogpost, we will look at some of the highlights, as well as the work the OpenGLAM team has done.

GLAMwiki London 2013 participants. Photograph by Mike Peel – licensed under the CC-BY-SA license

Keynote by Michael Edson – “Scope, Scale and Speed”

After a brief introduction by the British Library and the chair of Wikimedia UK, Michael Edson from the Smithsonian Institution in Washington came on stage. He gave a very inspiring talk about the potential of the web in terms of scope, scale and speed. The web enables cultural institutions to reach out to an audience unimaginable before. At the same time, digital tools make it easier to interact with them. Right now, going digital is something that institutions are not quite used to and slow steps are being made. The audience however, will be demanding for more online access and services and at some point having a well maintained, accessible and re-usable online collection will become just as basic as providing a proper wireless connection in the library. Edson shows in his presentation numerous examples of large scale projects that benefit from the possibilities of the internet. They all have one thing in common: they abandoned old fashioned copyright restrictions and found new ways to reach out to their audience, completely open.


Sam Leon – Curating the Digital Commons

Sam Leon talked in his presentation about the attributes of a digital cultural commons. Besides it being non-rivalrous and non-excludable, as the traditional notion of the commons, it allows unlimited re-use. But does free access alone facilitate this? Sam asked the question “if we just put it online, are we not trying to recreate the traditional GLAM?, is the glass not still there if we can not re-use the material?” The amount of available content from cultural institutions that can be found online at the moment is astonishing, but how do we make sense of these millions of metadatarecords and objects? The material being open is precondition, but we also need the easy to use tools that allow for re-use in order to create value and knowledge out of the data.


Keynote by Lizzy Jongsma – “We are Open”

Lizzy Jongsma works at the Rijksmuseum as data manager. She gave a passionate talk about the pioneering work the Rijksmuseum has been doing over the last few years and how this has worked out for them. They have released almost 125.000 high quality images to the public domain and build a beautiful website around it where everybody can enjoy, download and curate the masterpieces of the Rijksmuseum. The full talk has been recorded and can be found here.

Keynote by Mia Ridge – “A Brief History of Open Cultural Data”.

Mia Ridge presented on the Saturday morning session a brief overview of some the key moments in open cultural data. She also discussed the contradictory things GLAMs are told they must do. One the one hand they should give content away for the benefit of all but at the same time protect against loss of potential income, conserve collections in perpetuity and demonstrate return of investment on digitisation. “It’s important to understand some of the pressures they’re under. For example, GLAMs usually need to be able to track uses of their data and content to show the impact of digitising and publishing content, so they prefer attribution licences.”
In her presentation she referred to the paper that recently has been published by the Open Cultuur Data initiative where they have created a timeline of important moments in the open culture field. The full paper can be found here. During the weekend, we have used the Open Knowledge Foundation’s Timeliner tool to create an overview of these moments. The full presentation of Mia Ridge has been recorded and can be found here. She also published a full transcription on her blog

Panel discussion – “Striking the Balance”

On Saturday I was part of a panel discussing how an organisation strikes the balance between the moral imperative to open up collections, against the commercial drive to generate revenue. Nick Poole, CEO of the Collections Trust, has written his thoughts on this topic prior to the conference. The other panelists were Georgia Angelaki from National Documentation Center of Greece and Mike Peel, secretary of Wikimedia UK. We discussed together with the audience how two great pressures – openness and financial viability – set the context for how museums see their role, how they operate and how they will present themselves to their audiences, both online and off. Can you openly license your content and make money at the same time? Nick Poole will go into more detail about this session in a separate blogpost which we will post here as well anytime soon.

Beat Estermann – To what extent are GLAMs ready for Open Data and Crowdsourcing?

During the GLAMwiki London weekend we have seen incredibly insightful and thought provoking presentations. Beat Estermann from the Bern University of Applied Sciences took a totally different approach and presented what the cultural sector is most looking for: facts. Over the last couple of months, he conducted a survey where he asked the Swiss cultural heritage sector about their stance and needs towards open data and crowdsourcing. One of the interesting outcomes is that most institutions see the importance of open data. However, they are still reluctant to make it available for commercial re-use. Most institutions at the same time do not realise that this non-commercial restriction does not allow re-use on Wikipedia and by third-app developers. A lot more figures can be found in his slides.


GLAMwiki London was a great two days with an incredibly high quality of talks and discussions. For more photos, slides and documentation, please check the GLAMwiki page.

OpenGLAM Principles Launched

April 11, 2013 in Featured, News

Over the past weeks, we have been working together with members of the OpenGLAM working group to develop a set of principles. We think that it is important in a time where we hear more and more about ‘openness’, we should state what we mean explicitly when we use this term in order to avoid confusion and serve as a reference for institutions themselves.

With the OpenGLAM principles we try to clearly define the conditions for any GLAM to be ‘open’.

We aren’t finished yet. We really want to have your input on this (positive&negative) to create a set of principles that is supported by the community. The principles can be found here

Please send your comments to this list!

Looking forward to the discussion.

OpenGLAM at GLAM-Wiki 2013 conference

March 15, 2013 in Events/Workshops, Featured

picture

Just one month to go before cultural experts from all over the world will come to London for an international celebration of open access and culture. GLAM-Wiki 2013 is a global conference, organised by the Wikimedia UK chapter in association with Wikimedia Sweden and Europeana, and hosted by the British Library. It examines the possibilities, relationships and potential for galleries, libraries, archives and museums in working with Wikimedia projects such as Wikipedia. The conference will take place on the weekend of 12-14 April at the British Library in London.

This conference will have two main themes. On Friday, the work the Wikimedia movement has been doing with cultural organisations will be explained to showcase what partnerships and opportunities are available. You will hear first-hand how some institutions are already leveraging the connection between sharing a part of their own collections with Wikimedia and seeing some amazing benefits – such as a sharp increase in traffic to their websites and an increase in sales of merchandising. Attendees will be able to better understand the crossover of mutual interest that Wikimedians share with curators of cultural heritage to further each other’s goals. On Saturday, the conference will focus on the more practical sides of the Wikimedia-GLAM collaborations, with workshops and discussions to develop ideas.

During this two days conference, the entire OpenGLAM team team will be present to talk about open cultural data, open content and the public domain. Besides that, the organisation managed to attract an impressive group of international keynote speakers. The list includes Michael Edson of the Smithsonian Institution in Washington DC, Lizzy Jongma of the Rijksmuseum in Amsterdam and Nick Poole of the Collections Trust in London.

As well as the keynote speakers there will be plenty of other activities taking place throughout the conference covering a broad range of topics related to the conference theme. You can see the schedule here Tickets are available for the conference and are priced from £15 to £40, with some scholarships available.

We hope to see many of you there!

School of Open launches with OpenGLAM course

March 12, 2013 in Documentation, Featured, News

This week is Open Education Week and to celebrate the first courses on the School of Open are launched. The School offers courses on the meaning, application, and impact of “openness” in the digital age and its benefit to creative endeavors, education, research, and beyond. It is coordinated by P2PU and Creative Commons. The School offers all kinds of courses related to openness. You can learn about creative commons licenses, open data in science, copyright for educators, and now also how to open your cultural institution’s data.

Open Up your Institution’s Data course on the School of Open

The ‘Open Data for GLAMs‘ course is an adaptation of the masterclasses that the Open Culture Data initiative gave to several institutions. By releasing their course material, every memory institutions can now start to open up their cultural data as open culture data. Together with several partners from the OpenGLAM network we edited the course and it is now ready for anybody who wants to open up. The course will guide you through the different steps towards open data and provide you with extensive background information on how to handle copyright and other possible issues.

School of Open is offering free online courses on what “open” means and how it can help you.

The different steps will force you to think about different aspects of your data that could lead to a more efficient data infrastructure and a coherent data policy with great internal benefits for your institution.

We are very curious about your experiences and happy to help if you have any questions. We also would really like to sit down with you (in person or virtual) and go through the course together . So if you are interested, get in touch!

Dutch National Library gives full access to in copyright material

March 11, 2013 in Case Studies, Featured

The National Library of the Netherlands has made over the last years some great digitisation efforts. Amongst others, they have published their medieval manuscript collection and made their newspaper archive available under an open license. To make this material available they have to overcome many copyright issues. Their huge collection of material is created by many different authors. It can take years to track all the inheritors to ask for permission. For that reason they have experimented with an ‘opt-out’ model where they asked authors or inheritors to contact them when they did not want something to be published.

Page from the magazine “Op de Hoogte, een maandschrift voor de huiskamer” (Up to Date, a magazine for the living room), 1903.

In September 2012, the National Library of the Netherlands (Koninklijke Bibliotheek, KB) announced that they would publicise their digitised magazines collection from 1890-1939. Some of the articles or photos in the magazines are still under copyright because the material is only out of copyright 70 years after the death of the author. Because magazines are filled with content from many different authors, some parts of a magazine can be out of copyright, but others are not. They calculated that looking for all the inheritors of all the authors in the magazines would take them about 5 years and a lot of money, money that can be used a lot better to actually digitise material. For that reason they announced that they would make all available and requested authors to let them know if they had a problem with that. They exactly got one response from a family member of an author which loved the idea that his grandfather’s material would be made available again.

They also got a letter from two collective copyrights management organisations. They informed the KB they were representing some of the authors, and suggested to settle the copyrights. Because no complete and practicle inventory of rightholders and members of copyright organisations could be made, the KB has agreed on a collective license for all under copyright material. The Royal Library can show all the magazines and everybody is able to browse through them and use them for research. However, when somebody wants to reuse them commercially, they have to get in touch with the rights management organisations.

It is a great achievement that material of which parts of are potentially still under copyright can be made available without doing years of research first. While the commercial value of these magazines is very little, there are great opportunities for research as these magazines give a great insight in what was going on in the Dutch society during that period. Right now, 80 magazines can be found online with a total of 1,5 million digitised pages. In the coming months, more magazines will be added and a stunning total of 6,5 million pages will be made available.

However, because it was not posible to use one clear and open license, it remains rather unclear when a user has to ask the collective rights organisation for approval. As we have written before, it is very hard to define when a digitised work is being used commercially. It is for example not clear if we can feature these works on the Public Domain Review while this is clearly a not-for-profit effort. We hope that someday the material which is most likely completely out of copyright can be made available free to reuse without any restrictions, as all material in the public domain should be.

All the magazines can be found on their website.

Please create an account to get started.

Join our Network

No Yes

#openglam on Twitter