In the scholarly communications environment, the evolution of a journal article can be traced by the relationships it has with its preprints. Those preprint–journal article relationships are an important component of the research nexus. Some of those relationships are provided by Crossref members (including publishers, universities, research groups, funders, etc.) when they deposit metadata with Crossref, but we know that a significant number of them are missing. To fill this gap, we developed a new automated strategy for discovering relationships between preprints and journal articles and applied it to all the preprints in the Crossref database. We made the resulting dataset, containing both publisher-asserted and automatically discovered relationships, publicly available for anyone to analyse.
The second half of 2023 brought with itself a couple of big life changes for me: not only did I move to the Netherlands from India, I also started a new and exciting job at Crossref as the newest Community Engagement Manager. In this role, I am a part of the Community Engagement and Communications team, and my key responsibility is to engage with the global community of scholarly editors, publishers, and editorial organisations to develop sustained programs that help editors to leverage rich metadata.
STM, DataCite, and Crossref are pleased to announce an updated joint statement on research data.
In 2012, DataCite and STM drafted an initial joint statement on the linkability and citability of research data. With nearly 10 million data citations tracked, thousands of repositories adopting data citation best practices, thousands of journals adopting data policies, data availability statements and establishing persistent links between articles and datasets, and the introduction of data policies by an increasing number of funders, there has been significant progress since.
Have you attended any of our annual meeting sessions this year? Ah, yes – there were many in this conference-style event. I, as many of my colleagues, attended them all because it is so great to connect with our global community, and hear your thoughts on the developments at Crossref, and the stories you share.
Let me offer some highlights from the event and a reflection on some emergent themes of the day.
Our LIVE Annual Meeting is back in North America for the first time since 2015, and with just 10 days to go, there’s a lot going on in preparation. As you’d expect with a How good is your metadata? theme—the two-days will be entirely devoted to the subject of metadata—because it touches everything we do, and everything that publishers, hosting platforms, funders, researchers, and librarians do. Oh, and it’s actually super awesome too—and occasionally fun.
Metadata is what is used to describe the story of research: its origin, its contributors, its attention, and its relationships with other objects. The more machines start to do what humans cannot—parse millions of files through multiple views—the more we see what connections are missing, and the more we start to understand the opportunities that better metadata could offer.
We love metadata so much that we’re producing an 8-foot-high depiction of the ‘perfect’ record, in both XML and JSON, for people to gape at and annotate in person. Sneak preview:
The perfect metadata record is eight feet tall.
SchemaSchemer
Both days feature plenary-style talks, insights from ourselves and guests who will regale us with tales of metadata woes and wonders.
Lisa will be there at the end of Day 1 to update everyone on some recent and potential governance changes, and—the reason we started these gatherings—to reveal the results of our 2018 board election, the second contested election we’ve held, and already with twice the voters from 2017.
Our amazing guest speakers are too brilliant and too experienced to highlight in just one blog. But check out the LIVE18 schedule to see what they’ll be talking about:
Patricia Cruse, DataCite
Ravit David, University of Toronto
Clare Dean, Metadata 2020
Paul Dlug, American Physical Society
Kristen Fisher Ratan, CoKo Foundation
Stefanie Haustein, University of Ottawa
Bianca Kramer, Utrecht University
Graham Nott, Freelance developer (eLife/JATS)
Jodi Schneider, University of Urbana-Champaign
Shelley Stall, American Geophysical Union
We’ll be taking over the entire second floor of the Toronto Reference Library, whose three rooms will house a bunch of conversational sessions as well as some more formal talks:
Rally is the main room where we’ll have the plenary-style talks, a corner for Unscheduled Maintenance offering live support for your questions about billing or tech for Ryan, Shayn, Isaac, Jason, Chuck, & Mike. Running down the whole left side of this room is also the You-are-Crossref wall where the community will showcase their work with metadata through posters - feel free to bring one along and find Patricia to get the sticky tack.
The LIVE Lounge is where you can eat, drink, rest, and chat and where you’ll likely find Rosa as she laises between the caterers, the venue, AV, and all of us. The Lounge is also where we’ll gather for much-needed post-election refreshments at the end of Tuesday.
The Bigger Ambitions Room is where a lot of the Unplugged sessions will take place. This room will feature three separate stations:
Crossref Labs & Product where you can chat with Geoffrey, Esha, Jennifer L, Patrick, and Christine about your big ideas for us, and what we’re working on already.
Metadata discussions and annotations of the perfect record (previewed above) with Patricia, together with space to ideate around metadata principles.
Uses and users of metadata where Jennifer K will help us understand just how far Crossref metadata can reach, and who and what people are doing with it.
We cannot wait to show you what else we have planned :-)
For those of you not able to attend, recordings of the presentations will be made available on the event page directly soon after.