Open Access NewsNews from the open access movement Jump to navigation |
||||||||
Publisher division deepening on Google book-scanning
VNU Staff, Publishers call for library digitisation boycott at Book Fair, Information World Review, March 10, 2006. Excerpt:
Google used the London Book Fair (LBF) as a platform to reach out to a wary book trade, as it revealed plans to expand its controversial Library Project to include European libraries. On the eve of the fair, Bloomsbury, which includes Whos'e Who publisher A&C Black , c.e.o. Nigel Newton called on the industry to boycott Google's search engine "until it desists from its present misguided mission in the world of books". He described Google as "a false prophet" engaged in "acts of 'kleptomania'". But Jens Redmer, director of Google Book Search in Europe, told IWR sister title The Bookseller that Google is talking to further library partners in all major European countries. Google is currently working with only four American libraries and the Bodleian in Oxford. Redmer added that in-copyright works would not be scanned in Europe, where copyright laws are "significantly different" to the US. PS: Some publishers are acting on a faith-based fear of harm and some are acting on an evidence-based record of benefit. Software for tagging eprints in Eprints archives
New Connotea software supports institutional repositories, a press release from the Nature Publishing Group, March 10, 2006. Excerpt:
Nature Publishing Group (NPG) has released new software which enables institutional repositories running EPrints to integrate with the social bookmarking services Connotea and del.icio.us. This latest innovation allows content within institutional repositories to be bookmarked, tagged, and linked to related content. The work behind this development was funded by the Joint Information Systems Committee (JISC) as part of their PALS Metadata and Interoperability Projects 2 program. Once installed in a repository, the software will enable users to bookmark documents in that repository using their Connotea or del.icio.us account, assigning their own tags and without leaving the web page. They can also see what tags have already been assigned to the document they are viewing in the repository and click on links to related content, either within the same repository or elsewhere on the web. If bookmarked in Connotea, the bibliographic metadata for the institutional repository item can be automatically imported. Connotea already does this for items bookmarked from several other sources, including Nature, PubMed, Science, Blackwell Synergy, Wiley Interscience and Amazon. Recognizing the importance of the content within institutional repositories, this new functionality will allow such content to be integrated and linked with the wider scientific literature. OA legal scholarship at Lewis & Clark
Yesterday the Law Library at Lewis & Clark Law School launched a web site on Open Access Legal Scholarship at Lewis & Clark. From the site:
Today we introduce the latest addition to the law library web site - Open Access Legal Scholarship - a resource for those who are interested in learning more about open access scholarship and publishing, in both law and other fields. It has been created by Professor Joseph Miller in conjunction with the Lewis and Clark Law School 2006 Spring Symposium, Open Access Publishing and the Future of Legal Scholarship [PS: held on March 10]. Open Access has been briefly described as “the electronic publication of scholarly work that is available for free without copyright constraints other than attribution.” Paul George, Members’ Briefing: The Future Gate to Scholarly Legal Information, AALL Spectrum (April 2005). See the Open Access introduction for an expanded discussion. Our own law reviews have provided open access to their most recent issues, with Environmental Law publishing the full-text of Symposium: Ballot Measure 37: The Redrafting of Oregon’s Landscape (v.36, n.1 2006), and Lewis & Clark Law Review with Paper Symposium: Federalism After Gonzales v. Raich (v. 9, n.4 2006). Included in our new Open Access Legal Scholarship section are: [1] An Introduction to Open Access, [2] Blogs, [3] Core Documents, [4] Open Access Journals, [5] Projects and Gateways, [6] Self-Archive Repositories. Google working with publishers on paid-access plan for scanned books
Kimberly Maul, Publishers to Control Paid-Access Books Available Through Google, The Book Standard, March 10, 2006. Excerpt:
In an attempt to work with publishers and others opposed to the Google Book Search project, Google today announced its first plan for publishers to provide --for a price-- the full text of books online. Though the agreement, publishers can decide to have the full text of books available through Google’s program while the publisher still has control over the price—which they can change whenever they want. Google will take a portion of the profit, similar to an ad-revenue share model. “Virtually every partner we have spoken to has been extremely enthusiastic,” said Google executive Jim Gerber, Publisher’s Marketplace reported. Comment. This development isn't directly related to OA, so I won't be covering it in depth. But it may increase the number of publishers willing to let Google digitize their books and, therefore, enlarge the corpus of book literature indexed for free, full-text searching. It won't improve our access for reading, but it will improve our access for searching. More on libraries as publishers
Sarah E. Thomas, Publishing Solutions for Contemporary Scholars: The Library as Innovator and Partner, a PPT presentation at the 8th International Bielefeld Conference, 2006. Self-archived March 9, 2006.
Abstract: What can an academic library contribute to scholarly publishing? The Cornell University Library has engaged in a number of activities in the publishing realm that aim at increasing affordable, effective, widespread, and durable access to research. Cornell's Center for Innovative Publishing operates the arXiv, an e-print service for physicists, computer scientists, mathematicians, and others; Project Euclid, a journal hosting service for over 40 titles in math and statistics; and is developing, with Pennsylvania State University, DPubS, an open source publications management software. Cornell's DCAPS, or Digital Consulting & Production Service, assists in the transition of print to electronic through its digitization, metadata production, and consulting service. Digital publications are preserved according to a well-developed policy for digital archiving, ensuring ongoing access to information across time. The Cornell University Library's Center for Innovative Publishing is one manifestation of publishing activity undertaken by academic libraries as part of a movement to increase access to scholarship in an affordable manner, to ensure the ongoing availability of scholarly information in a way that is consistent with the traditional library role of preserving the record of our civilization from generation to generation, andwhich seeks to apply innovative techniques in the management and delivery of information to scholars. Update. A new version of this article was self-archived on March 23, 2007. LIS journals and permission for self-archiving
Anita Coleman, Self-Archiving and the Copyright Transfer Agreements of ISI-Ranked Library and Information Science Journals, a preprint. Self-archived March 10, 2006.
Abstract: This paper has been accepted for publication in the Journal of the American Society for Information Science and Technology. A study of Thomson-Scientific ISI ranked Library and Information Science (LIS) journals (n=52) is reported. The study examined the stances of publishers as expressed in the Copyright Transfer Agreements (CTAs) of the journals, towards self-archiving, the practice of depositing digital copies of one's works in an OAI-compliant open access repository. 62 % (32) do not make their CTAs available on the open web; 38 % (20) do. Of the 38 % that do make CTAs available, two are open access journals. Of the 62 % that do not have a publicly available CTA, 40 % are silent about self-archiving. Even among the 20 journal CTAs publicly available there is a high level of ambiguity. Closer examination augmented by publisher policy documents on copyright, self-archiving, and instructions to authors, reveal that only five, 10% of the ISI-ranked LIS journals in the study, actually prohibit self-archiving by publisher rule. Copyright is a moving target but publishers appear to be acknowledging that copyright and open access can co-exist in scholarly journal publishing. The ambivalence of LIS journal publishers provides unique opportunities to members of the community. Authors can self-archive in open access archives. A society-led global scholarly communication consortium can engage in the strategic building of the LIS information commons. Aggregating OAI-compliant archives and developing disciplinary-specific library services for an LIS commons has the potential to increase the field's research impact and visibility. It may also ameliorate its own scholarly communication and publishing systems and serve as a model for others. PS: This is an updated version of an article archived in January, blogged here 1/24/06.
India's Knowledge Commission launches its web site
India's National Knowledge Commission, launched in June 2005, now has its own web site. The site has separate pages for each of the NKC's five "focus areas": access to knowledge, knowledge concepts, knowledge creation, knowledge application, and knowledge services. The access to knowledge page doesn't mention OA, but comes close in these statements of commitment:
Information networks and a culture of information-sharing are required in sectors like education, health, agriculture, business, R&D, food distribution, disaster management, security, etc....National web-based portals need to be established as one-stop comprehensive sources of information on issues like water, sanitation, health, education, housing, nutrition, employment, etc. Technology and the Internet also have an important role in making the recently legislated Right to Information Act more effective in its implementation. Comment. I have a few recommendations for the NKC suggestion box: (1) support a network of OA institutional repositories at India's universities and research centers, (2) require recipients of publicly-funded research grants to deposit their peer-reviewed manuscripts in these repositories, and (3) encourage these institutions to adopt their own local policies requiring researchers to deposit their research output in them, whether it is publicly-funded or not. The case for using IRs for more than research eprints
Dorothea Salo, What's an IR for? Caveat Lector, March 8, 2006. Excerpt:
Arthur Sale’s risk assessment for institutional repositories is every bit as good as everyone says it is. Should be in every repository-rat’s documents drawer. In it, however, we find repeated the assertion that an IR should limit itself strictly to the peer-reviewed research literature of its target population. I still think that’s deeply wrong, but it’s up to me to defend my belief. The cited concern is cost. Further details are sketchy, but the general idea seems to be that doing “digital-library stuff,” whatever that is, requires a lot of technical jiggery-pokery that costs a lot of money, and loading that into an IR’s budget makes the IR look cost-ineffective, which creates the impression that OA is cost-ineffective....To put it briefly: if what you want is Greenstone, don’t use DSpace....Still, it does not follow that an IR is intrinsically poorly-suited to every conceivable digital-library need beyond archiving peer-reviewed research. To be a good fit with an IR, a project should consist of individual, self-sufficient pieces of work that don’t really need to be seen next to each other or manipulated during viewing by the patron....To me it seems absurd and arrogant to forbid a library that’s undertaken an IR project to use it for purposes that otherwise make sense but don’t consist of peer-reviewed literature....[T]he alternative --I speak frankly-- is an empty repository. It’s dead simple to set up an empty repository. A lot of people have. An empty repository strikes me as far more likely to be accused of misallocation of resources, fold, and threaten OA by folding, than a repository that has made itself useful in other ways besides holding on to peer-reviewed research....[T]he only way we get [to better times] is by enduring the current grim times long enough. Which means we can’t --absolutely cannot-- sit around with our IR doors barred to everything but peer-reviewed research while we wait for mandates that may never come.
Glyn Moody, The Dream of Open Data, Open..., March 9, 2006. Excerpt:
Today's Guardian has a fine piece by Charles Arthur and Michael Cross about making data paid for by the UK public freely accessible by them. [PS: see my blogged excerpt.] But it goes beyond merely detailing the problem, and represents the launch of a campaign called "Free Our Data". It's particularly good news that the unnecessary hoarding of data is being addressed by a high-profile title like the Guardian, since a few people in the UK Government might actually read it. It is rather ironic that at a time when nobody outside Redmond disputes the power of open source, and when open access is almost at the tipping point, open data remains something of a distant dream. Indeed, it is striking how advanced the genomics community is in this respect. As I discovered when I wrote Digital Code of Life, most scientists in this field have been routinely making their data freely available since 1996, when the Bermuda Principles were drawn up. The first of these stated:It was agreed that all human genomic sequence information, generated by centres funded for large-scale human sequencing, should be freely available and in the public domain in order to encourage research and development and to maximise its benefit to society. Richard Poynder interview with Michael Hart
Richard Poynder has posted his interview with Michael Hart, founder of Project Gutenberg. This is the first installment of The Basement Interviews, Poynder's blog-based OA book of interviews with leaders of many related openness initiatives. Excerpt:
Immediately seeing the potential of the network as a revolutionary new medium for distributing information, Hart was soon typing in entire books, including the Bible, all of Shakespeare, and Alice in Wonderland. Thus was born Project Gutenberg — a project that rapidly turned into an ambitious scheme to make electronic copies of 10,000 out-of-copyright books freely available on the Internet. Hart's mission: "to break down the bars of ignorance and illiteracy." In retrospect Project Gutenberg was both prescient and revolutionary. In effect, Hart had become the first "information provider" twenty years before Tim Berners-Lee invented the Web, and at a time when there were, says Hart, just 100 people on the network....Since then the number of volunteers has grown from tens, to hundreds, to thousands, and today Project Gutenberg offers over 17,000 e-texts, all of which can be freely downloaded in a wide variety of formats. In addition, there are now national Project Gutenbergs in Australia, Germany, Portugal, Canada and the Philippines, and plans are under way to create local projects in Africa, Asia, and other regions too. New obstacles were to arise however: while copyright had always posed a challenge for Hart, the 1998 Sonny Bono Copyright Term Extension Act — extending US copyright by a further 20 years — removed one million potential eBooks from the public domain in one fell swoop. With copyright now averaging 95.5 years, and creators no longer needing to register their copyright, Hart began to fear that the public domain could disappear all together, undermining the raison d'être of what by then had become his life's mission....For Hart the stakes are high, since he views Project Gutenberg as more than just the first and largest distributor of public domain eBooks. In addition, he argues, it is a primitive example of a "replicator" (a reference to a Star Trek machine envisaged as being capable of copying any inanimate matter by rearranging subatomic particles), and so therefore also a "lever to the Neo-Industrial Revolution." Does OA depend on findability or vice versa?
Dean Giustini, Open access is impossible without findability, OA Librarian, March 9, 2006. Excerpt:
Open access (OA) advocates like Peter Suber and my colleagues here at OA Librarian do a marvelous job of documenting the progress of the OA movement. In a post-OA world, however, what about findability? What about the search side of the equation? Without search engines like Google, for example, what happens to easy findability?? The problem is likely to be exacerbated as the web scales in size, and complexity. Authority destabilizes in open access models. I am thinking in terms of authority files in catalogues but also with respect to authoritative information. I grew up in a small suburb of Calgary, Alberta where authority was never questioned, where the World Book Encyclopedia was "what was right". For all its limitations, at least a ten year old could find the World Book confidently at the local public library. Can that same ten year old trust Wikipedia? OA librarians need to spend time and intellectual energy thinking about OA advocacy beyond free information for all. Dismantling paid search, for example. Advocating for OpenSearch, as in PubMed, but not just in medicine. Finally, the future of open access models on the web must be flexible enough to accomodate new means of findability - ie. algorithms, tagging, folksonomies, social software - but continue to build on the tried-and-true tenets of library science. Comment. Thanks for the plug. I have a couple of nits to pick, however. (1) OA will enhance findability by making content open for indexing by all comers, from the established players to newcomers with innovative ideas. It's true that the adequacy of search is challenged by the rapid growth of the web, but it's also true that OA is a necessary condition for the adequacy of search in a rapidly growing web. OA does not depend findability, if only because because it always brings findability with it. It's more true to say that findability depends on OA. (2) Why does "authority destabilize in open access models"? I think Dean is mixing up OA and peer-review reform, which are independent projects. Authority may destablize for OA resources that bypass peer review, or experiment with less rigorous or more fallible vetting models, like Wikipedia. But there's nothing intrinsic to OA that calls for abandoning or weakening peer review. On the contrary, all the major OA declarations agree on the importance of peer review. Because the rigor of peer review does not depend on the medium or price of a publication, an OA journal or encyclopedia can acquire the same kind and level of authority as the best non-OA resources. Good examples are PLoS Biology and the Stanford Encyclopedia of Philosophy. In short, Wikipedia is not the poster-child of OA! It mixes OA with a communal-review model that is not at all typical of the journal literature central to the OA movement.
The meeting today and tomorrow in Ann Arbor, Scholarship and Libraries in Transition: A Dialogue about the Impacts of Mass Digitization Projects, will be webcast for those can't attend. You can also follow the proceedings through the conference blog.
Publishers who resist Google indexing shouldn't pretend to speak for authors
Tom Evslin, John Battelle’s The Search and Google Book Search, Fractals of Change, March 7, 2006. Evslin interviews John Battelle. (Thanks to Ray Corrigan.) Excerpt:
While I was writing a review (to appear soon) of John Battelle’s prescient book The Search, I noticed something on the copyright page. Here it is:The scanning, uploading, and disstribution ofo t his books via the Internet or via any other means without the permission of the publisher is illegal and punishable by law. Please purchase only authorized electronic editions and do not participate in or encourage electronic piracy of copyrighted materials. Your support of the author's rights is appreciated. Comment. (1) Battelle's solution is the simplest and easiest. Let authors decide. (2) At least publishers who make this decision without consulting authors, and over the dissent of authors, should not pretend to speak for authors. As Evslin points out later in the interview, "the last sentence of Penguin’s prohibition – 'Your support of the author’s rights is appreciated.' – seem particularly hypocritical." (3) Penguin has let Lawrence Lessig provide open access to the entire text of Free Culture under a CC license. Why can't it take the much smaller step of letting John Battelle let Google make his book searchable and discoverable? Open courseware comes to the Open University
Alexandra Smith, OU to bring all course content online, The Guardian, March 10, 2006. Excerpt:
The Open University will become the first institution in the UK to put all its course materials online later this year, giving all students and teachers free access to study notes and reading lists. The university will select educational resources from all levels from access to postgraduate study and from a full range of subject themes, including arts and history, business and management, languages and science and nature. The material will be free to teachers and students studying in the UK and abroad, with the project following a long partnership with the BBC, which broadcasts the university's television programs....The university's vice-chancellor, Brenda Gourley, said the project would not only benefit the students studying at the university, but also students in countries where they were unable to access text books or quality course material. Prof Gourley said: "The philosophy of open access and sharing knowledge is a wonderful fit with the founding principles of the Open University and with the university's very strong commitment to opening up educational access and widening participation....Prof Gourley said the Open University would be the first in the UK to offer open content material on the internet, following the lead of several US institutions...."[Open courseware] is definitely a movement that is really going to change universities," Prof Gourley said....The £5.65m project will be partly funded by a US$4.45m (£2.56m) grant from the William and Flora Hewlett Foundation in the US. The Open University has more than 210,000 students studying courses this year, with around 40,000 studying outside the UK. The online project will start in October. Update. The first edition of the Guardian story, quoted above, was in error to report that "all" OU's course materials would be part of the new project. The Guardian has since rewritten its story. Also see the OU press release. (Thanks to Marc Eisenstadt.)
More on strengthening the NIH policy
Rick Weiss, Government Health Researchers Pressed to Share Data at No Charge, Washington Post, March 10, 2006. Excerpt:
Political momentum is growing for a change in federal policy that would require government-funded health researchers to make the results of their work freely available on the Internet. Advocates say taxpayers should not have to pay hundreds of dollars for subscriptions to scientific journals to see the results of research they already have paid for. Many journals charge $35 or more just to see one article -- a cost that can snowball as patients seek the latest information about their illnesses. Publishers have successfully fought the "public access" movement for years, saying the approach threatens their subscription base and would undercut their roles as peer reviewers and archivists of scientific knowledge. But the battle lines shifted last month when a National Institutes of Health report revealed that a compromise policy enacted last spring -- in which NIH-funded scientists were encouraged but not required to post their findings on the Internet -- has been a flop. Less than 4 percent filled out the online form to make their results available for public viewing. Update. Rick Weiss' story made it to the news blog of the Chronicle of Higher Education, where it will be seen by academics who missed it in the Post. More on the PRC study of the NIH compliance rate
Susan Morrissey, NIH Public Access Policy Is Having Little Impact, Chemical & Engineering News, March 9, 2006. Excerpt:
Although about 85% of NIH-funded researchers say they have heard about NIH’s policy on public access to research articles, only 18% of them report knowing specific details, according to a study by the Publishing Research Consortium (PRC), an international group of publishers and scientific societies. The survey of 1,128 journal authors was conducted in January. It focuses on how well authors who publish in the life sciences and medical journals understand NIH’s public-access policy. That policy, issued in May 2005, asks NIH-funded researchers to voluntarily post their manuscripts on PubMed Central, the agency’s online database, within one year of publication. The survey results also indicate that a lack of understanding about the policy has resulted in low submission rates: 24% of the NIH-funded authors surveyed reported that they have submitted a full manuscript to PubMed Central. Another 43% said they intend to do so in the future. Only 3% said they don’t plan to post manuscripts on the database. “As publishers, we are committed to working with NIH in improving dissemination of and enhancing access to scientific and medical research,” said PRC Chairman Robert Campbell in a statement, adding that the publishing consortium will work with NIH to facilitate author compliance. PS: See my blogged comment on this study from 3/2/06. Calling for OA to publicly-funded geospatial data in the UK
Charles Arthur and Michael Cross, Give us back our crown jewels, The Guardian, March 9, 2006. (Thanks to Glyn Moody.) Excerpt:
Imagine you had bought this newspaper for a friend. Imagine you asked them to tell you what's in the TV listings - and they demanded cash before they would tell you. Outrageous? Certainly. Yet that is what a number of government agencies are doing with the data that we, as taxpayers, pay to have collected on our behalf. You have to pay to get a useful version of that data. Think of Ordnance Survey's (OS) mapping data: useful to any business that wanted to provide a service in the UK, yet out of reach of startup companies without deep pockets. This situation prevails across a number of government agencies. Its effects are all bad. It stifles innovation, enterprise and the creativity that should be the lifeblood of new business. And that is why Guardian Technology today launches a campaign - Free Our Data. The aim is simple: to persuade the government to abandon copyright on essential national data, making it freely available to anyone, while keeping the crucial task of collecting that data in the hands of taxpayer-funded agencies. One government makes the data it collects available free to all: the United States. It is no accident that it is also the country that has seen the rise of multiple mapping services (such as Google Maps, Microsoft's MapPoint and Yahoo Maps) and other services - "mashups" - that mesh government-generated data with information created by the companies. The US takes the attitude that data collected using taxpayers' money should be provided to taxpayers free. And a detailed study shows that the UK's closed attitude to its data means we lose out on commercial opportunities, and even hold back scientific research in fields such as climate change.... The case for OA, especially in South Africa
Allison Moller, The case for open access publishing, with special reference to open access journals and their prospects in South Africa, Masters thesis, Department of Library and Information Science, University of the Western Cape (South Africa), 2006. Abstract:
Another defense of Google Library
Victor Keegan, To scan or not to scan, The Guardian, March 8, 2006. Excerpt:
Jennifer Abramsohn, Europe's Quaero Project Aims to Challenge Google, Deutsche Welle, March 9, 2006. Excerpt:
Some Europeans are concerned about US hegemony in the worldwide information market. Now France -- and maybe Germany -- aims to develop a Eurocentric alternative to the dominant Internet search engine, Google....few people doing a search on Google or one of its competitors (Yahoo, MSN, and Alta Vista make up most of the remaining 10 percent of the market) give much thought to the fact that they are using the services of a for-profit US company....But for Wolfgang Sander-Beuermann, head of the search-engine research lab at the University of Hanover and part of a growing cadre of European Google-skeptics, the situation is downright dangerous. "Google is on the way to becoming the most global media power that ever existed on earth, and the potential for misusing it is so enormous it cannot be accepted," said Sander-Beuermann, who also founded the nonprofit Association for the Promotion of Search Engine Technology and Free Access to Knowledge (SuMa)....Another man on the case is French President Jacques Chirac, who is pushing an initiative for France, together with Germany and perhaps other EU nations, to develop a European search engine to compete with the American behemoth. "We must take up the challenge presented by American giants like Google and Yahoo. There is a the threat that tomorrow, what is not available online will be invisible to the world," Chirac said in a presidential address to the nation at New Year's. His project, spurred by the French government and headed by an Internet information company called exalead.com, runs under the name Quaero (Latin for "I seek.") Germany's only link to Quaero at present is an agreement between a German meta-search engine, metager.de, and exalead. But several technology and information companies are considering the project. They include Siemens, Bertelsmann, and public broadcaster ARD (which Deutsche Welle and this Web site are part of)....Hendrik Speck is a lecturer in computer science at the University of Kaiserslautern, and is a strong Quaero activist. He seconds Sander-Beuermann’s concerns, and cites the example of searching for the keyword "Troy" on Google. "We all know that keyword has a rich cultural history. But at Google, most of the first results are a description of a second-class Hollywood production featuring a third-rate actor," Speck said, referring to the movie starring Brad Pitt....It is not yet clear how much European investors will ultimately be willing to spend, but their quest to build a rival to Google is likely to be very costly. Google spent $7.5 million (6.3 million euros) on one lab with 10 student in its development stages, "which is more than some of our universities have over here," Speck said. Meanwhile, the US search engine behemoth continues to spend $400 million annually on research and development.
Jan Velterop, What is an OA Journal? The Parachute, March 8, 2006. Excerpt:
"Currently, the ISI Web of Knowledge includes 298 Open Access journals", according to Thomson Scientific. We also have the Directory of Open Access Journals (DOAJ), reporting (March 8, 2006) that it includes 2089 OA journals. What, however, are 'Open Access Journals'? Do they exist? What's the definition? Journals that publish OA articles, or journals that publish only OA articles? Same question with regard to Open Access Publishers. Comment. Good point. BioMed Central's journals, for example, are unmistakably OA, but some of them include non-OA commissioned content, like review articles, alongside OA research articles. One property of OA journals is that they provide OA to their OA articles themselves and don't merely permit authors to do it through OA archiving. But that doesn't settle the question whether a certain portion of a journal's articles must be OA for the journal itself to be considered OA. It would be tempting to conclude that "full OA journals" and "hybrid OA journals" differ only in degree, not in kind. But that's not quite accurate either, since there's an important difference, in kind, between journals that let authors choose between OA and TA and journals that have already decided to make all their articles (of a certain kind) OA. Report on ERIC Users Group Meeting
The ERIC Users Information Exchange has posted a report on the ERIC Users Group Meeting at the ALA Midwinter 2006 meeting.
PubChem now contains bioassay data from the Southern Research Institute Molecular Libraries Screening Center (SRMLSC).
Increasing the diffusion rate of scientific knowledge
Walt Warnick, Global Discovery: Increasing the Pace of Knowledge Diffusion to Increase the Pace of Science, a talk at the AAAS annual meeting, February 16–20, 2006. Warnick is the Director of the US Department of Energy's Office of Scientific and Technical Information. Excerpt:
Science is all about the flow of knowledge....According to the National Science Foundation, there are over 2.5 million research workers worldwide, with more than 1.2 million in the U.S. alone.1 If we look at all the articles, reports, emails and conversations that pass between them, we could count billions of knowledge transactions every year. This incredible diffusion of knowledge is the very fabric of science. Given that the diffusion of knowledge is central to science, it behooves us to see if we can accelerate it. We note that diffusion takes time. Sometimes it takes a long time. Every diffusion process has a speed. Our thesis is that speeding up diffusion will accelerate the advancement of science....Currently it is difficult for researchers, who primarily track journals within their specific discipline, to hear about discoveries made in distant scientific communities. In fact, diffusion across distant communities can take years. In contrast, within an individual scientific community, internal communication systems are normally quicker. These include journals, conferences, email groups, and other outlets that ease communication. Many communities use related methods and concepts: mathematics, instrumentation, and computer applications. Thus there is significant potential for diffusion ACROSS communities, including very distant communities. We see this as an opportunity....Diffusion to distant communities takes a long time because it often proceeds sequentially, typically spreading from the community of origin (A) to a neighbor (B), then to community (C), a neighbor of B, and so on. This happens because neighboring communities are in fairly close contact. Science will progress faster if this diffusion lag time is diminished. The concept of global discovery is to transform this sequential diffusion process into a parallel process....We are particularly interested in recent work that applies models of disease dynamics to the spread of scientific ideas. The spread of new ideas in science is mathematically similar to the spread of disease, even though one produces positive results, the other negative. Our goal is to foster epidemics of new knowledge....Looking at these models has led us to focus on a parameter called the contact rate. In the disease model, this is the rate at which people come into contact with a person who has the disease. Increasing the contact rate speeds up the spread of the disease....To [increase the contact rate for knowledge] we must reduce a huge gap in how the Internet works today....Analysts estimate that perhaps 99 percent of all the Web-accessible scientific documents are in deep Web databases. Because these documents are not accessible to search engines and robots, this creates a huge gap in knowledge searchability. The problem of accessing all this deep Web science mirrors the problem of diffusion across distant communities. This is because many of the deep Web databases are maintained within specific communities, including specialized journals, scientific societies, university departments, or with individual researchers. Within each community the deep Web document repositories are typically well known. But they are hard for a scientist in a distant community to find. Worse, once found, each repository must be searched sequentially, making widespread search prohibitively difficult....We have begun to close this gap and solve the sequential search problem. Conceptually the solution is simple. It is simultaneous deep Web search with integrated ranking of results. All it takes is virtual aggregation or federation of diverse deep Web databases. The federated databases are searched in parallel, not sequentially. This greatly increases the contact rate across distant communities, speeding up the diffusion of new knowledge. We call this result Global Discovery. It means making each original discovery globally available. Federated deep Web search transforms local discovery into global discovery. While the concept is simple, making it a reality is not. The current challenge of metasearch is that the number of databases that can be searched simultaneously is limited. That's a tough problem to solve, and one that we're working on....When trying to integrate information from diverse sources, it is important to avoid adding burdens to information owners. The history of information management has seen a number of instances where seemingly promising efforts to integrate information have been hampered because too few information owners signed on: Government Information Locator System (GILS), Open Archive Initiative (OAI), Institutional Repositories, and others. While DOE adopted the protocols advanced by these efforts, too often few other information owners did so. Our view is that these efforts stumbled because they placed demands on the information owners who did not enjoy the benefits. In contrast, we believe that those who seek to integrate information from diverse sources need to bear the burdens themselves.
Heather Joseph, The Scholarly Publishing and Academic Resources Coalition: An evolving agenda, C&RL News, February 2006. Heather Joseph is the executive director of SPARC. Excerpt:
SPARC is, first and foremost, a strategic organization, and its agenda and programs have therefore evolved over its lifetime. In this article, I will sketch SPARC’s past accomplishments, outline ways that the organization has evolved, and share a sense of the direction SPARC will move in during the coming year....SPARC was created by the Association of Research Libraries in 1998, to serve as a catalyst for action and to reduce barriers to the access and use of information. As an alliance of more than 200 academic and research libraries, SPARC’s mission is to correct imbalances in the scholarly publishing system that have driven up the cost of scholarly journals and diminished the community’s ability to access information. At the core of our mission is the belief that these imbalances inhibit the advancement of scholarship and are at odds with fundamental needs of scholars and the academic enterprise. Since 2002, SPARC’s highest priority and most visible activity has centered on advancing the goal of open access to scholarly literature, and this will continue to be our main focus.... Update. Also see Heather's presentation at the UBC Library / SLAIS Colloquium, University of British Columbia Library, SPARC Futures : an evolving agenda.
How well do search engines index the OA repositories?
Frank McCown and three co-authors, Search Engine Coverage of the OAI-PMH Corpus, IEEE Internet Computing, March/April 2006.
Abstract: The major search engines are competing to index as much of the Web as possible. Having indexed much of the surface Web, search engines are now using a variety of approaches to index the deep Web. At the same time, institutional repositories and digital libraries are adopting the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) to expose their holdings, some of which are indexed by search engines and some of which are not. To determine how much of the current OAI-PMH corpus search engines index, we harvested nearly 10M records from 776 OAI-PMH repositories. From these records we extracted 3.3M unique resource identifiers and then conducted searches on samples from this collection. Of this OAI-PMH corpus, Yahoo indexed 65%, followed by Google (44%) and MSN (7%). Twenty-one percent of the resources were not indexed by any of the three search engines. U of Tennessee libraries launch all-OA academic press
Newfound Press is a new digital imprint from the University of Tennessee University Libraries. All its publications will be OA. From the site:
Today’s scholarly publishing environment presents a strategic opportunity for academic libraries to expand their role in the publications process. Universities are both creators and consumers in the information economy. A digital library press offers the potential for making scholarly and specialized resources widely available at a reasonable cost. The University of Tennessee Libraries is developing a framework to make scholarly and specialized works available worldwide. Newfound Press, the University Libraries digital imprint, advances the community of learning by experimenting with effective and open systems of scholarly communication. Drawing on the resources that the university has invested in digital library development, Newfound Press collaborates with authors and researchers to bring new forms of publication to an expanding scholarly universe. We consider manuscripts in all disciplines, encompassing scientific research, humanistic scholarship, and artistic creation. It will publish OA journals as well as OA books and OA multimedia scholarship. It only asks for non-exclusive rights from authors and offers CC licenses as an option. It works in partnership with the University of Tennessee institutional repository. And it links to Open Access News from the front page --from the phrase, Open Access. What's in it for us? PS: Kudos to the UT librarians! I wish this enterprise well and hope other universities consider testing the same waters. Update. Also see Scott Teague's article about the launch in the Daily Beacon Online. More on Sun's open-source education initiative
Darryl K. Taft, Sun to Open Source Education, ExtremeNano, March 7, 2006. Excerpt:
Sun Microsystems is taking a cue from its successes with open source to help shape the future of education and bridge the digital divide, according to the company's chief executive, Scott McNealy. In a speech at Sun's WWERC (Worldwide Education and Research Conference) here [in NY] on March 7, McNealy said Sun has spun out its GELC (Global Education and Learning Community) effort into a nonprofit organization aimed at aimed at delivering self-paced, Web-based, free and open content --including curriculum, resources and assessment-- for the K-12 segment. Or, as McNealy put it, GELC is "open-sourcing education." McNealy said, "[The] opportunity here is to apply all the community development to textbooks, curriculum and assessment for K-12. So with the help of some folks at Sun we created the GELC, with 2,700 members worldwide and 370-plus projects." From a Sun press release (March 7, 2006): Sun broke new ground in free and open-source computing in the creation of this non profit which aims to meet the needs of students by sharing best practices globally. The group named an executive director at the conference, Dr. Barbara "Bobbi" Kurshan, formerly President of Educorp Consultants Corporation, and co-CEO of Core Learning Group, Private Equity Fund. The director will lead an advisory board with representatives from nearly every continent to extend the vision for this group. The GELC Executive Director directs all activities of the GELC, including managing the various working groups, monitoring technical developments, overseeing the education community process, managing the creation of GELC specifications and representing the GELC to external organizations. Wiley on the RCUK policy and Commons debate
In the press release accompanying its third-quarter revenue report, John Wiley & Sons made a point of saying:
In December, the U.K. Parliament conducted a debate on the Science and Technology Select Committee's report on scientific publications, and reiterated its position that the government should not intervene in the market nor fund institutional repositories. Comment. Wiley must think this news is relevant to the value of its stock. But if so, then it should be careful about drawing attention to it and then misreporting it. For in fact, the U.K. Parliament did not oppose the funding of institutional repositories. Some members did and some members didn't; there was no vote or other resolution. See the transcript of the December debate. Moreover, funding institutional repositories is less important than mandating that publicly-funded researchers deposit their peer-reviewed manuscripts in them. That is still the policy proposed by the RCUK.
Web site aims to be research 'storehouse', eSchool News, March 7, 2006. An unsigned news story. Excerpt:
A new internet research tool called Digital Universe aspires to be a more authoritative version of Wikipedia. If successful, it could provide scholars and students with one more option for finding accurate, reliable information online. Skeptics, however, predict that Digital Universe is too ambitious for long-term success.....It's a lofty ambition --the internet equivalent of the Public Broadcasting Service, its founders say, a user-supported resource that pays top academics to create authoritative maps, articles, and links to third-party content related to virtually any scholarly topic. But the vast scope of the project hasn't stopped former high-flying Silicon Valley entrepreneur Joe Firmage from building Digital Universe, a commercial-free internet research clearinghouse four years in the making....A pilot version that debuted in January includes 50 or so portals, or entry points, on topics such as technology, the Earth, and the solar system. Firmage says it will mushroom to at least 500 portals by next year and 10,000 by 2011. Clicking on the Earth portal, for example, presents the visitor with links, reportedly vetted by experts for accuracy, to related articles, images, lists of frequently asked questions, and other resources from sites such as MSNBC.com, NASA, and the University of Hawaii's department of geology and geophysics. The Earth portal is also a jumping-off point to sub-portals on topics such as the atmosphere and hydrosphere, which in turn provide links to vetted content and further sub-portals. The approach is designed to give visitors a graphical means to find topics and understand how they are related to subjects in another category....Firmage and his backers say Digital Universe's biggest asset is the trust readers will feel knowing that every link, graphic, and article has been vetted by an army of academics....The site has been under construction since 2002 by Scotts Valley, Calif.-based ManyOne Networks, a 56-employee company that has received about $10 million in financing from Firmage and angel investors. ManyOne Networks has been recruiting professors to become "stewards" of each portal and building offerings such as eMail services to generate revenue. Digital Universe seeks to improve on the ground broken by Wikipedia, the online encyclopedia that allows anyone to contribute and edit articles. Wikipedia's volunteer model offers an impressive body of content, boasting 1 million articles in English on everything from art deco to nuclear physics. But Wikipedia's open system also has led to the publication of fraudulent articles, and authors sometimes have undisclosed conflicts of interest, critics have charged. Instead of relying on anonymous volunteers, Digital Universe will pay experts, mostly academics, to write encyclopedia articles and to round up outside video, audio, online chats, and other resources. Firmage has pledged that access to basic content on Digital Universe will remain free forever and that it will never include ads. To fund the venture, the site will sell monthly subscriptions that let visitors get additional content and features, many of them offered by for-profit third parties, such as film producers, game makers, map providers, and book publishers. "Imagine how many people would be interested in subscribing for $7.95 per month to get all those additional activities," Firmage said. He predicts the site will have at least 10 million paying subscribers within seven years. (At the end of February, it was reported, Digital Universe had more than 10,000 subscribers.)...Academics and others contributing content will get 25 percent of the proceeds, but the money isn't the only motivation for participating, said Peter Saundry, a physicist with the nonpartisan National Council for Science and the Environment. He heads the group responsible for Digital Universe's environmental portal. "At every scientific meeting you ever go to on any subject, one thing you hear is the general public doesn't understand what we're doing," Saundry said. "This now is a tool for the scientific community to [help inform the public]." An OA guide to performing abortions
Alex Steffen, Open Access and Reproductive Rights, WorldChanging, March 7, 2006. Excerpt:
Yesterday, South Dakota banned abortion. Normally, we'd steer clear of a hot-button topic like abortion, but this law has also triggered a small firestorm around the blog of a woman named Molly, who last week put issues of open access to scientific knowledge in sharp relief by publishing a guide to setting up a cheap, safe, mobile abortion clinic for use in places where abortion has been criminalized....Like other principles of free expression, open access to scientific knowledge often seems absurdly removed from our lives. Molly shows just how tangible such knowledge can be. Science is, above all else, a moral commitment to openly and freely discussing the actual functioning of the universe (and of our bodies). People went to the stake to make science a going concern. Not all that long ago, the information Molly is sharing would have made her a criminal in many countries, just as sharing information on contraception, or evolution, or the fact that the Earth moves around the Sun all once made scientists criminals. What knowledge now being acquired will politicians take it on themselves to criminalize in the future? New way to browse and search arXiv
Xstructure is a new way to browse and search arXiv. (Thanks to Richard Akerman.) From the site:
Among the features of this service are: [1] Automated generation of hierarchical classification scheme for the papers. The scheme results from classification of the papers from the arxiv database. The EqRank algorithm did the classification. The only input for the classification is the citation graph. The number of the levels in the hierarchy and the number of the clusters is determined by the algorithm. Generally, there is no external parameters (e.g., a preset list of clusters) in the algorithm. The algorithm creates the classification scheme, and indexes the papers by the created classification; [2] The classification is used to index the new papers. We plan to rebuild the classification scheme regularly. In this way, we will take into account that appearance of new papers may lead to emergence of new themes. Detection of new themes is one of our objectives; [3] A number of extra attributes (e.g. Theme name, Authority and Reference Articles, etc.) for the elements (themes) of the classification (see Help); [4] Accessability of the classification in response to search requests via display options, e.g., display as Tree of Themes, and Refrerence (Citation) Tree. At the moment, the service is available only for the hep-th sector of arxiv.org. Hopefully, it will be extended to cover a number of other sectors. | ||||||||