Latest in cs.dl

total 2625took 0.12s
Visualizing a Field of Research: A Methodology of Systematic Scientometric ReviewsJun 11 2019Systematic scientometric reviews, empowered by scientometric and visual analytic techniques, offer opportunities to improve the timeliness, accessibility, and reproducibility of conventional systematic reviews. While increasingly accessible science mapping ... More
Achieving competitive advantage in academia through early career coauthorship with top scientistsJun 11 2019We quantify the long term impact that the coauthorship with established top-cited scientists has on the career of junior researchers in four different scientific disciplines. Through matched pair analysis, we find that junior researchers who coauthor ... More
EXmatcher: Combining Features Based on Reference Strings and Segments to Enhance Citation MatchingJun 11 2019Citation matching is a challenging task due to different problems such as the variety of citation styles, mistakes in reference strings and the quality of identified reference segments. The classic citation matching configuration used in this paper is ... More
Qualifying threshold of take off stage for successfully disseminated creative ideasJun 10 2019The creative process is essentially Darwinian and only a small proportion of creative ideas are selected for further development. However, the threshold that identifies this small fraction of successfully disseminated creative ideas at their early stage ... More
What Do Citation Counts Measure? An Updated Review of Studies on Citations in Scientific Documents Published between 2006 and 2018Jun 10 2019The purpose of this paper is to update the review of Bornmann and Daniel (2008) presenting a narrative review of studies on citations in scientific documents. The current review covers 38 studies published between 2006 and 2018. Bornmann and Daniel (2008) ... More
Indicators of Open Access for universitiesJun 10 2019This paper presents a first attempt to analyse Open Access integration at the institutional level. For this, we combine information from Unpaywall and the Leiden Ranking to offer basic OA indicators for universities. We calculate the overall number of ... More
Do Authors Deposit on Time? Tracking Open Access Policy ComplianceJun 07 2019Recent years have seen fast growth in the number of policies mandating Open Access (OA) to research outputs. We conduct a large-scale analysis of over 800 thousand papers from repositories around the world published over a period of 5 years to investigate: ... More
Please, no more scientific journals! The strategy of the scientific publication systemJun 07 2019In the same way ecosystems tend to increase maturity by decreasing the flow of energy per unit biomass, we should move towards a more mature science by publishing less but high-quality papers and getting away from joining large teams in small roles. That ... More
How a Single Paper Affects the Impact Factor: Implications for Scholarly PublishingJun 06 2019Because the Impact Factor (IF) is an average quantity and most journals are small, IFs are volatile. We study how a single paper affects the IF using data from 11639 journals in the 2017 Journal Citation Reports. We define as volatility the IF gain (or ... More
Large publishing consortia produce higher citation impact research but co-author contributions are hard to evaluateJun 05 2019This paper introduces a simple agglomerative clustering method to identify large publishing consortia with at least 20 authors and 80% shared authorship between articles. Based on Scopus journal articles 1996-2018, under these criteria, nearly all (88%) ... More
Tracing Antisemitic Language Through Diachronic Embedding Projections: France 1789-1914Jun 04 2019We investigate some aspects of the history of antisemitism in France, one of the cradles of modern antisemitism, using diachronic word embeddings. We constructed a large corpus of French books and periodicals issues that contain a keyword related to Jews ... More
A gender analysis of top scientists' collaboration behavior: evidence from ItalyJun 03 2019This work analyzes the differences in collaboration behavior between males and females among a particular type of scholars: top scientists, and as compared to non top scientists. The field of observation consists of the Italian academic system and the ... More
DFS: A Dataset File System for Data Discovering UsersMay 31 2019Many research questions can be answered quickly and efficiently using data already collected for previous research. This practice is called secondary data analysis (SDA), and has gained popularity due to lower costs and improved research efficiency. In ... More
Exploring the Effects of Data Set Choice on Measuring International Research Collaboration: an Example Using the ACM Digital Library and Microsoft Academic GraphMay 30 2019International research collaboration (IRC) measurement is important because countries can and want to benefit from international collaboration but performing the same measurement procedure on different data sets can lead to different results. This study ... More
Enriching Bibliographic Data by Combining String Matching and the Wikidata Knowledge Graph to Improve the Measurement of International Research CollaborationMay 30 2019Measuring international research collaboration is necessary when evaluating, for example, the efficacy of policy meant to increase cooperation between countries, but is currently very difficult as bibliographic records contain only affiliation data from ... More
MementoMap Framework for Flexible and Adaptive Web Archive ProfilingMay 29 2019In this work we propose MementoMap, a flexible and adaptive framework to efficiently summarize holdings of a web archive. We described a simple, yet extensible, file format suitable for MementoMap. We used the complete index of the Arquivo.pt comprising ... More
Archive Assisted Archival Fixity Verification FrameworkMay 29 2019The number of public and private web archives has increased, and we implicitly trust content delivered by these archives. Fixity is checked to ensure an archived resource has remained unaltered since the time it was captured. Some web archives do not ... More
Using Micro-collections in Social Media to Generate Seeds for Web Archive CollectionsMay 29 2019In a Web plagued by disappearing resources, Web archive collections provide a valuable means of preserving Web resources important to the study of past events ranging from elections to disease outbreaks. These archived collections start with seed URIs ... More
Paper Matching with Local Fairness ConstraintsMay 28 2019Automatically matching reviewers to papers is a crucial step of the peer review process for venues receiving thousands of submissions. Unfortunately, common paper matching algorithms often construct matchings suffering from two critical problems: (1) ... More
Social Cards Probably Provide For Better Understanding Of Web Archive CollectionsMay 27 2019May 28 2019Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their ... More
Social Cards Probably Provide For Better Understanding Of Web Archive CollectionsMay 27 2019May 30 2019Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their ... More
Social Cards Probably Provide For Better Understanding Of Web Archive CollectionsMay 27 2019Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their ... More
Document Embeddings vs. Keyphrases vs. Terms: An Online Evaluation in Digital Library Recommender SystemsMay 27 2019Many recommendation algorithms are available to digital library recommender system operators. The effectiveness of algorithms is largely unreported by way of online evaluation. We compare a standard term-based recommendation approach to two promising ... More
Attributing and Referencing (Research) Software: Best Practices and Outlook from InriaMay 27 2019Software is a fundamental pillar of modern scientiic research, not only in computer science, but actually across all elds and disciplines. However, there is a lack of adequate means to cite and reference software, for many reasons. An obvious rst reason ... More
Statistical Significance Testing in Information Retrieval: An Empirical Analysis of Type I, Type II and Type III ErrorsMay 27 2019Statistical significance testing is widely accepted as a means to assess how well a difference in effectiveness reflects an actual difference between systems, as opposed to random noise because of the selection of topics. According to recent surveys on ... More
Statistical Significance Testing in Information Retrieval: An Empirical Analysis of Type I, Type II and Type III ErrorsMay 27 2019Jun 05 2019Statistical significance testing is widely accepted as a means to assess how well a difference in effectiveness reflects an actual difference between systems, as opposed to random noise because of the selection of topics. According to recent surveys on ... More
Does the $h_α$ index reinforce the Matthew effect in science? Agent-based simulations using Stata and RMay 27 2019Recently, Hirsch (2019a) proposed a new variant of the h index called the $h_\alpha$ index. He formulated as follows: "we define the $h_\alpha$ index of a scientist as the number of papers in the h-core of the scientist (i.e. the set of papers that contribute ... More
Shared Feelings: Understanding Facebook Reactions to Scholarly ArticlesMay 27 2019Research on social-media platforms has tended to rely on textual analysis to perform research tasks. While text-based approaches have significantly increased our understanding of online behavior and social dynamics, they overlook features on these platforms ... More
TrendNets: Mapping Research Trends From Dynamic Co-Word Networks via Sparse RepresentationMay 27 2019Mapping the knowledge structure from word co-occurrences in a collection of academic papers has been widely used to provide insight into the topic evolution in an arbitrary research field. In a traditional approach, the paper collection is first divided ... More
Citation concept analysis (CCA) - A new form of citation analysis revealing the usefulness of concepts for other researchers illustrated by two exemplary case studies including classic books by Thomas S. Kuhn and Karl R. PopperMay 23 2019In recent years, the full text of papers are increasingly available electronically which opens up the possibility of quantitatively investigating citation contexts in more detail. In this study, we introduce a new form of citation analysis, which we call ... More
Patent Citation Dynamics Modeling via Multi-Attention Recurrent NetworksMay 22 2019Modeling and forecasting forward citations to a patent is a central task for the discovery of emerging technologies and for measuring the pulse of inventive progress. Conventional methods for forecasting these forward citations cast the problem as analysis ... More
R package for producing beamplots as a preferred alternative to the h index when assessing single researchers (based on downloads from Web of Science)May 22 2019We propose the use of beamplots - which can be produced by using the R package BibPlots and WoS downloads - as a preferred alternative to h index values for assessing single researchers.
From heterogeneous data to heterogeneous public: thoughts on transmedia applications for digital heritage research and disseminationMay 22 2019In recent years, we have seen a tenfold increase in volume and complexity of digital data acquired for cultural heritage documentation. Meanwhile, open data and open science have become leading trends in digital humanities. The convergence of those two ... More
A Scalable Hybrid Research Paper Recommender System for Microsoft AcademicMay 21 2019We present the design and methodology for the large scale hybrid paper recommender system used by Microsoft Academic. The system provides recommendations for approximately 160 million English research papers and patents. Our approach handles incomplete ... More
Software Citation Implementation ChallengesMay 21 2019The main output of the FORCE11 Software Citation working group (https://www.force11.org/group/software-citation-working-group) was a paper on software citation principles (https://doi.org/10.7717/peerj-cs.86) published in September 2016. This paper laid ... More
Why Machines Cannot Learn Mathematics, YetMay 20 2019Nowadays, Machine Learning (ML) is seen as the universal solution to improve the effectiveness of information retrieval (IR) methods. However, while mathematics is a precise and accurate science, it is usually expressed by less accurate and imprecise ... More
Mining university rankings: Publication output and citation impact as their basisMay 17 2019World University rankings have become well-established tools that students, university managers and policy makers read and use. Each ranking claims to have a unique methodology capable of measuring the 'quality' of universities. The purpose of this paper ... More
The CEDAR Workbench: An Ontology-Assisted Environment for Authoring Metadata that Describe Scientific ExperimentsMay 16 2019The Center for Expanded Data Annotation and Retrieval (CEDAR) aims to revolutionize the way that metadata describing scientific experiments are authored. The software we have developed--the CEDAR Workbench--is a suite of Web-based tools and REST APIs ... More
Missing Movie Synergistic Completion across Multiple Isomeric Online Movie Knowledge LibrariesMay 15 2019Online knowledge libraries refer to the online data warehouses that systematically organize and categorize the knowledge-based information about different kinds of concepts and entities. In the era of big data, the setup of online knowledge libraries ... More
A human-inspired recognition system for premodern Japanese historical documentsMay 14 2019Recognition of historical documents is a challenging problem due to the noised, damaged characters and background. However, in Japanese historical documents, not only contains the mentioned problems, pre-modern Japanese characters were written in cursive ... More
On the share of mathematics published by Elsevier and SpringerMay 13 2019For-profit editors such as Elsevier and Springer have been subject to sustained criticism from academics and university libraries, including calls to boycott, and discontinued subscriptions. Mathematicians have played a particularly active role in this ... More
Transfer Learning for Scientific Data Chain Extraction in Small Chemical Corpus with BERT-CRF ModelMay 13 2019Computational chemistry develops fast in recent years due to the rapid growth and breakthroughs in AI. Thanks for the progress in natural language processing, researchers can extract more fine-grained knowledge in publications to stimulate the development ... More
Collecting 16K archived web pages from 17 public web archivesMay 09 2019We document the creation of a data set of 16,627 archived web pages, or mementos, of 3,698 unique live web URIs (Uniform Resource Identifiers) from 17 public web archives. We used four different methods to collect the dataset. First, we used the Los Alamos ... More
How to interpret algorithmically constructed topical structures of research specialties? A case study comparing an internal and an external mapping of the topical structure of invasion biologyMay 09 2019In our paper we seek to address a shortcoming in the scientometric literature, namely that, given the proliferation of algorithmic approaches to topic detection from bibliometric data, there is a relative lack of studies that validate and create a deeper ... More
Solo citations, duet citations, and prelude citations: New measures of the disruption of academic papersMay 09 2019It is important to measure the disruption of academic papers. According to the characteristics of three different kinds of citations, this paper borrows musical vocabulary and names them solo citations (SC), duet citations (DC), and prelude citations ... More
Solo citations, duet citations, and prelude citations: New measures of the disruption of academic papersMay 09 2019May 10 2019It is important to measure the disruption of academic papers. According to the characteristics of three different kinds of citations, this paper borrows musical vocabulary and names them solo citations (SC), duet citations (DC), and prelude citations ... More
Forms of Plagiarism in Digital Mathematical LibrariesMay 08 2019We report on an exploratory analysis of the forms of plagiarism observable in mathematical publications, which we identified by investigating editorial notes from zbMATH. While most cases we encountered were simple copies of earlier work, we also identified ... More
Interdisciplinary Relationships Between Biological and Physical SciencesMay 08 2019Several interdisciplinary areas have appeared at the interface between biological and physical sciences. In this work, we suggest a complex network-based methodology for analyzing the interrelationships between some of these interdisciplinary areas, including ... More
Pilot evaluation of Collection API with PID Kernel InformationMay 08 2019Persistent Identifier (PID) is a widely used long-term unique reference to digital objects. Meanwhile, Handle, one of the main persistent identifier schemes in use, implements a central global registry to resolve PIDs. The value of Handle varies in sizes ... More
Authorship analysis of specialized vs diversified research outputMay 07 2019The present work investigates the relations between amplitude and type of collaboration (intramural, extramural domestic or international) and output of specialized versus diversified research. By specialized or diversified research, we mean within or ... More
Does Environmental Economics lead to patentable research?May 07 2019In this feasibility study, the impact of academic research from social sciences and humanities on technological innovation is explored through a study of citations patterns of journal articles in patents. Specifically we focus on citations of journals ... More
Crediting multi-authored papers to single authorsMay 06 2019A fair assignment of credit for multi-authored publications is a long-standing issue in scientometrics. In the calculation of the $h$-index, for instance, all co-authors receive equal credit for a given publication, independent of a given author's contribution ... More
Impact factors, Within-Journal Self-citations, and the Pinski-Narin Influence Weights: An analysis using both subfields and the complete Journal Citation Reports 2017May 05 2019The Journal Impact Factor (JIF) is linearly sensitive to self-citations because each self-citation adds to the numerator, whereas the denominator is not affected. Pinski & Narin (1976) derived the Influence Weight (IW) as an alternative to Garfield's ... More
The Price of Gold: Curiosity?May 02 2019Gold open access as characterised by the payment of an article processing charge (APC) has become one of the dominant models in open access publication. This paper examines an extreme hypothetical case in which the APC model is the only model and the ... More
The Literary Theme Ontology for Media Annotation and Information RetrievalMay 01 2019Literary theme identification and interpretation is a focal point of literary studies scholarship. Classical forms of literary scholarship, such as close reading, have flourished with scarcely any need for commonly defined literary themes. However, the ... More
Time-series Insights into the Process of Passing or Failing Online University Courses using Neural-Induced Interpretable Student StatesMay 01 2019This paper addresses a key challenge in Educational Data Mining, namely to model student behavioral trajectories in order to provide a means for identifying students most at-risk, with the goal of providing supportive interventions. While many forms of ... More
The State of Open Access in Germany: An Analysis of the Publication Output of German UniversitiesApr 30 2019Starting with the Berlin declaration in 2003, Open Access (OA) publishing has established a new era of scholarly communication due to the unrestricted electronic access to peer reviewed publications. OA offers a number of benefits like e.g. increased ... More
The State of Open Access in Germany: An Analysis of the Publication Output of German UniversitiesApr 30 2019May 03 2019Starting with the Berlin declaration in 2003, Open Access (OA) publishing has established a new era of scholarly communication due to the unrestricted electronic access to peer reviewed publications. OA offers a number of benefits like e.g. increased ... More
Female citation impact superiority 1996-2018 in six out of seven English-speaking nationsApr 29 2019Efforts to combat continuing gender inequalities in academia need to be informed by evidence about where differences occur. Citations are relevant as potential evidence in appointment and promotion decisions, but it is unclear whether there have been ... More
Author name disambiguation of bibliometric data: A comparison of several unsupervised approachesApr 29 2019Adequately disambiguating author names in bibliometric databases is a precondition for conducting reliable analyses at the author level. In the case of bibliometric studies that include many researchers, it is not possible to disambiguate each single ... More
Venue Analytics: A Simple Alternative to Citation-Based MetricsApr 29 2019We present a method for automatically organizing and evaluating the quality of different publishing venues in Computer Science. Since this method only requires paper publication data as its input, we can demonstrate our method on a large portion of the ... More
Community Detection and Growth Potential Prediction Using the Stochastic Block Model and the Long Short-term Memory from Patent Citation NetworksApr 23 2019Scoring patent documents is very useful for technology management. However, conventional methods are based on static models and, thus, do not reflect the growth potential of the technology cluster of the patent. Because even if the cluster of a patent ... More
Sleeping Beauties in Medical Research: Technological Relevance, High Scientific ImpactApr 16 2019We investigate Sleeping Beauties (SBs) in medical research with a special focus on SBs cited in patents. We find that the increasing trend of the relative number of SBs comes to an end around 1998. However, still a constant fraction of publications becomes ... More
Go Wide, Go Deep: Quantifying the Impact of Scientific Papers through Influence Dispersion TreesApr 16 2019Despite a long history of use of citation count as a measure to assess the impact or influence of a scientific paper, the evolution of follow-up work inspired by the paper and their interactions through citation links have rarely been explored to quantify ... More
Female scholars need to achieve more for equal public recognitionApr 12 2019Different kinds of "gender gap" have been reported in different walks of the scientific life, almost always favouring male scientists over females. In this work, for the first time, we present a large-scale empirical analysis to ask whether female scientists ... More
Female scholars need to achieve more for equal public recognitionApr 12 2019Apr 16 2019Different kinds of "gender gap" have been reported in different walks of the scientific life, almost always favouring male scientists over females. In this work, for the first time, we present a large-scale empirical analysis to ask whether female scientists ... More
Explore with caution: mapping the evolution of scientific interest in PhysicsApr 12 2019In the book The Essential Tension Thomas Kuhn described the conflict between tradition and innovation in scientific research --i.e., the desire to explore new promising areas, counterposed to the need to capitalize on the work done in the past. While ... More
COCI, the OpenCitations Index of Crossref open DOI-to-DOI citationsApr 12 2019In this paper, we present COCI, the OpenCitations Index of Crossref open DOI-to-DOI citations (http://opencitations.net/index/coci). COCI is the first open citation index created by OpenCitations, in which we have applied the concept of citations as first-class ... More
Ontologies-based Architecture for Sociocultural Knowledge Co-Construction SystemsApr 11 2019Considering the evolution of the semantic wiki engine based platforms, two main approaches could be distinguished: Ontologies for Wikis (OfW) and Wikis for Ontologies (WfO). OfW vision requires existing ontologies to be imported. Most of them use the ... More
AMRec: An Intelligent System for Academic Method RecommendationApr 10 2019Finding new academic Methods for research problems is the key task in a researcher's research career. It is usually very difficult for new researchers to find good Methods for their research problems since they lack of research experiences. In order to ... More
Citation gaming induced by bibliometric evaluation: a country-level comparative analysisApr 09 2019It is several years since national research evaluation systems around the globe started making use of quantitative indicators to measure the performance of researchers. Nevertheless, the effects on these systems on the behavior of the evaluated researchers ... More
Providing Advanced Access to Historical War Memoirs Through the Identification of Events, Participants and RolesApr 08 2019The progressive digitization of historical archives provides new, often domain specific, textual resources that report on facts and events happened in the past; among them, memoirs are a very common type of primary source. In this paper, we present an ... More
Journal ranking should depend on the level of aggregationApr 08 2019Journal ranking, that is, placing journals within their respective fields, is an important research evaluation tool. Several indices have been suggested for this purpose, typically on the basis of citation graph. We follow an axiomatic approach and find ... More
Journal ranking should depend on the level of aggregationApr 08 2019Apr 25 2019Journal ranking, that is, placing journals within their respective fields, means an important research evaluation tool. Several indices have been suggested for this purpose, typically on the basis of a citation graph between the journals. We follow an ... More
Reproducible Research is more than Publishing Research Artefacts: A Systematic Analysis of Jupyter Notebooks from Research ArticlesApr 08 2019With the advent of Open Science, researchers have started to publish their research artefacts (i. e., data, software, and other products of the investigations) in order to allow others to reproduce their investigations. While this publication is beneficial ... More
Unsupervised Domain Adaptation of Contextualized Embeddings: A Case Study in Early Modern EnglishApr 04 2019Contextualized word embeddings such as ELMo and BERT provide a foundation for strong performance across a range of natural language processing tasks, in part by pretraining on a large and topically-diverse corpus. However, the applicability of this approach ... More
Software Tools for Big Data Resources in Family Names DictionariesApr 02 2019This paper describes the design and development of specific software tools used during the creation of Family Names in Britain and Ireland (FaNBI) research project, started by the University of the West of England in 2010 and finished successfully in ... More
Mapping social media attention in Microbiology: Identifying main topics and actorsApr 01 2019This paper aims to map and identify topics of interest within the field of Microbiology and identify the main sources driving such attention. We combine data from Web of Science and Altmetric.com, a platform which retrieves mentions to scientific literature ... More
Distribution of scientific journals impact factorMar 31 2019We consider distributions of scientific journals impact factor. Analysing 9028 scientific journals with the largest impact factors, we found that the distribution of them is year-to-year stable (at least for analysed 2011-2013 years), and it has the character ... More
Simulating Imperial Dynamics and Conflict in the Ancient WorldMar 27 2019Apr 01 2019The development of models to capture large-scale dynamics in human history is one of the core contributions of the cliodynamics field. Crucially and most often, these models are assessed by their predictive capability on some macro-scale and aggregated ... More
Simulating Imperial Dynamics and Conflict in the Ancient WorldMar 27 2019The development of models to capture large-scale dynamics in human history is one of the core contributions of the cliodynamics field. Crucially and most often, these models are assessed by their predictive capability on some macro-scale and aggregated ... More
Highly cited references in PLOS ONE and their in-text usage over timeMar 27 2019In this article, we describe highly cited publications in a PLOS ONE full-text corpus. For these publications, we analyse the citation contexts concerning their position in the text and their age at the time of citing. By selecting the perspective of ... More
From closed to open access: A case study of flipped journalsMar 27 2019In recent years, increased stakeholder pressure to transition research to Open Access has led to many journals "flipping" from a toll access to an open access publishing model. Changing the publishing model can influence the decision of authors to submit ... More
Like-for-like bibliometric substitutes for peer review: advantages and limits of indicators calculated from the ep indexMar 26 2019The use of bibliometric indicators would simplify research assessments. The 2014 Research Excellence Framework (REF) is a peer review assessment of UK universities, whose results can be taken as benchmarks for bibliometric indicators. In this study we ... More
An analysis of the evolution of science-technology linkage in biomedicineMar 25 2019Demonstrating the practical value of public research has been an important subject in science policy. Here we present a detailed study on the evolution of the citation linkage between life science related patents and biomedical research over a 37-year ... More
Penobscot Dataset: Fostering Machine Learning Development for Seismic InterpretationMar 21 2019We have seen in the past years the flourishing of machine and deep learning algorithms in several applications such as image classification and segmentation, object detection and recognition, among many others. This was only possible, in part, because ... More
ReviewerNet: Visualizing Citation and Authorship Relations for Finding ReviewersMar 19 2019We propose ReviewerNet, an online, interactive visualization system aimed to improve the reviewer selection process in the academic domain. Given a paper submitted for publication, we assume that good candidate reviewers can be chosen among the authors ... More
Quantifying dynamics of failure across science, startups, and securityMar 18 2019Human achievements are often preceded by repeated attempts that initially fail, yet little is known about the mechanisms governing the dynamics of failure. Here, building on the rich literature on innovation, human dynamics and learning, we develop a ... More
Should Citations be Counted Separately from Each Originating SectionMar 18 2019Articles are cited for different purposes and differentiating between reasons when counting citations may therefore give finer-grained citation count information. Although identifying and aggregating the individual reasons for each citation may be impractical, ... More
Early-career setback and future career impactMar 16 2019Setbacks are an integral part of a scientific career, yet little is known about whether an early-career setback may augment or hamper an individual's future career impact. Here we examine junior scientists applying for U.S. National Institutes of Health ... More
Availability of Hyperlinked Resources in Astrophysics PapersMar 15 2019Astrophysics papers often rely on software which may or may not be available, and URLs are often used as proxy citations for software and data. We extracted all URLs from two journals' 2015 research articles, removed those from certain long-term reliable ... More
Availability of Hyperlinked Resources in Astrophysics PapersMar 15 2019Mar 20 2019Astrophysics papers often rely on software which may or may not be available, and URLs are often used as proxy citations for software and data. We extracted all URLs from two journals' 2015 research articles, removed those from certain long-term reliable ... More
The $CI$-index: a new index to characterize the scientific output of researchersMar 15 2019We propose a simple new index, named the $CI$-index, based on the Choquet integral to characterize the scientific output of researchers. This index is an improvement of the $A$-index and $R$-index and has a notable feature that highly cited papers have ... More
The $CI$-index: a new index to characterize the scientific output of researchersMar 15 2019May 15 2019We propose a simple new index, named the $CI$-index, based on the Choquet integral to characterize the scientific output of researchers. This index is an improvement of the $A$-index and $R$-index and has a notable feature that highly cited papers have ... More
Data objects and documenting scientific processes: An analysis of data events in biodiversity data papersMar 14 2019The data paper, an emerging scholarly genre, describes research datasets and is intended to bridge the gap between the publication of research data and scientific articles. Research examining how data papers report data events, such as data transactions ... More
The practice of self-citations: a longitudinal studyMar 14 2019In this work, we discuss the result of an experiment that wants to track how authors use self-citations in their articles. In particular, we have analysed a subset of all the articles published between 1959 and 2016 in ScienceDirect written by the participants ... More
A Deep Patent Landscaping Model using Transformer and Graph Convolutional NetworkMar 14 2019Patent landscaping is a method that is employed for searching related patents during the process of a research and development (R&D) project. To avoid the risk of patent infringement and to follow the current trends of technology development, patent landscaping ... More
Science Quality and the Value of InventionsMar 12 2019Despite decades of research, the relationship between the quality of science and the value of inventions has remained unclear. We present the result of a large-scale matching exercise between the universes of 4.8 million patent families and 43 million ... More
Science Quality and the Value of InventionsMar 12 2019Apr 03 2019Despite decades of research, the relationship between the quality of science and the value of inventions has remained unclear. We present the result of a large-scale matching exercise between 4.8 million patent families and 43 million publication records. ... More
The rhetorical structure of science? A multidisciplinary analysis of article headingsMar 11 2019An effective structure helps an article to convey its core message. The optimal structure depends on the information to be conveyed and the expectations of the audience. In the current increasingly interdisciplinary era, structural norms can be confusing ... More