Latest in cs.dl

total 2760took 0.14s
Two Decades of Network Science as seen through the co-authorship network of network scientistsAug 22 2019Complex networks have attracted a great deal of research interest in the last two decades since Watts & Strogatz, Barab\'asi & Albert and Girvan & Newman published their highly-cited seminal papers on small-world networks, on scale-free networks and on ... More
The citation advantage of foreign language references for Chinese social science papersAug 20 2019Contemporary scientific exchanges are international, yet language continues to be a persistent barrier to scientific communication, particularly for non-native English-speaking scholars. Since the ability to absorb knowledge has a strong impact on how ... More
Predicting publication productivity for researchers: a piecewise Poisson modelAug 20 2019Predicting the scientific productivity of researchers is a basic task for academic administrators and funding agencies. This study provided a model for the publication dynamics of researchers, inspired by the distribution feature of researchers' publications ... More
Reducing the Effort for Systematic Reviews in Software EngineeringAug 19 2019Context. Systematic Reviews (SRs) are means for collecting and synthesizing evidence from the identification and analysis of relevant studies from multiple sources. To this aim, they use a well-defined methodology meant to mitigate the risks of biases ... More
Delineating Knowledge Domains in the Scientific Literature Using Visual InformationAug 12 2019Figures are an important channel for scientific communication, used to express complex ideas, models and data in ways that words cannot. However, this visual information is mostly ignored in analyses of the scientific literature. In this paper, we demonstrate ... More
Assessing the Quality of Scientific PapersAug 12 2019A multitude of factors are responsible for the overall quality of scientific papers, including readability, linguistic quality, fluency,semantic complexity, and of course domain-specific technical factors. These factors vary from one field of study to ... More
Citations in Software Engineering -- Paper-related, Journal-related, and Author-related FactorsAug 12 2019Many factors could affect the number of citations to a paper. Citations have an important role in research policy and in measuring the excellence of research and researchers. This work is the first study in software engineering (SE) to assess multiple ... More
Citations in Software Engineering -- Paper-related, Journal-related, and Author-related FactorsAug 12 2019Aug 13 2019Many factors could affect the number of citations to a paper. Citations have an important role in research policy and in measuring the excellence of research and researchers. This work is the first study in software engineering (SE) to assess multiple ... More
The Evolution of IJHCS and CHI: A Quantitative AnalysisAug 12 2019In this paper we focus on the International Journal of Human-Computer Studies (IJHCS) as a domain of analysis, to gain insights about its evolution in the past 50 years and what this evolution tells us about the research landscape associated with the ... More
The rise and rise of interdisciplinary research: Understanding the interaction dynamics of three major fields -- Physics, Mathematics & Computer ScienceAug 10 2019The distinction between sciences is becoming increasingly more artificial -- an approach from one area can be easily applied to the other. More exciting research nowadays is happening perhaps at the interfaces of disciplines like Physics, Mathematics ... More
Research Leadership Flow Determinants and the Role of Proximity in Research Collaborations NetworksAug 08 2019Characterizing the leadership in research is important to revealing the interaction pattern and organizational structure through research collaboration. This research defines the leadership role based on the corresponding author's affiliation, and presents, ... More
Making Recommendations from Web Archives for "Lost" Web PagesAug 07 2019When a user requests a web page from a web archive, the user will typically either get an HTTP 200 if the page is available, or an HTTP 404 if the web page has not been archived. This is because web archives are typically accessed by URI lookup, and the ... More
Criteria for assessing grant applications: A systematic reviewAug 04 2019Criteria are an essential component of any procedure for assessing merit. Yet, little is known about the criteria peers use in assessing grant applications. In this systematic review we therefore identify and synthesize studies that examine grant peer ... More
Economic Power, Population, and the Size of Astronomical CommunityAug 02 2019The number of astronomers for a country registered to the IAU is known to have a correlation with the GDP. However, the robustness of this relationship can be doubted, because the fraction of astronomers joining the IAU differs from country to country. ... More
Towards a Comprehensive Bibliography for SETIAug 01 2019In this work, we motivate, describe, and announce a living bibliography for academic papers and other works published in the Search for Extraterrestrial Intelligence (SETI). The bibliography makes use of bibliographic groups (bibgroups) in the NASA Astrophysics ... More
Plan S. Pardon impossible to executeAug 01 2019The Plan S initiative is expected to radically change the market of scholarly periodicals, resulting in the abandoning of the subscription model in favour of the open access model. This transition poses new challenges, as well as sets new tasks for researchers, ... More
A computational EXFOR databaseAug 01 2019The EXFOR library is a useful resource for many people in the field of nuclear physics. In particular, the experimental data in the EXFOR library serves as a starting point for nuclear data evaluations. There is an ongoing discussion about how to make ... More
The demography of the peripatetic researcher: Evidence on highly mobile scholars from the Web of ScienceJul 31 2019The policy debate around researchers' geographic mobility has been moving away from a theorized zero-sum game in which countries can be winners (brain gain) or losers (brain drain), and toward the concept of brain circulation, which implies that researchers ... More
Mapping the backbone of the Humanities through the eyes of WikipediaJul 30 2019The present study aims to establish a valid method by which to apply the theory of co-citations to Wikipedia article references and, subsequently, to map these relationships between scientific papers. This theory, originally applied to scientific literature, ... More
Coping with the delineation of emerging fields: Nanoscience and Nanotechnology as a case studyJul 27 2019Proper field delineation plays an important role in scientometric studies, although it is a tough task. Based on an emerging and interdisciplinary field, nanoscience and nanotechnology, this paper highlights the problem of field delineation. First, we ... More
Analyzing Linguistic Complexity and Scientific ImpactJul 27 2019The number of publications and the number of citations received have become the most common indicators of scholarly success. In this context, scientific writing increasingly plays an important role in scholars' scientific careers. To understand the relationship ... More
Investigating Correlations of Inter-coder Agreement and Machine Annotation Performance for Historical Video DataJul 24 2019Video indexing approaches such as visual concept classification and person recognition are essential to enable fine-grained semantic search in large-scale video archives such as the historical video collection of former German Democratic Republic (GDR) ... More
Overview and Results: CL-SciSumm Shared Task 2019Jul 23 2019The CL-SciSumm Shared Task is the first medium-scale shared task on scientific document summarization in the computational linguistics~(CL) domain. In 2019, it comprised three tasks: (1A) identifying relationships between citing documents and the referred ... More
Might Europe one day again be a global scientific powerhouse? Analysis of ERC publications suggests it will not be possible without changes in research policyJul 21 2019Numerous EU documents praise the excellence of EU research without empirical evidence and against academic studies. We investigated research performance in two fields of high socioeconomic importance, advanced technology and basic medical research, in ... More
Evaluating the Reproducibility of Research in Obstetrics and GynecologyJul 16 2019Objective: Reproducibility is a core tenet of scientific research. A reproducible study is one where the results can be recreated by different investigators in different circumstances using the same methodology and materials. Unfortunately, reproducibility ... More
CupQ: A New Clinical Literature Search EngineJul 15 2019A new clinical literature search engine, called CupQ, is presented. It aims to help clinicians stay updated with medical knowledge. Although PubMed is currently one of the most widely used digital libraries for biomedical information, it frequently does ... More
Non-English language publications in Citation Indexes -- quantity and qualityJul 15 2019We analyzed publications data in WoS and Scopus to compare publications in native languages vs publications in English and find any distinctive patterns. We analyzed their distribution by research areas, languages, type of access and citation patterns. ... More
Robust Archives Maximize Scientific AccessibilityJul 14 2019We present a bibliographic analysis of Chandra, Hubble, and Spitzer publications. We find (a) archival data are used in >60% of the publication output and (b) archives for these missions enable a much broader set of institutions and countries to scientifically ... More
Geographical Distribution of Biomedical Research in the USA and ChinaJul 11 2019We analyze nearly 20 million geocoded PubMed articles with author affiliations. Using K-means clustering for the lower 48 US states and mainland China, we find that the average published paper is within a relatively short distance of a few centroids. ... More
Cui Prodest? Reciprocity of collaboration measured by Russian Index of Science CitationJul 11 2019Scientific collaboration is often not perfectly reciprocal. Scientifically strong countries/institutions/laboratories may help their less prominent partners with leading scholars, or finance, or other resources. What is interesting in such type of collaboration ... More
Historical comparison of gender inequality in scientific careers across countries and disciplinesJul 09 2019There is extensive, yet fragmented, evidence of gender differences in academia suggesting that women are under-represented in most scientific disciplines, publish fewer articles throughout a career, and their work acquires fewer citations. Here, we offer ... More
Publication modalities 'article in press' and 'open access' in relation to journal average citationJul 08 2019There has been a generalization in the use of two publication practices by scientific journals during the past decade: 1. 'article in press' or early view, which allows access to the accepted paper before its formal publication in an issue; 2. 'open access', ... More
Materials databases: the need for open, interoperable databases with standardized data and rich metadataJul 05 2019Driven by the recent rapid increase in the number of materials databases published (open and commercial), I discuss here some perspectives on the growing need for standardized, interoperable, open databases. The field of computational materials discovery ... More
Un Modelo Ontológico para el Gobierno ElectrónicoJul 04 2019Decision making often requires information that must be Provided with the rich data format. Addressing these new requirements appropriately makes it necessary for government agencies to orchestrate large amounts of information from different sources and ... More
The citation advantage of linking publications to research dataJul 04 2019Efforts to make research results open and reproducible are increasingly reflected by journal policies encouraging or mandating authors to provide data availability statements. As a consequence of this, there has been a strong uptake of data availability ... More
Comparison of research productivity of Italian and Norwegian professors and universitiesJul 03 2019This is the first ever attempt of application in a country other than Italy of a research efficiency indicator (FSS), to assess and compare the performance of professors and universities, within and between countries. A special attention has been devoted ... More
Topic Modeling the Reading and Writing Behavior of Information ForagersJun 30 2019The general problem of "information foraging" in an environment about which agents have incomplete information has been explored in many fields, including cognitive psychology, neuroscience, economics, finance, ecology, and computer science. In all of ... More
Infrastructure-Agnostic HypertextJun 29 2019This paper presents a novel and formal interpretation of the original vision of hypertext: infrastructure-agnostic hypertext is independent from specific standards such as data formats and network protocols. Its model is illustrated with examples and ... More
Introducing MathQA -- A Math-Aware Question Answering SystemJun 28 2019We present an open source math-aware Question Answering System based on Ask Platypus. Our system returns as a single mathematical formula for a natural language question in English or Hindi. This formulae originate from the knowledge-base Wikidata. We ... More
OpenCitationsJun 27 2019OpenCitations is a scholarly infrastructure organization dedicated to open scholarship and the publication of open bibliographic and citation data as Linked Open Data using Semantic Web technologies, to the development of software tools and services that ... More
Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and CitationsJun 27 2019Identifying academic plagiarism is a pressing task for educational and research institutions, publishers, and funding agencies. Current plagiarism detection systems reliably find instances of copied and moderately reworded text. However, reliably detecting ... More
Semantic Preserving Bijective Mappings for Expressions involving Special Functions in Computer Algebra Systems and Document Preparation SystemsJun 27 2019Purpose: Modern mathematicians and scientists of math-related disciplines often use Document Preparation Systems (DPS) to write and Computer Algebra Systems (CAS) to calculate mathematical expressions. Usually, they translate the expressions manually ... More
BioGen: Automated Biography GenerationJun 27 2019A biography of a person is the detailed description of several life events including his education, work, relationships, and death. Wikipedia, the free web-based encyclopedia, consists of millions of manually curated biographies of eminent politicians, ... More
The UN Security Council debates 1995-2017Jun 26 2019This paper presents a new dataset containing 65,393 speeches held in the public meetings of the UN Security Council (UNSC) between 1995 and 2017. The dataset is based on publicly available meeting transcripts with the S/PV document symbol and includes ... More
The BIDS Toolbox: A web service to manage brain imaging datasetsJun 24 2019Data sharing is a key factor for ensuring reproducibility and transparency of scientific experiments, and neuroimaging is no exception. The vast heterogeneity of data formats and imaging modalities utilised in the field makes it a very challenging problem. ... More
Characterisation of the $χ$-index and the $rec$-indexJun 24 2019Axiomatic characterisation of a bibliometric index provides insight into the properties that the index satisfies and facilitates the comparison of different indices. A geometric generalisation of the $h$-index, called the $\chi$-index, has recently been ... More
Systematic improvement of user engagement with academic titles using computational linguisticsJun 23 2019This paper describes a novel approach to systematically improve information interactions based solely on its wording. Following an interdisciplinary literature review, we recognized three key attributes of words that drive user engagement: (1) Novelty ... More
A Retrospective Recount of Computer Architecture Research with a Data-Driven Study of Over Four Decades of ISCA PublicationsJun 22 2019This study began with a research project, called DISCvR, conducted at the IBM-ILLINOIS Center for Cognitive Computing Systems Reseach. The goal of DISCvR was to build a practical NLP based AI pipeline for document understanding which will help us better ... More
A bibliometric analysis of Bitcoin scientific productionJun 21 2019Blockchain technology, and more specifically Bitcoin (one of its foremost applications), have been receiving increasing attention in the scientific community. The first publications with Bitcoin as a topic, can be traced back to 2012. In spite of this ... More
Cleaning Noisy and Heterogeneous Metadata for Record Linking Across Scholarly Big DatasetsJun 20 2019Automatically extracted metadata from scholarly documents in PDF formats is usually noisy and heterogeneous, often containing incomplete fields and erroneous values. One common way of cleaning metadata is to use a bibliographic reference dataset. The ... More
Patent citations to scientific papers as early signs for predicting delayed recognition of scientific discoveries: a comparative study with instant recognitionJun 19 2019In this study, we investigate the extent to that patent citations to papers can serve as early signs for predicting delayed recognition using a comparative study with a control group, i.e., instant recognition papers. We identify the two opposite groups ... More
Gender trends in computer science authorshipJun 19 2019A comprehensive and up-to-date analysis of Computer Science literature (2.87 million papers through 2018) reveals that, if current trends continue, parity between the number of male and female authors will not be reached in this century. Under our most ... More
Taxonomy-as-a-Service: How To Structure Your Related WorkJun 18 2019Structuring related work is a daunting task encompassing literature review, classification, comparison (primarily in the form of concepts), and gap analysis. Building taxonomies is a compelling way to structure concepts in the literature yielding reusable ... More
Impact of HTTP Cookie Violations in Web ArchivesJun 17 2019Certain HTTP Cookies on certain sites can be a source of content bias in archival crawls. Accommodating Cookies at crawl time, but not utilizing them at replay time may cause cookie violations, resulting in defaced composite mementos that never existed ... More
Supporting Web Archiving via Web PackagingJun 17 2019We describe challenges related to web archiving, replaying archived web resources, and verifying their authenticity. We show that Web Packaging has significant potential to help address these challenges and identify areas in which changes are needed in ... More
Accuracy of citation data in Web of Science and ScopusJun 17 2019We present a large-scale analysis of the accuracy of citation data in the Web of Science and Scopus databases. The analysis is based on citations given in publications in Elsevier journals. We reveal significant data quality problems for both databases. ... More
The Evolving Ecosystem of Predatory Journals: A Case Study in Indian PerspectiveJun 17 2019Digital advancement in scholarly repositories has led to the emergence of a large number of open access predatory publishers that charge high article processing fees from authors but fail to provide necessary editorial and publishing services. Identifying ... More
Predicting Research Trends with Semantic and Neural Networks with an application in Quantum PhysicsJun 17 2019The vast and growing number of publications in all disciplines of science cannot be comprehended by a single human researcher. As a consequence, researchers have to specialize in narrow sub-disciplines, which makes it challenging to uncover scientific ... More
A Strategy for Expert Recommendation From Open Data Available on the Lattes PlatformJun 14 2019With the increasing volume of data and users of curriculum systems, the difficulty of finding specialists is increasing.This work proposes an open data extraction methodology of the Lattes Platform curricula, a treatment for this data and investigates ... More
Software and their Dependencies in Research Citation GraphsJun 14 2019Software is essential for a lot of research, but it is not featured in citation graphs which have the potential to assign credit for software contributions. This is due to a traditionalistic focus on textual research products. In this paper, I propose ... More
SchenQL -- A Domain-Specific Query Language on Bibliographic MetadataJun 14 2019Information access needs to be uncomplicated, users rather use incorrect data which is easily received than correct information which is harder to obtain. Querying bibliographic metadata from digital libraries mainly supports simple textual queries. A ... More
SchenQL -- A Domain-Specific Query Language on Bibliographic MetadataJun 14 2019Jun 17 2019Information access needs to be uncomplicated, users rather use incorrect data which is easily received than correct information which is harder to obtain. Querying bibliographic metadata from digital libraries mainly supports simple textual queries. A ... More
Nine Million Books and Eleven Million Citations: A Study of Book-Based Scholarly Communication Using OpenCitationsJun 14 2019Books have been widely used to share information and contribute to human knowledge. However, the quantitative use of books as a method of scholarly communication is relatively unexamined compared to journal articles and conference papers. This study uses ... More
Visualizing a Field of Research: A Methodology of Systematic Scientometric ReviewsJun 11 2019Systematic scientometric reviews, empowered by scientometric and visual analytic techniques, offer opportunities to improve the timeliness, accessibility, and reproducibility of conventional systematic reviews. While increasingly accessible science mapping ... More
Achieving competitive advantage in academia through early career coauthorship with top scientistsJun 11 2019We quantify the long term impact that the coauthorship with established top-cited scientists has on the career of junior researchers in four different scientific disciplines. Through matched pair analysis, we find that junior researchers who coauthor ... More
EXmatcher: Combining Features Based on Reference Strings and Segments to Enhance Citation MatchingJun 11 2019Citation matching is a challenging task due to different problems such as the variety of citation styles, mistakes in reference strings and the quality of identified reference segments. The classic citation matching configuration used in this paper is ... More
Qualifying threshold of take off stage for successfully disseminated creative ideasJun 10 2019The creative process is essentially Darwinian and only a small proportion of creative ideas are selected for further development. However, the threshold that identifies this small fraction of successfully disseminated creative ideas at their early stage ... More
What Do Citation Counts Measure? An Updated Review of Studies on Citations in Scientific Documents Published between 2006 and 2018Jun 10 2019The purpose of this paper is to update the review of Bornmann and Daniel (2008) presenting a narrative review of studies on citations in scientific documents. The current review covers 38 studies published between 2006 and 2018. Bornmann and Daniel (2008) ... More
Indicators of Open Access for universitiesJun 10 2019This paper presents a first attempt to analyse Open Access integration at the institutional level. For this, we combine information from Unpaywall and the Leiden Ranking to offer basic OA indicators for universities. We calculate the overall number of ... More
Do Authors Deposit on Time? Tracking Open Access Policy ComplianceJun 07 2019Recent years have seen fast growth in the number of policies mandating Open Access (OA) to research outputs. We conduct a large-scale analysis of over 800 thousand papers from repositories around the world published over a period of 5 years to investigate: ... More
Predicting Patent Citations to measure Economic Impact of Scholarly ResearchJun 07 2019A crucial goal of funding research and development has always been to advance economic development. On this basis, a consider-able body of research undertaken with the purpose of determining what exactly constitutes economic impact and how to accurately ... More
Please, no more scientific journals! The strategy of the scientific publication systemJun 07 2019In the same way ecosystems tend to increase maturity by decreasing the flow of energy per unit biomass, we should move towards a more mature science by publishing less but high-quality papers and getting away from joining large teams in small roles. That ... More
How a Single Paper Affects the Impact Factor: Implications for Scholarly PublishingJun 06 2019Because the Impact Factor (IF) is an average quantity and most journals are small, IFs are volatile. We study how a single paper affects the IF using data from 11639 journals in the 2017 Journal Citation Reports. We define as volatility the IF gain (or ... More
Token-Curated Registry with Citation GraphJun 05 2019In this study, we aim to incorporate the expertise of anonymous curators into a token-curated registry (TCR), a decentralized recommender system for collecting a list of high-quality content. This registry is important, because previous studies on TCRs ... More
Large publishing consortia produce higher citation impact research but co-author contributions are hard to evaluateJun 05 2019This paper introduces a simple agglomerative clustering method to identify large publishing consortia with at least 20 authors and 80% shared authorship between articles. Based on Scopus journal articles 1996-2018, under these criteria, nearly all (88%) ... More
Tracing Antisemitic Language Through Diachronic Embedding Projections: France 1789-1914Jun 04 2019We investigate some aspects of the history of antisemitism in France, one of the cradles of modern antisemitism, using diachronic word embeddings. We constructed a large corpus of French books and periodicals issues that contain a keyword related to Jews ... More
A gender analysis of top scientists' collaboration behavior: evidence from ItalyJun 03 2019This work analyzes the differences in collaboration behavior between males and females among a particular type of scholars: top scientists, and as compared to non top scientists. The field of observation consists of the Italian academic system and the ... More
DFS: A Dataset File System for Data Discovering UsersMay 31 2019Many research questions can be answered quickly and efficiently using data already collected for previous research. This practice is called secondary data analysis (SDA), and has gained popularity due to lower costs and improved research efficiency. In ... More
Exploring the Effects of Data Set Choice on Measuring International Research Collaboration: an Example Using the ACM Digital Library and Microsoft Academic GraphMay 30 2019International research collaboration (IRC) measurement is important because countries can and want to benefit from international collaboration but performing the same measurement procedure on different data sets can lead to different results. This study ... More
Enriching Bibliographic Data by Combining String Matching and the Wikidata Knowledge Graph to Improve the Measurement of International Research CollaborationMay 30 2019Measuring international research collaboration is necessary when evaluating, for example, the efficacy of policy meant to increase cooperation between countries, but is currently very difficult as bibliographic records contain only affiliation data from ... More
MementoMap Framework for Flexible and Adaptive Web Archive ProfilingMay 29 2019In this work we propose MementoMap, a flexible and adaptive framework to efficiently summarize holdings of a web archive. We described a simple, yet extensible, file format suitable for MementoMap. We used the complete index of the comprising ... More
Archive Assisted Archival Fixity Verification FrameworkMay 29 2019The number of public and private web archives has increased, and we implicitly trust content delivered by these archives. Fixity is checked to ensure an archived resource has remained unaltered since the time it was captured. Some web archives do not ... More
Using Micro-collections in Social Media to Generate Seeds for Web Archive CollectionsMay 29 2019In a Web plagued by disappearing resources, Web archive collections provide a valuable means of preserving Web resources important to the study of past events ranging from elections to disease outbreaks. These archived collections start with seed URIs ... More
Paper Matching with Local Fairness ConstraintsMay 28 2019Automatically matching reviewers to papers is a crucial step of the peer review process for venues receiving thousands of submissions. Unfortunately, common paper matching algorithms often construct matchings suffering from two critical problems: (1) ... More
Social Cards Probably Provide For Better Understanding Of Web Archive CollectionsMay 27 2019May 28 2019Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their ... More
Social Cards Probably Provide For Better Understanding Of Web Archive CollectionsMay 27 2019May 30 2019Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their ... More
Social Cards Probably Provide For Better Understanding Of Web Archive CollectionsMay 27 2019Used by a variety of researchers, web archive collections have become invaluable sources of evidence. If a researcher is presented with a web archive collection that they did not create, how do they know what is inside so that they can use it for their ... More
Document Embeddings vs. Keyphrases vs. Terms: An Online Evaluation in Digital Library Recommender SystemsMay 27 2019Many recommendation algorithms are available to digital library recommender system operators. The effectiveness of algorithms is largely unreported by way of online evaluation. We compare a standard term-based recommendation approach to two promising ... More
Attributing and Referencing (Research) Software: Best Practices and Outlook from InriaMay 27 2019Software is a fundamental pillar of modern scientiic research, not only in computer science, but actually across all elds and disciplines. However, there is a lack of adequate means to cite and reference software, for many reasons. An obvious rst reason ... More
Statistical Significance Testing in Information Retrieval: An Empirical Analysis of Type I, Type II and Type III ErrorsMay 27 2019Statistical significance testing is widely accepted as a means to assess how well a difference in effectiveness reflects an actual difference between systems, as opposed to random noise because of the selection of topics. According to recent surveys on ... More
Statistical Significance Testing in Information Retrieval: An Empirical Analysis of Type I, Type II and Type III ErrorsMay 27 2019Jun 05 2019Statistical significance testing is widely accepted as a means to assess how well a difference in effectiveness reflects an actual difference between systems, as opposed to random noise because of the selection of topics. According to recent surveys on ... More
Does the $h_α$ index reinforce the Matthew effect in science? Agent-based simulations using Stata and RMay 27 2019Recently, Hirsch (2019a) proposed a new variant of the h index called the $h_\alpha$ index. He formulated as follows: "we define the $h_\alpha$ index of a scientist as the number of papers in the h-core of the scientist (i.e. the set of papers that contribute ... More
Shared Feelings: Understanding Facebook Reactions to Scholarly ArticlesMay 27 2019Research on social-media platforms has tended to rely on textual analysis to perform research tasks. While text-based approaches have significantly increased our understanding of online behavior and social dynamics, they overlook features on these platforms ... More
TrendNets: Mapping Research Trends From Dynamic Co-Word Networks via Sparse RepresentationMay 27 2019Mapping the knowledge structure from word co-occurrences in a collection of academic papers has been widely used to provide insight into the topic evolution in an arbitrary research field. In a traditional approach, the paper collection is first divided ... More
Citation concept analysis (CCA) - A new form of citation analysis revealing the usefulness of concepts for other researchers illustrated by two exemplary case studies including classic books by Thomas S. Kuhn and Karl R. PopperMay 23 2019In recent years, the full text of papers are increasingly available electronically which opens up the possibility of quantitatively investigating citation contexts in more detail. In this study, we introduce a new form of citation analysis, which we call ... More
Patent Citation Dynamics Modeling via Multi-Attention Recurrent NetworksMay 22 2019Modeling and forecasting forward citations to a patent is a central task for the discovery of emerging technologies and for measuring the pulse of inventive progress. Conventional methods for forecasting these forward citations cast the problem as analysis ... More
R package for producing beamplots as a preferred alternative to the h index when assessing single researchers (based on downloads from Web of Science)May 22 2019We propose the use of beamplots - which can be produced by using the R package BibPlots and WoS downloads - as a preferred alternative to h index values for assessing single researchers.
From heterogeneous data to heterogeneous public: thoughts on transmedia applications for digital heritage research and disseminationMay 22 2019In recent years, we have seen a tenfold increase in volume and complexity of digital data acquired for cultural heritage documentation. Meanwhile, open data and open science have become leading trends in digital humanities. The convergence of those two ... More
A Scalable Hybrid Research Paper Recommender System for Microsoft AcademicMay 21 2019We present the design and methodology for the large scale hybrid paper recommender system used by Microsoft Academic. The system provides recommendations for approximately 160 million English research papers and patents. Our approach handles incomplete ... More
Software Citation Implementation ChallengesMay 21 2019The main output of the FORCE11 Software Citation working group ( was a paper on software citation principles ( published in September 2016. This paper laid ... More