Academic Search Engines: A Quantitative Outlook

Ebook335 pages3 hours

Academic Search Engines: A Quantitative Outlook

Name: Academic Search Engines: A Quantitative Outlook
Author: Jose Luis Ortega
ISBN: 9781780634722

By Jose Luis Ortega

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Academic Search Engines intends to run through the current panorama of the academic search engines through a quantitative approach that analyses the reliability and consistence of these services. The objective is to describe the main characteristics of these engines, to highlight their advantages and drawbacks, and to discuss the implications of these new products in the future of scientific communication and their impact on the research measurement and evaluation. In short, Academic Search Engines presents a summary view of the new challenges that the Web set to the scientific activity through the most novel and innovative searching services available on the Web.

This is the first approach to analyze search engines exclusively addressed to the research community in an integrative handbook. The novelty, expectation and usefulness of many of these services justify their analysis
This book is not merely a description of the web functionalities of these services; it is a scientific review of the most outstanding characteristics of each platform, discussing their significance to the scholarly communication and research evaluation
This book introduces an original methodology based on a quantitative analysis of the covered data through the extensive use of crawlers and harvesters which allow going in depth into how these engines are working. Beside of this, a detailed descriptive review of their functionalities and a critical discussion about their use for scientific community is displayed

Skip carousel

LanguageEnglish

PublisherElsevier Science

Release dateOct 2, 2014

ISBN9781780634722

Author

Jose Luis Ortega

José Luis Ortega is a web researcher in the Spanish National Research Council (CSIC). He achieved a fellowship to the Cybermetrics Lab of the CSIC, where he finished his doctoral studies. In 2005, he was hired by the Virtual Knowledge Studio of the Royal Netherlands Academy of Sciences and Arts, and in 2008 he received a full position in the Vice-presidency for Scientific and Technological Research at the CSIC, working in research evaluation. He collaborates with the Cybermetrics Lab in research areas such as Webometrics, Web usage mining, Visualization of Information, Social network analysis and Web bibliometrics.

Related authors

Skip carousel

Related to Academic Search Engines

Related ebooks

Skip carousel

Learn Data Science Using SAS Studio: A Quick-Start Guide
Ebook
Learn Data Science Using SAS Studio: A Quick-Start Guide
byEngy Fouda
Rating: 0 out of 5 stars
0 ratings
Social Network Sites for Scientists: A Quantitative Survey
Ebook
Social Network Sites for Scientists: A Quantitative Survey
byJose Luis Ortega
Rating: 0 out of 5 stars
0 ratings
Indexing: From Thesauri to the Semantic Web
Ebook
Indexing: From Thesauri to the Semantic Web
byPiet de Keyser
Rating: 0 out of 5 stars
0 ratings
Web Semantics: Cutting Edge and Future Directions in Healthcare
Ebook
Web Semantics: Cutting Edge and Future Directions in Healthcare
bySarika Jain
Rating: 0 out of 5 stars
0 ratings
Mastering Clojure Data Analysis
Ebook
Mastering Clojure Data Analysis
byEric Rochester
Rating: 0 out of 5 stars
0 ratings
Learning Social Media Analytics with R
Ebook
Learning Social Media Analytics with R
byDipanjan Sarkar
Rating: 0 out of 5 stars
0 ratings
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
Ebook
Practical Data Science with Python 3: Synthesizing Actionable Insights from Data
byErvin Varga
Rating: 0 out of 5 stars
0 ratings
Cybermetric Techniques to Evaluate Organizations Using Web-Based Data
Ebook
Cybermetric Techniques to Evaluate Organizations Using Web-Based Data
byEnrique Orduna-Malea
Rating: 0 out of 5 stars
0 ratings
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
Ebook
Deploy Machine Learning Models to Production: With Flask, Streamlit, Docker, and Kubernetes on Google Cloud Platform
byPramod Singh
Rating: 0 out of 5 stars
0 ratings
The Art and Science of Analyzing Software Data
Ebook
The Art and Science of Analyzing Software Data
byChristian Bird
Rating: 0 out of 5 stars
0 ratings
Managing Multimedia and Unstructured Data in the Oracle Database
Ebook
Managing Multimedia and Unstructured Data in the Oracle Database
byMarcelle Kratochvil
Rating: 0 out of 5 stars
0 ratings
Data Visualization Tools A Complete Guide - 2021 Edition
Ebook
Data Visualization Tools A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Human Recognition in Unconstrained Environments: Using Computer Vision, Pattern Recognition and Machine Learning Methods for Biometrics
Ebook
Human Recognition in Unconstrained Environments: Using Computer Vision, Pattern Recognition and Machine Learning Methods for Biometrics
byMaria De Marsico
Rating: 0 out of 5 stars
0 ratings
Extending jQuery
Ebook
Extending jQuery
byKeith Wood
Rating: 0 out of 5 stars
0 ratings
How to Design Optimization Algorithms by Applying Natural Behavioral Patterns
Ebook
How to Design Optimization Algorithms by Applying Natural Behavioral Patterns
byRohollah Omidvar
Rating: 0 out of 5 stars
0 ratings
Quantum Theory of Collective Phenomena
Ebook
Quantum Theory of Collective Phenomena
byG. L. Sewell
Rating: 0 out of 5 stars
0 ratings
Regression Graphics: Ideas for Studying Regressions Through Graphics
Ebook
Regression Graphics: Ideas for Studying Regressions Through Graphics
byR. Dennis Cook
Rating: 0 out of 5 stars
0 ratings
Multimodal Signal Processing: Theory and Applications for Human-Computer Interaction
Ebook
Multimodal Signal Processing: Theory and Applications for Human-Computer Interaction
byJean-Philippe Thiran
Rating: 0 out of 5 stars
0 ratings
iText in Action
Ebook
iText in Action
byBruno Lowagie
Rating: 0 out of 5 stars
0 ratings
Expert Data Visualization
Ebook
Expert Data Visualization
byJos Dirksen
Rating: 0 out of 5 stars
0 ratings
Python for SAS Users: A SAS-Oriented Introduction to Python
Ebook
Python for SAS Users: A SAS-Oriented Introduction to Python
byRandy Betancourt
Rating: 0 out of 5 stars
0 ratings
Natural Language Processing A Complete Guide - 2021 Edition
Ebook
Natural Language Processing A Complete Guide - 2021 Edition
byGerardus Blokdyk
Rating: 0 out of 5 stars
0 ratings
Design Discourse: Composing and Revising Programs in Professional and Technical Writing
Ebook
Design Discourse: Composing and Revising Programs in Professional and Technical Writing
byCSPtrade2
Rating: 0 out of 5 stars
0 ratings
Human-Centered Artificial Intelligence: Research and Applications
Ebook
Human-Centered Artificial Intelligence: Research and Applications
byChang S. Nam
Rating: 0 out of 5 stars
0 ratings
Implementing Effective Code Reviews: How to Build and Maintain Clean Code
Ebook
Implementing Effective Code Reviews: How to Build and Maintain Clean Code
byGiuliana Carullo
Rating: 0 out of 5 stars
0 ratings
Readings in Artificial Intelligence
Ebook
Readings in Artificial Intelligence
byBonnie Lynn Webber
Rating: 0 out of 5 stars
0 ratings
Pro Cryptography and Cryptanalysis: Creating Advanced Algorithms with C# and .NET
Ebook
Pro Cryptography and Cryptanalysis: Creating Advanced Algorithms with C# and .NET
byMarius Iulian Mihailescu
Rating: 0 out of 5 stars
0 ratings
Hybrid Computational Intelligence: Challenges and Applications
Ebook
Hybrid Computational Intelligence: Challenges and Applications
bySiddhartha Bhattacharyya
Rating: 0 out of 5 stars
0 ratings
Modeling with Data: Tools and Techniques for Scientific Computing
Ebook
Modeling with Data: Tools and Techniques for Scientific Computing
byBen Klemens
Rating: 3 out of 5 stars
3/5
Julia as a Second Language
Ebook
Julia as a Second Language
byErik Engheim
Rating: 0 out of 5 stars
0 ratings

Internet & Web For You

Skip carousel

No Place to Hide: Edward Snowden, the NSA, and the U.S. Surveillance State
Ebook
No Place to Hide: Edward Snowden, the NSA, and the U.S. Surveillance State
byGlenn Greenwald
Rating: 4 out of 5 stars
4/5
How to Be Invisible: Protect Your Home, Your Children, Your Assets, and Your Life
Ebook
How to Be Invisible: Protect Your Home, Your Children, Your Assets, and Your Life
byJ. J. Luna
Rating: 4 out of 5 stars
4/5
Social Engineering: The Science of Human Hacking
Ebook
Social Engineering: The Science of Human Hacking
byChristopher Hadnagy
Rating: 3 out of 5 stars
3/5
The Gothic Novel Collection
Ebook
The Gothic Novel Collection
byGaston Leroux
Rating: 5 out of 5 stars
5/5
Get Rich or Lie Trying: Ambition and Deceit in the New Influencer Economy
Ebook
Get Rich or Lie Trying: Ambition and Deceit in the New Influencer Economy
bySymeon Brown
Rating: 0 out of 5 stars
0 ratings
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
Ebook
The JavaScript Workshop: Learn to develop interactive web applications with clean and maintainable JavaScript code
byJoseph Labrecque
Rating: 5 out of 5 stars
5/5
Six Figure Blogging Blueprint
Ebook
Six Figure Blogging Blueprint
byRaza Imam
Rating: 5 out of 5 stars
5/5
Coding For Dummies
Ebook
Coding For Dummies
byNikhil Abraham
Rating: 5 out of 5 stars
5/5
How to Disappear and Live Off the Grid: A CIA Insider's Guide
Ebook
How to Disappear and Live Off the Grid: A CIA Insider's Guide
byJohn Kiriakou
Rating: 0 out of 5 stars
0 ratings
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
Ebook
Everybody Lies: Big Data, New Data, and What the Internet Can Tell Us About Who We Really Are
bySeth Stephens-Davidowitz
Rating: 4 out of 5 stars
4/5
Coding All-in-One For Dummies
Ebook
Coding All-in-One For Dummies
byNikhil Abraham
Rating: 4 out of 5 stars
4/5
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
Ebook
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
byKristen Meinzer
Rating: 3 out of 5 stars
3/5
Podcasting For Dummies
Ebook
Podcasting For Dummies
byTee Morris
Rating: 4 out of 5 stars
4/5
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
Ebook
The Hacker Crackdown: Law and Disorder on the Electronic Frontier
byBruce Sterling
Rating: 4 out of 5 stars
4/5
Beginner's Guide To Starting An Etsy Print-On-Demand Shop
Ebook
Beginner's Guide To Starting An Etsy Print-On-Demand Shop
byAnn Eckhart
Rating: 0 out of 5 stars
0 ratings
Grokking Algorithms: An illustrated guide for programmers and other curious people
Ebook
Grokking Algorithms: An illustrated guide for programmers and other curious people
byAditya Bhargava
Rating: 4 out of 5 stars
4/5
The Beginner's Affiliate Marketing Blueprint
Ebook
The Beginner's Affiliate Marketing Blueprint
byAlex M
Rating: 4 out of 5 stars
4/5
The Digital Decluttering Workbook: How to Succeed with Digital Minimalism, Defeat Smartphone Addiction, Detox Social Media, and Organize Your Online Life: Declutter Workbook, #3
Ebook
The Digital Decluttering Workbook: How to Succeed with Digital Minimalism, Defeat Smartphone Addiction, Detox Social Media, and Organize Your Online Life: Declutter Workbook, #3
byAlex Wong
Rating: 3 out of 5 stars
3/5
200+ Ways to Protect Your Privacy: Simple Ways to Prevent Hacks and Protect Your Privacy--On and Offline
Ebook
200+ Ways to Protect Your Privacy: Simple Ways to Prevent Hacks and Protect Your Privacy--On and Offline
byJeni Rogers
Rating: 0 out of 5 stars
0 ratings
Hacking : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Ethical Hacking
Ebook
Hacking : The Ultimate Comprehensive Step-By-Step Guide to the Basics of Ethical Hacking
byKevin Clark
Rating: 5 out of 5 stars
5/5
The Internet Is Not What You Think It Is: A History, a Philosophy, a Warning
Ebook
The Internet Is Not What You Think It Is: A History, a Philosophy, a Warning
byJustin Smith-Ruiu
Rating: 4 out of 5 stars
4/5
Summary of Dotcom Secrets: by Russell Brunson - The Underground Playbook for Growing Your Company Online with Sales Funnels - A Comprehensive Summary
Ebook
Summary of Dotcom Secrets: by Russell Brunson - The Underground Playbook for Growing Your Company Online with Sales Funnels - A Comprehensive Summary
byAlexander Cooper
Rating: 5 out of 5 stars
5/5
The Logo Brainstorm Book: A Comprehensive Guide for Exploring Design Directions
Ebook
The Logo Brainstorm Book: A Comprehensive Guide for Exploring Design Directions
byJim Krause
Rating: 4 out of 5 stars
4/5
Remote/WebCam Notarization : Basic Understanding
Ebook
Remote/WebCam Notarization : Basic Understanding
byJeannie Eunice Franks
Rating: 3 out of 5 stars
3/5
How To Start A Podcast
Ebook
How To Start A Podcast
byP Teague
Rating: 4 out of 5 stars
4/5
Tor and the Dark Art of Anonymity
Ebook
Tor and the Dark Art of Anonymity
byLance Henderson
Rating: 5 out of 5 stars
5/5
More Porn - Faster!: 50 Tips & Tools for Faster and More Efficient Porn Browsing
Ebook
More Porn - Faster!: 50 Tips & Tools for Faster and More Efficient Porn Browsing
bySue Denim
Rating: 3 out of 5 stars
3/5
The Digital Marketing Handbook: A Step-By-Step Guide to Creating Websites That Sell
Ebook
The Digital Marketing Handbook: A Step-By-Step Guide to Creating Websites That Sell
byRobert W. Bly
Rating: 5 out of 5 stars
5/5
The Cyber Attack Survival Manual: Tools for Surviving Everything from Identity Theft to the Digital Apocalypse
Ebook
The Cyber Attack Survival Manual: Tools for Surviving Everything from Identity Theft to the Digital Apocalypse
byNick Selby
Rating: 0 out of 5 stars
0 ratings
Introduction to Internet Scams and Fraud: Credit Card Theft, Work-At-Home Scams and Lottery Scams
Ebook
Introduction to Internet Scams and Fraud: Credit Card Theft, Work-At-Home Scams and Lottery Scams
byDueep J. Singh
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

764: The Top 10 Episodes of 2023: Data science futurists, bestselling authors, and …
Podcast episode
764: The Top 10 Episodes of 2023: Data science futurists, bestselling authors, and …
bySuper Data Science: ML & AI Podcast with Jon Krohn
0 ratings
0% found this document useful
114 - Project Orleans and the distributed database future with Dr. Philip Bernstein
Podcast episode
114 - Project Orleans and the distributed database future with Dr. Philip Bernstein
byMicrosoft Research Podcast
0 ratings
0% found this document useful
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 2 #10
Podcast episode
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 2 #10
byThe Anthrozoology Podcast
0 ratings
0% found this document useful
Nature's Take: what's next for the preprint revolution: Nature editors take on the big topics that matter in science.
Podcast episode
Nature's Take: what's next for the preprint revolution: Nature editors take on the big topics that matter in science.
byNature Podcast
0 ratings
0% found this document useful
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 1 #9
Podcast episode
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 1 #9
byThe Anthrozoology Podcast
0 ratings
0% found this document useful
Integrating Logic, Probability and Neuro-Symbolic Reasoning using Probabilistic Soft Logic: An overview of work on probabilistic soft logic (PSL), an SRL framework for large-scale collective, probabilistic reasoning in relational domains and a description of recent work which integrates neural and symbolic (NeSy) reasoning.
Podcast episode
Integrating Logic, Probability and Neuro-Symbolic Reasoning using Probabilistic Soft Logic: An overview of work on probabilistic soft logic (PSL), an SRL framework for large-scale collective, probabilistic reasoning in relational domains and a description of recent work which integrates neural and symbolic (NeSy) reasoning.
byComputer Science
0 ratings
0% found this document useful
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
Podcast episode
Why Microservices Are Better Than Cloud Computing: This episode on Systems—one of the four Domains of Data Science UVA uses to define the field—explores the challenges of cloud computing within the framework of biomedical research. Phil Bourne, Dean of the UVA School of Data Science, speaks with computational biologist and associate professor Nathan Sheffield about a paper they co-wrote on systemic issues from cloud platforms that do not support FAIRness, including platform lock-in, poor integration across platforms, and duplicated efforts for users and developers. They suggest instead prioritizing microservices and access to modular data in smaller chunks or summarized form. Emphasizing modularity and interoperability would lead to a more powerful Unix-like ecosystem of web services for biomedical analysis and data retrieval. The two discuss how funders, developers, and researchers can support microservices as the next generation of cloud-based bioinformatics. From Cloud Computing to
byUVA Data Points
0 ratings
0% found this document useful
Reproducible data science: How hard can it be?: The ability to reproduce the research that other scientists have done to see whether the same results are obtained (or the same conclusions are reached) is an integral part of the scientific process, but are we doing it right and how difficult is it to d...
Podcast episode
Reproducible data science: How hard can it be?: The ability to reproduce the research that other scientists have done to see whether the same results are obtained (or the same conclusions are reached) is an integral part of the scientific process, but are we doing it right and how difficult is it to d...
byThe Turing Podcast
0 ratings
0% found this document useful
Episode: 42 - Machine Learning Informatics for Antibody Discovery
Podcast episode
Episode: 42 - Machine Learning Informatics for Antibody Discovery
byThe Chain: Protein Engineering Podcast
0 ratings
0% found this document useful
EP 242: AI Tools to Supercharge Research
Podcast episode
EP 242: AI Tools to Supercharge Research
byEveryday AI Podcast – An AI and ChatGPT Podcast
0 ratings
0% found this document useful
Setting the Standard: Impact of Method Standardization in Chromatography
Podcast episode
Setting the Standard: Impact of Method Standardization in Chromatography
byThe Analytical Wavelength
0 ratings
0% found this document useful
Collaborators: Teachable AI with Cecily Morrison and Karolina Pakėnaitė
Podcast episode
Collaborators: Teachable AI with Cecily Morrison and Karolina Pakėnaitė
byMicrosoft Research Podcast
0 ratings
0% found this document useful
What Happened to Data Marts?
Podcast episode
What Happened to Data Marts?
byInsights Tomorrow
0 ratings
0% found this document useful
Developing an Enterprise Data Strategy the Right Way - with Dora Boussias of Stryker: In this week’s episode, we’re discussing a topic that applies across industries for AI projects: where is the data coming from? Our guest is Dora Boussias, Senior Director for Data Strategy and Architecture at Stryker. Stryker is a $17B per year...
Podcast episode
Developing an Enterprise Data Strategy the Right Way - with Dora Boussias of Stryker: In this week’s episode, we’re discussing a topic that applies across industries for AI projects: where is the data coming from? Our guest is Dora Boussias, Senior Director for Data Strategy and Architecture at Stryker. Stryker is a $17B per year...
byThe AI in Business Podcast
0 ratings
0% found this document useful
Intern Insights: Dr. Madeleine Daepp with Jennifer Scurrell and Alejandro Cuevas
Podcast episode
Intern Insights: Dr. Madeleine Daepp with Jennifer Scurrell and Alejandro Cuevas
byMicrosoft Research Podcast
0 ratings
0% found this document useful
002 Dr. Jim Warren and the Materials Genome Initiative: This episode focuses on the importance of materials data to the materials informatics process, as well an update on the . In this episode, Dr. Meredig and Dr. Warren discuss: How a shared improv background has surprisingly made a positive...
Podcast episode
002 Dr. Jim Warren and the Materials Genome Initiative: This episode focuses on the importance of materials data to the materials informatics process, as well an update on the . In this episode, Dr. Meredig and Dr. Warren discuss: How a shared improv background has surprisingly made a positive...
byDataLab: The Materials Informatics Podcast
0 ratings
0% found this document useful
What Is Your SOC's Single Search of Truth?: All links and images for this episode can be found on . Check out this post for the discussion that is the basis of our conversation on this week’s episode co-hosted by me, (), the producer of , and . Joining us is our sponsored guest, , CEO,...
Podcast episode
What Is Your SOC's Single Search of Truth?: All links and images for this episode can be found on . Check out this post for the discussion that is the basis of our conversation on this week’s episode co-hosted by me, (), the producer of , and . Joining us is our sponsored guest, , CEO,...
byDefense in Depth
0 ratings
0% found this document useful
Patient Reported Outcome Measures Special Issue: Operationalizing PROMs at the Musculoskeletal Practice and Policy Levels: Host Liana Tedesco, MD Guest interviewee David N. Bernstein MD, MBA, MEI discussing his article “Operationalizing PROMS at the Musculoskeletal Practice and Policy Levels” () Guest interviewee Prakash Jayakumar, MD, PhD discussing the October...
Podcast episode
Patient Reported Outcome Measures Special Issue: Operationalizing PROMs at the Musculoskeletal Practice and Policy Levels: Host Liana Tedesco, MD Guest interviewee David N. Bernstein MD, MBA, MEI discussing his article “Operationalizing PROMS at the Musculoskeletal Practice and Policy Levels” () Guest interviewee Prakash Jayakumar, MD, PhD discussing the October...
byJAAOS Unplugged
0 ratings
0% found this document useful
Is Relevance Irrelevant?: Should every information systems research paper be relevant to practice? Does research need to be relevant in the first place? And if relevance is relevant, how should we engage with practitioners about our research? We discuss relevance of...
Podcast episode
Is Relevance Irrelevant?: Should every information systems research paper be relevant to practice? Does research need to be relevant in the first place? And if relevance is relevant, how should we engage with practitioners about our research? We discuss relevance of...
bythis IS research
0 ratings
0% found this document useful
ASRM Today: ASRM Updates July 2023: On this episode we bring you up to date on things to know about ASRM for July 2023! MHPG Course: APP Survey: SPARK Program Survey: F&S On Air: Webinar Info: ASRM 2023:
Podcast episode
ASRM Today: ASRM Updates July 2023: On this episode we bring you up to date on things to know about ASRM for July 2023! MHPG Course: APP Survey: SPARK Program Survey: F&S On Air: Webinar Info: ASRM 2023:
byASRM Today Podcast
0 ratings
0% found this document useful
Elasticsearch with Philipp Krenn: Search is a common building block for applications. Whether we are searching Wikipedia or our log files, the behavior is similar: a query is entered and the most relevant documents are returned. The core data structure for search is an inverted index.
Podcast episode
Elasticsearch with Philipp Krenn: Search is a common building block for applications. Whether we are searching Wikipedia or our log files, the behavior is similar: a query is entered and the most relevant documents are returned. The core data structure for search is an inverted index.
byCloud Engineering Archives - Software Engineering Daily
0 ratings
0% found this document useful
Episode 41: 3D Printing Case Studies: examples of the power of additive manufacturing
Podcast episode
Episode 41: 3D Printing Case Studies: examples of the power of additive manufacturing
byMaterialism: A Materials Science Podcast
0 ratings
0% found this document useful
Episode 248 - Dr. Henry Swofford Interview: Eric and Glenn interview returning guest, and new…
Podcast episode
Episode 248 - Dr. Henry Swofford Interview: Eric and Glenn interview returning guest, and new…
byDouble Loop Podcast
0 ratings
0% found this document useful
Episode 7: Introducing Perception vs. Reality
Podcast episode
Episode 7: Introducing Perception vs. Reality
byStop Anamythics Podcast
0 ratings
0% found this document useful
What is the role of academia in modern AI research? With Stanford Professor Dr. Percy Liang: When AI research is evolving at warp speed and takes significant capital and compute power, what is the role of academia? Dr. Percy Liang – Stanford computer science professor and director of the Stanford Center for Research on Foundation Models talks about training costs, distributed infrastructure, model evaluation, alignment, and societal impact. Sarah Guo and Elad Gil join Percy at his office to discuss the evolution of research in NLP, why AI developers should aim for superhuman levels of performance, the goals of the Center for Research on Foundation Models, and Together, a decentralized cloud for artificial intelligence.
Podcast episode
What is the role of academia in modern AI research? With Stanford Professor Dr. Percy Liang: When AI research is evolving at warp speed and takes significant capital and compute power, what is the role of academia? Dr. Percy Liang – Stanford computer science professor and director of the Stanford Center for Research on Foundation Models talks about training costs, distributed infrastructure, model evaluation, alignment, and societal impact. Sarah Guo and Elad Gil join Percy at his office to discuss the evolution of research in NLP, why AI developers should aim for superhuman levels of performance, the goals of the Center for Research on Foundation Models, and Together, a decentralized cloud for artificial intelligence.
byNo Priors: Artificial Intelligence | Technology | Startups
0 ratings
0% found this document useful
Responsible and Explainable AI - Supreet Kaur
Podcast episode
Responsible and Explainable AI - Supreet Kaur
byDataTalks.Club
0 ratings
0% found this document useful
Analytics for a Better World - Parvathy Krishnan
Podcast episode
Analytics for a Better World - Parvathy Krishnan
byDataTalks.Club
0 ratings
0% found this document useful
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
Podcast episode
Keeping ourselves honest when we work with observational healthcare data: The abundance of data in healthcare, and the valu…
byLinear Digressions
0 ratings
0% found this document useful
Service Design: Practitioner Stories
Podcast episode
Service Design: Practitioner Stories
byUX Soup
0 ratings
0% found this document useful
[Bite] Data Science and the Scientific Method
Podcast episode
[Bite] Data Science and the Scientific Method
byDataCafé
0 ratings
0% found this document useful

Skip carousel

Showing Your Work
Family Tree
Article
Showing Your Work
Aug 17, 2021
6 min read
Opinion: Reinvent Scientific Publishing With Blockchain Technology
STAT
Article
Opinion: Reinvent Scientific Publishing With Blockchain Technology
Dec 21, 2018
Blockchain-based publishing can give researchers and research institutions — not for-profit publishers — full control over the life cycle of scholarly publications.
4 min read
Online Research
Writing Magazine
Article
Online Research
Aug 6, 2020
The internet has the potential to give you much of the information, if not all the information, you need for your research, provided you know how to use it effectively. There are different ways to approach online research, but following a process tha
3 min read
For Google, Everything Is a Popularity Contest
The Atlantic
Article
For Google, Everything Is a Popularity Contest
Jun 27, 2017
8 min read
Is Transparency Always A Good Thing? EPA Weighs Controversial New Rule.
The Christian Science Monitor
Article
Is Transparency Always A Good Thing? EPA Weighs Controversial New Rule.
Mar 12, 2020
The Environmental Protection Agency is mulling a proposal to give preference to scientific research whose datasets and models are publicly available.
3 min read
Imagine… One Single Family Tree In The World
Family Tree UK
Article
Imagine… One Single Family Tree In The World
Oct 13, 2023
7 min read
To Spur New AI Tools To Fight Coronavirus, Tech Leaders Launch Open Database Of Scientific Articles
STAT
Article
To Spur New AI Tools To Fight Coronavirus, Tech Leaders Launch Open Database Of Scientific Articles
Mar 16, 2020
The dataset is believed to be the most extensive collection concerning the #coronavirus, and crucially, it’s machine-readable, a format that can be easily processed by a computer.
1 min read
FSearch
Linux Format
Article
FSearch
Jan 11, 2022
2 min read
‘Politeness Bias’ Can Skew Our Online Research
Futurity
Article
‘Politeness Bias’ Can Skew Our Online Research
Jun 4, 2019
2 min read
Internet Search Engines
Writing Magazine
Article
Internet Search Engines
Sep 3, 2020
3 min read
DOS & DON’TS OF BUILDING AN ONLINE FAMILY TREE
Family Tree UK
Article
DOS & DON’TS OF BUILDING AN ONLINE FAMILY TREE
Feb 10, 2023
6 min read
Cryptographers Solve Decades-Old Privacy Problem
Nautilus
Article
Cryptographers Solve Decades-Old Privacy Problem
Nov 17, 2023
4 min read
You Won’t Believe How Well This Algorithm Spots Clickbait
Futurity
Article
You Won’t Believe How Well This Algorithm Spots Clickbait
Aug 29, 2019
3 min read
Out On A Limb
Family Tree
Article
Out On A Limb
Aug 29, 2023
Where do you build your family tree? It’s a question almost as old as the “genealogy internet” itself. Researchers today have several options: online trees hosted at big genealogy websites, desktop genealogy software programs, and (of course) classic
1 min read
Opinion: Science Publications Should Use Checklists, Badges To Signal Trustworthiness
STAT
Article
Opinion: Science Publications Should Use Checklists, Badges To Signal Trustworthiness
Sep 30, 2019
Scientists have time-honored criteria for deciding which research results to trust. Those should be conveyed with signals the public can see and understand.
4 min read
How And Where You Use Machine-learning
APC
Article
How And Where You Use Machine-learning
Oct 7, 2019
4 min read
Your Browsing History Alone Can Give Away Your Identity
The Atlantic
Article
Your Browsing History Alone Can Give Away Your Identity
Feb 6, 2017
4 min read
Should A Federal Agency Govern Artificial Intelligence?
Futurity
Article
Should A Federal Agency Govern Artificial Intelligence?
Sep 25, 2023
A new survey finds that a majority of computer science experts at top US research universities want to see the creation of a new federal agency or global organization to govern artificial intelligence. The Axios-Generation Lab-Syracuse University AI
1 min read
QUICK GUIDE 6: How To Do Research
You South Africa
Article
QUICK GUIDE 6: How To Do Research
Jul 21, 2023
3 min read
Your Bumper Guide To Online Research
Family Tree UK
Article
Your Bumper Guide To Online Research
Apr 7, 2020
10 min read
Photogenealogy: Step 2
Family Tree UK
Article
Photogenealogy: Step 2
Aug 12, 2022
3 min read
Share Your Research
TechLife
Article
Share Your Research
Nov 18, 2019
3 min read
The Genealogy Source Citation Process
Family Tree
Article
The Genealogy Source Citation Process
Dec 20, 2022
1 min read
Family Tree-Building Software
Family Tree
Article
Family Tree-Building Software
Aug 17, 2021
1 min read
Shine a Light on Search Results With Spotlight
MacLife
Article
Shine a Light on Search Results With Spotlight
Nov 3, 2017
1 min read
Family History In The AI Era
Family Tree UK
Article
Family History In The AI Era
Apr 12, 2024
7 min read
Opinion: Solving The Fake News Problem In Science
STAT
Article
Opinion: Solving The Fake News Problem In Science
Dec 13, 2018
Thousands of research papers are published each year. Citations are a poor way to gauge their scientific rigor. AI and a network of human experts can do the job.
3 min read
You No Longer Own Your Face
The Atlantic
Article
You No Longer Own Your Face
Jun 27, 2019
4 min read
Shades Of Grey
Writing Magazine
Article
Shades Of Grey
Aug 3, 2023
3 min read
Flower Power
Cosmos Magazine
Article
Flower Power
Jun 3, 2020
3 min read

Related categories

Skip carousel

Reviews for Academic Search Engines

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Academic Search Engines - Jose Luis Ortega

Academic Search Engines

A quantitative outlook

First Edition

José Luis Ortega

Cover image

Title page

Copyright page

Dedication

List of figures and tables

Figures

Tables

Preface

About the author

1: Introduction

Abstract

What is an academic search engine?

Challenges for an academic search engine

The evolution of academic search engines

Future perspectives

2: CiteSeerx: a scientific engine for scientists

Abstract

Autonomous citation indexing

A focus on computer science

A searchable digital library

Searching for authors and citations

Parsing mistakes

Other ‘Seers’: the CiteSeerx lab

A pioneer in citation indexing

3: Scirus: a multi-source searcher

Abstract

Web pages and authoritative sources

Crawling and data extraction

Source filtering

Ranking on links

A missed opportunity

4: AMiner: science networking as an information source

Abstract

A networked engine

A chaotic design

Based on bibliographic databases

Searching only in documents

Exhaustive author profiles

PatentMiner

A half-academic search engine

5: Microsoft Academic Search: the multi-object engine

Abstract

The object-level vertical search engine

Slow content updating speed

Filtering across the directory

A multidimensional ranking

A composition based on profiles

Visualization: graphs as research assessment tools

More a directory than an engine

6: Google Scholar: on the shoulders of a giant

Abstract

A specialization of Google

Feeding back its own sources

The opacity of results

Google Scholar’s additional services

The most exhaustive academic search engine

7: Other academic search engines

Abstract

BASE

Q-Sensei Scholar

WorldWideScience

8: A comparative analysis

Abstract

Functioning

Structure

Coverage

Searching

A heterogeneous sample

9: Final remarks

References

Index

Copyright

Chandos Publishing

Elsevier Limited

The Boulevard

Langford Lane

Kidlington

Oxford OX5 1GB

store.elsevier.com/Chandos-Publishing-/IMP_207/

Chandos Publishing is an imprint of Elsevier Limited

Tel: +44 (0) 1865 843000

Fax: +44 (0) 1865 843010

store.elsevier.com

First published in 2014

ISBN 978-1-84334-791-0 (print)

ISBN 978-1-78063-472-2 (online)

Chandos Information Professional Series ISSN: 2052-210X (print) and ISSN: 2052-2118 (online)

Library of Congress Control Number: 2014946174

British Library Cataloguing-in-Publication Data.

A catalogue record for this book is available from the British Library.

All rights reserved. No part of this publication may be reproduced, stored in or introduced into a retrieval system, or transmitted, in any form, or by any means (electronic, mechanical, photocopying, recording or otherwise) without the prior written permission of the publishers. This publication may not be lent, resold, hired out or otherwise disposed of by way of trade in any form of binding or cover other than that in which it is published without the prior consent of the publishers. Any person who does any unauthorised act in relation to this publication may be liable to criminal prosecution and civil claims for damages.

The publishers make no representation, express or implied, with regard to the accuracy of the information contained in this publication and cannot accept any legal responsibility or liability for any errors or omissions.

The material contained in this publication constitutes general guidelines only and does not represent to be advice on any particular matter. No reader or purchaser should act on the basis of material contained in this publication without first taking professional advice appropriate to their particular circumstances. All screenshots in the publication are the copyright of the website owner(s), unless indicated otherwise.

Typeset by Domex e-Data Pvt. Ltd., India

Dedication

To everyone who was interested in this book – family, friends, colleagues – and especially

mi madre y mi padre, quienes se empeñaron en que este libro saliera a la luz

List of figures and tables

Figures

Tables

Preface

In 1989 the World Wide Web was born in the CERN laboratory, a large and international scientific facility. It was not by chance that this information publishing technology appeared out of a scientific environment because researchers always have needed access to high-quality information about new advances as well as information on the state of the art of their disciplines. At the same time, the scientific community demands the sharing of media that help the rapid dissemination of results and discussion. Today, the practice of science would be impossible without a strong information system that allows the diffusion and storage of all the necessary knowledge. In this context, the Internet and the web represent an authentic revolution, comparable to the birth of the printing press, in the way in which scientists communicate among themselves. This new web era is also bringing about an explosion of new academic results that recognize the diversity of scholarly activity. But perhaps the most important change is the questioning of the earlier way of thinking typified by print publishing. The new potential of the web, participative and open, is favouring the break-up of publishing barriers when it comes to disseminating results. Thus, the open access movement, with the proliferation of digital libraries and repositories as well as the consolidation of free electronic journals, fights for a total democratization of science, bypassing intermediaries and reducing the cost of scientific publishing.

On the other hand, Web 2.0 has made possible a Science 2.0 where social networking sites have emerged that enhance the discussion and sharing of ideas, the comparison of curricula and the popularization of results. In this way, scientists are discovering a huge spectrum of utilities that expand the diffusion of results as well as improve their relationships with other partners. Alongside this flowering of new publishing forms, the web is also bringing about an important change in the evaluation of research. Webometrics – studying the use of hyperlinks as a reflection of citations, and of academic web pages as synonymous with research production – and currently Altmetrics – analysing the parallelism between citation impact and social visibility – are symptoms of the need for a new way to evaluate this changeable behaviour in research performance, breaking the borders of bibliometric impact and classical publication in print journals.

In this new scenario, specialized search engines for scientists have been emerging over the past few years as a means of gathering this new academic production, accessible via the web. Although academic search engines are indebted to classical citation indexes, this new publishing and communication panorama is changing the appearance of these search services, incorporating heterogeneous research results, adding information sharing tools, and offering a new perception on current scientific activity. In this way, these engines are springing up as web alternatives to the classical citation databases, mainly due to their free access, comprehensiveness and profiling applications.

In the face of such groundbreaking developments, this book aims to present a general review of the most important academic search engines in the market with the intention of making available a brief guide to how these platforms operate, to what and how many materials they index, and to which functionalities they support for a satisfactory search experience. This review has been carried out along a critical path discussing the suitability of these engines for the scientific community, their reliability for evaluation purposes, and the advantages and limitations of each service in meeting the specific and rigorous needs of the researchers. Further, this examination has been carried out via a precise analysis of every section and function, in which each searching feature has been explored and the sources that feed each tool have been tracked.

As the book’s subtitle suggests, a quantitative approach is followed in this analytical review, with the aims of measuring the coverage and quantifying the weight of each source. However, the data obtained leads me to suggest an innovative methodology based on the profuse use of crawlers and harvesting procedures which draw a detailed picture of each search service. To some extent, it is an attempt to design an inverse engineering method to capture the essence of the data stored in those services – motivated by the fact that many search engines do not incorporate suitable search interfaces that allow users to take a global view on the coverage of the service as well as providing enough information about their functioning. Therefore, in many cases automatic queries were launched to secure the most detailed coverage possible of the search engine. Another important function of the crawling process is that it makes possible the detection of duplicated records and misleading information, and the absence of data on authors and documents. Thus, this work tries to present an objective analysis of these search tools, principally based on the extracted data.

Throughout the process of analysis of these products, it could be seen that many of them fulfil various concepts of what academic search engines are. My aim is to highlight the numerous forms that these services may take, signalling that there is no clear definition of what elements are essential in actually designating an academic search engine. The compilation of this list is driven by the popularity and the importance of these developments in shaping the concept of the academic search engine, although each element can differ markedly to the next. In this way, the intention is to present a colourful range of academic search solutions that illustrate different approaches to building data sets, designing search interfaces or structuring contents. On the other hand, the selection of these services has also been shaped by the fact that they share basic elements that reflect the principal characteristics of an academic search engine, helping to flesh out the meaning of the concept.

Addressing a topic in book form presents numerous problems, but publishing a work via web services is inevitably accompanied by the obsolescence of the results. Data on the size of search engines in terms of number of documents indexed, or on the disappearance/appearance of new functionalities, can change dramatically in a short period of time, even if a more quantitative approach is taken, where the data sets are the most unstable elements. For example, although all the figures in this book were updated during February and March 2014, it is possible that during the publication process many aspects will disappear or change. Thinking positively, this should be interpreted as a sign of the dynamic and evolving nature of scholarly engines.

For the above reason, I do not expect to have built a static picture of each search engine, but I hope that this quantitative approximation makes it possible to describe the fundamental characteristics and functionalities of each service, as well as to suggest their advantages and disadvantages for scientific users. An example of the problem of obsolescence mentioned above can be seen in the case of Scirus, which was closed down during the preparation of this book. Despite this, the analysis carried out on that product has been included in the book – first, as a final tribute to its contribution; second, because this could be the last analysis carried out on the site; and, third, because it is a good opportunity to explain for future reference how that engine worked.

About the author

José Luis Ortega is a web researcher in the Spanish National Research Council (CSIC). He achieved a fellowship in the Cybermetrics Lab of CSIC, where he finished his doctoral studies (2003–8). In 2005, he was employed by the Virtual Knowledge Studio of the Royal Netherlands Academy of Sciences and Arts, and in 2008 he took up a position as information scientist within CSIC. He now continues his collaboration with the Cybermetrics Lab in research areas such as webometrics, web usage mining, visualization of information, social network analysis and web bibliometrics.

Introduction

Abstract

This chapter first introduces the problem of the need for a clear definition of what an academic search engine is, and the colourful range of products that could be considered within this. The chapter goes on to summarize the mean characteristics that these engines should have in order to serve the complexity of scholarly information and the thoroughness of the scientific community. A brief history of these engines is given, marking their principal milestones and contributions, such as the appearance of the first autonomous citation index or the first author profiling platform. Finally, the chapter looks at the challenges these services will face in the future as they try to establish themselves within the landscape of research evaluation and information retrieval.

Keywords

academic search engines

scientific users

autonomous citation indexing

profiling

research evaluation

Academic search engines could be considered to be the meeting point between two streams that started to diverge just when the web evolved: on the one hand the traditional specialized databases and on the other hand the new generalist web search engines. Before the appearance of the web, any search product was a specialized object addressed to a specific and sophisticated user (scientist, technician, lawyer, etc.) who needed to carry out complex queries to obtain the highest precision or recall. These databases contained records with a great number of fields which described structured and formal documents. This paradigm started to break up when the arrival of the web brought about the opening up of these search services to a broad and heterogeneous population with few skills in information searching and

Enjoying the preview?

Page 1 of 1

Academic Search Engines: A Quantitative Outlook

About this ebook

Jose Luis Ortega

Related authors

Related to Academic Search Engines

Related ebooks

Internet & Web For You

Related podcast episodes

Related articles

Related categories

Reviews for Academic Search Engines

What did you think?

Book preview

Academic Search Engines - Jose Luis Ortega

Table of Contents

Copyright

Dedication

Preface

About the author

Abstract

Keywords