Perplexity-Inspired Metasearch-Based Alternatives to FAIR GPT: Open-source AI Consultants for Research Data Management

Thomas Schmidt; Renat Shigapov; Jan Kamlah; Irene Schumm

doi:10.11588/heibooks.1652

Zitationsvorschlag

Schmidt, Thomas et al.: Perplexity-Inspired Metasearch-Based Alternatives to FAIR GPT: Open-source AI Consultants for Research Data Management, in Heuveline, Vincent et al. (Hrsg.): E-Science-Tage 2025: Research Data Management: Challenges in a Changing World, Heidelberg: heiBOOKS, 2025, S. 402–408. https://doi.org/10.11588/heibooks.1652.c23938

Bibliografische Angaben herunterladen

Lizenz (Kapitel)

Dieses Werk steht unter der Lizenz Creative Commons Namensnennung - Weitergabe unter gleichen Bedingungen 4.0 International.

Identifier (Buch)

https://doi.org/10.11588/heibooks.1652

ISBN 978-3-911056-51-9 (PDF)

ISBN 978-3-911056-52-6 (Softcover)

Veröffentlicht

05.11.2025

Downloads

Kapitel herunterladen (PDF/261KB)

Statistik

Autor/innen

Thomas Schmidt , Renat Shigapov , Jan Kamlah , Irene Schumm

Perplexity-Inspired Metasearch-Based Alternatives to FAIR GPT: Open-source AI Consultants for Research Data Management

Abstract: Chatbots and virtual assistants are becoming increasingly popular for user questions and support. With FAIR GPT, the Mannheim University Library released a virtual assistant for research data management (RDM) in 2024, designed to help researchers and institutions in making their data FAIR (Findable, Accessible, Interoperable, Reusable). FAIR GPT provides various RDM services, e.g. metadata enhancement, repository selection and FAIR assessment. However, FAIR GPT has numerous disadvantages: As a ‘Custom GPT’ of OpenAI, it is proprietary software that only outputs sources for the generated answers if it uses its internal web search tool (which cannot be controlled by the user) and therefore lacks transparency. Reliance on external cloud-based services leads to privacy
concerns when dealing with sensitive (meta)data and the chatbot is still prone to hallucinations, thus reducing its trustworthiness. These issues led us to explore alternative open-source solutions. We searched for opensource
alternatives to Perplexity.ai, a system known for its ability to provide citations for the information it retrieves through web searches. We identified three candidates available on GitHub: Perplexica, sensei, and farfalle. These tools use local instances of the metasearch engine SearXNG to perform internet searches, using the results as input
for Large Language Models (LLMs). We modified these tools to focus specifically on RDM tasks, releasing the new versions on GitHub openly under the names FAIRplexica, FAIR-sensei and FAIR-farfalle.
Keywords: Research Data Management, LLMs, Chatbot, RDM Assistants, FAIR data