Titre : | Extraction and generation of metadata for scientific publications: Development of an automation tool for non-specialists |
Auteurs : | MEGHERBI Douha, Auteur ; Toufik Kalfali, Directeur de thèse |
Type de document : | Mémoire magistere |
Editeur : | Biskra [Algérie] : Faculté des Sciences Exactes et des Sciences de la Nature et de la Vie, Université Mohamed Khider, 2025 |
Format : | 1 vol. (77 p.) / ill., couv. ill. en coul / 30cm |
Langues: | Anglais |
Langues originales: | Anglais |
Résumé : | The main objectives of this master's thesis are: (i) review the state of the art of the computerization of the editorial process of scientific journals, (ii) Automate DOI (Digital Object Identifier) indexing through XML Export Plugin of a peer review management software, (iii) Design and develop a lightweight tool for beginner editors which allows to extract article metadata and generate XML files ready for indexing, (iv) Test and validate the developed tool by using recent published articles of an active journal. To achieve these objectives, a practical internship of more than three months was carried out at Alwaha Scientific Publishing Services SARL (ASPS), an independent international academic publisher. The metadata of an active scientific journal, titled ‘Energy and Thermofluids Engineering, ETE, ISSN 2716-8026’, was considered as a case study. Thus, this manuscript first deals with the state of the art of the digitalization of the scientific journal publishing process from submission to dissemination. Particular attention was drawn to the system used by the ASPS. It's about the Open Journal Systems (OJS), an open source and free software for the management of peer-reviewed academic journals. The second chapter represents a practical work showing how to automate DOI indexing through OJS by leveraging tools such as the Crossref XML Export Plugin and DOI Public Identifier. The methodology draws on firsthand experience gained through the participation in a Crossref-hosted webinar held on 13 March 2025. In this international event, advanced OJS workflows were implemented on ASPS platform to reduce processing time by 80% and improve metadata completeness scores from 65% to 98%. The third chapter presents the design and development of a lightweight XML generation tool tailored for beginner publishers who lack technical expertise or access to automation. The tool converts structured CSV inputs into Crossref-compliant XML files, eliminating the need for manual XML editing. Developed in C++ for efficiency and portability, it allows non-specialist editorial teams to register DOIs and deposit metadata with minimal effort. The last chapter presents successful stages of testing and validation of the tool developed. In summary, this work empowers publishers of all scales to meet global indexing standards. The proposed solutions can enhance the discoverability of Algerian research and provide a replicable framework for institutions in resource-constrained regions to participate equitably in open scholarly communication. |
Sommaire : |
ACKNOWLEDGEMENTS _____________________ III
DEDICATION ___________________________ IV ABSTRACT _ ملخص ___ VI GENERAL INTRODUCTION ____________________ 1 1 DIGITALIZATION OF JOURNAL PUBLISHING PROCESS: BACKGROUND _______________ 4 1.1 Introduction _______________ 4 1.2 Historical Evolution of Scholarly Publishing _____________ 4 1.3 Core Terminologies and Conceptual Framework _____________ 5 1.3.1 Publisher (Publishing House) __________________ 5 1.3.2 Indexing (in Academic Publishing) ______________________ 6 1.3.3 Open Journal Systems (OJS) ____________________ 6 1.3.4 Metadata _______________________ 6 1.3.5 Digital Object Identifier (DOI) ________________ 7 1.4 Functional Roles within Open Journal Systems (OJS) ____________ 7 1.4.1 Author ______________________ 8 1.4.2 Production or Managing Editor _____________ 8 1.4.3 Section Editor __________________ 8 1.4.4 Layout Editor (Designer)_________________ 8 1.4.5 Reviewer _________________________ 9 1.4.6 Copyeditor _________________________________________ 9 1.4.8 __________________________________________________________ 9 1.4.7 Proofreader ______Reader/Community __________________________________ 9 1.4.9 Indexer (or “Metadata Editor”) _________________________________ 10 1.5 Editorial Workflow: From Submission to Dissemination _________ 10 1.5.1 Scientific Evaluation Phase _________________________ 11 1.5.2 Manuscript Production Phase _____________________ 12 1.6 Comparative Analysis of Print and Digital Production ______________ 13 1.6.1 Production Workflows in Print Publishing _____________________ 14 1.6.2 Production Workflows in Digital Publishing __________________ 14 1.7 Technical Perspectives on Editorial Processes _____________________ 15 1.7.1 Scientific Phase: Adaptive and Dynamic Design Needs ____________ 15 1.7.2 Production Phase: Structured Data and Automation Potential _____________ 15 1.7.3 Towards Standardization and Semi-Automation ______________________ 16 1.8 Critical Role of the Indexer in Scholarly Communication ______________________ 16 1.9 Functions of the Indexer __________________________ 17 1.9.1 Types of Indexers _____________________________________ 17 1.10 .Automation Opportunities and Technical Limitation ____________________ 19 1.11 Conclusion _______________ 19 2 CASE STUDY OF AUTOMATED INDEXING USING THE ASPS OJS PLATFORM: THE CROSSREF XML EXPORT PLUGIN _____________________________ 21 2.1 Introduction ____________________________________ 21 2.2 Overview of Alwaha Scientific Publishing Services (ASPS) ______________ 21 2.2.1 Journal Portfolio and Scope ___________________________ 22 2.2.2 Open-Access Policy and Global Reach _________________ 23 2.2.3 ASPS’s Research and Publication Ethics_______________________ 23 2.2.4 ASPS’s Standards for Authors: Responsible Research Publication ______ 23 2.2.5 ASPS’s Privacy Statement ____________________ 24 2.3 Open Journal Systems (OJS) as ASPS’s Publishing Platform ________________ 24 2.3.1 Workflow Automation and Metadata Management __________ 24 2.3.2 Operational Advantages for ASPS _______________________ 25 2.4 Challenges and Strategic Directions for OJS Optimization ____________________ 25 2.4.1 Current Technical and Organizational Gaps ___________________ 25 2.4.2 Strategic Outlook _____________ 26 2.5 Crossref: Background and Strategic Relevance for ASPS _____________ 26 2.6 Capacity Building: ASPS Training and Crossref Webinar________________26 2.6.1 Scholarship Activities and Skill Development Outcomes ____________ 27 2.6.2 Assessing Metadata Quality with the Crossref Participation Reports Tool ________ 28 2.6.3 Crossref Webinar: Technical Training and Strategic Upskilling __________ 32 2.7 Automated Indexing Workflow __________________ 33 2.7.1 Manual Indexing ______________________ 33 2.7.2 Limitations of Manual Indexing at ASPS ______________ 33 2.7.3 Transition to Automation via OJS Plugins _____________ 33 2.7.4 Core OJS Components Configured for Automation ___________ 34 2.7.5 OJS Plugins for Indexing Automation ___________ 35 2.8 Implementing Automated Indexing in OJS ____________ 36 2.8.1 Plugin Configuration __________________ 37 2.8.2 Retro-Assigning DOIs to Previously Published Articles ____________ 43 2.10 Recommendations and Improvements __________________ 47 2.9 2.11 Conclusion _____________________ 47 3 DESIGN AND IMPLEMENTATION OF XML GENERATOR TOOL FOR METADATA INDEXERS: CASE STUDY OF ASPS JOURNALS __________________ 49 3.1 Introduction ____________________________ 49 3.2 XML and Metadata Standards _______________ 49 3.2.1 Evolution of Metadata Standards __________________ 50 3.2.2 Schema Analysis: Crossref, JATS, and Dublin Core __________ 51 3.3 Indexing Services: Requirements and Strategic Value _____________ 51 3.3.1 some indexing services_____________________ 51 3.3.2 Metadata Requirements _____________ 54 3.3.3 Unified Field Matrix _________________________ 54 3.3.4 Relevance to ASPS: Strategic Metadata Compliance ___________ 55 3.4 Crossref XML Generator Design _______________________ 56 3.4.1 Workflow of the Tool _____________________ 56 3.4.2 Input Data Models and XML Generation Results ____________ 58 3.4.3 Data Standardization ____________ 67 3.4.4 DOI Automatic Generation__________________ 68 3.4.5 Overcoming Technical Challenges in Metadata Extraction _________ 68 3.4.6 UML Diagrams _______________________________ 69 3.5 Implementation: Languages and tools for development __________70 3.5.1 3.5.1 Hardware tools ________________________ 71 3.5.2 3.5.2 Software tools _______________ 72 3.5.3 Tool Realization _______________________________ 73 3.6 Conclusion ____________________ 76 4 TESTING, VALIDATION, AND DEPLOYMENT ___________________ 78 4.1 Introduction __________________ 78 4.2 4.2 Real-World Validation ________________________ 78 4.2.1 4.2.1 Initial Submission and Schema Rejection _______________ 79 4.2.2 4.2.2 Schema Corrections and Revalidation ________________ 80 4.3 4.3 Validation Strategy _____________________ 80 4.4 4.4 Testing Scenarios and Methodology _________________ 81 4.4.1 4.4.1 Scenario 1: Creating a DOI for a New Journal volume (No Articles) __________ 81 4.4.2 4.4.2 Scenario 2: Depositing a DOI for a New Article in an Existing Journal _______ 88 4.4.3 4.4.3 Scenario 3: Batch Deposit for Multiple Articles in an Existing Journal ______ 90 4.5 4.5 Final Deployment within ASPS _______________________________ 92 4.6 4.6 Deployment Outcomes and Impact at ASPS _________________ 93 4.7 4.7 Recommendations and Future Work _________________ 94 4.8 4.8 Conclusion __________________________ 94 GENERAL CONCLUSION ___________ REFERENCES _____________________ 98 APPENDIX ___________________106 |
Type de document : | Mémoire master |
Disponibilité (1)
Cote | Support | Localisation | Statut |
---|---|---|---|
MINF/925 | Mémoire master | bibliothèque sciences exactes | Consultable |