Project detail

MERmaid

A multi-year, multimodal Music Emotion Recognition system with robustness and microservices architecture, involving multiple MSc and BSc works.

Applied research Multi-year projectMSc thesesBSc projectsResearch prototype 2022–Present On Hold

Demo

Overview

MERmaid is a multi-year research project developing a multimodal Music Emotion Recognition (MER) system. It focuses on robustness, modular design through microservices, and combining multiple modalities (audio, lyrics, metadata) for emotion prediction.

The project acts as an umbrella for several MSc theses and BSc works, each contributing specific components, evaluations or architectural improvements to the overall system.

Research context

Music Emotion Recognition is an active research area pursued by the MIR group, of the CISUC R&D center. Existing approaches are normally buried in academic publications and public prototypes often lack robustness, modularity and support for multiple data modalities. Most systems are monolithic and difficult to extend or evaluate systematically.

System design

A microservices architecture where each component (data input,feature extraction, model inference) is independently developed and deployed. Multiple MSc and BSc students contribute specific modules, so the system grows incrementally while keeping architectural coherence.

Multi-year evolution

Each student thesis addresses a specific aspect:

Ricardo António (2018–2019) — Proof-of-concept distributed MER system with microservices, message queues, YouTube as data source, and Essentia + SVM for classification.
Tiago António (2020–2021) — Replaced the dummy ML models with proper MER logic for audio and lyrics, bridging software development and data science on top of Ricardo’s foundation.
João Canoso (2020–2021) — Focused on orchestration: automated deployment, scaling, and management of the distributed system using containerisation and orchestration techniques.
Hélder Ribeiro (2023–2024) — Developed the MERmaid web app (v2): an Express.js API and React SPA with WebSockets, JWT authentication, rate limiting, and integration with the YouTube API and RabbitMQ broker. Currently live.
Luís Costa (2024–2025) — MERmaid v0.3 (Deep Flow): improving system resilience and orchestration, adding source separation, voice analysis, and deep learning models, and refining private cloud deployment.

External collaboration

The project aggregates CISUC researchers and students, particularly in audio analysis and machine learning, connecting applied engineering at DataLab (development of practical applications) with fundamental research at CISUC (the ML/MER logic).

Technologies

Outputs

António, R. (2019). Microsserviços para Reconhecimento de Emoção em Música [Master's thesis]. Instituto Politécnico de Tomar.

António, T. (2021). MER: Estudo e restruturação do sistema de reconhecimento emocional em música áudio usando o YouTube [Master's thesis]. Instituto Politécnico de Tomar.

Canoso, J. (2021). Orchestration of Music Emotion Recognition Services – Automating Deployment, Scaling and Management [Master's thesis]. Instituto Politécnico de Tomar.

Ribeiro, H. (2023). MERmaid Web App [BSc report]. Instituto Politécnico de Tomar.

Costa, L. (2025). MERmaid v0.3 – Deep Flow [Master's thesis]. Instituto Politécnico de Tomar.

Team

Renato Panda

Supervisor

Luís Costa

MSc Student · 2024–2025

MERmaid v0.3 (Deep Flow): system resilience and orchestration improvements, source separation, voice analysis, deep learning integration, private cloud deployment

João Canoso

MSc Student · 2020–2021

Orchestration of MER services: automated deployment, scaling, and management of the distributed system using containerisation and orchestration techniques

Tiago António

MSc Student · 2020–2021

Study and implementation of proper MER machine learning logic for audio and lyrics, replacing the proof-of-concept ML models in the distributed system

Ricardo António

MSc Student · 2018–2019

Initial distributed MER prototype: microservices architecture with message queues, YouTube as song source, Essentia + SVM for feature extraction and classification

Hélder Ribeiro

BSc Student · 2022–2023

Express.js API and React web app with WebSockets, JWT authentication, rate limiting, YouTube API and RabbitMQ broker integration; currently live while MER classification services are still being deployed

External collaborators

Pedro Louro

PhD Student · CISUC · 2023–Ongoing

PhD research on feature engineering and deep learning for audio-based MER; novel emotionally-relevant audio features and hybrid shallow/deep learning approaches whose outcomes feed into the MERmaid pipeline

Hugo Redinho

Research Student · CISUC · 2022–2025

Dataset preparation, ML pipelines and applied MER expertise, bridging CISUC research outputs with the MERmaid system