I’m a violinist, guitarist, and CS researcher.
I am currently a proud PhD candidate (3rd year) in the Sound and Music Computing Lab, School of Computing, National University of Singapore, advised by Prof. Ye Wang (王晔). My research centers on solving music information retrieval challenges using machine learning techniques. I am particularly interested in the automatic transcription and generation of music and lyrics. Prior to joining NUS, I earned my Bachelor's degree in Computer Science with honors from the Harbin Institute of Technology and completed my bachelor's thesis on piano music transcription at the Auditory Intelligence Research Center, advised by Prof. Jiqing Han (韩纪庆).
Moreover, I am a professional-level violinist and guitarist. Please visit the "As Musician" page to explore my music portfolio. I feel incredibly fortunate to have discovered my passion for music and to have the opportunity to work and conduct research in a music-related field. My love for music energizes and motivates me to continually grow and excel in this area.
Recent News
- [2024.8] I started my internship at Sony Computer Science Laboratories.
- [2024.5] I started my internship at YAMAHA at Hamamatsu, Shizuoka, Japan.
- [2024.3] The paper DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator is accepted by ISPASS 2024. Congratulations to Puru Sharma!
- [2024.3] The paper Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing is accepted by ACM TOMM. Congratulations to my colleague Xiangming!
- [2023.10] My short paper Singable and Controllable Neural Lyric Translation: a Late-Breaking Showcase is accepted by ISMIR 2023 Late Breaking Demo.
- [2023.6] One full paper was rejected by ISMIR 2023. Sadge!
- [2023.5] I passed the Qualification Exam. Now I am a PhD candidate!
- [2023.5] My paper Songs Across Borders: Singable and Controllable Neural Lyric Translation is accepted by ACL 2023.
- [2023.1] I receive Research Achievement Award (2022/2023) from School of Computing, NUS.
- [2022.12] I'm attending ISMIR 2023 at Bengaluru, India.
- [2022.11] Our ACM Multimedia paper receives the top paper award (2% of accepted full papers).
- [2022.10] I'm attending ACM Multimedia at Lisbon, Portugal.
- [2022.7] An extension work of our previous paper, Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription is acctepted by ISMIR 2023.
- [2022.7] My paper collaborated with Xiangming Gu, MM-ALT: A multimodal automatic lyric transcription system is accepted by ACM Multimedia 2022.
- [2022.5] I'm attending ICASSP 2022 at Singapore.
- [2022.1] My first paper, which achieves another SOTA on piano music transcription, is accepted by ICASSP 2022.
- [2022.1] I start my PhD journey in NUS SMCL, advised by Prof. Wang Ye.
- [2021.8] I join National University of Singapore as a student in Master of Computing program (AI track), start my research in Sound and Music Computing Lab .
Publication
-
DNA Storage Toolkit: A Modular End-to-End DNA Data Storage Codec and Simulator
Puru Sharma, Gary Goh, Bin Gao, Longshen Ou, Dehui Lin, Deepak Sharma, Djordje Jevdjic
2024 IEEE Int’l Symposium on Performance Analysis of Systems and Software (ISPASS 2024)
[slides] -
Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
Xiangming Gu, Longshen Ou, Wei Zeng, Jianan Zhang, Nicholas Wong, Ye Wang
ACM Transactions on Multimedia Computing, Communications and Applications (TOMM 2024)
[code | ArXiv] -
Songs Across Borders: Singable and Controllable Neural Lyric Translation
Longshen Ou, Xichu Ma, Min-Yen Kan, Ye Wang
The 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023)
[demo | code] -
Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
Longshen Ou*, Xiangming Gu*, and Ye Wang
Proceedings of the 23rd International Society for Music Information Retrieval Conf. (ISMIR 2022)
[code] -
MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top Paper Award)
Xiangming Gu*, Longshen Ou*, Danielle Ong, and Ye Wang
Proceedings of the 30th ACM International Conference on Multimedia (ACM Multimedia 2022)
[demo | code | dataset | press] -
Exploring Transformer’s Potential on Automatic Piano Transcription
Longshen Ou, Ziyi Guo, Emmanouil Benetos, Jiqing Han, and Ye Wang
2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2022)
Preprint
-
Lead Instrument Detection from Multitrack Music
Longshen Ou, Yu Takahashi, and Ye Wang
2024.08.28 -
Unlocking Potential in Pre-Trained Music Language Models for Versatile Multi-Track Music Arrangement
Longshen Ou, Jingwei Zhao, Ziyu Wang, Gus Xia, and Ye Wang
arXiv:2408.15176 (2024) -
LOAF-M2L: Joint Learning of Wording and Formatting for Singable Melody-to-Lyric Generation
Longshen Ou, Xichu Ma, and Ye Wang
arXiv:2307.02146 (2023)
- Automatic Hyper-Parameter Optimization Based on Mapping Discovery from Data to Hyper-Parameters
Bozhou Chen, Kaixin Zhang, Longshen Ou, Chenmin Ba, Hongzhi Wang, and Chunnan Wang.
arXiv:2003.01751 (2020)
Demo & Workshop Papers
- Singable and Controllable Neural Lyric Translation: a Late-Breaking Showcase
Longshen Ou, Xichu Ma, and Ye Wang
Late Breaking Demo at the 24rd Int. Society for Music Information Retrieval Conf. (ISMIR 2023)
Other Projects
-
GNN-based Music Recommender
This project aims to tackle the music artist recommendation challenge using Graph Convolutional Networks (GCNs). By modeling artist and user identities through their interactive relationships, the network predicts affinity scores between users and previously unexplored artists to generate personalized recommendations. I implemented the original GCN as a baseline and proposed three enhancements: incorporating edge weight for aggregation, augmenting edge weight with attention mechanisms, and implementing data augmentation by introducing noise to edge values. -
DNA Storage Simulation
DNA-based storage systems present unique challenges, as reading and writing operations can sometimes result in alterations to the original information. To model the changes introduced by such storage systems in a wet lab environment, we designed a simulation system to emulate DNA behavioral changes. This system includes a rule-based method, a Multi-Layer Perceptron (MLP) method, and a sequence-to-sequence attention-based Recurrent Neural Network (RNN). The experiments based on the Microsoft Nanopore dataset shows the sequence-to-sequence method is highly effective. -
GuitarFret
With this guitar fretboard simulator on your laptop, never worry about composing without a guitar around you!
Honors and Awards
- Research Achievement Award (2022/2023), issued by School of Computing, NUS, 2023.5.
- Top Paper Award (2% of accepted full papers), issued by ACM Multimedia 2022, 2022.11.
- Honor Degree of Bachelor of Engineering, issued by Harbin Institute of Technology Honors School, 2021.6.
- People Scholarship (6%), issued by Harbin Institute of Technology, 2020.6.
- Third Prize, Sogou Innovative Practice Project for College Student, 2018.10.
Teaching
- Teaching Assistant, CS4347/5647 Sound and Music Computing (2022/2023 sem 1, 2023/2024 sem 1).
- Teaching Assistant, CS4248 Natural Language Processing (2022/2023 sem 2).
Academic Reviewers
- IEEE TAFFC (2024)
- EAI ArtsIT 2024
- ACM TOMM (2024)
- ACM Multimedia 2024
- ACL Rolling Review (2024)
- TASLP (2024)
- ISMIR 2023
- ACM Multimedia 2023
- ISMIR 2022