This is a tool to crawl song and lyrics from marumaru-x
pip install -r requirement.txt
playwright install
python song_list_crawler.py
python lyrics_crawler.py
The crawled data will be saved in JSON format.
[
{
"title": "", // song title
"image_url": "", // thumbnail
"song_link": "", // link to marumaru lyrics page
"duration": 100 // seconds
}
]
{
"title": "",
"image_url": "",
"song_link": "",
"duration": 100,
"youtube_link": "",
"composer": "", // may be empty
"artist": "", // may be empty
"arrange": "", // may be empty
"lyrics_list": [
{
"start_time": 0.0,
"end_time": 5.0,
"lyrics": "Lyrics line 1"
},
{
"start_time": 5.0,
"end_time": 10.0,
"lyrics": "Lyrics line 2"
}
],
"process_time": "1970-01-01 00:00:00"
}
Ken |
All contributions are the same, no matter how big or small.