-
Notifications
You must be signed in to change notification settings - Fork 53
Description
I just found this amazing resource and its associated paper a few days ago, but unfortunately, as is mentioned in another issue from 3 years ago, it seems like the bible.is website has changed, causing the download part of the script to not work anymore.
I tried to adapt the script to the new website, the URLs can be fixed by simply changing http://listen.bible.is/... to http://live.bible.is/bible/... which can be achieved by changing
datasets-CMU_Wilderness/bin/do_found
Line 1270 in a2102dc
| $this get_from_bibleis `cat $languageid/starturl` |
to
url_id=`cat $languageid/starturl | cut -c 24-`
$this get_from_bibleis `echo 'http://live.bible.is/bible/'$url_id`
The parsing of the webpage however to get the languageid, languagename and most importantly audioUrl is also broken and I am unable to fix it. When inspecting the website, there is still an audioplayer with a source url, which can be downloaded, but I don't know how to parse this and all of the other needed bits out of the html response.
It would be really unfortunate to let all the amazing work on the alignments go to waste, just because the download changed, is there any chance this gets updated?