Pages like https://www.bbc.co.uk/schedules/p00fzl6p
have svg's that have
This means our grep for titles grep -oiE '<title.*>.*</title>'
is needs to change.
I propose we use this: <title[^>]*>([^<]+)<\/title>
. However this will grab all titles split by a new line, so I then pipe in head -1
to get the first instance