xarantolus
94255fa0b1
[youtube] More general title extraction
...
Seems like this attribute is moved every few weeks, so we just extract both and use the one that is present.
2020-08-24 14:30:08 +02:00
xarantolus
7d743516b5
[youtube] Make ytcfg.set
config extraction non-fatal
...
If the markup of the page changes in the future, it might be possible that _FEED_DATA still works, but the other regex does not. SInce it is not necessary for the first page of videos, we make sure the program doesn't exit before extracting them.
TL;DR: Extract the first video page even if there are problems
2020-08-24 14:29:16 +02:00
xarantolus
a5e386d9fe
Fix python2 compatibility and title extraction
2020-07-31 10:05:11 +02:00
xarantolus
4f37c60bf5
Run formatter
2020-07-30 16:38:56 +02:00
xarantolus
c449f70965
[youtube] Fix feed extraction
...
In order to extract videos from further pages, we need to get various variables that are in an argument to the `ytcfg.set` call in a script on the feed page.
2020-07-30 16:34:48 +02:00
Sergey M․
49004a6b59
[youtube] Fix sigfunc name extraction ( closes #26134 , closes #26135 , closes #26136 , closes #26137 )
2020-07-28 15:34:32 +02:00
MRWITEK
54ffcbb8eb
[youtube] Improve description extraction ( closes #25937 ) ( #25980 )
2020-07-28 15:34:32 +02:00
Sergey M․
9fa728f4e8
[wistia] Restrict embed regex ( closes #25969 )
2020-07-28 15:34:31 +02:00
Glenn Slayden
bb2c950b8e
[youtube] Prevent excess HTTP 301 ( #25786 )
2020-07-28 15:34:31 +02:00
Sergey M․
255f31b5cb
[youtube:playlists] Extend _VALID_URL ( closes #25810 )
2020-07-28 15:34:31 +02:00
Remita Amine
2bd94127a2
[bellmedia] add support for cp24.com clip URLs( closes #25764 )
2020-07-28 15:34:30 +02:00
xarantolus
7fa0a67cc1
Remove unused variable
2020-07-10 11:50:50 +02:00
xarantolus
c37ca4732b
[youtube] Remote download limit
2020-07-10 11:47:13 +02:00
xarantolus
5cbe7563be
[youtube] Return to old feed extraction code as it *seems* like that change was reverted
...
The old code now works again, but it downloads without limit. This is why a limit of 1000 videos is added, it can be overwritten with the `--max-downloads` option - that way, only so many ids will be extracted as videos downloaded
2020-06-23 11:27:02 +02:00
xarantolus
6a3cc89394
[youtube] Make search extraction less dependent on json schema.
...
If an object looks like a video (it has a `videoId` key), assume that it is.
2020-06-23 08:56:21 +02:00
xarantolus
b3fd4b155e
run flake8
2020-06-21 09:41:42 +02:00
xarantolus
57f72370c5
[youtube] Fix feed extraction
...
This moves feed extraction from using html content to json metadata. However, loading additional pages no longer works.
The _extract_video_info function also returns a continuation object that contains some metadata that - together with an API key that is in the page source - might be used to request the next page.
2020-06-21 09:31:04 +02:00
xarantolus
6dad89289c
[youtube] Move search URL extraction to appropriate extractor
2020-06-19 21:29:47 +02:00
xarantolus
7a74feda78
[youtube] Fix extraction of search urls ( closes ytdl-org/youtube-dl#25696 )
2020-06-19 14:57:57 +02:00
Sergey M․
2391941f28
[brightcove] Improve embed detection ( closes #25674 )
2020-06-16 17:38:25 +07:00
Sergey M․
ee0b726cd7
[youtube] Force old layout ( closes #25682 , closes #25683 , closes #25680 , closes #25686 )
2020-06-16 06:17:53 +07:00
Sergey M․
dbeafce5d5
[youtube] Fix categories and improve tags extraction
2020-06-16 03:13:39 +07:00
Sergey M․
a6211d237b
[youtube] Fix uploader id and uploader URL extraction
2020-06-16 02:43:09 +07:00
Sergey M․
7b16239a49
[youtube] Improve view count extraction
2020-06-16 02:38:45 +07:00
Sergey M․
37357d21a9
[youtube] Fix upload date extraction
2020-06-16 02:37:19 +07:00
Sergey M․
b477fc1314
[youtube] Fix thumbnails extraction and remove uploader id extraction warning ( closes #25676 )
2020-06-16 02:29:04 +07:00
Sergey M․
d84b21b427
[youtube] Fix playlist and feed extraction ( closes #25675 )
2020-06-16 02:01:12 +07:00
Philipp Hagemeister
48bd042ce7
[facebook] Support single-video ID links
...
I stumbled upon this at https://www.facebook.com/bwfbadminton/posts/10157127020046316 . No idea how prevalent it is yet.
2020-06-14 13:17:51 +02:00
Sergey M․
84213ea8d4
[youtube] Extract chapters from JSON ( closes #24819 )
2020-06-06 04:22:10 +07:00
Sergey M․
562de77f41
[kaltura] Add support for multiple embeds on a webpage ( closes #25523 )
2020-06-06 02:14:35 +07:00
Sergey M․
a5b6102ea8
[tele5] Bypass geo restriction
2020-06-06 01:45:05 +07:00
Sergey M․
b77888228d
[jwplatform] Add support for bypass geo restriction
2020-06-06 01:44:36 +07:00
Sergey M․
0b1eaec3bc
[tele5] Prefer jwplatform over nexx ( closes #25533 )
2020-06-06 01:35:09 +07:00
Sergey M․
b37e47a3f9
[twitch:stream] Expect 400 and 410 HTTP errors from API
2020-06-06 00:57:40 +07:00
Sergey M․
ce3735df02
[twitch:stream] Fix extraction ( closes #25528 )
2020-06-06 00:55:29 +07:00
Sergey M․
a0455d0ffd
[twitch] Pass v5 accept header and fix thumbnails extraction ( closes #25531 )
2020-06-06 00:12:47 +07:00
Sergey M․
c8b232cc48
[brightcove] Sort imports
2020-06-05 23:35:57 +07:00
Sergey M․
b4eb0bc7bd
[brightcove] Fix subtitles extraction ( closes #25540 )
2020-06-05 23:33:14 +07:00
Matej Dujava
d5147b65ac
[malltv] Add support for sk.mall.tv ( #25445 )
2020-06-01 21:11:31 +07:00
Sergey M․
7b0b53ea69
[twitter:broadcast] Add untitled periscope broadcast test
2020-06-01 20:32:57 +07:00
Sergey M․
7016e24ebe
[periscope] Fix untitled broadcasts ( #25482 )
2020-06-01 20:31:51 +07:00
Sergey M․
bef4688c72
[jwplatform] Improve embeds extraction ( closes #25467 )
2020-05-31 11:10:31 +07:00
Sergey M․
fe515e5c75
[ard:beta] Extend _VALID_URL ( closes #25405 )
2020-05-29 02:01:51 +07:00
striker.sh
1db5ab6b34
[youtube] Add support for more invidious instances ( #25417 )
2020-05-27 01:26:45 +07:00
JordanWeatherby
8f841fafcd
[giantbomb] Extend _VALID_URL ( #25222 )
2020-05-21 04:30:50 +07:00
Michael Klein
a54c5f83c0
[ard] Improve _VALID_URL ( closes #25134 ) ( #25198 )
2020-05-20 04:08:08 +07:00
Sergey M․
cd13343ad8
[redtube] Improve formats extraction and extract m3u8 formats ( closes #25311 , closes #25321 )
2020-05-20 03:39:41 +07:00
tlsssl
9a269547f2
[indavideo] Switch to HTTPS for API request ( #25191 )
2020-05-20 02:13:06 +07:00
Dave Loyall
bf097a5077
[redtube] Improve title extraction ( #25208 )
2020-05-20 02:11:05 +07:00
Remita Amine
52c50a10af
[vimeo] improve format extraction and sorting( closes #25285 )
2020-05-15 15:57:06 +01:00