release 2015.09.03

[ok] Sort formats
[ok] Add support for youtube embeds
2026-06-09 22:25:20 +00:00 · 2015-09-03 12:34:20 +02:00 · 2015-09-02 22:09:33 +06:00 · 2015-09-02 22:08:50 +06:00 · 2015-09-02 21:38:56 +06:00 · 2015-09-02 21:36:22 +06:00
40 changed files with 469 additions and 131 deletions
@@ -125,7 +125,7 @@ If you want to add support for a new site, you can follow this quick list (assum
    ```
 5. Add an import in [`youtube_dl/extractor/__init__.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/__init__.py).
 6. Run `python test/test_download.py TestDownload.test_YourExtractor`. This *should fail* at first, but you can continually re-run it until you're done. If you decide to add more than one test, then rename ``_TEST`` to ``_TESTS`` and make it into a list of dictionaries. The tests will be then be named `TestDownload.test_YourExtractor`, `TestDownload.test_YourExtractor_1`, `TestDownload.test_YourExtractor_2`, etc.
-7. Have a look at [`youtube_dl/common/extractor/common.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py) for possible helper methods and a [detailed description of what your extractor should and may return](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L62-L200). Add tests and code for as many as you want.
+7. Have a look at [`youtube_dl/extractor/common.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py) for possible helper methods and a [detailed description of what your extractor should and may return](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L62-L200). Add tests and code for as many as you want.
 8. If you can, check the code with [flake8](https://pypi.python.org/pypi/flake8).
 9. When the tests pass, [add](http://git-scm.com/docs/git-add) the new files and [commit](http://git-scm.com/docs/git-commit) them and [push](http://git-scm.com/docs/git-push) the result, like this:

@@ -34,7 +34,7 @@ You can also use pip:

    sudo pip install youtube-dl

-Alternatively, refer to the developer instructions below for how to check out and work with the git repository. For further options, including PGP signatures, see https://rg3.github.io/youtube-dl/download.html .
+Alternatively, refer to the [developer instructions](#developer-instructions) for how to check out and work with the git repository. For further options, including PGP signatures, see https://rg3.github.io/youtube-dl/download.html .

 # DESCRIPTION
 **youtube-dl** is a small command-line program to download videos from
@@ -207,7 +207,7 @@ which means you can modify it, redistribute it or use it however you like.
    -p, --password PASSWORD          Account password. If this option is left out, youtube-dl will ask interactively.
    -2, --twofactor TWOFACTOR        Two-factor auth code
    -n, --netrc                      Use .netrc authentication data
-    --video-password PASSWORD        Video password (vimeo, smotri)
+    --video-password PASSWORD        Video password (vimeo, smotri, youku)

 ## Post-processing Options:
    -x, --extract-audio              Convert video files to audio-only files (requires ffmpeg or avconv and ffprobe or avprobe)
@@ -552,7 +552,7 @@ If you want to add support for a new site, you can follow this quick list (assum
    ```
 5. Add an import in [`youtube_dl/extractor/__init__.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/__init__.py).
 6. Run `python test/test_download.py TestDownload.test_YourExtractor`. This *should fail* at first, but you can continually re-run it until you're done. If you decide to add more than one test, then rename ``_TEST`` to ``_TESTS`` and make it into a list of dictionaries. The tests will be then be named `TestDownload.test_YourExtractor`, `TestDownload.test_YourExtractor_1`, `TestDownload.test_YourExtractor_2`, etc.
-7. Have a look at [`youtube_dl/common/extractor/common.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py) for possible helper methods and a [detailed description of what your extractor should and may return](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L62-L200). Add tests and code for as many as you want.
+7. Have a look at [`youtube_dl/extractor/common.py`](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py) for possible helper methods and a [detailed description of what your extractor should and may return](https://github.com/rg3/youtube-dl/blob/master/youtube_dl/extractor/common.py#L62-L200). Add tests and code for as many as you want.
 8. If you can, check the code with [flake8](https://pypi.python.org/pypi/flake8).
 9. When the tests pass, [add](http://git-scm.com/docs/git-add) the new files and [commit](http://git-scm.com/docs/git-commit) them and [push](http://git-scm.com/docs/git-push) the result, like this:

@@ -166,7 +166,7 @@
 - **Folketinget**: Folketinget (ft.dk; Danish parliament)
 - **FootyRoom**
 - **Foxgay**
- - **FoxNews**
+ - **FoxNews**: Fox News and Fox Business Video
 - **FoxSports**
 - **france2.fr:generation-quoi**
 - **FranceCulture**
@@ -220,6 +220,7 @@
 - **imdb**: Internet Movie Database trailers
 - **imdb:list**: Internet Movie Database lists
 - **Imgur**
+ - **ImgurAlbum**
 - **Ina**
 - **Indavideo**
 - **IndavideoEmbed**
@@ -303,6 +304,7 @@
 - **MPORA**
 - **MSNBC**
 - **MTV**
+ - **mtv.de**
 - **mtviggy.com**
 - **mtvservices:embedded**
 - **MuenchenTV**: münchen.tv
@@ -465,7 +467,7 @@
 - **Sexu**
 - **SexyKarma**: Sexy Karma and Watch Indian Porn
 - **Shahid**
- - **Shared**
+ - **Shared**: shared.sx and vivo.sx
 - **ShareSix**
 - **Sina**
 - **Slideshare**
@@ -2009,7 +2009,7 @@ class YoutubeDL(object):
                               (info_dict['extractor'], info_dict['id'], thumb_display_id))
                try:
                    uf = self.urlopen(t['url'])
-                    with open(thumb_filename, 'wb') as thumbf:
+                    with open(encodeFilename(thumb_filename), 'wb') as thumbf:
                        shutil.copyfileobj(uf, thumbf)
                    self.to_screen('[%s] %s: Writing thumbnail %sto: %s' %
                                   (info_dict['extractor'], info_dict['id'], thumb_display_id, thumb_filename))
@@ -13,6 +13,8 @@ from ..compat import (
    compat_urllib_error,
 )
 from ..utils import (
+    encodeFilename,
+    sanitize_open,
    struct_pack,
    struct_unpack,
    xpath_text,
@@ -343,18 +345,19 @@ class F4mFD(FragmentFD):
                success = ctx['dl'].download(frag_filename, {'url': url})
                if not success:
                    return False
-                with open(frag_filename, 'rb') as down:
-                    down_data = down.read()
-                    reader = FlvReader(down_data)
-                    while True:
-                        _, box_type, box_data = reader.read_box_info()
-                        if box_type == b'mdat':
-                            dest_stream.write(box_data)
-                            break
+                (down, frag_sanitized) = sanitize_open(frag_filename, 'rb')
+                down_data = down.read()
+                down.close()
+                reader = FlvReader(down_data)
+                while True:
+                    _, box_type, box_data = reader.read_box_info()
+                    if box_type == b'mdat':
+                        dest_stream.write(box_data)
+                        break
                if live:
-                    os.remove(frag_filename)
+                    os.remove(encodeFilename(frag_sanitized))
                else:
-                    frags_filenames.append(frag_filename)
+                    frags_filenames.append(frag_sanitized)
            except (compat_urllib_error.HTTPError, ) as err:
                if live and (err.code == 404 or err.code == 410):
                    # We didn't keep up with the live window. Continue
@@ -375,6 +378,6 @@ class F4mFD(FragmentFD):
        self._finish_frag_download(ctx)

        for frag_file in frags_filenames:
-            os.remove(frag_file)
+            os.remove(encodeFilename(frag_file))

        return True
@@ -12,6 +12,7 @@ from ..postprocessor.ffmpeg import FFmpegPostProcessor
 from ..utils import (
    encodeArgument,
    encodeFilename,
+    sanitize_open,
 )


@@ -89,13 +90,13 @@ class NativeHlsFD(FragmentFD):
            success = ctx['dl'].download(frag_filename, {'url': frag_url})
            if not success:
                return False
-            with open(frag_filename, 'rb') as down:
-                ctx['dest_stream'].write(down.read())
-            frags_filenames.append(frag_filename)
+            down, frag_sanitized = sanitize_open(frag_filename, 'rb')
+            ctx['dest_stream'].write(down.read())
+            frags_filenames.append(frag_sanitized)

        self._finish_frag_download(ctx)

        for frag_file in frags_filenames:
-            os.remove(frag_file)
+            os.remove(encodeFilename(frag_file))

        return True
@@ -241,7 +241,10 @@ from .imdb import (
    ImdbIE,
    ImdbListIE
 )
-from .imgur import ImgurIE
+from .imgur import (
+    ImgurIE,
+    ImgurAlbumIE,
+)
 from .ina import InaIE
 from .indavideo import (
    IndavideoIE,
@@ -340,6 +343,7 @@ from .mtv import (
    MTVIE,
    MTVServicesEmbeddedIE,
    MTVIggyIE,
+    MTVDEIE,
 )
 from .muenchentv import MuenchenTVIE
 from .musicplayon import MusicPlayOnIE
@@ -1,16 +1,20 @@
 from __future__ import unicode_literals

 import re
-import json

 from .common import InfoExtractor
+from ..utils import (
+    ExtractorError,
+    js_to_json,
+    int_or_none,
+)


 class ABCIE(InfoExtractor):
    IE_NAME = 'abc.net.au'
    _VALID_URL = r'http://www\.abc\.net\.au/news/[^/]+/[^/]+/(?P<id>\d+)'

-    _TEST = {
+    _TESTS = [{
        'url': 'http://www.abc.net.au/news/2014-11-05/australia-to-staff-ebola-treatment-centre-in-sierra-leone/5868334',
        'md5': 'cb3dd03b18455a661071ee1e28344d9f',
        'info_dict': {
@@ -19,22 +23,47 @@ class ABCIE(InfoExtractor):
            'title': 'Australia to help staff Ebola treatment centre in Sierra Leone',
            'description': 'md5:809ad29c67a05f54eb41f2a105693a67',
        },
-    }
+    }, {
+        'url': 'http://www.abc.net.au/news/2015-08-17/warren-entsch-introduces-same-sex-marriage-bill/6702326',
+        'md5': 'db2a5369238b51f9811ad815b69dc086',
+        'info_dict': {
+            'id': 'NvqvPeNZsHU',
+            'ext': 'mp4',
+            'upload_date': '20150816',
+            'uploader': 'ABC News (Australia)',
+            'description': 'Government backbencher Warren Entsch introduces a cross-party sponsored bill to legalise same-sex marriage, saying the bill is designed to promote "an inclusive Australia, not a divided one.". Read more here: http://ab.co/1Mwc6ef',
+            'uploader_id': 'NewsOnABC',
+            'title': 'Marriage Equality: Warren Entsch introduces same sex marriage bill',
+        },
+        'add_ie': ['Youtube'],
+    }]

    def _real_extract(self, url):
        video_id = self._match_id(url)
        webpage = self._download_webpage(url, video_id)

-        urls_info_json = self._search_regex(
-            r'inlineVideoData\.push\((.*?)\);', webpage, 'video urls',
-            flags=re.DOTALL)
-        urls_info = json.loads(urls_info_json.replace('\'', '"'))
+        mobj = re.search(
+            r'inline(?P<type>Video|YouTube)Data\.push\((?P<json_data>[^)]+)\);',
+            webpage)
+        if mobj is None:
+            raise ExtractorError('Unable to extract video urls')
+
+        urls_info = self._parse_json(
+            mobj.group('json_data'), video_id, transform_source=js_to_json)
+
+        if not isinstance(urls_info, list):
+            urls_info = [urls_info]
+
+        if mobj.group('type') == 'YouTube':
+            return self.playlist_result([
+                self.url_result(url_info['url']) for url_info in urls_info])
+
        formats = [{
            'url': url_info['url'],
-            'width': int(url_info['width']),
-            'height': int(url_info['height']),
-            'tbr': int(url_info['bitrate']),
-            'filesize': int(url_info['filesize']),
+            'width': int_or_none(url_info.get('width')),
+            'height': int_or_none(url_info.get('height')),
+            'tbr': int_or_none(url_info.get('bitrate')),
+            'filesize': int_or_none(url_info.get('filesize')),
        } for url_info in urls_info]
        self._sort_formats(formats)

@@ -6,6 +6,7 @@ import re
 from .common import InfoExtractor
 from ..utils import ExtractorError
 from .bliptv import BlipTVIE
+from .screenwavemedia import ScreenwaveMediaIE


 class CinemassacreIE(InfoExtractor):
@@ -83,10 +84,10 @@ class CinemassacreIE(InfoExtractor):

        playerdata_url = self._search_regex(
            [
-                r'src="(http://(?:player2\.screenwavemedia\.com|player\.screenwavemedia\.com/play)/[a-zA-Z]+\.php\?[^"]*\bid=.+?)"',
-                r'<iframe[^>]+src="((?:https?:)?//(?:[^.]+\.)?youtube\.com/.+?)"',
+                ScreenwaveMediaIE.EMBED_PATTERN,
+                r'<iframe[^>]+src="(?P<url>(?:https?:)?//(?:[^.]+\.)?youtube\.com/.+?)"',
            ],
-            webpage, 'player data URL', default=None)
+            webpage, 'player data URL', default=None, group='url')
        if not playerdata_url:
            playerdata_url = BlipTVIE._extract_url(webpage)
        if not playerdata_url:
@@ -510,6 +510,12 @@ class InfoExtractor(object):
        """Report attempt to log in."""
        self.to_screen('Logging in')

+    @staticmethod
+    def raise_login_required(msg='This video is only available for registered users'):
+        raise ExtractorError(
+            '%s. Use --username and --password or --netrc to provide account credentials.' % msg,
+            expected=True)
+
    # Methods for following #608
    @staticmethod
    def url_result(url, ie=None, video_id=None, video_title=None):
@@ -237,7 +237,9 @@ Format: Layer, Start, End, Style, Name, MarginL, MarginR, MarginV, Effect, Text
            webpage_url = 'http://www.' + mobj.group('url')

        webpage = self._download_webpage(webpage_url, video_id, 'Downloading webpage')
-        note_m = self._html_search_regex(r'<div class="showmedia-trailer-notice">(.+?)</div>', webpage, 'trailer-notice', default='')
+        note_m = self._html_search_regex(
+            r'<div class="showmedia-trailer-notice">(.+?)</div>',
+            webpage, 'trailer-notice', default='')
        if note_m:
            raise ExtractorError(note_m)

@@ -247,6 +249,9 @@ Format: Layer, Start, End, Style, Name, MarginL, MarginR, MarginV, Effect, Text
            if msg.get('type') == 'error':
                raise ExtractorError('crunchyroll returned error: %s' % msg['message_body'], expected=True)

+        if 'To view this, please log in to verify you are 18 or older.' in webpage:
+            self.raise_login_required()
+
        video_title = self._html_search_regex(r'<h1[^>]*>(.+?)</h1>', webpage, 'video_title', flags=re.DOTALL)
        video_title = re.sub(r' {2,}', ' ', video_title)
        video_description = self._html_search_regex(r'"description":"([^"]+)', webpage, 'video_description', default='')
@@ -9,8 +9,8 @@ from ..utils import qualities


 class DumpertIE(InfoExtractor):
-    _VALID_URL = r'https?://(?:www\.)?dumpert\.nl/mediabase/(?P<id>[0-9]+/[0-9a-zA-Z]+)'
-    _TEST = {
+    _VALID_URL = r'https?://(?:www\.)?dumpert\.nl/(?:mediabase|embed)/(?P<id>[0-9]+/[0-9a-zA-Z]+)'
+    _TESTS = [{
        'url': 'http://www.dumpert.nl/mediabase/6646981/951bc60f/',
        'md5': '1b9318d7d5054e7dcb9dc7654f21d643',
        'info_dict': {
@@ -20,11 +20,15 @@ class DumpertIE(InfoExtractor):
            'description': 'Niet schrikken hoor',
            'thumbnail': 're:^https?://.*\.jpg$',
        }
-    }
+    }, {
+        'url': 'http://www.dumpert.nl/embed/6675421/dc440fe7/',
+        'only_matching': True,
+    }]

    def _real_extract(self, url):
        video_id = self._match_id(url)

+        url = 'https://www.dumpert.nl/mediabase/' + video_id
        req = compat_urllib_request.Request(url)
        req.add_header('Cookie', 'nsfw=1; cpc=10')
        webpage = self._download_webpage(req, video_id)
@@ -71,8 +71,7 @@ class EroProfileIE(InfoExtractor):

        m = re.search(r'You must be logged in to view this video\.', webpage)
        if m:
-            raise ExtractorError(
-                'This video requires login. Please specify a username and password and try again.', expected=True)
+            self.raise_login_required('This video requires login')

        video_id = self._search_regex(
            [r"glbUpdViews\s*\('\d*','(\d+)'", r'p/report/video/(\d+)'],
@@ -15,7 +15,7 @@ from ..utils import (


 class FC2IE(InfoExtractor):
-    _VALID_URL = r'^http://video\.fc2\.com/(?:[^/]+/)?content/(?P<id>[^/]+)'
+    _VALID_URL = r'^http://video\.fc2\.com/(?:[^/]+/)*content/(?P<id>[^/]+)'
    IE_NAME = 'fc2'
    _NETRC_MACHINE = 'fc2'
    _TESTS = [{
@@ -37,6 +37,9 @@ class FC2IE(InfoExtractor):
            'password': '(snip)',
            'skip': 'requires actual password'
        }
+    }, {
+        'url': 'http://video.fc2.com/en/a/content/20130926eZpARwsF',
+        'only_matching': True,
    }]

    def _login(self):
@@ -80,7 +83,7 @@ class FC2IE(InfoExtractor):

        title = self._og_search_title(webpage)
        thumbnail = self._og_search_thumbnail(webpage)
-        refer = url.replace('/content/', '/a/content/')
+        refer = url.replace('/content/', '/a/content/') if '/a/content/' not in url else url

        mimi = hashlib.md5((video_id + '_gGddgPfeaf_gzyr').encode('utf-8')).hexdigest()

@@ -1,5 +1,7 @@
 from __future__ import unicode_literals

+import re
+
 from .common import InfoExtractor
 from ..utils import (
    parse_iso8601,
@@ -8,7 +10,8 @@ from ..utils import (


 class FoxNewsIE(InfoExtractor):
-    _VALID_URL = r'https?://video\.foxnews\.com/v/(?:video-embed\.html\?video_id=)?(?P<id>\d+)'
+    IE_DESC = 'Fox News and Fox Business Video'
+    _VALID_URL = r'https?://(?P<host>video\.fox(?:news|business)\.com)/v/(?:video-embed\.html\?video_id=)?(?P<id>\d+)'
    _TESTS = [
        {
            'url': 'http://video.foxnews.com/v/3937480/frozen-in-time/#sp=show-clips',
@@ -42,13 +45,19 @@ class FoxNewsIE(InfoExtractor):
            'url': 'http://video.foxnews.com/v/video-embed.html?video_id=3937480&d=video.foxnews.com',
            'only_matching': True,
        },
+        {
+            'url': 'http://video.foxbusiness.com/v/4442309889001',
+            'only_matching': True,
+        },
    ]

    def _real_extract(self, url):
-        video_id = self._match_id(url)
+        mobj = re.match(self._VALID_URL, url)
+        video_id = mobj.group('id')
+        host = mobj.group('host')

        video = self._download_json(
-            'http://video.foxnews.com/v/feed/video/%s.js?template=fox' % video_id, video_id)
+            'http://%s/v/feed/video/%s.js?template=fox' % (host, video_id), video_id)

        item = video['channel']['item']
        title = item['title']
@@ -78,9 +78,14 @@ class FranceTVBaseInfoExtractor(InfoExtractor):
                })
        self._sort_formats(formats)

+        title = info['titre']
+        subtitle = info.get('sous_titre')
+        if subtitle:
+            title += ' - %s' % subtitle
+
        return {
            'id': video_id,
-            'title': info['titre'],
+            'title': title,
            'description': clean_html(info['synopsis']),
            'thumbnail': compat_urlparse.urljoin('http://pluzz.francetv.fr', info['image']),
            'duration': int_or_none(info.get('real_duration')) or parse_duration(info['duree']),
@@ -214,15 +219,15 @@ class FranceTVIE(FranceTVBaseInfoExtractor):
        },
        # france5
        {
-            'url': 'http://www.france5.fr/emissions/c-a-dire/videos/92837968',
-            'md5': '78f0f4064f9074438e660785bbf2c5d9',
+            'url': 'http://www.france5.fr/emissions/c-a-dire/videos/quels_sont_les_enjeux_de_cette_rentree_politique__31-08-2015_908948?onglet=tous&page=1',
+            'md5': 'f6c577df3806e26471b3d21631241fd0',
            'info_dict': {
-                'id': '108961659',
+                'id': '123327454',
                'ext': 'flv',
-                'title': 'C à dire ?!',
-                'description': 'md5:1a4aeab476eb657bf57c4ff122129f81',
-                'upload_date': '20140915',
-                'timestamp': 1410795000,
+                'title': 'C à dire ?! - Quels sont les enjeux de cette rentrée politique ?',
+                'description': 'md5:4a0d5cb5dce89d353522a84462bae5a4',
+                'upload_date': '20150831',
+                'timestamp': 1441035120,
            },
        },
        # franceo
@@ -48,6 +48,7 @@ from .vimeo import VimeoIE
 from .dailymotion import DailymotionCloudIE
 from .onionstudios import OnionStudiosIE
 from .snagfilms import SnagFilmsEmbedIE
+from .screenwavemedia import ScreenwaveMediaIE


 class GenericIE(InfoExtractor):
@@ -1001,6 +1002,16 @@ class GenericIE(InfoExtractor):
                'description': 'New experience with Acrobat DC',
                'duration': 248.667,
            },
+        },
+        # ScreenwaveMedia embed
+        {
+            'url': 'http://www.thecinemasnob.com/the-cinema-snob/a-nightmare-on-elm-street-2-freddys-revenge1',
+            'md5': '24ace5baba0d35d55c6810b51f34e9e0',
+            'info_dict': {
+                'id': 'cinemasnob-55d26273809dd',
+                'ext': 'mp4',
+                'title': 'cinemasnob',
+            },
        }
    ]

@@ -1718,6 +1729,11 @@ class GenericIE(InfoExtractor):
        if snagfilms_url:
            return self.url_result(snagfilms_url)

+        # Look for ScreenwaveMedia embeds
+        mobj = re.search(ScreenwaveMediaIE.EMBED_PATTERN, webpage)
+        if mobj is not None:
+            return self.url_result(unescapeHTML(mobj.group('url')), 'ScreenwaveMedia')
+
        # Look for AdobeTVVideo embeds
        mobj = re.search(
            r'<iframe[^>]+src=[\'"]((?:https?:)?//video\.tv\.adobe\.com/v/\d+[^"]+)[\'"]',
@@ -13,6 +13,7 @@ from ..compat import (
 from ..utils import (
    ExtractorError,
    float_or_none,
+    int_or_none,
 )


@@ -359,13 +360,8 @@ class GloboIE(InfoExtractor):
            self._API_URL_TEMPLATE % video_id, video_id)['videos'][0]

        title = video['title']
-        duration = float_or_none(video['duration'], 1000)
-        like_count = video['likes']
-        uploader = video['channel']
-        uploader_id = video['channel_id']

        formats = []
-
        for resource in video['resources']:
            resource_id = resource.get('_id')
            if not resource_id:
@@ -407,6 +403,11 @@ class GloboIE(InfoExtractor):

        self._sort_formats(formats)

+        duration = float_or_none(video.get('duration'), 1000)
+        like_count = int_or_none(video.get('likes'))
+        uploader = video.get('channel')
+        uploader_id = video.get('channel_id')
+
        return {
            'id': video_id,
            'title': title,
@@ -13,7 +13,7 @@ from ..utils import (


 class ImgurIE(InfoExtractor):
-    _VALID_URL = r'https?://(?:i\.)?imgur\.com/(?P<id>[a-zA-Z0-9]+)'
+    _VALID_URL = r'https?://(?:i\.)?imgur\.com/(?!gallery)(?P<id>[a-zA-Z0-9]+)'

    _TESTS = [{
        'url': 'https://i.imgur.com/A61SaA1.gifv',
@@ -97,3 +97,28 @@ class ImgurIE(InfoExtractor):
            'description': self._og_search_description(webpage),
            'title': self._og_search_title(webpage),
        }
+
+
+class ImgurAlbumIE(InfoExtractor):
+    _VALID_URL = r'https?://(?:i\.)?imgur\.com/gallery/(?P<id>[a-zA-Z0-9]+)'
+
+    _TEST = {
+        'url': 'http://imgur.com/gallery/Q95ko',
+        'info_dict': {
+            'id': 'Q95ko',
+        },
+        'playlist_count': 25,
+    }
+
+    def _real_extract(self, url):
+        album_id = self._match_id(url)
+
+        album_images = self._download_json(
+            'http://imgur.com/gallery/%s/album_images/hit.json?all=true' % album_id,
+            album_id)['data']['images']
+
+        entries = [
+            self.url_result('http://imgur.com/%s' % image['hash'])
+            for image in album_images if image.get('hash')]
+
+        return self.playlist_result(entries, album_id)
@@ -13,12 +13,24 @@ from ..utils import (

 class KalturaIE(InfoExtractor):
    _VALID_URL = r'''(?x)
-    (?:kaltura:|
-       https?://(:?(?:www|cdnapisec)\.)?kaltura\.com/index\.php/kwidget/(?:[^/]+/)*?wid/_
-    )(?P<partner_id>\d+)
-    (?::|
-       /(?:[^/]+/)*?entry_id/
-    )(?P<id>[0-9a-z_]+)'''
+                (?:
+                    kaltura:(?P<partner_id_s>\d+):(?P<id_s>[0-9a-z_]+)|
+                    https?://
+                        (:?(?:www|cdnapisec)\.)?kaltura\.com/
+                        (?:
+                            (?:
+                                # flash player
+                                index\.php/kwidget/
+                                (?:[^/]+/)*?wid/_(?P<partner_id>\d+)/
+                                (?:[^/]+/)*?entry_id/(?P<id>[0-9a-z_]+)|
+                                # html5 player
+                                html5/html5lib/
+                                (?:[^/]+/)*?entry_id/(?P<id_html5>[0-9a-z_]+)
+                                .*\?.*\bwid=_(?P<partner_id_html5>\d+)
+                            )
+                        )
+                )
+                '''
    _API_BASE = 'http://cdnapi.kaltura.com/api_v3/index.php?'
    _TESTS = [
        {
@@ -43,6 +55,10 @@ class KalturaIE(InfoExtractor):
            'url': 'https://cdnapisec.kaltura.com/index.php/kwidget/wid/_557781/uiconf_id/22845202/entry_id/1_plr1syf3',
            'only_matching': True,
        },
+        {
+            'url': 'https://cdnapisec.kaltura.com/html5/html5lib/v2.30.2/mwEmbedFrame.php/p/1337/uiconf_id/20540612/entry_id/1_sf5ovm7u?wid=_243342',
+            'only_matching': True,
+        }
    ]

    def _kaltura_api_call(self, video_id, actions, *args, **kwargs):
@@ -105,9 +121,9 @@ class KalturaIE(InfoExtractor):
            video_id, actions, note='Downloading video info JSON')

    def _real_extract(self, url):
-        video_id = self._match_id(url)
        mobj = re.match(self._VALID_URL, url)
-        partner_id, entry_id = mobj.group('partner_id'), mobj.group('id')
+        partner_id = mobj.group('partner_id_s') or mobj.group('partner_id') or mobj.group('partner_id_html5')
+        entry_id = mobj.group('id_s') or mobj.group('id') or mobj.group('id_html5')

        info, source_data = self._get_video_info(entry_id, partner_id)

@@ -126,7 +142,7 @@ class KalturaIE(InfoExtractor):
        self._sort_formats(formats)

        return {
-            'id': video_id,
+            'id': entry_id,
            'title': info['name'],
            'formats': formats,
            'description': info.get('description'),
@@ -25,6 +25,9 @@ class KrasViewIE(InfoExtractor):
            'duration': 27,
            'thumbnail': 're:^https?://.*\.jpg',
        },
+        'params': {
+            'skip_download': 'Not accessible from Travis CI server',
+        },
    }

    def _real_extract(self, url):
@@ -118,9 +118,7 @@ class LyndaIE(LyndaBaseIE):
                'lynda returned error: %s' % video_json['Message'], expected=True)

        if video_json['HasAccess'] is False:
-            raise ExtractorError(
-                'Video %s is only available for members. '
-                % video_id + self._ACCOUNT_CREDENTIALS_HINT, expected=True)
+            self.raise_login_required('Video %s is only available for members' % video_id)

        video_id = compat_str(video_json['ID'])
        duration = video_json['DurationInSeconds']
@@ -25,6 +25,7 @@ class MailRuIE(InfoExtractor):
                'uploader_id': 'sonypicturesrus@mail.ru',
                'duration': 184,
            },
+            'skip': 'Not accessible from Travis CI server',
        },
        {
            'url': 'http://my.mail.ru/corp/hitech/video/news_hi-tech_mail_ru/1263.html',
@@ -39,6 +40,7 @@ class MailRuIE(InfoExtractor):
                'uploader_id': 'hitech@corp.mail.ru',
                'duration': 245,
            },
+            'skip': 'Not accessible from Travis CI server',
        },
    ]

@@ -67,7 +67,7 @@ class MTVServicesInfoExtractor(InfoExtractor):
        return [{'url': url, 'ext': 'mp4'}]

    def _extract_video_formats(self, mdoc, mtvn_id):
-        if re.match(r'.*/(error_country_block\.swf|geoblock\.mp4)$', mdoc.find('.//src').text) is not None:
+        if re.match(r'.*/(error_country_block\.swf|geoblock\.mp4|copyright_error\.flv(?:\?geo\b.+?)?)$', mdoc.find('.//src').text) is not None:
            if mtvn_id is not None and self._MOBILE_TEMPLATE is not None:
                self.to_screen('The normal version is not available from your '
                               'country, trying with the mobile version')
@@ -114,7 +114,8 @@ class MTVServicesInfoExtractor(InfoExtractor):
        # Remove the templates, like &device={device}
        mediagen_url = re.sub(r'&[^=]*?={.*?}(?=(&|$))', '', mediagen_url)
        if 'acceptMethods' not in mediagen_url:
-            mediagen_url += '&acceptMethods=fms'
+            mediagen_url += '&' if '?' in mediagen_url else '?'
+            mediagen_url += 'acceptMethods=fms'

        mediagen_doc = self._download_xml(mediagen_url, video_id,
                                          'Downloading video urls')
@@ -141,7 +142,7 @@ class MTVServicesInfoExtractor(InfoExtractor):
        if title_el is None:
            title_el = itemdoc.find('.//{http://search.yahoo.com/mrss/}title')
        if title_el is None:
-            title_el = itemdoc.find('.//title')
+            title_el = itemdoc.find('.//title') or itemdoc.find('./title')
            if title_el.text is None:
                title_el = None

@@ -174,8 +175,11 @@ class MTVServicesInfoExtractor(InfoExtractor):
        if self._LANG:
            info_url += 'lang=%s&' % self._LANG
        info_url += data
+        return self._get_videos_info_from_url(info_url, video_id)
+
+    def _get_videos_info_from_url(self, url, video_id):
        idoc = self._download_xml(
-            info_url, video_id,
+            url, video_id,
            'Downloading info', transform_source=fix_xml_ampersands)
        return self.playlist_result(
            [self._get_video_info(item) for item in idoc.findall('.//item')])
@@ -288,3 +292,65 @@ class MTVIggyIE(MTVServicesInfoExtractor):
        }
    }
    _FEED_URL = 'http://all.mtvworldverticals.com/feed-xml/'
+
+
+class MTVDEIE(MTVServicesInfoExtractor):
+    IE_NAME = 'mtv.de'
+    _VALID_URL = r'https?://(?:www\.)?mtv\.de/(?:artists|shows|news)/(?:[^/]+/)*(?P<id>\d+)-[^/#?]+/*(?:[#?].*)?$'
+    _TESTS = [{
+        'url': 'http://www.mtv.de/artists/10571-cro/videos/61131-traum',
+        'info_dict': {
+            'id': 'music_video-a50bc5f0b3aa4b3190aa',
+            'ext': 'mp4',
+            'title': 'MusicVideo_cro-traum',
+            'description': 'Cro - Traum',
+        },
+        'params': {
+            # rtmp download
+            'skip_download': True,
+        },
+    }, {
+        # mediagen URL without query (e.g. http://videos.mtvnn.com/mediagen/e865da714c166d18d6f80893195fcb97)
+        'url': 'http://www.mtv.de/shows/933-teen-mom-2/staffeln/5353/folgen/63565-enthullungen',
+        'info_dict': {
+            'id': 'local_playlist-f5ae778b9832cc837189',
+            'ext': 'mp4',
+            'title': 'Episode_teen-mom-2_shows_season-5_episode-1_full-episode_part1',
+        },
+        'params': {
+            # rtmp download
+            'skip_download': True,
+        },
+    }, {
+        # single video in pagePlaylist with different id
+        'url': 'http://www.mtv.de/news/77491-mtv-movies-spotlight-pixels-teil-3',
+        'info_dict': {
+            'id': 'local_playlist-4e760566473c4c8c5344',
+            'ext': 'mp4',
+            'title': 'Article_mtv-movies-spotlight-pixels-teil-3_short-clips_part1',
+            'description': 'MTV Movies Supercut',
+        },
+        'params': {
+            # rtmp download
+            'skip_download': True,
+        },
+    }]
+
+    def _real_extract(self, url):
+        video_id = self._match_id(url)
+
+        webpage = self._download_webpage(url, video_id)
+
+        playlist = self._parse_json(
+            self._search_regex(
+                r'window\.pagePlaylist\s*=\s*(\[.+?\]);\n', webpage, 'page playlist'),
+            video_id)
+
+        # news pages contain single video in playlist with different id
+        if len(playlist) == 1:
+            return self._get_videos_info_from_url(playlist[0]['mrss'], video_id)
+
+        for item in playlist:
+            item_id = item.get('id')
+            if item_id and compat_str(item_id) == video_id:
+                return self._get_videos_info_from_url(item['mrss'], video_id)
@@ -130,10 +130,16 @@ class NowTVIE(InfoExtractor):
    }, {
        'url': 'http://www.nowtv.at/rtl/bauer-sucht-frau/die-neuen-bauern-und-eine-hochzeit/preview?return=/rtl/bauer-sucht-frau/die-neuen-bauern-und-eine-hochzeit',
        'only_matching': True,
+    }, {
+        'url': 'http://www.nowtv.de/rtl2/echtzeit/list/aktuell/schnelles-geld-am-ende-der-welt/player',
+        'only_matching': True,
    }]

    def _real_extract(self, url):
        display_id = self._match_id(url)
+        display_id_split = display_id.split('/')
+        if len(display_id) > 2:
+            display_id = '/'.join((display_id_split[0], display_id_split[-1]))

        info = self._download_json(
            'https://api.nowtv.de/v3/movies/%s?fields=id,title,free,geoblocked,articleLong,articleShort,broadcastStartDate,seoUrl,duration,format,files' % display_id,
@@ -12,7 +12,7 @@ from ..utils import (


 class OdnoklassnikiIE(InfoExtractor):
-    _VALID_URL = r'https?://(?:odnoklassniki|ok)\.ru/(?:video|web-api/video/moviePlayer)/(?P<id>[\d-]+)'
+    _VALID_URL = r'https?://(?:www\.)?(?:odnoklassniki|ok)\.ru/(?:video|web-api/video/moviePlayer)/(?P<id>[\d-]+)'
    _TESTS = [{
        # metadata in JSON
        'url': 'http://ok.ru/video/20079905452',
@@ -43,9 +43,27 @@ class OdnoklassnikiIE(InfoExtractor):
            'like_count': int,
            'age_limit': 0,
        },
+    }, {
+        # YouTube embed (metadataUrl, provider == USER_YOUTUBE)
+        'url': 'http://ok.ru/video/64211978996595-1',
+        'md5': '5d7475d428845cd2e13bae6f1a992278',
+        'info_dict': {
+            'id': '64211978996595-1',
+            'ext': 'mp4',
+            'title': 'Космическая среда от 26 августа 2015',
+            'description': 'md5:848eb8b85e5e3471a3a803dae1343ed0',
+            'duration': 440,
+            'upload_date': '20150826',
+            'uploader_id': '750099571',
+            'uploader': 'Алина П',
+            'age_limit': 0,
+        },
    }, {
        'url': 'http://ok.ru/web-api/video/moviePlayer/20079905452',
        'only_matching': True,
+    }, {
+        'url': 'http://www.ok.ru/video/20648036891',
+        'only_matching': True,
    }]

    def _real_extract(self, url):
@@ -56,7 +74,8 @@ class OdnoklassnikiIE(InfoExtractor):

        player = self._parse_json(
            unescapeHTML(self._search_regex(
-                r'data-attributes="([^"]+)"', webpage, 'player')),
+                r'data-options=(?P<quote>["\'])(?P<player>{.+?%s.+?})(?P=quote)' % video_id,
+                webpage, 'player', group='player')),
            video_id)

        flashvars = player['flashvars']
@@ -89,16 +108,7 @@ class OdnoklassnikiIE(InfoExtractor):

        like_count = int_or_none(metadata.get('likeCount'))

-        quality = qualities(('mobile', 'lowest', 'low', 'sd', 'hd'))
-
-        formats = [{
-            'url': f['url'],
-            'ext': 'mp4',
-            'format_id': f['name'],
-            'quality': quality(f['name']),
-        } for f in metadata['videos']]
-
-        return {
+        info = {
            'id': video_id,
            'title': title,
            'thumbnail': thumbnail,
@@ -108,5 +118,24 @@ class OdnoklassnikiIE(InfoExtractor):
            'uploader_id': uploader_id,
            'like_count': like_count,
            'age_limit': age_limit,
-            'formats': formats,
        }
+
+        if metadata.get('provider') == 'USER_YOUTUBE':
+            info.update({
+                '_type': 'url_transparent',
+                'url': movie['contentId'],
+            })
+            return info
+
+        quality = qualities(('mobile', 'lowest', 'low', 'sd', 'hd'))
+
+        formats = [{
+            'url': f['url'],
+            'ext': 'mp4',
+            'format_id': f['name'],
+            'quality': quality(f['name']),
+        } for f in metadata['videos']]
+        self._sort_formats(formats)
+
+        info['formats'] = formats
+        return info
@@ -41,9 +41,7 @@ class PluralsightIE(InfoExtractor):
    def _login(self):
        (username, password) = self._get_login_info()
        if username is None:
-            raise ExtractorError(
-                'Pluralsight account is required, use --username and --password options to provide account credentials.',
-                expected=True)
+            self.raise_login_required('Pluralsight account is required')

        login_page = self._download_webpage(
            self._LOGIN_URL, None, 'Downloading login page')
@@ -6,7 +6,7 @@ import re
 import time

 from .common import InfoExtractor
-from ..compat import compat_urlparse
+from ..compat import compat_urllib_request, compat_urlparse
 from ..utils import (
    ExtractorError,
    float_or_none,
@@ -102,7 +102,9 @@ class RTVEALaCartaIE(InfoExtractor):
        if info['state'] == 'DESPU':
            raise ExtractorError('The video is no longer available', expected=True)
        png_url = 'http://www.rtve.es/ztnr/movil/thumbnail/%s/videos/%s.png' % (self._manager, video_id)
-        png = self._download_webpage(png_url, video_id, 'Downloading url information')
+        png_request = compat_urllib_request.Request(png_url)
+        png_request.add_header('Referer', url)
+        png = self._download_webpage(png_request, video_id, 'Downloading url information')
        video_url = _decrypt_url(png)
        if not video_url.endswith('.f4m'):
            auth_url = video_url.replace(
@@ -20,7 +20,6 @@ from ..utils import (
 class SafariBaseIE(InfoExtractor):
    _LOGIN_URL = 'https://www.safaribooksonline.com/accounts/login/'
    _SUCCESSFUL_LOGIN_REGEX = r'<a href="/accounts/logout/"[^>]*>Sign Out</a>'
-    _ACCOUNT_CREDENTIALS_HINT = 'Use --username and --password options to supply credentials for safaribooksonline.com'
    _NETRC_MACHINE = 'safari'

    _API_BASE = 'https://www.safaribooksonline.com/api/v1/book'
@@ -37,9 +36,7 @@ class SafariBaseIE(InfoExtractor):
    def _login(self):
        (username, password) = self._get_login_info()
        if username is None:
-            raise ExtractorError(
-                self._ACCOUNT_CREDENTIALS_HINT,
-                expected=True)
+            self.raise_login_required('safaribooksonline.com account is required')

        headers = std_headers
        if 'Referer' not in headers:
@@ -12,8 +12,8 @@ from ..utils import (


 class ScreenwaveMediaIE(InfoExtractor):
-    _VALID_URL = r'http://player\d?\.screenwavemedia\.com/(?:play/)?[a-zA-Z]+\.php\?[^"]*\bid=(?P<id>.+)'
-
+    _VALID_URL = r'https?://player\d?\.screenwavemedia\.com/(?:play/)?[a-zA-Z]+\.php\?.*\bid=(?P<id>[A-Za-z0-9-]+)'
+    EMBED_PATTERN = r'src=(["\'])(?P<url>(?:https?:)?//player\d?\.screenwavemedia\.com/(?:play/)?[a-zA-Z]+\.php\?.*\bid=.+?)\1'
    _TESTS = [{
        'url': 'http://player.screenwavemedia.com/play/play.php?playerdiv=videoarea&companiondiv=squareAd&id=Cinemassacre-19911',
        'only_matching': True,
@@ -33,7 +33,7 @@ class ScreenwaveMediaIE(InfoExtractor):
            'http://player.screenwavemedia.com/player.js',
            video_id, 'Downloading playerconfig webpage')

-        videoserver = self._search_regex(r"\[ipaddress\]\s*=>\s*([\d\.]+)", playerdata, 'videoserver')
+        videoserver = self._search_regex(r'SWMServer\s*=\s*"([\d\.]+)"', playerdata, 'videoserver')

        sources = self._parse_json(
            js_to_json(
@@ -56,6 +56,7 @@ class ScreenwaveMediaIE(InfoExtractor):

        # Fallback to hardcoded sources if JS changes again
        if not sources:
+            self.report_warning('Falling back to a hardcoded list of streams')
            sources = [{
                'file': 'http://%s/vod/%s_%s.mp4' % (videoserver, video_id, format_id),
                'type': 'mp4',
@@ -14,17 +14,28 @@ from ..utils import (


 class SharedIE(InfoExtractor):
-    _VALID_URL = r'http://shared\.sx/(?P<id>[\da-z]{10})'
+    IE_DESC = 'shared.sx and vivo.sx'
+    _VALID_URL = r'http://(?:shared|vivo)\.sx/(?P<id>[\da-z]{10})'

-    _TEST = {
+    _TESTS = [{
        'url': 'http://shared.sx/0060718775',
        'md5': '106fefed92a8a2adb8c98e6a0652f49b',
        'info_dict': {
            'id': '0060718775',
            'ext': 'mp4',
            'title': 'Bmp4',
+            'filesize': 1720110,
        },
-    }
+    }, {
+        'url': 'http://vivo.sx/d7ddda0e78',
+        'md5': '15b3af41be0b4fe01f4df075c2678b2c',
+        'info_dict': {
+            'id': 'd7ddda0e78',
+            'ext': 'mp4',
+            'title': 'Chicken',
+            'filesize': 528031,
+        },
+    }]

    def _real_extract(self, url):
        video_id = self._match_id(url)
@@ -330,10 +330,7 @@ class SmotriBroadcastIE(InfoExtractor):

            (username, password) = self._get_login_info()
            if username is None:
-                raise ExtractorError(
-                    'Erotic broadcasts allowed only for registered users, '
-                    'use --username and --password options to provide account credentials.',
-                    expected=True)
+                self.raise_login_required('Erotic broadcasts allowed only for registered users')

            login_form = {
                'login-hint53': '1',
@@ -60,9 +60,7 @@ class TubiTvIE(InfoExtractor):

        webpage = self._download_webpage(url, video_id)
        if re.search(r"<(?:DIV|div) class='login-required-screen'>", webpage):
-            raise ExtractorError(
-                'This video requires login, use --username and --password '
-                'options to provide account credentials.', expected=True)
+            self.raise_login_required('This video requires login')

        title = self._og_search_title(webpage)
        description = self._og_search_description(webpage)
@@ -70,9 +70,7 @@ class UdemyIE(InfoExtractor):
    def _login(self):
        (username, password) = self._get_login_info()
        if username is None:
-            raise ExtractorError(
-                'Udemy account is required, use --username and --password options to provide account credentials.',
-                expected=True)
+            self.raise_login_required('Udemy account is required')

        login_popup = self._download_webpage(
            self._LOGIN_URL, None, 'Downloading login popup')
@@ -5,7 +5,11 @@ import re
 import hashlib

 from .common import InfoExtractor
-from ..compat import compat_str
+from ..compat import (
+    compat_str,
+    compat_urllib_parse,
+    compat_urllib_request,
+)
 from ..utils import (
    int_or_none,
    float_or_none,
@@ -67,7 +71,7 @@ class YandexMusicPlaylistBaseIE(InfoExtractor):
        return [
            self.url_result(
                'http://music.yandex.ru/album/%s/track/%s' % (track['albums'][0]['id'], track['id']))
-            for track in tracks]
+            for track in tracks if track.get('albums') and isinstance(track.get('albums'), list)]


 class YandexMusicAlbumIE(YandexMusicPlaylistBaseIE):
@@ -106,7 +110,7 @@ class YandexMusicPlaylistIE(YandexMusicPlaylistBaseIE):
    IE_DESC = 'Яндекс.Музыка - Плейлист'
    _VALID_URL = r'https?://music\.yandex\.(?:ru|kz|ua|by)/users/[^/]+/playlists/(?P<id>\d+)'

-    _TEST = {
+    _TESTS = [{
        'url': 'http://music.yandex.ru/users/music.partners/playlists/1245',
        'info_dict': {
            'id': '1245',
@@ -114,19 +118,54 @@ class YandexMusicPlaylistIE(YandexMusicPlaylistBaseIE):
            'description': 'md5:3b9f27b0efbe53f2ee1e844d07155cc9',
        },
        'playlist_count': 6,
-    }
+    }, {
+        # playlist exceeding the limit of 150 tracks shipped with webpage (see
+        # https://github.com/rg3/youtube-dl/issues/6666)
+        'url': 'https://music.yandex.ru/users/ya.playlist/playlists/1036',
+        'info_dict': {
+            'id': '1036',
+            'title': 'Музыка 90-х',
+        },
+        'playlist_count': 310,
+    }]

    def _real_extract(self, url):
        playlist_id = self._match_id(url)

        webpage = self._download_webpage(url, playlist_id)

-        playlist = self._parse_json(
+        mu = self._parse_json(
            self._search_regex(
                r'var\s+Mu\s*=\s*({.+?});\s*</script>', webpage, 'player'),
-            playlist_id)['pageData']['playlist']
+            playlist_id)
+
+        playlist = mu['pageData']['playlist']
+        tracks, track_ids = playlist['tracks'], playlist['trackIds']
+
+        # tracks dictionary shipped with webpage is limited to 150 tracks,
+        # missing tracks should be retrieved manually.
+        if len(tracks) < len(track_ids):
+            present_track_ids = set([compat_str(track['id']) for track in tracks if track.get('id')])
+            missing_track_ids = set(map(compat_str, track_ids)) - set(present_track_ids)
+            request = compat_urllib_request.Request(
+                'https://music.yandex.ru/handlers/track-entries.jsx',
+                compat_urllib_parse.urlencode({
+                    'entries': ','.join(missing_track_ids),
+                    'lang': mu.get('settings', {}).get('lang', 'en'),
+                    'external-domain': 'music.yandex.ru',
+                    'overembed': 'false',
+                    'sign': mu.get('authData', {}).get('user', {}).get('sign'),
+                    'strict': 'true',
+                }).encode('utf-8'))
+            request.add_header('Referer', url)
+            request.add_header('X-Requested-With', 'XMLHttpRequest')
+
+            missing_tracks = self._download_json(
+                request, playlist_id, 'Downloading missing tracks JSON', fatal=False)
+            if missing_tracks:
+                tracks.extend(missing_tracks)

        return self.playlist_result(
-            self._build_playlist(playlist['tracks']),
+            self._build_playlist(tracks),
            compat_str(playlist_id),
            playlist['title'], playlist.get('description'))
@@ -49,6 +49,17 @@ class YoukuIE(InfoExtractor):
        },
        'playlist_count': 13,
        'skip': 'Available in China only',
+    }, {
+        'url': 'http://v.youku.com/v_show/id_XNjA1NzA2Njgw.html',
+        'note': 'Video protected with password',
+        'info_dict': {
+            'id': 'XNjA1NzA2Njgw',
+            'title': '邢義田复旦讲座之想象中的胡人—从“左衽孔子”说起',
+        },
+        'playlist_count': 19,
+        'params': {
+            'videopassword': '100600',
+        },
    }]

    def construct_video_urls(self, data1, data2):
@@ -185,9 +196,15 @@ class YoukuIE(InfoExtractor):
            raw_data = self._download_json(req, video_id, note=note)
            return raw_data['data'][0]

+        video_password = self._downloader.params.get('videopassword', None)
+
        # request basic data
+        basic_data_url = 'http://v.youku.com/player/getPlayList/VideoIDS/%s' % video_id
+        if video_password:
+            basic_data_url += '?password=%s' % video_password
+
        data1 = retrieve_data(
-            'http://v.youku.com/player/getPlayList/VideoIDS/%s' % video_id,
+            basic_data_url,
            'Downloading JSON metadata 1')
        data2 = retrieve_data(
            'http://v.youku.com/player/getPlayList/VideoIDS/%s/Pf/4/ctype/12/ev/1' % video_id,
@@ -660,7 +660,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):

    def _extract_signature_function(self, video_id, player_url, example_sig):
        id_m = re.match(
-            r'.*?-(?P<id>[a-zA-Z0-9_-]+)(?:/watch_as3|/html5player)?\.(?P<ext>[a-z]+)$',
+            r'.*?-(?P<id>[a-zA-Z0-9_-]+)(?:/watch_as3|/html5player(?:-new)?)?\.(?P<ext>[a-z]+)$',
            player_url)
        if not id_m:
            raise ExtractorError('Cannot identify player %r' % player_url)
@@ -1243,7 +1243,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
            encoded_url_map = video_info.get('url_encoded_fmt_stream_map', [''])[0] + ',' + video_info.get('adaptive_fmts', [''])[0]
            if 'rtmpe%3Dyes' in encoded_url_map:
                raise ExtractorError('rtmpe downloads are not supported, see https://github.com/rg3/youtube-dl/issues/343 for more information.', expected=True)
-            url_map = {}
+            formats = []
            for url_data_str in encoded_url_map.split(','):
                url_data = compat_parse_qs(url_data_str)
                if 'itag' not in url_data or 'url' not in url_data:
@@ -1289,7 +1289,7 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
                                player_desc = 'flash player %s' % player_version
                            else:
                                player_version = self._search_regex(
-                                    r'html5player-([^/]+?)(?:/html5player)?\.js',
+                                    r'html5player-([^/]+?)(?:/html5player(?:-new)?)?\.js',
                                    player_url,
                                    'html5 player', fatal=False)
                                player_desc = 'html5 player %s' % player_version
@@ -1303,8 +1303,50 @@ class YoutubeIE(YoutubeBaseInfoExtractor):
                    url += '&signature=' + signature
                if 'ratebypass' not in url:
                    url += '&ratebypass=yes'
-                url_map[format_id] = url
-            formats = _map_to_format_list(url_map)
+
+                # Some itags are not included in DASH manifest thus corresponding formats will
+                # lack metadata (see https://github.com/rg3/youtube-dl/pull/5993).
+                # Trying to extract metadata from url_encoded_fmt_stream_map entry.
+                mobj = re.search(r'^(?P<width>\d+)[xX](?P<height>\d+)$', url_data.get('size', [''])[0])
+                width, height = (int(mobj.group('width')), int(mobj.group('height'))) if mobj else (None, None)
+                dct = {
+                    'format_id': format_id,
+                    'url': url,
+                    'player_url': player_url,
+                    'filesize': int_or_none(url_data.get('clen', [None])[0]),
+                    'tbr': float_or_none(url_data.get('bitrate', [None])[0], 1000),
+                    'width': width,
+                    'height': height,
+                    'fps': int_or_none(url_data.get('fps', [None])[0]),
+                    'format_note': url_data.get('quality_label', [None])[0] or url_data.get('quality', [None])[0],
+                }
+                type_ = url_data.get('type', [None])[0]
+                if type_:
+                    type_split = type_.split(';')
+                    kind_ext = type_split[0].split('/')
+                    if len(kind_ext) == 2:
+                        kind, ext = kind_ext
+                        dct['ext'] = ext
+                        if kind in ('audio', 'video'):
+                            codecs = None
+                            for mobj in re.finditer(
+                                    r'(?P<key>[a-zA-Z_-]+)=(?P<quote>["\']?)(?P<val>.+?)(?P=quote)(?:;|$)', type_):
+                                if mobj.group('key') == 'codecs':
+                                    codecs = mobj.group('val')
+                                    break
+                            if codecs:
+                                codecs = codecs.split(',')
+                                if len(codecs) == 2:
+                                    acodec, vcodec = codecs[0], codecs[1]
+                                else:
+                                    acodec, vcodec = (codecs[0], 'none') if kind == 'audio' else ('none', codecs[0])
+                                dct.update({
+                                    'acodec': acodec,
+                                    'vcodec': vcodec,
+                                })
+                if format_id in self._formats:
+                    dct.update(self._formats[format_id])
+                formats.append(dct)
        elif video_info.get('hlsvp'):
            manifest_url = video_info['hlsvp'][0]
            url_map = self._extract_from_m3u8(manifest_url, video_id)
@@ -320,7 +320,7 @@ def parseOpts(overrideArguments=None):
    authentication.add_option(
        '--video-password',
        dest='videopassword', metavar='PASSWORD',
-        help='Video password (vimeo, smotri)')
+        help='Video password (vimeo, smotri, youku)')

    video_format = optparse.OptionGroup(parser, 'Video Format Options')
    video_format.add_option(
@@ -587,6 +587,11 @@ class ContentTooShortError(Exception):


 def _create_http_connection(ydl_handler, http_class, is_https, *args, **kwargs):
+    # Working around python 2 bug (see http://bugs.python.org/issue17849) by limiting
+    # expected HTTP responses to meet HTTP/1.0 or later (see also
+    # https://github.com/rg3/youtube-dl/issues/6727)
+    if sys.version_info < (3, 0):
+        kwargs['strict'] = True
    hc = http_class(*args, **kwargs)
    source_address = ydl_handler._params.get('source_address')
    if source_address is not None:
@@ -1,3 +1,3 @@
 from __future__ import unicode_literals

-__version__ = '2015.08.23'
+__version__ = '2015.09.03'
Author	SHA1	Message	Date
Philipp Hagemeister	bd6742137f	release 2015.09.03	2015-09-03 12:34:20 +02:00
Sergey M․	e8dcfa3d69	[ok] Sort formats	2015-09-02 22:09:33 +06:00
Sergey M․	88720ed09b	[ok] Add support for youtube embeds	2015-09-02 22:08:50 +06:00
Sergey M․	1e804244d0	[ok] Improve player regex	2015-09-02 21:38:56 +06:00
Sergey M․	198492bbf0	[ok] Extract correct data-options (Closes #6726 )	2015-09-02 21:36:22 +06:00
Sergey M.	8f9d522f62	[README.md] Remove superfluous word	2015-09-02 21:09:31 +06:00
Sergey M.	cbae233aba	Merge pull request #6736 from remitamine/readme [README] link to developer instructions and correct common.py path	2015-09-02 21:08:22 +06:00
Sergey M.	b17ca9c945	Merge pull request #6731 from dstftw/strict-http-response [utils] Strict HTTP responses (Closes #6727)	2015-09-02 20:22:59 +06:00
remitamine	ebf4ca39ba	link to developer instructions and correct common.py path	2015-09-02 01:24:59 +01:00
Sergey M․	e5e78797e6	[utils] Strict HTTP responses (Closes #6727 )	2015-09-02 02:16:04 +06:00
Jaime Marquínez Ferrándiz	080997b808	[rtve] Fix extraction (fixes #6723 ) Adding the 'Referer' header to the png request is enough.	2015-09-01 20:30:58 +02:00
Sergey M․	77306e8b97	[francetv] Update tests (2)	2015-09-01 22:42:43 +06:00
Sergey M․	6917d2a2f0	[francetv] Update tests	2015-09-01 22:39:26 +06:00
Sergey M․	36c15522c1	[francetv] Use subtitle when present (Closes #6715 )	2015-09-01 22:37:42 +06:00
Sergey M․	804c343a4f	[options] Clarify extractors supporting videopassword	2015-09-01 22:30:28 +06:00
Sergey M․	cd5d75427e	[youku] Fix videopassword test	2015-09-01 22:28:03 +06:00
Sergey M․	5ddc127da6	[youku] Simplify (Closes #6717 )	2015-09-01 22:26:17 +06:00
PSJay	f859695b49	Update README Add Youku in video password description.	2015-09-01 22:25:10 +06:00
PSJay	cb3d2eb9e9	Fix coding styple in YouKuIE.	2015-09-01 22:25:03 +06:00
PSJay	33eae08f04	Support video-password on Youku.	2015-09-01 22:24:57 +06:00
Jaime Marquínez Ferrándiz	aa3f98677d	[dumpert] Support 'embed/' urls (fixes #6712 )	2015-09-01 18:13:33 +02:00
Sergey M․	fffccaaf41	[globo] Fix extraction and make more robust (Closes #6728 )	2015-09-01 20:41:52 +06:00
Yen Chi Hsuan	cdc8d0c373	[odnoklassniki] Fix extraction and extend _VALID_URL (closes #6726 )	2015-09-01 16:34:05 +08:00
Sergey M․	39955b0451	[fc2] Fix refer	2015-08-31 21:12:21 +06:00
Sergey M․	52dfb7ffe2	[fc2] Add test for #3171	2015-08-31 21:05:37 +06:00
Sergey M․	93462856e1	[fc2] Relax _VALID_URL (Closes #3171 )	2015-08-31 21:05:23 +06:00
Sergey M․	fcd9e423ec	[hls] Properly encode fragment filename	2015-08-31 02:28:36 +06:00
Sergey M․	db8f2bfd99	[f4m] Add missing imports	2015-08-31 02:20:29 +06:00
Sergey M.	55801fc76e	Merge pull request #5588 from aajanki/encode_frag_filenames [f4m] Fix encode error by sanitizing fragment filenames	2015-08-31 02:18:15 +06:00
Sergey M․	d3d89c3256	[YoutubeDL] Encode thumbnail filename (Closes #6714 )	2015-08-31 02:01:13 +06:00
Sergey M․	8875b3d572	[imgut:album] Add extractor (Closes #6711 )	2015-08-30 19:03:21 +06:00
Sergey M․	aabc2be693	[youtube] Simplify and extract more metadata from url_encoded_fmt_stream_map (Closes #5993 )	2015-08-30 08:07:07 +06:00
Anssi Hannula	c9afb51cea	[youtube] Fix missing format details for 60fps DASH formats 60fps DASH formats do not appear in the DASH manifest, but the non-DASH video info page does contain additional parameters for DASH formats that we can parse. Use those when they exist. This fixes "bestvideo" not selecting 60fps formats over similar 30fps formats just because the file size is unknown.	2015-08-30 08:05:16 +06:00
Yen Chi Hsuan	c0a656876c	[abc] Use int_or_none	2015-08-30 01:11:56 +08:00
Yen Chi Hsuan	17a647630b	[abc] Support YouTube embeds (fixes #6595 )	2015-08-30 01:08:55 +08:00
Sergey M․	c88e118b3c	[nowtv] Fix display_id (Closes #6702 )	2015-08-29 22:41:49 +06:00
Sergey M․	ae6a802106	[kaltura] Fix _VALID_URL formatting	2015-08-29 20:26:51 +06:00
Sergey M․	b184f94413	[kaltura] Clarify html5 player comment	2015-08-29 20:23:06 +06:00
Sergey M․	ee3ec091f4	[kaltura] Strictify _VALID_URL	2015-08-29 20:21:59 +06:00
sourcerect	ef49b59053	[kaltura] add html5 player urls	2015-08-29 19:29:07 +06:00
Sergey M․	1f8125805e	[generic] Add test for screenwavemedia embed	2015-08-29 19:12:38 +06:00
Sergey M․	efd712c69b	[generic] Use screenwavemedia embed pattern	2015-08-29 19:07:31 +06:00
Sergey M․	109a4156e1	[cinemassacre] Use screenwavemedia embed pattern	2015-08-29 19:06:55 +06:00
Sergey M․	678d33295b	[screenwavemedia] Improve _VALID_URL	2015-08-29 19:05:11 +06:00
Sergey M․	5e58956d0a	[screenwavemedia] Add EMBED_PATTERN	2015-08-29 19:00:38 +06:00
felix	e276fd2cb3	[screenwavemedia] warn when falling back to a hardcoded list of streams	2015-08-29 18:38:41 +06:00
felix	9b22cb10c4	[screenwavemedia] use the true JS value instead of a comment	2015-08-29 18:38:34 +06:00
felix	8ca31a0e05	[generic] Look for ScreenwaveMedia embeds	2015-08-29 18:38:23 +06:00
Sergey M․	20149a5da1	[mtvservices] Fix title extraction under python 2	2015-08-28 22:10:49 +06:00
Sergey M․	054d43bb11	[mtvservices] Catch more georestriction errors	2015-08-28 21:45:29 +06:00
Sergey M․	65488b820c	[mtvde] Add support for news	2015-08-28 21:41:54 +06:00
Sergey M․	c3c9f87954	[mtvde] Add test for mediagen URL without query	2015-08-28 21:31:38 +06:00
Sergey M․	56f447be9f	[mtvservices] Fix mediagen URL generation	2015-08-28 21:27:36 +06:00
Sergey M․	79fa9db0da	[mtvde] Simplify (Closes #6673 )	2015-08-28 21:24:54 +06:00
Paul Hartmann	071c10137b	[MTV] move German mtv site to new class	2015-08-28 21:23:00 +06:00
Philipp Hagemeister	a4962b80d6	release 2015.08.28	2015-08-28 05:04:39 +02:00
Sergey M․	5307c33232	[foxnews] Clarify IE_DESC	2015-08-27 21:48:47 +06:00
Sergey M․	1b660cce12	[foxnews] Simplify (Closes #6694 )	2015-08-27 21:48:03 +06:00
nmrugg	8df8c278b6	Added matching test for FoxBusiness.	2015-08-27 21:41:16 +06:00
nmrugg	d7e8264517	Make FoxBusiness work.	2015-08-27 21:41:08 +06:00
Sergey M․	f11c316347	[shared] Add filesize to test	2015-08-26 22:06:10 +06:00
Sergey M․	f62e02c24f	[shared] Add test for vivo	2015-08-26 22:05:45 +06:00
Sergey M․	70113c38c9	[shared] Clarify IE_DESC	2015-08-26 22:04:39 +06:00
Sergey M․	3d8132f5e2	[shared] Extend _VALID_URL to support vivo.sx (Closes #6681 )	2015-08-26 22:03:58 +06:00
Sergey M․	39affb5aa4	[crunchyroll] Fix typo	2015-08-26 21:27:57 +06:00
Sergey M․	a882c5f474	[udemy] Use raise_login_required	2015-08-26 21:27:07 +06:00
Sergey M․	61a7ff1622	[tubitv] Use raise_login_required	2015-08-26 21:26:47 +06:00
Sergey M․	42e7373bd3	[smotri] Use raise_login_required	2015-08-26 21:26:35 +06:00
Sergey M․	e269d3ae7d	[safari] Use raise_login_required	2015-08-26 21:26:24 +06:00
Sergey M․	e7ddaef5bd	[pluralsight] Use raise_login_required	2015-08-26 21:26:09 +06:00
Sergey M․	62984e4584	[lynda] Use raise_login_required	2015-08-26 21:25:53 +06:00
Sergey M․	3c53455d15	[eroprofile] Use raise_login_required	2015-08-26 21:25:37 +06:00
Sergey M․	bbb43a39fd	[crunchyroll] Use raise_login_required	2015-08-26 21:25:04 +06:00
Sergey M․	43e7d3c945	[extractor/common] Add raise_login_required	2015-08-26 21:24:47 +06:00
Sergey M․	2f72e83bbd	[crunchyroll] Detect required login (#6677 )	2015-08-26 20:47:57 +06:00
Sergey M.	57179b4ca1	Merge pull request #6679 from jaimeMF/youtube-fix-player-version-regex [youtube] Adapt player version regex to handle urls ending in '/html5player-new.js'	2015-08-26 20:17:52 +06:00
Jaime Marquínez Ferrándiz	4bc8eec4eb	[youtube] Adapt player version regex to handle urls ending in '/html5player-new.js' It was always extracting 'new' as the version, breaking the cache system.	2015-08-26 15:21:55 +02:00
Sergey M․	baf510bf8c	[yandexmusic:playlist] Handle playlists with more than 150 tracks (Closes #6666 )	2015-08-26 00:11:15 +06:00
Sergey M․	6d53cdd6ce	[yandexmusic] Skip removed tracks (#6666 )	2015-08-25 23:29:02 +06:00
Sergey M․	ebbf078c7d	[krasview] Skip download for test	2015-08-25 21:19:21 +06:00
Sergey M․	95e431e9ec	[mailru] Skip tests	2015-08-25 21:08:38 +06:00
Antti Ajanki	233c1c0e76	[downloader/f4m] Fragment filenames must be sanitized because the fragment was written to a file with a sanitized name by http_dl.download()	2015-05-03 11:31:42 +03:00