mirror of
https://github.com/soimort/you-get.git
synced 2025-01-25 06:25:03 +03:00
458 lines
20 KiB
Markdown
458 lines
20 KiB
Markdown
# You-Get
|
|
|
|
[![PyPI version](https://img.shields.io/pypi/v/you-get.svg)](https://pypi.python.org/pypi/you-get/)
|
|
[![Build Status](https://travis-ci.org/soimort/you-get.svg)](https://travis-ci.org/soimort/you-get)
|
|
[![Gitter](https://badges.gitter.im/Join%20Chat.svg)](https://gitter.im/soimort/you-get?utm_source=badge&utm_medium=badge&utm_campaign=pr-badge&utm_content=badge)
|
|
|
|
[You-Get](https://you-get.org/) is a tiny command-line utility to download media contents (videos, audios, images) from the Web, in case there is no other handy way to do it.
|
|
|
|
Here's how you use `you-get` to download a video from [this web page](http://www.fsf.org/blogs/rms/20140407-geneva-tedx-talk-free-software-free-society):
|
|
|
|
```console
|
|
$ you-get http://www.fsf.org/blogs/rms/20140407-geneva-tedx-talk-free-software-free-society
|
|
Site: fsf.org
|
|
Title: TEDxGE2014_Stallman05_LQ
|
|
Type: WebM video (video/webm)
|
|
Size: 27.12 MiB (28435804 Bytes)
|
|
|
|
Downloading TEDxGE2014_Stallman05_LQ.webm ...
|
|
100.0% ( 27.1/27.1 MB) ├████████████████████████████████████████┤[1/1] 12 MB/s
|
|
```
|
|
|
|
And here's why you might want to use it:
|
|
|
|
* You enjoyed something on the Internet, and just want to download them for your own pleasure.
|
|
* You watch your favorite videos online from your computer, but you are prohibited from saving them. You feel that you have no control over your own computer. (And it's not how an open Web is supposed to work.)
|
|
* You want to get rid of any closed-source technology or proprietary JavaScript code, and disallow things like Flash running on your computer.
|
|
* You are an adherent of hacker culture and free software.
|
|
|
|
What `you-get` can do for you:
|
|
|
|
* Download videos / audios from popular websites such as YouTube, Youku, Niconico, and a bunch more. (See the [full list of supported sites](#supported-sites))
|
|
* Stream an online video in your media player. No web browser, no more ads.
|
|
* Download images (of interest) by scraping a web page.
|
|
* Download arbitrary non-HTML contents, i.e., binary files.
|
|
|
|
Interested? [Install it](#installation) now and [get started by examples](#getting-started).
|
|
|
|
Are you a Python programmer? Then check out [the source](https://github.com/soimort/you-get) and fork it!
|
|
|
|
![](https://i.imgur.com/GfthFAz.png)
|
|
|
|
## Installation
|
|
|
|
### Prerequisites
|
|
|
|
The following dependencies are required and must be installed separately, unless you are using a pre-built package or chocolatey on Windows:
|
|
|
|
* **[Python 3](https://www.python.org/downloads/)**
|
|
* **[FFmpeg](https://www.ffmpeg.org/)** (strongly recommended) or [Libav](https://libav.org/)
|
|
* (Optional) [RTMPDump](https://rtmpdump.mplayerhq.hu/)
|
|
|
|
### Option 1: Install via pip
|
|
|
|
The official release of `you-get` is distributed on [PyPI](https://pypi.python.org/pypi/you-get), and can be installed easily from a PyPI mirror via the [pip](https://en.wikipedia.org/wiki/Pip_\(package_manager\)) package manager. Note that you must use the Python 3 version of `pip`:
|
|
|
|
$ pip3 install you-get
|
|
|
|
### Option 2: Install via [Antigen](https://github.com/zsh-users/antigen)
|
|
|
|
Add the following line to your `.zshrc`:
|
|
|
|
antigen bundle soimort/you-get
|
|
|
|
### Option 3: Use a pre-built package (Windows only)
|
|
|
|
Download the `exe` (standalone) or `7z` (all dependencies included) from: <https://github.com/soimort/you-get/releases/latest>.
|
|
|
|
### Option 4: Download from GitHub
|
|
|
|
You may either download the [stable](https://github.com/soimort/you-get/archive/master.zip) (identical with the latest release on PyPI) or the [develop](https://github.com/soimort/you-get/archive/develop.zip) (more hotfixes, unstable features) branch of `you-get`. Unzip it, and put the directory containing the `you-get` script into your `PATH`.
|
|
|
|
Alternatively, run
|
|
|
|
```
|
|
$ [sudo] python3 setup.py install
|
|
```
|
|
|
|
Or
|
|
|
|
```
|
|
$ python3 setup.py install --user
|
|
```
|
|
|
|
to install `you-get` to a permanent path.
|
|
|
|
### Option 5: Git clone
|
|
|
|
This is the recommended way for all developers, even if you don't often code in Python.
|
|
|
|
```
|
|
$ git clone git://github.com/soimort/you-get.git
|
|
```
|
|
|
|
Then put the cloned directory into your `PATH`, or run `./setup.py install` to install `you-get` to a permanent path.
|
|
|
|
### Option 6: Using [Chocolatey](https://chocolatey.org/) (Windows only)
|
|
|
|
```
|
|
> choco install you-get
|
|
```
|
|
|
|
### Option 7: Homebrew (Mac only)
|
|
|
|
You can install `you-get` easily via:
|
|
|
|
```
|
|
$ brew install you-get
|
|
```
|
|
|
|
### Shell completion
|
|
|
|
Completion definitions for Bash, Fish and Zsh can be found in [`contrib/completion`](contrib/completion). Please consult your shell's manual for how to take advantage of them.
|
|
|
|
## Upgrading
|
|
|
|
Based on which option you chose to install `you-get`, you may upgrade it via:
|
|
|
|
```
|
|
$ pip3 install --upgrade you-get
|
|
```
|
|
|
|
or download the latest release via:
|
|
|
|
```
|
|
$ you-get https://github.com/soimort/you-get/archive/master.zip
|
|
```
|
|
|
|
or use [chocolatey package manager](https://chocolatey.org):
|
|
|
|
```
|
|
> choco upgrade you-get
|
|
```
|
|
|
|
In order to get the latest ```develop``` branch without messing up the PIP, you can try:
|
|
|
|
```
|
|
$ pip3 install --upgrade git+https://github.com/soimort/you-get@develop
|
|
```
|
|
|
|
## Getting Started
|
|
|
|
### Download a video
|
|
|
|
When you get a video of interest, you might want to use the `--info`/`-i` option to see all available quality and formats:
|
|
|
|
```
|
|
$ you-get -i 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
site: YouTube
|
|
title: Me at the zoo
|
|
streams: # Available quality and codecs
|
|
[ DEFAULT ] _________________________________
|
|
- itag: 43
|
|
container: webm
|
|
quality: medium
|
|
size: 0.5 MiB (564215 bytes)
|
|
# download-with: you-get --itag=43 [URL]
|
|
|
|
- itag: 18
|
|
container: mp4
|
|
quality: medium
|
|
# download-with: you-get --itag=18 [URL]
|
|
|
|
- itag: 5
|
|
container: flv
|
|
quality: small
|
|
# download-with: you-get --itag=5 [URL]
|
|
|
|
- itag: 36
|
|
container: 3gp
|
|
quality: small
|
|
# download-with: you-get --itag=36 [URL]
|
|
|
|
- itag: 17
|
|
container: 3gp
|
|
quality: small
|
|
# download-with: you-get --itag=17 [URL]
|
|
```
|
|
|
|
The format marked with `DEFAULT` is the one you will get by default. If that looks cool to you, download it:
|
|
|
|
```
|
|
$ you-get 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
site: YouTube
|
|
title: Me at the zoo
|
|
stream:
|
|
- itag: 43
|
|
container: webm
|
|
quality: medium
|
|
size: 0.5 MiB (564215 bytes)
|
|
# download-with: you-get --itag=43 [URL]
|
|
|
|
Downloading zoo.webm ...
|
|
100.0% ( 0.5/0.5 MB) ├████████████████████████████████████████┤[1/1] 7 MB/s
|
|
|
|
Saving Me at the zoo.en.srt ...Done.
|
|
```
|
|
|
|
(If a YouTube video has any closed captions, they will be downloaded together with the video file, in SubRip subtitle format.)
|
|
|
|
Or, if you prefer another format (mp4), just use whatever the option `you-get` shows to you:
|
|
|
|
```
|
|
$ you-get --itag=18 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
```
|
|
|
|
**Note:**
|
|
|
|
* At this point, format selection has not been generally implemented for most of our supported sites; in that case, the default format to download is the one with the highest quality.
|
|
* `ffmpeg` is a required dependency, for downloading and joining videos streamed in multiple parts (e.g. on some sites like Youku), and for YouTube videos of 1080p or high resolution.
|
|
* If you don't want `you-get` to join video parts after downloading them, use the `--no-merge`/`-n` option.
|
|
|
|
### Download anything else
|
|
|
|
If you already have the URL of the exact resource you want, you can download it directly with:
|
|
|
|
```
|
|
$ you-get https://stallman.org/rms.jpg
|
|
Site: stallman.org
|
|
Title: rms
|
|
Type: JPEG Image (image/jpeg)
|
|
Size: 0.06 MiB (66482 Bytes)
|
|
|
|
Downloading rms.jpg ...
|
|
100.0% ( 0.1/0.1 MB) ├████████████████████████████████████████┤[1/1] 127 kB/s
|
|
```
|
|
|
|
Otherwise, `you-get` will scrape the web page and try to figure out if there's anything interesting to you:
|
|
|
|
```
|
|
$ you-get http://kopasas.tumblr.com/post/69361932517
|
|
Site: Tumblr.com
|
|
Title: kopasas
|
|
Type: Unknown type (None)
|
|
Size: 0.51 MiB (536583 Bytes)
|
|
|
|
Site: Tumblr.com
|
|
Title: tumblr_mxhg13jx4n1sftq6do1_1280
|
|
Type: Portable Network Graphics (image/png)
|
|
Size: 0.51 MiB (536583 Bytes)
|
|
|
|
Downloading tumblr_mxhg13jx4n1sftq6do1_1280.png ...
|
|
100.0% ( 0.5/0.5 MB) ├████████████████████████████████████████┤[1/1] 22 MB/s
|
|
```
|
|
|
|
**Note:**
|
|
|
|
* This feature is an experimental one and far from perfect. It works best on scraping large-sized images from popular websites like Tumblr and Blogger, but there is really no universal pattern that can apply to any site on the Internet.
|
|
|
|
### Search on Google Videos and download
|
|
|
|
You can pass literally anything to `you-get`. If it isn't a valid URL, `you-get` will do a Google search and download the most relevant video for you. (It might not be exactly the thing you wish to see, but still very likely.)
|
|
|
|
```
|
|
$ you-get "Richard Stallman eats"
|
|
```
|
|
|
|
### Pause and resume a download
|
|
|
|
You may use <kbd>Ctrl</kbd>+<kbd>C</kbd> to interrupt a download.
|
|
|
|
A temporary `.download` file is kept in the output directory. Next time you run `you-get` with the same arguments, the download progress will resume from the last session. In case the file is completely downloaded (the temporary `.download` extension is gone), `you-get` will just skip the download.
|
|
|
|
To enforce re-downloading, use the `--force`/`-f` option. (**Warning:** doing so will overwrite any existing file or temporary file with the same name!)
|
|
|
|
### Set the path and name of downloaded file
|
|
|
|
Use the `--output-dir`/`-o` option to set the path, and `--output-filename`/`-O` to set the name of the downloaded file:
|
|
|
|
```
|
|
$ you-get -o ~/Videos -O zoo.webm 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
```
|
|
|
|
**Tips:**
|
|
|
|
* These options are helpful if you encounter problems with the default video titles, which may contain special characters that do not play well with your current shell / operating system / filesystem.
|
|
* These options are also helpful if you write a script to batch download files and put them into designated folders with designated names.
|
|
|
|
### Proxy settings
|
|
|
|
You may specify an HTTP proxy for `you-get` to use, via the `--http-proxy`/`-x` option:
|
|
|
|
```
|
|
$ you-get -x 127.0.0.1:8087 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
```
|
|
|
|
However, the system proxy setting (i.e. the environment variable `http_proxy`) is applied by default. To disable any proxy, use the `--no-proxy` option.
|
|
|
|
**Tips:**
|
|
|
|
* If you need to use proxies a lot (in case your network is blocking certain sites), you might want to use `you-get` with [proxychains](https://github.com/rofl0r/proxychains-ng) and set `alias you-get="proxychains -q you-get"` (in Bash).
|
|
* For some websites (e.g. Youku), if you need access to some videos that are only available in mainland China, there is an option of using a specific proxy to extract video information from the site: `--extractor-proxy`/`-y`.
|
|
|
|
### Watch a video
|
|
|
|
Use the `--player`/`-p` option to feed the video into your media player of choice, e.g. `mplayer` or `vlc`, instead of downloading it:
|
|
|
|
```
|
|
$ you-get -p vlc 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
```
|
|
|
|
Or, if you prefer to watch the video in a browser, just without ads or comment section:
|
|
|
|
```
|
|
$ you-get -p chromium 'https://www.youtube.com/watch?v=jNQXAC9IVRw'
|
|
```
|
|
|
|
**Tips:**
|
|
|
|
* It is possible to use the `-p` option to start another download manager, e.g., `you-get -p uget-gtk 'https://www.youtube.com/watch?v=jNQXAC9IVRw'`, though they may not play together very well.
|
|
|
|
### Load cookies
|
|
|
|
Not all videos are publicly available to anyone. If you need to log in your account to access something (e.g., a private video), it would be unavoidable to feed the browser cookies to `you-get` via the `--cookies`/`-c` option.
|
|
|
|
**Note:**
|
|
|
|
* As of now, we are supporting two formats of browser cookies: Mozilla `cookies.sqlite` and Netscape `cookies.txt`.
|
|
|
|
### Reuse extracted data
|
|
|
|
Use `--url`/`-u` to get a list of downloadable resource URLs extracted from the page. Use `--json` to get an abstract of extracted data in the JSON format.
|
|
|
|
**Warning:**
|
|
|
|
* For the time being, this feature has **NOT** been stabilized and the JSON schema may have breaking changes in the future.
|
|
|
|
## Supported Sites
|
|
|
|
| Site | URL | Videos? | Images? | Audios? |
|
|
| :--: | :-- | :-----: | :-----: | :-----: |
|
|
| **YouTube** | <https://www.youtube.com/> |✓| | |
|
|
| **Twitter** | <https://twitter.com/> |✓|✓| |
|
|
| VK | <http://vk.com/> |✓|✓| |
|
|
| Vine | <https://vine.co/> |✓| | |
|
|
| Vimeo | <https://vimeo.com/> |✓| | |
|
|
| Vidto | <http://vidto.me/> |✓| | |
|
|
| Videomega | <http://videomega.tv/> |✓| | |
|
|
| Veoh | <http://www.veoh.com/> |✓| | |
|
|
| **Tumblr** | <https://www.tumblr.com/> |✓|✓|✓|
|
|
| TED | <http://www.ted.com/> |✓| | |
|
|
| SoundCloud | <https://soundcloud.com/> | | |✓|
|
|
| SHOWROOM | <https://www.showroom-live.com/> |✓| | |
|
|
| Pinterest | <https://www.pinterest.com/> | |✓| |
|
|
| MusicPlayOn | <http://en.musicplayon.com/> |✓| | |
|
|
| MTV81 | <http://www.mtv81.com/> |✓| | |
|
|
| Mixcloud | <https://www.mixcloud.com/> | | |✓|
|
|
| Metacafe | <http://www.metacafe.com/> |✓| | |
|
|
| Magisto | <http://www.magisto.com/> |✓| | |
|
|
| Khan Academy | <https://www.khanacademy.org/> |✓| | |
|
|
| JPopsuki TV | <http://www.jpopsuki.tv/> |✓| | |
|
|
| Internet Archive | <https://archive.org/> |✓| | |
|
|
| **Instagram** | <https://instagram.com/> |✓|✓| |
|
|
| InfoQ | <http://www.infoq.com/presentations/> |✓| | |
|
|
| Imgur | <http://imgur.com/> | |✓| |
|
|
| Heavy Music Archive | <http://www.heavy-music.ru/> | | |✓|
|
|
| **Google+** | <https://plus.google.com/> |✓|✓| |
|
|
| Freesound | <http://www.freesound.org/> | | |✓|
|
|
| Flickr | <https://www.flickr.com/> |✓|✓| |
|
|
| FC2 Video | <http://video.fc2.com/> |✓| | |
|
|
| Facebook | <https://www.facebook.com/> |✓| | |
|
|
| eHow | <http://www.ehow.com/> |✓| | |
|
|
| Dailymotion | <http://www.dailymotion.com/> |✓| | |
|
|
| CBS | <http://www.cbs.com/> |✓| | |
|
|
| Bandcamp | <http://bandcamp.com/> | | |✓|
|
|
| AliveThai | <http://alive.in.th/> |✓| | |
|
|
| interest.me | <http://ch.interest.me/tvn> |✓| | |
|
|
| **755<br/>ナナゴーゴー** | <http://7gogo.jp/> |✓|✓| |
|
|
| **niconico<br/>ニコニコ動画** | <http://www.nicovideo.jp/> |✓| | |
|
|
| **163<br/>网易视频<br/>网易云音乐** | <http://v.163.com/><br/><http://music.163.com/> |✓| |✓|
|
|
| 56网 | <http://www.56.com/> |✓| | |
|
|
| **AcFun** | <http://www.acfun.tv/> |✓| | |
|
|
| **Baidu<br/>百度贴吧** | <http://tieba.baidu.com/> |✓|✓| |
|
|
| 爆米花网 | <http://www.baomihua.com/> |✓| | |
|
|
| **bilibili<br/>哔哩哔哩** | <http://www.bilibili.com/> |✓| | |
|
|
| Dilidili | <http://www.dilidili.com/> |✓| | |
|
|
| 豆瓣 | <http://www.douban.com/> |✓| |✓|
|
|
| 斗鱼 | <http://www.douyutv.com/> |✓| | |
|
|
| Panda<br/>熊猫 | <http://www.panda.tv/> |✓| | |
|
|
| 凤凰视频 | <http://v.ifeng.com/> |✓| | |
|
|
| 风行网 | <http://www.fun.tv/> |✓| | |
|
|
| iQIYI<br/>爱奇艺 | <http://www.iqiyi.com/> |✓| | |
|
|
| 激动网 | <http://www.joy.cn/> |✓| | |
|
|
| 酷6网 | <http://www.ku6.com/> |✓| | |
|
|
| 酷狗音乐 | <http://www.kugou.com/> | | |✓|
|
|
| 酷我音乐 | <http://www.kuwo.cn/> | | |✓|
|
|
| 乐视网 | <http://www.le.com/> |✓| | |
|
|
| 荔枝FM | <http://www.lizhi.fm/> | | |✓|
|
|
| 秒拍 | <http://www.miaopai.com/> |✓| | |
|
|
| MioMio弹幕网 | <http://www.miomio.tv/> |✓| | |
|
|
| 痞客邦 | <https://www.pixnet.net/> |✓| | |
|
|
| PPTV聚力 | <http://www.pptv.com/> |✓| | |
|
|
| 齐鲁网 | <http://v.iqilu.com/> |✓| | |
|
|
| QQ<br/>腾讯视频 | <http://v.qq.com/> |✓| | |
|
|
| 企鹅直播 | <http://live.qq.com/> |✓| | |
|
|
| 阡陌视频 | <http://qianmo.com/> |✓| | |
|
|
| THVideo | <http://thvideo.tv/> |✓| | |
|
|
| Sina<br/>新浪视频<br/>微博秒拍视频 | <http://video.sina.com.cn/><br/><http://video.weibo.com/> |✓| | |
|
|
| Sohu<br/>搜狐视频 | <http://tv.sohu.com/> |✓| | |
|
|
| 天天动听 | <http://www.dongting.com/> | | |✓|
|
|
| **Tudou<br/>土豆** | <http://www.tudou.com/> |✓| | |
|
|
| 虾米 | <http://www.xiami.com/> | | |✓|
|
|
| 阳光卫视 | <http://www.isuntv.com/> |✓| | |
|
|
| **音悦Tai** | <http://www.yinyuetai.com/> |✓| | |
|
|
| **Youku<br/>优酷** | <http://www.youku.com/> |✓| | |
|
|
| 战旗TV | <http://www.zhanqi.tv/lives> |✓| | |
|
|
| 央视网 | <http://www.cntv.cn/> |✓| | |
|
|
| 花瓣 | <http://huaban.com/> | |✓| |
|
|
| Naver<br/>네이버 | <http://tvcast.naver.com/> |✓| | |
|
|
| 芒果TV | <http://www.mgtv.com/> |✓| | |
|
|
| 火猫TV | <http://www.huomao.com/> |✓| | |
|
|
| 全民Tv | <http://www.quanmin.tv/> |✓| | |
|
|
|
|
For all other sites not on the list, the universal extractor will take care of finding and downloading interesting resources from the page.
|
|
|
|
### Known bugs
|
|
|
|
If something is broken and `you-get` can't get you things you want, don't panic. (Yes, this happens all the time!)
|
|
|
|
Check if it's already a known problem on <https://github.com/soimort/you-get/wiki/Known-Bugs>, and search on the [list of open issues](https://github.com/soimort/you-get/issues). If it has not been reported yet, open a new issue, with detailed command-line output attached.
|
|
|
|
## Getting Involved
|
|
|
|
You can reach us on the Gitter channel [#soimort/you-get](https://gitter.im/soimort/you-get) (here's how you [set up your IRC client](http://irc.gitter.im) for Gitter). If you have a quick question regarding `you-get`, ask it there.
|
|
|
|
All kinds of pull requests are welcome. However, there are a few guidelines to follow:
|
|
|
|
* The [`develop`](https://github.com/soimort/you-get/tree/develop) branch is where your pull request should go.
|
|
* Remember to rebase.
|
|
* Document your PR clearly, and if applicable, provide some sample links for reviewers to test with.
|
|
* Write well-formatted, easy-to-understand commit messages. If you don't know how, look at existing ones.
|
|
* We will not ask you to sign a CLA, but you must assure that your code can be legally redistributed (under the terms of the MIT license).
|
|
|
|
## Legal Issues
|
|
|
|
This software is distributed under the [MIT license](https://raw.github.com/soimort/you-get/master/LICENSE.txt).
|
|
|
|
In particular, please be aware that
|
|
|
|
> THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
|
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
|
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
|
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
|
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
|
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
|
SOFTWARE.
|
|
|
|
Translated to human words:
|
|
|
|
*In case your use of the software forms the basis of copyright infringement, or you use the software for any other illegal purposes, the authors cannot take any responsibility for you.*
|
|
|
|
We only ship the code here, and how you are going to use it is left to your own discretion.
|
|
|
|
## Authors
|
|
|
|
Made by [@soimort](https://github.com/soimort), who is in turn powered by :coffee:, :pizza: and :ramen:.
|
|
|
|
You can find the [list of all contributors](https://github.com/soimort/you-get/graphs/contributors) here.
|