896 lines
45 KiB
ReStructuredText
896 lines
45 KiB
ReStructuredText
============================
|
|
libtorrent API Documentation
|
|
============================
|
|
|
|
:Author: Arvid Norberg, arvid@libtorrent.org
|
|
:Version: 1.1.0
|
|
|
|
.. contents:: Table of contents
|
|
:depth: 1
|
|
:backlinks: none
|
|
|
|
overview
|
|
========
|
|
|
|
The interface of libtorrent consists of a few classes. The main class is
|
|
the ``session``, it contains the main loop that serves all torrents.
|
|
|
|
The basic usage is as follows:
|
|
|
|
* construct a session
|
|
* load session state from settings file (see load_state())
|
|
* start extensions (see add_extension()).
|
|
* start DHT, LSD, UPnP, NAT-PMP etc (see start_dht(), start_lsd(), start_upnp()
|
|
and start_natpmp()).
|
|
* parse .torrent-files and add them to the session (see torrent_info,
|
|
async_add_torrent() and add_torrent())
|
|
* main loop (see session)
|
|
|
|
* poll for alerts (see wait_for_alert(), pop_alerts())
|
|
* handle updates to torrents, (see state_update_alert).
|
|
* handle other alerts, (see alert).
|
|
* query the session for information (see session::status()).
|
|
* add and remove torrents from the session (remove_torrent())
|
|
|
|
* save resume data for all torrent_handles (optional, see
|
|
save_resume_data())
|
|
* save session state (see save_state())
|
|
* destruct session object
|
|
|
|
Each class and function is described in this manual.
|
|
|
|
For a description on how to create torrent files, see create_torrent.
|
|
|
|
.. _make_torrent: make_torrent.html
|
|
|
|
things to keep in mind
|
|
======================
|
|
|
|
A common problem developers are facing is torrents stopping without explanation.
|
|
Here is a description on which conditions libtorrent will stop your torrents,
|
|
how to find out about it and what to do about it.
|
|
|
|
Make sure to keep track of the paused state, the error state and the upload
|
|
mode of your torrents. By default, torrents are auto-managed, which means
|
|
libtorrent will pause them, unpause them, scrape them and take them out
|
|
of upload-mode automatically.
|
|
|
|
Whenever a torrent encounters a fatal error, it will be stopped, and the
|
|
``torrent_status::error`` will describe the error that caused it. If a torrent
|
|
is auto managed, it is scraped periodically and paused or resumed based on
|
|
the number of downloaders per seed. This will effectively seed torrents that
|
|
are in the greatest need of seeds.
|
|
|
|
If a torrent hits a disk write error, it will be put into upload mode. This
|
|
means it will not download anything, but only upload. The assumption is that
|
|
the write error is caused by a full disk or write permission errors. If the
|
|
torrent is auto-managed, it will periodically be taken out of the upload
|
|
mode, trying to write things to the disk again. This means torrent will recover
|
|
from certain disk errors if the problem is resolved. If the torrent is not
|
|
auto managed, you have to call set_upload_mode() to turn
|
|
downloading back on again.
|
|
|
|
network primitives
|
|
==================
|
|
|
|
There are a few typedefs in the ``libtorrent`` namespace which pulls
|
|
in network types from the ``asio`` namespace. These are::
|
|
|
|
typedef asio::ip::address address;
|
|
typedef asio::ip::address_v4 address_v4;
|
|
typedef asio::ip::address_v6 address_v6;
|
|
using asio::ip::tcp;
|
|
using asio::ip::udp;
|
|
|
|
These are declared in the ``<libtorrent/socket.hpp>`` header.
|
|
|
|
The ``using`` statements will give easy access to::
|
|
|
|
tcp::endpoint
|
|
udp::endpoint
|
|
|
|
Which are the endpoint types used in libtorrent. An endpoint is an address
|
|
with an associated port.
|
|
|
|
For documentation on these types, please refer to the `asio documentation`_.
|
|
|
|
.. _`asio documentation`: http://asio.sourceforge.net/asio-0.3.8/doc/asio/reference.html
|
|
|
|
exceptions
|
|
==========
|
|
|
|
Many functions in libtorrent have two versions, one that throws exceptions on
|
|
errors and one that takes an ``error_code`` reference which is filled with the
|
|
error code on errors.
|
|
|
|
There is one exception class that is used for errors in libtorrent, it is based
|
|
on boost.system's ``error_code`` class to carry the error code.
|
|
|
|
For more information, see libtorrent_exception and error_code_enum.
|
|
|
|
translating error codes
|
|
-----------------------
|
|
|
|
The error_code::message() function will typically return a localized error string,
|
|
for system errors. That is, errors that belong to the generic or system category.
|
|
|
|
Errors that belong to the libtorrent error category are not localized however, they
|
|
are only available in english. In order to translate libtorrent errors, compare the
|
|
error category of the ``error_code`` object against ``libtorrent::get_libtorrent_category()``,
|
|
and if matches, you know the error code refers to the list above. You can provide
|
|
your own mapping from error code to string, which is localized. In this case, you
|
|
cannot rely on ``error_code::message()`` to generate your strings.
|
|
|
|
The numeric values of the errors are part of the API and will stay the same, although
|
|
new error codes may be appended at the end.
|
|
|
|
Here's a simple example of how to translate error codes:
|
|
|
|
.. code:: c++
|
|
|
|
std::string error_code_to_string(boost::system::error_code const& ec)
|
|
{
|
|
if (ec.category() != libtorrent::get_libtorrent_category())
|
|
{
|
|
return ec.message();
|
|
}
|
|
// the error is a libtorrent error
|
|
|
|
int code = ec.value();
|
|
static const char const* swedish[] =
|
|
{
|
|
"inget fel",
|
|
"en fil i torrenten kolliderar med en fil fran en annan torrent",
|
|
"hash check misslyckades",
|
|
"torrentfilen ar inte en dictionary",
|
|
"'info'-nyckeln saknas eller ar korrupt i torrentfilen",
|
|
"'info'-faltet ar inte en dictionary",
|
|
"'piece length' faltet saknas eller ar korrupt i torrentfilen",
|
|
"torrentfilen saknar namnfaltet",
|
|
"ogiltigt namn i torrentfilen (kan vara en attack)",
|
|
// ... more strings here
|
|
};
|
|
|
|
// use the default error string in case we don't have it
|
|
// in our translated list
|
|
if (code < 0 || code >= sizeof(swedish)/sizeof(swedish[0]))
|
|
return ec.message();
|
|
|
|
return swedish[code];
|
|
}
|
|
|
|
magnet links
|
|
============
|
|
|
|
Magnet links are URIs that includes an info-hash, a display name and optionally
|
|
a tracker url. The idea behind magnet links is that an end user can click on a
|
|
link in a browser and have it handled by a bittorrent application, to start a
|
|
download, without any .torrent file.
|
|
|
|
The format of the magnet URI is:
|
|
|
|
**magnet:?xt=urn:btih:** *Base16 encoded info-hash* [ **&dn=** *name of download* ] [ **&tr=** *tracker URL* ]*
|
|
|
|
queuing
|
|
=======
|
|
|
|
libtorrent supports *queuing*. Which means it makes sure that a limited number of
|
|
torrents are being downloaded at any given time, and once a torrent is completely
|
|
downloaded, the next in line is started.
|
|
|
|
Torrents that are *auto managed* are subject to the queuing and the active
|
|
torrents limits. To make a torrent auto managed, set add_torrent_params::flag_auto_managed
|
|
when adding the torrent (see async_add_torrent() and add_torrent()).
|
|
|
|
The limits of the number of downloading and seeding torrents are controlled via
|
|
settings_pack::active_downloads, settings_pack::active_seeds and settings_pack::active_limit in
|
|
settings_pack. These limits takes non auto managed torrents into account as
|
|
well. If there are more non-auto managed torrents being downloaded than the
|
|
settings_pack::active_downloads setting, any auto managed torrents will be queued until
|
|
torrents are removed so that the number drops below the limit.
|
|
|
|
At a regular interval, torrents are checked if there needs to be any
|
|
re-ordering of which torrents are active and which are queued. This interval
|
|
can be controlled via settings_pack::auto_manage_interval.
|
|
|
|
For queuing to work, resume data needs to be saved and restored for all
|
|
torrents. See save_resume_data().
|
|
|
|
downloading
|
|
-----------
|
|
|
|
Torrents that are currently being downloaded or incomplete (with bytes still to
|
|
download) are queued. The torrents in the front of the queue are started to be
|
|
actively downloaded and the rest are ordered with regards to their queue
|
|
position. Any newly added torrent is placed at the end of the queue. Once a
|
|
torrent is removed or turns into a seed, its queue position is -1 and all
|
|
torrents that used to be after it in the queue, decreases their position in
|
|
order to fill the gap.
|
|
|
|
The queue positions are always in a sequence without any gaps.
|
|
|
|
Lower queue position means closer to the front of the queue, and will be
|
|
started sooner than torrents with higher queue positions.
|
|
|
|
To query a torrent for its position in the queue, or change its position, see:
|
|
queue_position(), queue_position_up(), queue_position_down(),
|
|
queue_position_top() and queue_position_bottom().
|
|
|
|
seeding
|
|
-------
|
|
|
|
Auto managed seeding torrents are rotated, so that all of them are allocated a
|
|
fair amount of seeding. Torrents with fewer completed *seed cycles* are
|
|
prioritized for seeding. A seed cycle is completed when a torrent meets either
|
|
the share ratio limit (uploaded bytes / downloaded bytes), the share time ratio
|
|
(time seeding / time downloaing) or seed time limit (time seeded).
|
|
|
|
The relevant settings to control these limits are
|
|
settings_pack::share_ratio_limit, settings_pack::seed_time_ratio_limit and
|
|
settings_pack::seed_time_limit.
|
|
|
|
fast resume
|
|
===========
|
|
|
|
The fast resume mechanism is a way to remember which pieces are downloaded
|
|
and where they are put between sessions. You can generate fast resume data by
|
|
calling save_resume_data() on torrent_handle. You can
|
|
then save this data to disk and use it when resuming the torrent. libtorrent
|
|
will not check the piece hashes then, and rely on the information given in the
|
|
fast-resume data. The fast-resume data also contains information about which
|
|
blocks, in the unfinished pieces, were downloaded, so it will not have to
|
|
start from scratch on the partially downloaded pieces.
|
|
|
|
To use the fast-resume data you simply give it to async_add_torrent() and
|
|
add_torrent(), and it will skip the time consuming checks. It may have to do
|
|
the checking anyway, if the fast-resume data is corrupt or doesn't fit the
|
|
storage for that torrent, then it will not trust the fast-resume data and just
|
|
do the checking.
|
|
|
|
file format
|
|
-----------
|
|
|
|
The file format is a bencoded dictionary containing the following fields:
|
|
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``file-format`` | string: "libtorrent resume file" |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``file-version`` | integer: 1 |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``info-hash`` | string, the info hash of the torrent this data is saved for. |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``blocks per piece`` | integer, the number of blocks per piece. Must be: piece_size |
|
|
| | / (16 * 1024). Clamped to be within the range [1, 256]. It |
|
|
| | is the number of blocks per (normal sized) piece. Usually |
|
|
| | each block is 16 * 1024 bytes in size. But if piece size is |
|
|
| | greater than 4 megabytes, the block size will increase. |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``pieces`` | A string with piece flags, one character per piece. |
|
|
| | Bit 1 means we have that piece. |
|
|
| | Bit 2 means we have verified that this piece is correct. |
|
|
| | This only applies when the torrent is in seed_mode. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``slots`` | list of integers. The list maps slots to piece indices. It |
|
|
| | tells which piece is on which slot. If piece index is -2 it |
|
|
| | means it is free, that there's no piece there. If it is -1, |
|
|
| | means the slot isn't allocated on disk yet. The pieces have |
|
|
| | to meet the following requirement: |
|
|
| | |
|
|
| | If there's a slot at the position of the piece index, |
|
|
| | the piece must be located in that slot. |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``total_uploaded`` | integer. The number of bytes that have been uploaded in |
|
|
| | total for this torrent. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``total_downloaded`` | integer. The number of bytes that have been downloaded in |
|
|
| | total for this torrent. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``active_time`` | integer. The number of seconds this torrent has been active. |
|
|
| | i.e. not paused. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``seeding_time`` | integer. The number of seconds this torrent has been active |
|
|
| | and seeding. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``num_seeds`` | integer. An estimate of the number of seeds on this torrent |
|
|
| | when the resume data was saved. This is scrape data or based |
|
|
| | on the peer list if scrape data is unavailable. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``num_downloaders`` | integer. An estimate of the number of downloaders on this |
|
|
| | torrent when the resume data was last saved. This is used as |
|
|
| | an initial estimate until we acquire up-to-date scrape info. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``upload_rate_limit`` | integer. In case this torrent has a per-torrent upload rate |
|
|
| | limit, this is that limit. In bytes per second. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``download_rate_limit`` | integer. The download rate limit for this torrent in case |
|
|
| | one is set, in bytes per second. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``max_connections`` | integer. The max number of peer connections this torrent |
|
|
| | may have, if a limit is set. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``max_uploads`` | integer. The max number of unchoked peers this torrent may |
|
|
| | have, if a limit is set. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``seed_mode`` | integer. 1 if the torrent is in seed mode, 0 otherwise. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``file_priority`` | list of integers. One entry per file in the torrent. Each |
|
|
| | entry is the priority of the file with the same index. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``piece_priority`` | string of bytes. Each byte is interpreted as an integer and |
|
|
| | is the priority of that piece. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``auto_managed`` | integer. 1 if the torrent is auto managed, otherwise 0. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``sequential_download`` | integer. 1 if the torrent is in sequential download mode, |
|
|
| | 0 otherwise. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``paused`` | integer. 1 if the torrent is paused, 0 otherwise. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``trackers`` | list of lists of strings. The top level list lists all |
|
|
| | tracker tiers. Each second level list is one tier of |
|
|
| | trackers. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``mapped_files`` | list of strings. If any file in the torrent has been |
|
|
| | renamed, this entry contains a list of all the filenames. |
|
|
| | In the same order as in the torrent file. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``url-list`` | list of strings. List of url-seed URLs used by this torrent. |
|
|
| | The urls are expected to be properly encoded and not contain |
|
|
| | any illegal url characters. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``httpseeds`` | list of strings. List of httpseed URLs used by this torrent. |
|
|
| | The urls are expected to be properly encoded and not contain |
|
|
| | any illegal url characters. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``merkle tree`` | string. In case this torrent is a merkle torrent, this is a |
|
|
| | string containing the entire merkle tree, all nodes, |
|
|
| | including the root and all leaves. The tree is not |
|
|
| | necessarily complete, but complete enough to be able to send |
|
|
| | any piece that we have, indicated by the have bitmask. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``save_path`` | string. The save path where this torrent was saved. This is |
|
|
| | especially useful when moving torrents with move_storage() |
|
|
| | since this will be updated. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``peers`` | list of dictionaries. Each dictionary has the following |
|
|
| | layout: |
|
|
| | |
|
|
| | +----------+-----------------------------------------------+ |
|
|
| | | ``ip`` | string, the ip address of the peer. This is | |
|
|
| | | | not a binary representation of the ip | |
|
|
| | | | address, but the string representation. It | |
|
|
| | | | may be an IPv6 string or an IPv4 string. | |
|
|
| | +----------+-----------------------------------------------+ |
|
|
| | | ``port`` | integer, the listen port of the peer | |
|
|
| | +----------+-----------------------------------------------+ |
|
|
| | |
|
|
| | These are the local peers we were connected to when this |
|
|
| | fast-resume data was saved. |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``unfinished`` | list of dictionaries. Each dictionary represents an |
|
|
| | piece, and has the following layout: |
|
|
| | |
|
|
| | +-------------+--------------------------------------------+ |
|
|
| | | ``piece`` | integer, the index of the piece this entry | |
|
|
| | | | refers to. | |
|
|
| | +-------------+--------------------------------------------+ |
|
|
| | | ``bitmask`` | string, a binary bitmask representing the | |
|
|
| | | | blocks that have been downloaded in this | |
|
|
| | | | piece. | |
|
|
| | +-------------+--------------------------------------------+ |
|
|
| | | ``adler32`` | The adler32 checksum of the data in the | |
|
|
| | | | blocks specified by ``bitmask``. | |
|
|
| | | | | |
|
|
| | +-------------+--------------------------------------------+ |
|
|
| | |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``file sizes`` | list where each entry corresponds to a file in the file list |
|
|
| | in the metadata. Each entry has a list of two values, the |
|
|
| | first value is the size of the file in bytes, the second |
|
|
| | is the time stamp when the last time someone wrote to it. |
|
|
| | This information is used to compare with the files on disk. |
|
|
| | All the files must match exactly this information in order |
|
|
| | to consider the resume data as current. Otherwise a full |
|
|
| | re-check is issued. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
| ``allocation`` | The allocation mode for the storage. Can be either ``full`` |
|
|
| | or ``compact``. If this is full, the file sizes and |
|
|
| | timestamps are disregarded. Pieces are assumed not to have |
|
|
| | moved around even if the files have been modified after the |
|
|
| | last resume data checkpoint. |
|
|
+--------------------------+--------------------------------------------------------------+
|
|
|
|
storage allocation
|
|
==================
|
|
|
|
There are two modes in which storage (files on disk) are allocated in libtorrent.
|
|
|
|
1. The traditional *full allocation* mode, where the entire files are filled up
|
|
with zeros before anything is downloaded. Files are allocated on demand, the
|
|
first time anything is written to them. The main benefit of this mode is that
|
|
it avoids creating heavily fragmented files.
|
|
|
|
2. The *sparse allocation*, sparse files are used, and pieces are downloaded
|
|
directly to where they belong. This is the recommended (and default) mode.
|
|
|
|
In previous versions of libtorrent, a 3rd mode was supported, *compact
|
|
allocation*. Support for this is deprecated and will be removed in future
|
|
versions of libtorrent. It's still described in here for completeness.
|
|
|
|
The allocation mode is selected when a torrent is started. It is passed as an
|
|
argument to session::add_torrent() or session::async_add_torrent().
|
|
|
|
The decision to use full allocation or compact allocation typically depends on
|
|
whether any files have priority 0 and if the filesystem supports sparse files.
|
|
|
|
sparse allocation
|
|
-----------------
|
|
|
|
On filesystems that supports sparse files, this allocation mode will only use
|
|
as much space as has been downloaded.
|
|
|
|
The main drawback of this mode is that it may create heavily fragmented files.
|
|
|
|
* It does not require an allocation pass on startup.
|
|
|
|
full allocation
|
|
---------------
|
|
|
|
When a torrent is started in full allocation mode, the disk-io thread
|
|
will make sure that the entire storage is allocated, and fill any gaps with zeros.
|
|
It will of course still check for existing pieces and fast resume data. The main
|
|
drawbacks of this mode are:
|
|
|
|
* It may take longer to start the torrent, since it will need to fill the files
|
|
with zeroes. This delay is linear to the size of the download.
|
|
|
|
* The download may occupy unnecessary disk space between download sessions.
|
|
|
|
* Disk caches usually perform poorly with random access to large files
|
|
and may slow down the download some.
|
|
|
|
The benefits of this mode are:
|
|
|
|
* Downloaded pieces are written directly to their final place in the files and
|
|
the total number of disk operations will be fewer and may also play nicer to
|
|
filesystems' file allocation, and reduce fragmentation.
|
|
|
|
* No risk of a download failing because of a full disk during download, once
|
|
all files have been created.
|
|
|
|
compact allocation
|
|
------------------
|
|
|
|
.. note::
|
|
Support for compact allocation has been removed from libttorrent
|
|
|
|
The compact allocation will only allocate as much storage as it needs to keep
|
|
the pieces downloaded so far. This means that pieces will be moved around to be
|
|
placed at their final position in the files while downloading (to make sure the
|
|
completed download has all its pieces in the correct place). So, the main
|
|
drawbacks are:
|
|
|
|
* More disk operations while downloading since pieces are moved around.
|
|
|
|
* Potentially more fragmentation in the filesystem.
|
|
|
|
* Cannot be used while having files with priority 0.
|
|
|
|
The benefits though, are:
|
|
|
|
* No startup delay, since the files don't need allocating.
|
|
|
|
* The download will not use unnecessary disk space.
|
|
|
|
* Disk caches perform much better than in full allocation and raises the
|
|
download speed limit imposed by the disk.
|
|
|
|
* Works well on filesystems that don't support sparse files.
|
|
|
|
The algorithm that is used when allocating pieces and slots isn't very
|
|
complicated. For the interested, a description follows.
|
|
|
|
storing a piece:
|
|
|
|
1. let **A** be a newly downloaded piece, with index **n**.
|
|
2. let **s** be the number of slots allocated in the file we're
|
|
downloading to. (the number of pieces it has room for).
|
|
3. if **n** >= **s** then allocate a new slot and put the piece there.
|
|
4. if **n** < **s** then allocate a new slot, move the data at
|
|
slot **n** to the new slot and put **A** in slot **n**.
|
|
|
|
allocating a new slot:
|
|
|
|
1. if there's an unassigned slot (a slot that doesn't
|
|
contain any piece), return that slot index.
|
|
2. append the new slot at the end of the file (or find an unused slot).
|
|
3. let **i** be the index of newly allocated slot
|
|
4. if we have downloaded piece index **i** already (to slot **j**) then
|
|
|
|
1. move the data at slot **j** to slot **i**.
|
|
2. return slot index **j** as the newly allocated free slot.
|
|
|
|
5. return **i** as the newly allocated slot.
|
|
|
|
|
|
HTTP seeding
|
|
============
|
|
|
|
There are two kinds of HTTP seeding. One with that assumes a smart (and polite)
|
|
client and one that assumes a smart server. These are specified in `BEP 19`_
|
|
and `BEP 17`_ respectively.
|
|
|
|
libtorrent supports both. In the libtorrent source code and API, BEP 19 urls
|
|
are typically referred to as *url seeds* and BEP 17 urls are typically referred
|
|
to as *HTTP seeds*.
|
|
|
|
The libtorrent implementation of `BEP 19`_ assumes that, if the URL ends with a
|
|
slash ('/'), the filename should be appended to it in order to request pieces
|
|
from that file. The way this works is that if the torrent is a single-file
|
|
torrent, only that filename is appended. If the torrent is a multi-file
|
|
torrent, the torrent's name '/' the file name is appended. This is the same
|
|
directory structure that libtorrent will download torrents into.
|
|
|
|
.. _`BEP 17`: http://bittorrent.org/beps/bep_0017.html
|
|
.. _`BEP 19`: http://bittorrent.org/beps/bep_0019.html
|
|
|
|
dynamic loading of torrent files
|
|
================================
|
|
|
|
libtorrent has a feature that can unload idle torrents from memory. The purpose
|
|
of this is to support being active on many more torrents than the RAM permits.
|
|
This is useful for both embedded devices that have limited RAM and servers
|
|
seeding tens of thousands of torrents.
|
|
|
|
The most significant parts of loaded torrents that use RAM are the piece
|
|
hashes (20 bytes per piece) and the file list. The entire info-dictionary
|
|
of the .torrent file is kept in RAM.
|
|
|
|
In order to activate the dynamic loading of torrent files, set the load
|
|
function on the session. See set_load_function().
|
|
|
|
When a load function is set on the session, the dynamic load/unload
|
|
feature is enabled. Torrents are kept in an LRU. Every time an operation
|
|
is performed, on a torrent or from a peer, that requires the metadata of
|
|
the torrent to be loaded, the torrent is bumped up in the LRU. When a torrent
|
|
is paused or queued, it is demoted to the least recently used torrent in
|
|
the LRU, since it's a good candidate for eviction.
|
|
|
|
To configure how many torrents are allowed to be loaded at the same time,
|
|
set settings_pack::active_loaded_limit on the session.
|
|
|
|
Torrents can be exempt from being unloaded by being *pinned*. Pinned torrents
|
|
still count against the limit, but are never considered for eviction.
|
|
You can either pin a torrent when adding it, in ``add_torrent_params``
|
|
(see async_add_torrent() and add_torrent()), or after ading it with the
|
|
set_pinned() function on torrent_handle.
|
|
|
|
Torrents that start out without metadata (e.g. magnet links or http downloads)
|
|
are automatically pinned. This is important in order to give the client a
|
|
chance to save the metadata to disk once it's received (see metadata_received_alert).
|
|
|
|
Once the metadata is saved to disk, it might make sense to unpin the torrent.
|
|
|
|
piece picker
|
|
============
|
|
|
|
The piece picker in libtorrent has the following features:
|
|
|
|
* rarest first
|
|
* sequential download
|
|
* random pick
|
|
* reverse order picking
|
|
* parole mode
|
|
* prioritize partial pieces
|
|
* prefer whole pieces
|
|
* piece affinity by speed category
|
|
* piece priorities
|
|
|
|
internal representation
|
|
-----------------------
|
|
|
|
It is optimized by, at all times, keeping a list of pieces ordered by rarity,
|
|
randomly shuffled within each rarity class. This list is organized as a single
|
|
vector of contigous memory in RAM, for optimal memory locality and to eliminate
|
|
heap allocations and frees when updating rarity of pieces.
|
|
|
|
Expensive events, like a peer joining or leaving, are evaluated lazily, since
|
|
it's cheaper to rebuild the whole list rather than updating every single piece
|
|
in it. This means as long as no blocks are picked, peers joining and leaving is
|
|
no more costly than a single peer joining or leaving. Of course the special
|
|
cases of peers that have all or no pieces are optimized to not require
|
|
rebuilding the list.
|
|
|
|
picker strategy
|
|
---------------
|
|
|
|
The normal mode of the picker is of course *rarest first*, meaning pieces that
|
|
few peers have are preferred to be downloaded over pieces that more peers have.
|
|
This is a fundamental algorithm that is the basis of the performance of
|
|
bittorrent. However, the user may set the piece picker into sequential download
|
|
mode. This mode simply picks pieces sequentially, always preferring lower piece
|
|
indices.
|
|
|
|
When a torrent starts out, picking the rarest pieces means increased risk that
|
|
pieces won't be completed early (since there are only a few peers they can be
|
|
downloaded from), leading to a delay of having any piece to offer to other
|
|
peers. This lack of pieces to trade, delays the client from getting started
|
|
into the normal tit-for-tat mode of bittorrent, and will result in a long
|
|
ramp-up time. The heuristic to mitigate this problem is to, for the first few
|
|
pieces, pick random pieces rather than rare pieces. The threshold for when to
|
|
leave this initial picker mode is determined by
|
|
settings_pack::initial_picker_threshold.
|
|
|
|
reverse order
|
|
-------------
|
|
|
|
An orthogonal setting is *reverse order*, which is used for *snubbed* peers.
|
|
Snubbed peers are peers that appear very slow, and might have timed out a piece
|
|
request. The idea behind this is to make all snubbed peers more likely to be
|
|
able to do download blocks from the same piece, concentrating slow peers on as
|
|
few pieces as possible. The reverse order means that the most common pieces are
|
|
picked, instead of the rarest pieces (or in the case of sequential download,
|
|
the last pieces, intead of the first).
|
|
|
|
parole mode
|
|
-----------
|
|
|
|
Peers that have participated in a piece that failed the hash check, may be put
|
|
in *parole mode*. This means we prefer downloading a full piece from this
|
|
peer, in order to distinguish which peer is sending corrupt data. Whether to do
|
|
this is or not is controlled by settings_pack::use_parole_mode.
|
|
|
|
In parole mode, the piece picker prefers picking one whole piece at a time for
|
|
a given peer, avoiding picking any blocks from a piece any other peer has
|
|
contributed to (since that would defeat the purpose of parole mode).
|
|
|
|
prioritize partial pieces
|
|
-------------------------
|
|
|
|
This setting determines if partially downloaded or requested pieces should
|
|
always be preferred over other pieces. The benefit of doing this is that the
|
|
number of partial pieces is minimized (and hence the turn-around time for
|
|
downloading a block until it can be uploaded to others is minimized). It also
|
|
puts less stress on the disk cache, since fewer partial pieces need to be kept
|
|
in the cache. Whether or not to enable this is controlled by
|
|
setting_pack::prioritize_partial_pieces.
|
|
|
|
The main benefit of not prioritizing partial pieces is that the rarest first
|
|
algorithm gets to have more influence on which pieces are picked. The picker is
|
|
more likely to truly pick the rarest piece, and hence improving the performance
|
|
of the swarm.
|
|
|
|
This setting is turned on automatically whenever the number of partial pieces
|
|
in the piece picker exceeds the number of peers we're connected to times 1.5.
|
|
This is in order to keep the waste of partial pieces to a minimum, but still
|
|
prefer rarest pieces.
|
|
|
|
prefer whole pieces
|
|
-------------------
|
|
|
|
The *prefer whole pieces* setting makes the piece picker prefer picking entire
|
|
pieces at a time. This is used by web connections (both http seeding
|
|
standards), in order to be able to coalesce the small bittorrent requests to
|
|
larger HTTP requests. This significantly improves performance when downloading
|
|
over HTTP.
|
|
|
|
It is also used by peers that are downloading faster than a certain threshold.
|
|
The main advantage is that these peers will better utilize the other peer's
|
|
disk cache, by requesting all blocks in a single piece, from the same peer.
|
|
|
|
This threshold is controlled by the settings_pack::whole_pieces_threshold
|
|
setting.
|
|
|
|
*TODO: piece priorities*
|
|
|
|
predictive piece announce
|
|
=========================
|
|
|
|
In order to improve performance, libtorrent supports a feature called
|
|
``predictive piece announce``. When enabled, it will make libtorrent announce
|
|
that we have pieces to peers, before we truly have them. The most important
|
|
case is to announce a piece as soon as it has been downloaded and passed the
|
|
hash check, but not yet been written to disk. In this case, there is a risk the
|
|
piece will fail to be written to disk, in which case we won't have the piece
|
|
anymore, even though we announced it to peers.
|
|
|
|
The other case is when we're very close to completing the download of a piece
|
|
and assume it will pass the hash check, we can announce it to peers to make it
|
|
available one round-trip sooner than otherwise. This lets libtorrent start
|
|
uploading the piece to interested peers immediately when the piece complete,
|
|
instead of waiting one round-trip for the peers to request it.
|
|
|
|
This makes for the implementation slightly more complicated, since piece will
|
|
have more states and more complicated transitions. For instance, a piece could
|
|
be:
|
|
|
|
1. hashed but not fully written to disk
|
|
2. fully written to disk but not hashed
|
|
3. not fully downloaded
|
|
4. downloaded and hash checked
|
|
|
|
Once a piece is fully downloaded, the hash check could complete before any of
|
|
the write operations or it could complete after all write operations are
|
|
complete.
|
|
|
|
peer classes
|
|
============
|
|
|
|
The peer classes feature in libtorrent allows a client to define custom groups
|
|
of peers and rate limit them individually. Each such group is called a *peer
|
|
class*. There are a few default peer classes that are always created:
|
|
|
|
* global - all peers belong to this class, except peers on the local network
|
|
* local peers - all peers on the local network belongs to this class TCP peers
|
|
* - all peers connected over TCP belong to this class
|
|
|
|
The TCP peers class is used by the uTP/TCP balancing logic, if it's enabled, to
|
|
throttle TCP peers. The global and local classes are used to adjust the global
|
|
rate limits.
|
|
|
|
When the rate limits are adjusted for a specific torrent, a class is created
|
|
implicitly for that torrent.
|
|
|
|
The default peer class IDs are defined as enums in the ``session`` class::
|
|
|
|
enum {
|
|
global_peer_class_id,
|
|
tcp_peer_class_id,
|
|
local_peer_class_id
|
|
};
|
|
|
|
A peer class can be considered a more general form of *lables* that some
|
|
clients have. Peer classes however are not just applied to torrents, but
|
|
ultimately the peers.
|
|
|
|
Peer classes can be created with the create_peer_class() call (on the session
|
|
object), and deleted with the delete_peer_class() call.
|
|
|
|
Peer classes are configured with the set_peer_class() get_peer_class() calls.
|
|
|
|
Custom peer classes can be assigned to torrents, with the ??? call, in which
|
|
case all its peers will belong to the class. They can also be assigned based on
|
|
the peer's IP address. See set_peer_class_filter() for more information.
|
|
|
|
SSL torrents
|
|
============
|
|
|
|
Torrents may have an SSL root (CA) certificate embedded in them. Such torrents
|
|
are called *SSL torrents*. An SSL torrent talks to all bittorrent peers over
|
|
SSL. The protocols are layered like this:
|
|
|
|
.. image:: utp_stack.png
|
|
|
|
During the SSL handshake, both peers need to authenticate by providing a
|
|
certificate that is signed by the CA certificate found in the .torrent file.
|
|
These peer certificates are expected to be privided to peers through some other
|
|
means than bittorrent. Typically by a peer generating a certificate request
|
|
which is sent to the publisher of the torrent, and the publisher returning a
|
|
signed certificate.
|
|
|
|
In libtorrent, set_ssl_certificate() in torrent_handle is used to tell
|
|
libtorrent where to find the peer certificate and the private key for it. When
|
|
an SSL torrent is loaded, the torrent_need_cert_alert is posted to remind the
|
|
user to provide a certificate.
|
|
|
|
A peer connecting to an SSL torrent MUST provide the *SNI* TLS extension
|
|
(server name indication). The server name is the hex encoded info-hash of the
|
|
torrent to connect to. This is required for the client accepting the connection
|
|
to know which certificate to present.
|
|
|
|
SSL connections are accepted on a separate socket from normal bittorrent
|
|
connections. To pick which port the SSL socket should bind to, set
|
|
settings_pack::ssl_listen to a different port. It defaults to port 4433.
|
|
This setting is only taken into account when the normal listen socket is opened
|
|
(i.e. just changing this setting won't necessarily close and re-open the SSL
|
|
socket). To not listen on an SSL socket at all, set ``ssl_listen`` to 0.
|
|
|
|
This feature is only available if libtorrent is build with openssl support
|
|
(``TORRENT_USE_OPENSSL``) and requires at least openSSL version 1.0, since it
|
|
needs SNI support.
|
|
|
|
Peer certificates must have at least one *SubjectAltName* field of type
|
|
dNSName. At least one of the fields must *exactly* match the name of the
|
|
torrent. This is a byte-by-byte comparison, the UTF-8 encoding must be
|
|
identical (i.e. there's no unicode normalization going on). This is the
|
|
recommended way of verifying certificates for HTTPS servers according to `RFC
|
|
2818`_. Note the difference that for torrents only *dNSName* fields are taken
|
|
into account (not IP address fields). The most specific (i.e. last) *Common
|
|
Name* field is also taken into account if no *SubjectAltName* did not match.
|
|
|
|
If any of these fields contain a single asterisk ("*"), the certificate is
|
|
considered covering any torrent, allowing it to be reused for any torrent.
|
|
|
|
The purpose of matching the torrent name with the fields in the peer
|
|
certificate is to allow a publisher to have a single root certificate for all
|
|
torrents it distributes, and issue separate peer certificates for each torrent.
|
|
A peer receiving a certificate will not necessarily be able to access all
|
|
torrents published by this root certificate (only if it has a "star cert").
|
|
|
|
.. _`RFC 2818`: http://www.ietf.org/rfc/rfc2818.txt
|
|
|
|
testing
|
|
-------
|
|
|
|
To test incoming SSL connections to an SSL torrent, one can use the following
|
|
*openssl* command::
|
|
|
|
openssl s_client -cert <peer-certificate>.pem -key <peer-private-key>.pem -CAfile \
|
|
<torrent-cert>.pem -debug -connect 127.0.0.1:4433 -tls1 -servername <info-hash>
|
|
|
|
To create a root certificate, the Distinguished Name (*DN*) is not taken into
|
|
account by bittorrent peers. You still need to specify something, but from
|
|
libtorrent's point of view, it doesn't matter what it is. libtorrent only makes
|
|
sure the peer certificates are signed by the correct root certificate.
|
|
|
|
One way to create the certificates is to use the ``CA.sh`` script that comes
|
|
with openssl, like thisi (don't forget to enter a common Name for the
|
|
certificate)::
|
|
|
|
CA.sh -newca
|
|
CA.sh -newreq
|
|
CA.sh -sign
|
|
|
|
The torrent certificate is located in ``./demoCA/private/demoCA/cacert.pem``,
|
|
this is the pem file to include in the .torrent file.
|
|
|
|
The peer's certificate is located in ``./newcert.pem`` and the certificate's
|
|
private key in ``./newkey.pem``.
|
|
|
|
session statistics
|
|
==================
|
|
|
|
libtorrent provides a mechanism to query performance and statistics counters
|
|
from its internals. This is primarily useful for troubleshooting of production
|
|
systems and performance tuning.
|
|
|
|
The statistics consists of two fundamental types. *counters* and *gauges*. A
|
|
counter is a monotonically increasing value, incremented every time some event
|
|
occurs. For example, every time the network thread wakes up because a socket
|
|
became readable will increment a counter. Another example is every time a
|
|
socket receives *n* bytes, a counter is incremented by *n*.
|
|
|
|
*Counters* are the most flexible of metrics. It allows the program to sample
|
|
the counter at any interval, and calculate average rates of increments to the
|
|
counter. Some events may be rare and need to be sampled over a longer period in
|
|
order to get userful rates, where other events may be more frequent and evenly
|
|
distributed that sampling it frequently yields useful values. Counters also
|
|
provides accurate overall counts. For example, converting samples of a download
|
|
rate into a total transfer count is not accurate and takes more samples.
|
|
Converting an increasing counter into a rate is easy and flexible.
|
|
|
|
*Gauges* measure the instantaneous state of some kind. This is used for metrics
|
|
that are not counting events or flows, but states that can fluctuate. For
|
|
example, the number of torrents that are currenly being downloaded.
|
|
|
|
It's important to know whether a value is a counter or a gauge in order to
|
|
interpret it correctly. In order to query libtorrent for which counters and
|
|
gauges are available, call session_stats_metrics(). This will return metadata
|
|
about the values available for inspection in libtorrent. It will include
|
|
whether a value is a counter or a gauge. The key information it includes is the
|
|
index used to extract the actual measurements for a specific counter or gauge.
|
|
|
|
In order to take a sample, call post_session_stats() in the session object.
|
|
This will result in a session_stats_alert being posted. In this alert object,
|
|
there is an array of values, these values make up the sample. The value index
|
|
in the stats metric indicates which index the metric's value is stored in.
|
|
|
|
The mapping between metric and value is not stable across versions of
|
|
libtorrent. Always query the metrics first, to find out the index at which the
|
|
value is stored, before interpreting the values array in the
|
|
session_stats_alert. The mapping will *not* change during the runtime of your
|
|
process though, it's tied to a specific libtorrent version. You only have to
|
|
query the mapping once on startup (or every time ``libtorrent.so`` is loaded,
|
|
if it's done dynamically).
|
|
|
|
The available stats metrics are:
|
|
|
|
.. include:: stats_counters.rst
|
|
|