A downloader for the ISIS service of TU Berlin

Related tags

Downloaderisis_dl
Overview

isis_dl

Tests

A downloading utility for the ISIS tool of TU-Berlin.

Version 0.4

Features

  • Downloads all Material from all courses of your ISIS page.
  • Efficient and dynamic checksum computing for a very good file recognition.
  • You can whitelist / blacklist courses with a given course ID.
  • Multithreaded: A fixed number of threads can be selected at start time.
  • Compatibility: This library will run with any python interpreter that is >= 3.8.

Binary functionality:

  • Building of checksums from existing files.
  • Automatic unpacking of archives.

TL;DR

  1. Use this library instead of isia-tub. It provides a superset of features while having improved performance.
  2. Install using pip install isis_dl. For a manual installation clone this repository and pip install .
  3. For a detailed explanation of Command-Line flags please run isisdl -h.
  4. The first time you run the program you will be prompted if you want to save your password. Look at section encryption for more details.

Installation

You will need a working python3.8 interpreter or above. The script will fail for python3.7 as some new python3.8 features are used.

The recommended installation is via pip - a package manager for python. If pip is not yet installed with the python interpreter run

python3 -m ensurepip

to bootstrap pip.

pip (PyPi)

I have uploaded the repository to the Python Package index (PyPi) where one can download it with the command

pip install isis_dl

Note that you are executing arbitrary code as user. Do this at your own risk!

To run the downloader simply type

isisdl

into your favorite shell.

Please note that, if the virtual environment feature is not used, the ~/.local/bin directory must be in the PATH, otherwise the executable isisdl will not be found.

Manual

Please note that you have to be in a virtual environment in order for this to work as the installation fails otherwise.

Steps:

  • Clone this repository
  • cd isis_dl
  • pip install -e .

Developing

If you want to actively contribute to this repository you will want to install the package in editable mode along with the development requirements:

pip install -r requirements_dev.txt

This creates a symlink to the source code in the pip package location. It will be treated as if it was installed there directly.

There is no method of installation without pip - as the source code expects the module isis_dl to be installed as a package.

Benchmarks

For a comparison between isia-tub and isis_dl please see the Benchmark.md file.

Documentation on features

I am planning on moving this part of the documentation to a dedicated doc site.

File recognition

The file recognition is handled in src/isis_dl/backend/checksums.py.

The main idea is to download a small portion of the file and calculate a hash based on that.

As the requests library provides a file stream, one can only download the first n Bytes and calculate the hash. The problem with this idea is that some files have a header, which is permanently changing.

Unfortunately I don't have an idea why this is the case. In order to circumvent this problem the first portion of the file is skipped based on the file type. The lookup table is located in src/isis_dl/share/settings.py - with the variable being checksum_num_bytes.

The format is <extension>: (<#bytes to ignore>, <#bytes to read>).

This means that one can also set the number of bytes to be read for each file type. For files which store a big header ( I'm looking at you .pdf) the number of bytes to be read is quite high. For others e.g. .mp4 it is not.

Note: If the file extension is not found the default entry None is consulted.

Advantages

  • Only download 512 Bytes of every file.
  • Can verify independently of directory structure / filenames.
  • Lookup is O(1) as a HashSet is used as a datastructure.
  • Up to 255 ** 512 unique files can be saved per course using this method.

Disadvantages

  • For every file in every course x Bytes have to be downloaded.
  • Files are bound to a course.

Note that a default value of 64 suffices to

Can store your password securely

The entire encryption is handled by the src/isis_dl/backend/crypt.py.

The encryption is handled via Fernet

Fernet guarantees that a message encrypted using it cannot be manipulated or read without the key. Fernet is an implementation of symmetric (also known as “secret key”) authenticated cryptography.

The key is generated based on a password you enter and then stored securely.

TODO: This is currently untested. Please enter your password manually for the moment.

Hash Settings

Beware: If you change these settings you will not be able to recover an encrypted file without restoring the settings. I would not recommend changing them.

You may select any hashing algorithm which is supported. This is any hashes.HashAlgorithm. You may also change the number of iterations, which will increase / decrease the time it takes to encrypt / decrypt respectively.

A customizable settings file

The file is located at src/isis_dl/share/settings.py. For the most part you will want to keep the default settings, but if they don't fit your needs, you may easily change them.

Download Directory

The default download directory is ~/isis_dl_downloads. As the intended installation is via pip, there is no good "current working directory", so one cannot use that.

What can be done, however, is migrating this directory to e.g. the Desktop/ or Documents/.

Acknowledgements

isia-tub

Consider checking out the gitlab

This was the original inspiration for this library. At the time isia did not offer the functionality of uri-encoding the password which lead me to create this library. I have recently implemented this functionality into isia in order to benchmark and test both solutions.

Comparison

Downloading my entire isis directory took 22m8s with isia. This is in contrast to the 11m16s it took with isis_dl

mCoding

The structure of this project is heavily inspired by the GitHub of mCoding. Consider giving their video about automated testing a shot.

You might also like...
MMDL (Mega Music Downloader) - A tool to easily download music.
MMDL (Mega Music Downloader) - A tool to easily download music.

mmdl - Mega Music Downloader What is mmdl ❓ TLDR: MMDL is a cli app which allows you to quickly and efficiently download one or multiple songs from Yo

apkizer is a mass downloader for android applications for all available versions.

apkizer apkizer collects all available versions of an Android application from apkpure.com Purpose Sometimes mobile applications can be useful to dig

Pantheon - The fastest YouTube downloader.
Pantheon - The fastest YouTube downloader.

A Youtube downloader written in Python3, using HTTP requests and an API.

Terminal based YouTube player and downloader
Terminal based YouTube player and downloader

termitube NOTE: THIS REPOSITORY IS A FORK OF mps-youtube as mps-youtube has been unmaintained for almost a year now. Features Search and play audio/vi

Youtube playlist downloader with full metadata support
Youtube playlist downloader with full metadata support

ytrake GUI tool to embed metadata for albums on Youtube with youtube-dl. Requires youtube-dl v2021.06.06. Post-processing Album metadata: Usage ytrake

Using Youtube downloader is the fast and easy way to download and save any YouTube video.
Using Youtube downloader is the fast and easy way to download and save any YouTube video.

Youtube video downloader using Django Using Django as a backend along with pytube module to create Youtbue Video Downloader. https://yt-videos-downloa

Advance Image Downloader/Extractor (Job) is a Python-Flask web-based app, which will help the user download the any kind of Images at any date and time over the internet. These images will get downloaded as a job and then let user know that the images have been downloaded by sending them a link over an email. A prometheus exporter for torrent downloader like qbittorrent/transmission/deluge
A prometheus exporter for torrent downloader like qbittorrent/transmission/deluge

downloader-exporter A prometheus exporter for qBitorrent/Transmission/Deluge. Get metrics from multiple servers and offers them in a prometheus format

bing image downloader app used to download bulk images for a specific search term created using streamlit and bing_image_downloader python packages
bing image downloader app used to download bulk images for a specific search term created using streamlit and bing_image_downloader python packages

bing image downloader app bing image downloader app is used to download bulk images for a specific search term. bing image downloader app gets the sea

Releases(untagged-44c24ae2f0ed9de798c2)
  • untagged-44c24ae2f0ed9de798c2(Oct 23, 2021)

    Version0.2

    Version 0.2 is out! yay

    Changelog:

    • Changed download mechanism from 1 Executor per course which downloads with args.num_threads to 1 Executor which downloads everything.
    • Faster instantiation of Files
    • All files get randomly shuffled for a better utilization
    • When interrupted, will finish all remaining downloads and then exit. If prompted again will instantly exit.
    • A better status indicator
    • Moved unzipping from auto → command line argument
    • More tests!
    Source code(tar.gz)
    Source code(zip)
Get the latest updates around you as they happen

Adherent We all are different, experience various things happening around us but we stick together. We are all a part of a greater community. As human

Shreyas Daniel 1 Nov 10, 2022
mescrappy - Python + Selenium Youtube scraper

mescrappy - Python + Selenium Youtube scraper Youtube Sraping With Python (Selenium) Table of Contents About The Project Built With Getting Started In

Merdan Chariyarov 12 Nov 28, 2021
Terminal based YouTube player and downloader

termitube NOTE: THIS REPOSITORY IS A FORK OF mps-youtube as mps-youtube has been unmaintained for almost a year now. Features Search and play audio/vi

Otis/Jacob Root 27 Dec 23, 2022
A Simple YouTube Video Downloader With Python

Simple YouTube Video Downloader Simple YouTube Video Downloader is an open source project with a very simple UI that tries to speed up the process of

Brian Han 2 Jan 03, 2022
lo2: Simple youtube-dl web frontend

Simple youtube-dl web frontend

Denis Volk 22 Jun 03, 2022
一个在新番更新后第一时间在dmhy等BT下载站自动下载的小工具.

Anime Track 一个在新番更新后第一时间自动下载的小工具. 可以根据自定义的关键字在dmhy等BT下载站在搜索结果更新时将磁力链发送至aria2实现自动下载. 基本功能包含: 将BT下载站的某个关键字的搜索结果的所有磁力链添加至ARIA2 自动更新aria2 trackers 将已添加的磁力

Sunky 24 Oct 12, 2022
Downloader Middleware to support Playwright in Scrapy & Gerapy

Gerapy Playwright This is a package for supporting Playwright in Scrapy, also this package is a module in Gerapy. Installation pip3 install gerapy-pla

Gerapy 85 Dec 31, 2022
Youtube-music - Youtube music with python

youtube-music fzf on https://github.com/junegunn/fzf python3 ytb.py [no/yes] yes

direskyfer 0 Feb 03, 2022
Programmers-quest - Programmer's Quest! An open source MMO built on top of the Panda3D game engine and Astron server

Programmer's Quest! Programmer's Quest! The open source Python 3 2D MMORPG showc

Jordan Maxwell 5 Oct 07, 2022
Python script designed to search and fetch direct download links from nxbrew.com

SwitchGamesDownloader Only for windows nxbrew.com is a website, accessible only using a proxy, where the majority of games for the Nintendo Switch are

Backend 91 Dec 28, 2022
Python utility to download jobs at seek.com.au

Job Seeker job_seeker is an utility to download data of a job search from seek.com.au into a csv file for data analysis and exploration Install using

PyBites 3 May 14, 2022
Download images where login is required using har python and js

이미지 다운로드(har, python, js 사용) 로그인이 필요한 사이트에서 DevTools로 이미지를 다운받는 방법은 조금 까다로웠다. 가장 쉽게 할 수 있는 방법을 찾아보았다. 사용법 F12를 눌러 DevTools를 실행 Network 탭으로 이동 페이지 새로고침

0 Jul 22, 2022
Download every approved Obsidian.md community Plugin and Theme

obsidian-repos-downloader Contents What? Why? Setup Requirements Download Run Getting Started Usage - all the arguments Output Directories Flatter Str

Clare Macrae 16 Dec 13, 2022
Simple Python script to download images and videos from public subreddits without using Reddit's API 😎

Subreddit Media Downloader Download images and videos from any public subreddit without using Reddit's API Made with ❤ by Nico 💬 About: This script a

Nico 106 Jan 07, 2023
A youtube downloader, built with flask yt-dlp

Built With Python Flask - The Python micro framework for building web applications. yt-dlp - A youtube-dl fork with additional features and fixes

Abhijith N T 13 Dec 17, 2022
TikTok downloader video without watermark from Telegram bot

⬇️ How to download video from Tik Tok via telegram bot? Send a link to the video from tik tok to our telegram bot and it will send you a video without

1 Mar 04, 2022
Python based YouTube video Downloader GUI Application.

Youtube video Downloader Python based Youtube video Downloader GUI Application. Installation Python Dependencies Import pytube pip install pytube Im

Naem Azam 1 Jan 03, 2022
The PornHub Downloader is a powerfull script used to download and manage both videos and pictures

The PornHub Downloader is a powerfull script used to download and manage both videos and pictures

16 Aug 31, 2022
This project is helps to download contents from Streamtape by utilizing the API

It scrapes Streamtape api and download contents from the site.

Debiprasad Das 5 Dec 28, 2022
An automatic beatmapset downloader via txt file, suitable for tourney mappools.

Pooler Pooler is a bulk osu! mapset downloader, perfect for use with osu! Tournament Mappools. Prerequisites Python 3.10 Requests (pip install request

Thomas 0 Feb 11, 2022