A script for backing up Tumblr posts and likes →

backup_tumblr

This is a set of scripts for downloading your posts and likes from Tumblr.

The scripts try to download as much as possible, including:

Every post and like All the metadata about a post that's available through the Tumblr API Any media files attached to a post (e.g. photos, videos)

I've had these for private use for a while, and in the wake of Tumblr going on a deletion spree, I'm trying to make them usable by other people.

A script for backing up Tumblr posts and likes →

Pictured: a group of Tumblr users fleeing the new content moderation policies. Image credit: Wellcome Collection , CC BY.

Getting started

Install python 3.6 or later. Instructions on the Python website .

Check you have pip installed by running the following command at a command prompt:

$ pip3 --version pip 18.1 (python 3.6)

If you don't have it installed or the command errors, follow the pip installation instructions

Clone this repository:

$ git clone git@github.com:alexwlchan/backup_tumblr.git $ cd backup_tumblr

Install the Python dependencies:

$ pip3 install -r requirements.txt

Get yourself a Tumblr API key by registering an app at https://www.tumblr.com/oauth/apps .

You need the OAuth Consumer Key from this screen:

Usage

There are three scripts in this repo:

save_posts_metadata.py save_likes_metadata.py save_media_files.py

They're split into separate scripts because saving metadata is much faster than media files.

You should run (1) and/or (2), then run (3). Something like:

$ python3 save_posts_metadata.py $ python3 save_likes_metadata.py $ python3 save_media_files.py

If you know what command-line flags are: you can pass arguments (e.g. API key) as flags. Use --help to see the available flags.

If that sentence meant nothing: don't worry, the scripts will ask you for any information they need.

Unanswered questions and notes

I have no idea how Tumblr's content blocks interact with the API, or if blocked posts are visible through the API.

I've seen mixed reports saying that ordering in the dashboard has been broken for the last few days. Again, no idea how this interacts with the API.

Media files can get big. I have ~12k likes which are taking ~9GB of disk space. The scripts will merrily fill up your disk, so make sure you have plenty of space before you start!

These scripts are provided "as is". File an issue if you have a problem, but I don't have much time for maintenance right now.

Sometimes the Tumblr API claims to have more posts than it actually returns, and the effect is that the script appears to stop early, e.g. at 96%.

I'm reading the total_posts parameter from the API responses, and paginating through it as expected -- I have no idea what causes the discrepancy.

Acknowledgements

Hat tip to @cesy for nudging me to post it, and providing useful feedback on the initial version.

Licence

MIT.

A script for backing up Tumblr posts and likes →

Trending Articles

瓶男消失十天，又出现了 (豆瓣我爱我恨水瓶男小组)

PCBETA Milestone要多久可以升级啊

关门一家亲：习远平、张澜澜、徐才厚

新年礼6[晨曦制作][魔动王 Granzot][BDrip][1080P][HEVC Ma10p FLAC MKV]

mp3DirectCut 2.39 免安裝中文版 - MP3切割軟體音樂剪輯軟體

【3.8.X】请教一个关于多节点同步动画的问题

[閒聊] 新竹湖口N2優質網咖

大佬们app端文件分片报错“ReferenceError: nativeFileManager is not defined”

狂賀，校安盃足球賽，西屯國小U12組冠軍

【台積電IT卓越新戰略5】台積IT組織5年三次大調整，要靠平臺工程讓DevOps創新再加速

曾智希写真集12.1预购首次挑战全裸「浴照」

【梦奇字幕组】★古畑任三郎★ Season 1 Episode 04 杀人传真 [720P][MKV]

《沈冰自述——我和周永康的故事》全本

搞笑麻将漫画「3年B组一八先生」被网友吐槽“杀人麻将”？！

中软国际中期业绩喜人，归属于母公司净利同比大增69%

免费翻墙节点大全

台南火車站周邊店面地坪價約130~170萬元

出售: 中村製作所 - NSIT-3500 Pro 隔離牛

想看迪斯科与核战争

具身智能创企“维他动力”完成天使轮融资