33 lines
1.4 KiB
Markdown
33 lines
1.4 KiB
Markdown
# Weebly Blog Post Scraper
|
|
> How to get yourself out Weebly hosting, without losing your content ?
|
|
|
|
## Why ?
|
|
|
|
Weebly provides the option to "back up" a site, but not the actual posts or content made to it. Which is predatory bullshit designed to prey upon clients who don't have any technical skills or understanding and then lock them into their service.
|
|
|
|
This is a very simple script to scrape a Weebly site's blog posts into Markdown files that can be used in things like Hugo or Jekyll, or just be viewed by hand. To import Markdown files to Wordpress see [this link](https://tyler.io/importing-jekyll-posts-into-wordpress/).
|
|
|
|
## Install it
|
|
|
|
Start with installing these requirements :
|
|
|
|
* [Python 3](https://wiki.python.org/moin/BeginnersGuide/Download)
|
|
* [Beautiful Soup](https://www.crummy.com/software/BeautifulSoup/bs4/doc/#installing-beautiful-soup)
|
|
|
|
```shell
|
|
git clone https://labs.pixelswap.fr/HorlogeSkynet/weebly-scraper.git
|
|
```
|
|
|
|
## Use it
|
|
|
|
To run this script with Python on the command line, just precise the Weebly target website, and the number of page you want to attempt to scrape (if you don't know, try a large number) :
|
|
|
|
```shell
|
|
cd weebly-scraper/
|
|
python3 weebly-scrapy.py http://example.weebly.com/ 20
|
|
```
|
|
|
|
## Acknowledgment
|
|
|
|
This script had been firstly written by [William GILLIS](https://github.com/rechelon), and improved afterwards by [Samuel FORESTIER](https://github.com/HorlogeSkynet).
|