Commons:Batch uploading/Geograph Worldwide
Geograph Worldwide
Based on the discussion section from Commons talk:Batch uploading/Geograph#Uploading from Geograph sister sites
Sources to upload from
Geograph Channel Islands – https://www.geograph.org.gg/ (2,500+ files)
Geograph Deutschland – https://geo-en.hlipp.de/ (260,000+ files)
Geograph Ireland – https://www.geograph.ie/ (mirror site for files relating to Ireland from the Geograph Britain and Ireland database)
License
I believe all media from Geograph websites are licenced under Attribution-ShareAlike 2.0 Generic (CC BY-SA 2.0)
Description
- Do the media URLs follow a pattern? – Yes, most links to media follow this pattern: "domain name/photo/sequential number".
- Does the site have an API? – I believe so.
- What else could ease uploading? – I don't know.
- Did you contact the site owner? – No, but the licence information for the media is clearly stated, and I believe the Geograph team knows about the project to archive the media contain on the website(s) to Wikimedia Commons (at least for Geograph Britain and Ireland).
- Is there a template that could be used on the file description pages, or should one be created? – There seems to be a standard template that is used to upload media files from Geograph Britain and Ireland; that could be modified for the other sites since all of them are quite similar. (See: Template:Geograph from structured data)
@GeographBot, and person/people running it, have done a great job over the years uploading nearly all the media files from Geograph Britain and Ireland, and the bot is basically up to date now. The site stills sees a lot of uploads daily, meaning there is a consistent backlog of about ±20,000 files. I'm sure the bot and the person/people who run it are busy with that, so I'm requesting that another bot is created to handle the backlog from the other Geograph sites.
Since they are similarly designed, I think the upload code for GeographBot could be used here. Other than Geograph Britain and Ireland, the other sites do not receive a lot of uploads (if any) daily. After the initial batch upload, a weekly or monthly sweep of the databases could bring the file numbers on Wikimedia Commons up to date, and likely in less than two hours. I know that the overall history of GeographBot batch uploading to Commons has not been smooth for one reason or another, but the other Geograph databases should be less intensive to upload. Though, I apologize in advance if I'm bringing up a contentious and/or settled discussion here.
Have a nice day!
Opinions
| Assigned to | Progress | Bot name | Category |
|---|---|---|---|