You cannot select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

109 lines
7.3 KiB
Markdown

<p align="center">
<img height=200 src="https://user-images.githubusercontent.com/7591552/100047500-b6011c00-2e66-11eb-84b2-5c75d60d4d2a.png">
</p>
# CoastSnap Automation - Batch download, registration and tagging
## About this repo
This repo aims to automate the workflow to generate timelapses and do shoreline mapping with CoastSnap (Spotteron) images. It contains a couple of python/photoshop scripts that were written for the intended workflow: Download -> Register -> Tag -> Manually remove bad images in 'Registered'. See `workflow.pptx`, for a more informative schematic of the intended workflow. Batch downloading code is an extension on the Leaman CoastSnap Toolbox (http://git.wrl.unsw.edu.au:3000/chrisl/coastsnap.git)
This repo can:
* Download images stored on the Spotteron app service `batch_spotteron_download.bat`
* Register the downloaded images in photoshop `batch_photoshop_registration.bat`
* Tag registered images with the image date, time, tide and contributor `batch_tag_registered.bat`
## Getting started
### Prerequisites
This package has only been tested on a Windows system.
Using the [Anaconda distribution](https://www.anaconda.com/products/individual) of Python is recommended for easiest installation of the toolbox requirements.
### Installation
1. Clone this repo onto your PC: `git clone http://git.wrl.unsw.edu.au:3000/jonoc/CoastsnapAuto.git`
2. Change your working directory into the repo: `cd CoastsnapAuto`
3. Create a new conda environment called "coastsnap" with the required dependencies. This is the same environment as for the Leaman CoastSnap Toolbox, so you may already have it. If you're not sure, run: `conda info --envs` and look for "coastsnap". Otherwise, create the environment by runnning:
`conda env create -f environment.yml`
### Demo (optional)
The following is a demo to make sure all of the code is working and for the user to get a sense of the intended workflow. It'll take approx. 10 mins (assuming everything works). The code will run on a demo CoastSnap directory (`CoastSnap_DEMO`) with 3 sites.
1. Open up `coastsnap_sites.csv`. Change the path name in cell E2, to the
path of the CoastsnapAuto/CoastSnap_DEMO directory.
2. Open the CoastsnapAuto/coastsnap directory
3. Run `batch_spotteron_download.bat` [30s]
- This downloads 5 images each for 3 sites and stores them in CoastSnap_DEMO/Images/Site_Name/Processed)
4. Run `batch_photoshop_registration.bat` [2 mins]
- This registers the 15 images in photoshop and stores them in CoastSnap_DEMO/Images/Site_Name/Photoshop
5. Run `batch_tag_registered.bat` [3 mins]
- This tags the photoshop registered images and stores them in CoastSnap_DEMO/Images/Site_Name/Registered. Note: Loading .mat tide data in Python takes a while.
6. Delete any poorly registered/tagged images in CoastSnap_DEMO/Images/Site_Name/Registered
7. Run `generate_images_snapshot.bat`
- This generates a snapshot of the CoastSnap images directory at `images_snapshot.csv`
### Update file paths
To run this code on the shared oneDrive CoastSnap directory:
1. Delete `coastsnap_sites.csv`
2. Rename `coastsnap_sites_ALL.csv` to `coastsnap_sites.csv`
3. Open `coastsnap_sites.csv` and change the path name in cell E2, to the oneDrive CoastSnap directory.
NOTE: `coastsnap_sites.csv` is called by all of the scripts in this repo. DON'T CHANGE THE STRUCTURE OF THE CSV, as it'll cause the registration.jsx scripts to die. Feel free to add more sites (rows) though.
### Updating repo with latest code
1. Open your terminal and make sure you're in the directory you cloned the coastsnap repo into.
2. Ensure you have the coastsnap conda environment active: `activate coastsnap`
3. Pull the latest changes: `git pull`
4. Update the environment: `conda env update --name coastsnap --file environment.yml`
## Usage
### Batch download images from Spotteron
Run `batch_spotteron_download.bat`
Script Logic: For every site in `coastsnap_sites.csv`, download the latest Spotteron images and save to Processed.
* Starts downloading from the most recent image and stops downloading for the site when an image has already been downloaded, to avoid overwriting images.
### Batch register images
Run `batch_photoshop_registration.bat`
Script Logic: For every site in `coastsnap_sites.csv`, iterate through the sites' Processed images (Images/Site/Processed). For any image that hasn't already been photoshop registered (in Images/Site/Photoshop), register it.
* To get rid of the popup error when running this script (which you'll want to do if running this as a scheduled task):
1. Create a text file
2. Add this to the file
```sh
# Force VM Buffering
VMForceBuffering 1
WarnRunningScripts 0
```
3. Call it PSUserConfig.txt and save as C:\Users\z5079346\AppData\Roaming\Adobe\Adobe Photoshop 2022\Adobe Photoshop 2022 Settings\PSUserConfig.txt
(changing the zID and Photoshop version if required)
* This script is based on the manual photoshop registration process, but additionally resizes images to the same width or height of the Target image prior to registration: https://unsw.sharepoint.com/:w:/r/sites/Coastsnap/_layouts/15/doc2.aspx?action=edit&sourcedoc=%7B49c9d377-f450-40bf-93e4-6ff0f289e889%7D&wdOrigin=TEAMS-ELECTRON.teams.chiclet&wdExp=TEAMS-CONTROL
* Output images are all 1280px wide and will have the same aspect ratio as the sites' Target.jpg image
* Batch Size: Images are registered with the target and seed images, in batches of 15. This is set on line 13 of `photoshop_registration_all_sites.jsx` and can be changed. Images may be registered in smaller batch sizes if there are less than 15 images in the Site/Processed/Year folder.
### Batch tag images
Run `batch_tag_registered.bat`
Script Logic: For every site in oneDrive CoastSnap directory, iterate through the sites' registered images (Images/Site/Photoshop). Tag the images with their Date, Time, Contributor and Tide (if tide data for site exists), suffix the filename with "_registered" and save at Images/Site/Registered.
* Starts tagging from the most recent image and stops for the site when an image has already been tagged. This way, the user can manually remove bad registered/tagged images, and they will not be automatically replaced.
* Retrieves tide data for the site from the .mat file specified in Database/CoastSnapDB.xlsx
### Images Snapshot
Run `generate_images_snapshot.bat`
Generates `images_snapshot.csv` which contains information about the CoastSnap oneDrive Images directory. Columns include:
site | # processed | # photoshop | # registered | stability | most recently deleted
* stability (%) = # registered / # processed. This formula is based on the assumption that someone will manually remove poorly registered images in `Images/Registered`. Thus stability represents the percentage of images that had good registration.
* most recently deleted. This is the image date of the most recently deleted in `Images/Registered`. Could be useful to know for manual refining, so the user doesn't have to check every image each time.
## Future Improvements
### Image metadata
Currently (22/6/22) it would appear that images downloaded from Spotteron do not retain the images' metadata. This is based on looking in windows file explorer image->properties, as well as using the exif python package. Note: The metadata presented in file explorer is IPTC data. There is a python package to interact with this data, but I had issues with it.