640_project
This is where the images are stored. /Imgaes/PreRequest contains the images before the edits. /Images/PostRequest are results (targets). These are the input images. Following the link in the Images.csv will display this image for each request. These were the images fed to both stable diffusion and Dall-e 2 to generate the images in those folders. These are the human photoshopped images. We selected them from the comments of the reddit posts. We tried to select the image that we deemed the most accurate to the request. These are the images resulting from passing the prompt and the PreRequest image into stable diffusion. These images are the PreRequest images that were passed through Dalle-2. These images requeired a user generated mask. We did these using our best judgement based on the image and the request text. We selected the image that was the most representative of the Dall-e 2 results. If the results were consistently good, we chose a good image. If there was a salient error throughout the Dall-e 2 results for a given input image and request, we tried to select the image best portrayed the behavior of the model. This file contains data on each image with the following headers:id: unique identifier
pre_request_file_name: File name for the pre-request (input) image.
post_request_file_name: Human generated photoshopped image.
sd_file_name: Stable diffusion result.
dalle_2_name: Dall-e 2 result.
request_text: The request provided from reddit titles. We did some cleaning to make it more concise.
reverse_request: This is a data augmentation opportunity that we will look into in later steps. Our thinking is that we can swap the photoshopped and the input images to generate the opposite request.
link: Link the the reddit image source.
watermark: If a watermark is present.
difficulty_: A column for each group member to asses the request difficulty, as well as our average. Scores between 1 to 5.
Images and requests displayed.