Moksh Jain, 16IT221
Nishanth Hebbar, 16IT234
Suyash Ghuge, 16IT114
Abhishek Kamal, 16IT202
Code at: github.com/MJ10/Unix-Project
Growing popularity of Machine Learning.
Machine Learning models requiring a lot of Data.
Categorized datasets hard to find.
Creating new datasets hard for beginner.
Can be used by beginners to create toy datasets to test machine learning models.
Scrapes Google Search results for a particular category name, and downloads the specified number of images.
Scraped images can be resized to the desired dimensions and saved.
awk
convert
curl
cat
egrep
mkdir
rm
rmdir
wget
which
...
awk -F <delimiter> <action> <file>
convert $input_file -resize $RESIZE_WIDTH\x$RESIZE_HEIGHT! $output_file &
which [filename/command]
curl [options] [URL...]
wget [options] [url]
egrep [option] pattern [file…]
https://www.gnu.org/software/wget/manual/
https://curl.haxx.se/docs/manpage.html
https://www.imagemagick.org/script/convert.php
https://linux.die.net/man/1/egrep
https://www.gnu.org/software/gawk/manual/gawk.html
https://images.google.com