From problem to application

how to automate tasks

Agenda

About Me

Problem

Minimum Viable Solution

Improvements

Interface

Extensibility

Cleaning up

Summary

About Me

Test Automation Engineer

Python Developer
Scrum Master
Project Manager

Problem
System
Automation

piszczala.gitlab.io

Problem

Music

Gathering

1763 items

Manual Flow

Hikaru Utada & Skrillex - Face My Fears [Official Video]

Hikaru Utada & Skrillex - Face My Fears ~~[Official Video]~~

Genres: EDM; dubstep; electro; trap;

Solution!

LLameDL

Minimum Viable Solution

Working Script

import webbot

def wait_for_and_click(browser, text):
    while not browser.exists(text=text):
        pass
    browser.click(text=text)

w = webbot.Browser(showWindow=False)
w.go_to("https://portalpacjenta.luxmed.pl/PatientPortal/Account/LogOn")
w.type('name.surname', id="Login")
w.type('pass', id="TempPassword")
w.type('pass', id="Password")
w.click(tag='div', classname='submit')
wait_for_and_click(w, text="Zmień termin")
wait_for_and_click(w, text="Szukaj")
while not w.find_elements(xpath=".//div/text", tag='text'):
    pass
available = w.find_elements(xpath=".//div/text", tag='text').pop(0).text
print(available)

w.click(text="Wyloguj się")
w.go_to("https://portalpacjenta.luxmed.pl/PatientPortal/Account/LogOut")
w.close_current_tab()

import requests
from lxml import html
import xml.etree.cElementTree as ET
from collections import namedtuple

FeedTuple = namedtuple('FeedTuple', ['title', 'url', 'feed'])

feed_postfix_list = ['/rss.xml', '/atom.xml', '/feed/atom', '/feed.atom', '/feed', '/feed.xml',
                     '/feed.rss', '/atom/entries/', '/feed_rss.xml', '/rss', '/posts/rss.xml',
                     'feeds/all_rss.xml']

def get_feed(url):
    content = requests.get(url)
    print('-' * 64, content)
    if content.status_code != 200:
        print('RSS is not available for {url}'.format(url=url))
        return FeedTuple("", url, "")

    tree = html.fromstring(content.content)
    title = tree.xpath("//title").pop().text.strip()

    for feed_postfix in feed_postfix_list:
        rc = requests.get(url + feed_postfix).status_code
        # print(feed_postfix, rc)
        if rc == 200:
            return FeedTuple(title, url, url + feed_postfix)

    rss = tree.xpath("//*[@type='application/rss+xml']")
    for i in rss:
        rss_url = i.attrib.get('href', None)
        if '/comments/' not in rss_url and requests.get(rss_url).status_code == 200:
            return FeedTuple(title, url, rss_url)

    print('RSS is not available for {url}'.format(url=url))
    return FeedTuple(title, url, "")

def create_template():
    root = ET.Element("opml", version="1.0")
    doc = ET.SubElement(root, "body")
    return root, doc

def add_subelement(doc, feed):
    ET.SubElement(doc, "outline", type="rss", text=feed.title, title=feed.title, xmlUrl=feed.feed, htmlUrl=feed.url)
    return doc

def save(root):
    tree = ET.ElementTree(root)
    tree.write("opml.xml")

if __name__ == '__main__':
    RSS_URLS_list = [
        'http://switchweekly.com/',
    ]
    root, doc = create_template()
    for rss in RSS_URLS_list:
        try:
            y = get_feed(rss)
            add_subelement(doc, y)
        except Exception as e:
            print(e)
            print(rss)
    save(root)

"""
Get URLs from evernote note
xml is starting with <en-note id="en-note" class="editable" g_editable="true" contenteditable="true">
"""
import re
import xml.etree.ElementTree as ET

tree = ET.parse('evernote.xml')
root = tree.getroot()
for child in root:
    if child.findall('.//span'):
        text = "# " + child.findall('.//span')[0].text
        print(text)
        print("")
    elif child.findall('.//a'):
        text = child.findall('.//a')[0].text
        print(re.sub('(\\n| +)', ' ', text))
        print("<{}>".format(child.findall('.//a')[0].attrib['href']))
        print("")
    else:
        print(child.findall('.//'))

"""
Change filenames for Xiaomi 3S files to format 'yyyy-mm-dd hh-mm-ss'
"""
import re
import os


def get_new_filename(filename):
    result = re.search("\w*_(\d{4})(\d{2})(\d{2})_(\d{2})(\d{2})(\d{2})(?:_\w*?)\.(\w*)", filename)
    if result:
        year, month, day, hour, minute, second, f = result.groups()
        return "-".join([year, month, day]) + ' ' + ".".join([hour, minute, second, f])
    else:
        return ""


if __name__ == '__main__':
    directory = "/media/jarek/Work [HDD]/290118/New folder/DCIM/Camera"
    dir_files = os.listdir(directory)
    for file in dir_files:
        new_file = get_new_filename(file)
        if new_file:
            os.rename(os.path.join(directory, file), os.path.join(directory, new_file))

Readme

Improvements

Existing libraries

last.FM scraper musicbrainzngs

Selenium WebBOT

Unit Tests

"Test Pyramid"

Refactoring

https://refactoring.guru/

Static Code Analysis

Logical
Stylistic

Analytical

Logical

Static types, detect various errors, security issues

MyPy, Bandit, PyFlakes, Pyre

Stylistic

Style convention, docstrings

pycodestyle, pydocstyle

Analytical

McCabe complexity, lines of code...

Mccabe, Radon

Continuous Integration

Interface

CLI

Argparse

Python Fire

Click

Docopt

.

entry_points

Command-Line Options

http://www.catb.org/esr/writings/taoup/html/ch10s05.html

"The Art of Unix Programming" Chapter 10. Configuration

GUI

Web frameworks

llamedl --bookmark_name music_to_download --directory_path /home/jarek/Music

Extensibility

Design Patterns

Creational

Structural

Behavioral

https://refactoring.guru/

Python Patterns

https://python-patterns.guide

Composition over inheritance

Inheritance is when you design your types around what they are, and composition is when you design types around what they do.

├── downloaders
│   ├── basedownloader.py
│   └── youtubedownloader.py
├── tagsproviders
│   ├── basetags.py
│   ├── filetags.py
│   ├── lastfmtags.py
│   ├── musicbrainzgstags.py
├── urlproviders
│   ├── basebrowser.py
│   ├── baseurl.py
│   ├── chromeurls.py
│   └── userurl.py

│  
│  
│  
│  
├── iyoutube.py
│   
├── tagger.py
│   
├── ichrome
│  
│  
│  
│

Cleaning up

Documentation

Tutorials

How to guides

References

Explanations

Refactoring

Autoformating

autopep8
yapf

What's next?

Docker
Support for other browsers
Support for other sources

Summary

Problem
System
Automation

Problem

Define a problem

Create "Definition of Done"

System

Steps list

Flowchart

Decision tree

Pipeline

Automation

Automate elements of the system

Solution
Interface
Extensibility

Solution

creating a working code

Interface

increasing coverage of use

adaptation to other needs

Extensibility

easy testing

easy to expand

From problem to application

Agenda

About Me

Test Automation Engineer

Problem System Automation

Problem

Music

Gathering

1763 items

Manual Flow

Solution!

LLameDL

Minimum Viable Solution

Working Script

Readme

Improvements

Existing libraries

last.FM scraper musicbrainzngs

Selenium WebBOT

Unit Tests

Refactoring

https://refactoring.guru/

Static Code Analysis

Logical Stylistic

Analytical

Logical

Stylistic

Analytical

Continuous Integration

Interface

CLI

Argparse

Python Fire

Click

Docopt

.

.

.

Command-Line Options

GUI

Web frameworks

Extensibility

Design Patterns

Creational

Structural

Behavioral

https://refactoring.guru/

Python Patterns

https://python-patterns.guide

Composition over inheritance

Cleaning up

Documentation

Refactoring

Autoformating

autopep8 yapf

What's next?

Summary

Problem System Automation

Problem

System

Automation

Solution Interface Extensibility

Solution

Interface

Extensibility

Thanks for your time

Questions?

Od problemu do aplikacji

More from Jarek Pi

Problem
System
Automation

Logical
Stylistic

autopep8
yapf

Problem
System
Automation

Solution
Interface
Extensibility