Transformation:

XPath and XSLT

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

Tiziana Mancinelli

University of Bologna

 

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

Aims of today's laboratory:

 

 

  • Investigate the structure of TEI/XML documents
  • Transform and visualise our TEI/XML
  • Therefore - we learn basic syntax of XSLT and XPATH
  • create different outputs with XSLT

 

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

Transformation

 

 

<?xml version="1.0" encoding="UTF-8"?>
    <xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform" version="2.0">

    <xsl:output method="xhtml"
        doctype-system="http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"/>

    <xsl:template match="/">
        <html>
            <head>
                <title><xsl:value-of select="//title"/> 
                written by <xsl:value-of select="//author"/></title>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>
    <xsl:template match="title">
        <h1>
            <xsl:apply-templates/>
        </h1>
    </xsl:template>
    <xsl:template match="author">
        <p><i>
            <xsl:apply-templates/>
        </i></p>
    </xsl:template>
    <xsl:template match="stanza">
        <xsl:apply-templates/>
        <br/>
    </xsl:template>
    <xsl:template match="line">
        <div>
            <xsl:apply-templates/>
        </div>
    </xsl:template>
</xsl:stylesheet>

An example:

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

XSL (eXtensible Stylesheet Language)

is a styling language for XML.

 

 

XSLT stands for XSL Transformations.

FROM XML to DIFFERENT OUTPUTS

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

XSL (eXtensible Stylesheet Language)

is a styling language for XML.

 

 

 Transformations

 to DIFFERENT OUTPUTS

HTML

PDF

TXT

XML

ePub

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

Transformation

 

 

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>
  • Xpath interrogates and navigates XML documents

  • XSLT depends on it

 

 

 

 

 

What You Should Already Know

 

Before you continue you should have a basic understanding of the following:

 

  • HTML
  • CSS
  • XML

Recommend editors:

 

 

 

Check down for further instructions

about how to use those two tools

 

 

First exercise

Let's make our first transformation with default XSLT within oXygen!

  1. Open your XML file
  2. Let's create our first SCENARIO 

HOW makes "scenarios"

  1. click on the tool icon

HOW makes "scenarios"

2. Click on the format you would like to choose (HTML/PDF)

HOW makes "scenarios"

3.

How to use XSLTransform plugin in Atom

Download the plugin

How to use XSLTransform plugin in Atom

Download the plugin

How to use XSLTransform plugin in Atom

Download the plugin

How to use XSLTransform plugin in Atom

Apply XSLT with Atom

what to do to navigate an XML documenti and transform ITS elements in something ELSE?

We are going to transform our XML into HTML

Navigate around the tree, selecting nodes by a variety of criteria

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

 

XPath is a major element in the XSLT standard.

 

XSLT uses XPath to find information in an XML document.

 

XPath can be used to navigate through elements and attributes in an XML document.

 

A language to describe how to locate a part of an XML document


Used in many XML-based technologies and tools

 

What is XPath?

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

 

How XPath works

In XPath, there are seven kinds of nodes: element, attribute, text, namespace, processing-instruction, comment, and document nodes.

 

XML documents are treated as trees of nodes. The topmost element of the tree is called the root element.

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>
Expression Description
nodename Selects all nodes with the name "nodename"
/ Selects from the root node
// Selects nodes in the document from the current node that match the selection no matter where they are
. Selects the current node
.. Selects the parent of the current node
@ Selects attributes
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>
Wildcard Description
* matches any element node
@* matches any attribute node
node() matches any node
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

A bare-bones path expression is similar to filesystem addressing: if the path starts with a solidus (/ aka forward slash), then it represents a path from the root; if it does not start with a solidus then it represents a path from here

/TEI/teiHeader/fileDesc/titleStmt/title
<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

body

div

div

head

head

lg

lg

lg

lg

lg

lg

TEI/XML structure

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

body

div

div

head

head

lg

lg

lg

lg

lg

lg

TEI/XML structure

/body/div/

Toolbar in oXygen

Let's open the file:

  pilot-proemio.xml

  

some xpath

/TEI/teiHeader/fileDesc/titleStmt/respStmt/name

Ensure the box is labelled XPath 2.0 (or XPath 3.0). Then type in

/TEI/teiHeader/fileDesc/titleStmt/respStmt/name

some xpath

//p

// Selects nodes in the document from the current node that match the selection no matter where they are

some xpath

//del/@hand

@ Selects attributes

Some Xpath

//del[@hand='overwritten']

[ ] square brackets are used to create conditions

Take all the 'del' elements with an attribute 'hand' with value 'overwritten'

count(//del[@type='overwritten'])

xpath has function

What is a function?

In programming, a named section of a program that performs a specific task

Some Xpath

ancestor
ancestor-or-self
attribute
child
descendant
descendant-or-self
following
following-sibling
namespace
parent
preceding
preceding-sibling
self

axes

An axis (plural axes) is a set of nodes relative to a given node; X::Y means “choose Y from the X axis”

self:: is the set of current nodes (not too useful) self::node() is the current node

child:: is the default

/child::X is the same as /X

parent:: is the parent of the current node

ancestor:: is all ancestors of the current node, up to and including the root

descendant:: is all descendants of the current node (Note: never contains attribute or namespace nodes)

preceding:: is everything before the current node in the entire XML document

following:: is everything after the current node in the entire XML document

Axes

examples

example:
//pb[@facs="#p21r"]/ancestor::div

Equality tests

  •   = means “equal to” (Notice it’s not ==)
  •  != means “not equal to”
    • But it’s not that simple!
  • value = node-set will be true if the node-set contains any node with a value that matches value
  • value != node-set will be true if the node-set contains any node with a value that does not match value
    • Hence,
  • value = node-set and value != node-set may both be true at the same time!

Other boolean operators

  • and (infix operator)
  • or (infix operator) 
    • Example: count = 0 or count = 1
  • not() (function)
    • The following are used for numerical comparisons only:
  • < “less than” Some places may require &lt;
  • <= “less than Some places may require &lt;= or equal to”
  • > “greater than” Some places may require &gt;
  • >= “greater than Some places may require &gt;= or equal to”
 

Some XPath functions

  • XPath contains a number of functions on node sets, numbers, and strings;
    • here are a few of them:
  • count(elem) counts the number of selected elements
    • Example: //chapter[count(section)=1] selects chapters with exactly two section children
  •  name() returns the name of the element
    • Example: //*[name()='section'] is the same as //section
  •  starts-with(arg1, arg2) tests if arg1 starts with arg2
    • Example: //*[starts-with(name(), 'sec']
  • contains(arg1, arg2) tests if arg1 contains arg2
    • Example: //*[contains(name(), 'ect']
 

XSLT

(eXtensible Stylesheet Language Transformations)

<xsl:stylesheet xmlns:xsl="http://www.w3.org/1999/XSL/Transform"
    xmlns:tei="http://www.tei-c.org/ns/1.0"
    xmlns:xs="http://www.w3.org/2001/XMLSchema"
    exclude-result-prefixes="xs"
    version="2.0">
    
    <xsl:output method="html"/>
    <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
    </xsl:template>

Transformation

 

 

 

 

<?xml version="1.0" encoding="utf-8"?>
<poem>
     <head>Chapter 7th</head>
     <author></author>
    <div type="poem">
     <l>It was on a dreary night of Novembe</l>
</poem>
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
   <meta/>
 </head>
<body>
   <h1>Untitled</h1>
   <p>It was on a dreary night of Novembe</p>
   </body>
</html>

XML

HTML

XML HTML
<TEI>
      ...
       <body>
          <text>
            <body>
             
            </body>
         </text>
       </body>
</TEI>
<html>
     <head>
      <head>
      <body>
         
      </body>
</html>
 

xslt

XSL (eXtensible Stylesheet Language) is a styling language for XML. XSLT stands for XSL Transformations.

Make a new file: File > New..

XML declaration

 

Namespace

 

  • XML Namespaces provide a method to avoid element name conflicts.
  • XML standard way to use two or more XML vocabularies
  • In XSLT there are at least two vocabularies (can be more):     XSLT and HTML

Namespaces

TEI Namespace

xmlns:tei="http://www.tei-c.org/ns/1.0"

<xsl:output/>

<xsl:output method=" HTML "> 

 

The <xsl:output> element defines the format of the output document.

<xsl:template>

The <xsl:template> element is used to build templates.

 

A template (definition from Wiktionary) is

a generic model or pattern from which other objects are based or derived

<xsl:template>

The <xsl:template> element is used to build templates.

 

The match attribute is used to associate a template with an XML element. The match attribute can also be used to define a template for the entire XML document. The value of the match attribute is an XPath expression (i.e. match="/" defines the whole document).

 

<xsl:template match="node">
[materials to include before node's content]
    <xsl:apply-templates/>
[materials to include after node's content]
</xsl:template> 

<xsl:template>

<xsl:apply-templates>

  • Indicates where to put the node matched by the template
  • In process the node an all its content (other nodes included!)

Our first instruction:

Transform the element root <TEI> into <html>

Let's learn by doing it

Add to your xslt the template that matches with your root element:

 <xsl:template match="tei:TEI">
        <html>
            <head>
                <meta charset="UTF-8"/>
            </head>
            <body>
                <xsl:apply-templates/>
            </body>
        </html>
 </xsl:template>

Add to your xslt the template that matches with the first element:

Text

 <xsl:template match="/">

   <xsl:apply-templates select="/tei:TEI/tei:teiHeader" />

 </xsl:template>

This to create a HTML that matches with the tei:Header

Text

 <xsl:template match="/">

   <xsl:apply-templates select="/tei:TEI/tei:teiHeader" />

 </xsl:template>

Add the <title> element to the template

Text

<head>
     <title><xsl:value-of select=".//tei:title" /></title>
     <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
 </head>

Text

<xsl:template match="//tei:head">
        <h2>
           <ins><xsl:apply-templates/></ins>
        </h2>
</xsl:template>
    

HTML tags

The <h1> to <h6> tags are used to define HTML headings.

<ins>...</ins> underline the text:

https://www.w3schools.com/html/html_formatting.asp

Add some more instructions:

 

The element XML <head> into the element HTML <h2>

Add some more instructions:

 

The element XML <p> into the element HTML <p>

    <xsl:template match="//tei:p">
        <p><xsl:apply-templates/></p>
    </xsl:template>
    

HTML

The <p> in TEI is the same in HTML <p> > paragraph

 

You can also use <br/> stands for line break in HTML

Use your XSLT!

Go down and check how to do it in command line, oXygen and Atom

Use your XSLT!

Command line

Install (sudo) npm install -g xslt3

 

(RUN)

 xslt3 -xsl:[name.xsl] -s:[name].xml > output2.html

Create a scenario with your XSLT!

Create a scenario with your XSLT!

Create a scenario with your XSLT!

Create a scenario with your XSLT!

Create a scenario with your XSLT!

file XML

file XSLT

SAXON VERSION 9

(software)

Create a scenario with your XSLT!

Create a scenario with your XSLT!

HOW TO USE ATOM

CTRL + SHIFT + P

HOW TO USE ATOM

HOW TO USE ATOM

<xsl:value-of select="node"/>

The <xsl:value-of> element can be used to extract the value of an XML element and add it to the output stream of the transformation:

<img width="600" height="600">
    <xsl:attribute name="src">
      <xsl:value-of select="//tei:facsimile/tei:graphic/@url"/>
    </xsl:attribute>
</img>

The <xsl:if> element contains a test attribute and a template. If the test evaluates to true, the template is processed. In this it is similar to an if statement in other languages. To achieve the functionality of an if-then-else statement, however, use the <xsl:choose> element with one <xsl:when> and one <xsl:otherwise> children.


<xsl:if test="expression">
  ...some output if the expression is true...
</xsl:if> 

<xsl:if>

<xsl:template match="//tei:hi">
    <xsl:choose>
        <xsl:when test="@rend='u'">
            <u>
                <xsl:apply-templates/>
            </u>
        </xsl:when>
        <xsl:otherwise>
            <sup>
                <xsl:apply-templates/>
            </sup>
        </xsl:otherwise>
    </xsl:choose>
    </xsl:template>

<xsl:choose>

<xsl:choose> tag specifies a multiple conditional tests against the content of nodes in conjunction with the <xsl:otherwise> and <xsl:when> elements.


<xsl:variable
name="name"
select="expression">

  <!-- Content:template -->

</xsl:variable> 

<xsl:variable>

<xsl:copy-of select="$variable" />

The <xsl:variable> element declares a global or local variable in a stylesheet and gives it a value. Because XSLT permits no side-effects, once the value of the variable has been established, it remains the same until the variable goes out of scope

<xsl:include>

<xsl:include>

The <xsl:include> element merges the contents of one stylesheet with another. Unlike the case of <xsl:import>, the contents of an included stylesheet have exactly the same precedence as the contents of the including stylesheet.

Text

Resources:

Many thanks!

Contacts: @tizmancinelli

tiziana.mancinelli@unive.it

Copy of XPATH - XSLT Bologna, May 2021

By Tiziana Mancinelli

Copy of XPATH - XSLT Bologna, May 2021

  • 537