Pelican Builds

This website is built using Pelican. It is configured in your pelicanconf.py settings.

Pelican Theme

# Theme
THEME = './theme/apache'

See theme for more detail.

Plugins

The Pelican environment is enhanced with Plugins. Our environment has its own copy the asf plugins while the pelican-gfm is provided by the pelican-build.py script.

# Pelican Plugins
# pelican-gfm is installed in the buildbot as part of build_pelican.py. It is an ASF Infra custom plugin.
# other plugins are discoverable and can be installed via pip by mentioning them in requirements.txt
# You can find plugins here: https://github.com/pelican-plugins
# Plugins that are custom for this site are found in PLUGIN_PATHS.
PLUGIN_PATHS = ['./theme/plugins']
PLUGINS = ['asfgenid', 'asfdata', 'pelican-gfm', 'asfreader']

The plugins select which file extensions they read.

pelican-gfm reads .md, .markdown, .mkd, and .mdown extensions. .md is preferred.
asfreader reads .ezmd extensions.

See process for the steps signaled. See plugins for the python code.

Tree Structure

Pages and static content are stored in the same tree. Generated content is output with the same relative path except with an html extension. These are the necessary settings.

PATH = 'content'
# Save pages using full directory preservation
PAGE_PATHS = ['.']
# Path with no extension
PATH_METADATA = '(?P<path_no_ext>.*)\..*'
# We are not slugifying any pages
ARTICLE_URL = ARTICLE_SAVE_AS = PAGE_URL = PAGE_SAVE_AS = '{path_no_ext}.html'
# We want to serve our static files mixed with content.
STATIC_PATHS = ['.']
# we want any html to be served as is
READERS = {'html': None}
# ignore README.md files in the content tree and the interviews and include folders.
IGNORE_FILES = ['README.md','interviews','include']

Process

Pelican uses signals as it goes through the process of reading and generating content. Our plugins

Pelican Signal	Step	GFM Content	EZMD Content	Description
Initialization	Data Model			Read data sources
Reader	Class	GFMReader	ASFReader(GFMReader)	Pelican Reader class
	Read	read_source	super.read_source	read page source and metadata
	Model Metadata		add_data	add asf data to the model and expand any `[{ reference }]`
	Generate		ezt	ezt template transformation
	Render GFM	render	super.render	render GFM/HTML into HTML
Content	Generate ID	generate_id	generate_id	Perform ASF specific HTML enhancements
! Generator	[Template][template]	translate	translate	Create output HTML by pushing the generated content and metadata through the theme's templates.

Read Source

From pelican-gfm

    def read_source(self, source_path):
        "Read metadata and content from the source."
	...
	# Fetch the source content, with a few appropriate tweaks
        with pelican.utils.pelican_open(source_path) as text:

            # Extract the metadata from the header of the text
            lines = text.splitlines()
            for i in range(len(lines)):
                line = lines[i]
                match = GFMReader.RE_METADATA.match(line)
                if match:
                    name = match.group(1).strip().lower()
		    ...
                    metadata[name] = value
                elif not line.strip():
                    # blank line
                    continue
                else:
                    # reached actual content
                    break
	    ...
            # Reassemble content, minus the metadata
            text = '\n'.join(lines[i:])

            return text, metadata

Example:

Title: ASF Export Classifications and Source Links
license: https://www.apache.org/licenses/LICENSE-2.0
asf_headings: False

#### ASF Project
...

The first three lines specify three metadata key value pairs. There is a blank line and the rest is the text

Model Metadata

From asfreader

    def add_data(self, text, metadata):
        "Mix in ASF data as metadata"

        asf_metadata = self.settings.get('ASF_DATA', { }).get('metadata')
        if asf_metadata:
            metadata.update(asf_metadata)
            # insert any direct references
            m = 1
            while m:
                m = METADATA_RE.search(text)
                if m:
                    this_data = m.group(1).strip()
                    format_string = '{{{0}}}'.format(this_data)
                    try:
                        new_string = format_string.format(**metadata)
                        print(f'{{{{{m.group(1)}}}}} -> {new_string}')
                    except Exception:
                        # the data expression was not found
                        new_string = format_string
                        print(f'{{{{{m.group(1)}}}}} is not found')
                    text = re.sub(METADATA_RE, new_string, text, count=1)
        return text, metadata

Content

Content is GitHub Flavored Markdown (GFM) with ASF specific enhancements for Apache CMS style annotations.

Generate ID

The ASF specific enhancements are controlled in pelican settings in the ASF_GENID dictionary.

step	ASF_GENID key	default	process	page override
1	-	True	fixup html that GFM marks as unsafe
2	-	True	convert html into beautiful soup
3	metadata	True	`{{ metadata }}` inclusion of data in the html
4	-	True	inventory of all id attributes, duplicates are invalid
5	elements	True	find all `{#id}` and `{.class}` text and assign attributes
6	headings	True	assign ids to all headings w/o ids already present or assigned with `{#id}` text	asf_headings
	headings_re	`r'^h[1-6]'`	regex for finding headings that require ids
7	tables	True	tables with a class attribute are assgned `class=table`
8	toc	True	generate a table of contents if [TOC] is found
	toc_headers	`r'h[1-6]'`	headings to include in the [TOC]

Data

Data is placed into GFM content with a combination of EZT and ASF python style directives.

Data Model

The data model file specifies three types of data:

Constants. These metadata are made available to ezmd, asfgenid, and pelican templates.

- [{ code_lines }]+ lines of code in&nbsp;stewardship

Sequences. These metadata are used by EZT directives and also the ASF python style directives.
EZT

| Office    | Individual  |
|-----------|-------------|[for projects]
| V.P., [if-any projects.site][[][end]Apache [projects.display_name][if-any projects.site]]([projects.site])[end] | [projects.chair] |[end]

[for featured_projs]<li [if-index featured_projs first]class="active"[end]>
     <a href="#[featured_projs.key_id]" data-toggle="tab">[featured_projs.display_name]</a>
</li>[end]

ASF python style directive

|  |  |  |
|-----------|-----------|-------------|
| [{ board[0].name }] | [{ board[1].name }] | [{ board[2].name }] |
| [{ board[3].name }] | [{ board[4].name }] | [{ board[5].name }] |
| [{ board[6].name }] | [{ board[7].name }] | [{ board[8].name }] |

Dictionaries. These metadata are used by ASF python style directives.
ASF python style directive - we know the names and forcing a sequence is indirect.

| Office    | Individual  |
|-----------|-------------|
| Board Chair |  [{ ci[boardchair][roster] }] |
| Vice Chair |  [{ ci[vicechair][roster] }] |
| President |  [{ ci[president][roster] }] |
| Exec. V.P |  [{ ci[execvp][roster] }] |
| [[]Treasurer](https://treasurer.apache.org/) |  [{ ci[treasurer][roster] }] |
| Assistant Treasurer |  [{ ci[assistanttreasurer][roster] }] |
| Secretary |  [{ ci[secretary][roster] }] |
| Assistant Secretary |  [{ ci[assistantsecretary][roster] }] |
| V.P., [[]Legal Affairs](/legal/) |  [{ ci[legal][chair] }] |
| Assistant V.P., [[]Legal Affairs](/legal/) |  [{ ci[assistantvplegalaffairs][roster] }] |

Include and Insert

Templates

Pelican uses HTML templates. Templates have pelican metadata and pelican settings.

Theme

templates for details.

Data Model How the global data model works and how to enhance the data availble for your content.
EZT How EZ Templates are used convert global data into content.
Markdown How your content is converted to HTML.
Processing How the process works to create the HTML.
Preview Branching How to work on complex changes like updated templates, new data sources, and alternative content.
Local Builds How to setup your local environment to work on a branch.
Error Analysis How to track down errors in your build.