About this document
This document is an introduction to the ReportLab PDF library. Some previous programming experience is presumed and familiarity with the Python Programming language is recommended. If you are new to Python, we tell you in the next section where to go for orientation.
This manual does not cover 100% of the features, but should explain all the main concepts and help you get started, and point you at other learning resources. After working your way through this, you should be ready to begin writing programs to produce sophisticated reports.
In this chapter, we will cover the groundwork: - What is ReportLab all about, and why should I use it? - What is Python? - How do I get everything set up and running?
We need your help to make sure this manual is complete and helpful. Please send any feedback to our user mailing list, which is signposted from www.reportlab.com.
What is the ReportLab PDF Library?
This is a software library that lets you directly create documents in Adobe's Portable Document Format (PDF) using the Python programming language. It also creates charts and data graphics in various bitmap and vector formats as well as PDF.
PDF is the global standard for electronic documents. It supports high-quality printing yet is totally portable across platforms, thanks to the freely available Acrobat Reader. Any application which previously generated hard copy reports or driving a printer can benefit from making PDF documents instead; these can be archived, emailed, placed on the web, or printed out the old-fashioned way. However, the PDF file format is a complex indexed binary format which is impossible to type directly. The PDF format specification is more than 600 pages long and PDF files must provide precise byte offsets -- a single extra character placed anywhere in a valid PDF document can render it invalid. This makes it harder to generate than HTML.
Most of the world's PDF documents have been produced by Adobe's Acrobat tools, or rivals such as JAWS PDF Creator, which act as 'print drivers'. Anyone wanting to automate PDF production would typically use a product like Quark, Word or Framemaker running in a loop with macros or plugins, connected to Acrobat. Pipelines of several languages and products can be slow and somewhat unwieldy.
The ReportLab library directly creates PDF based on your graphics commands. There are no intervening steps. Your applications can generate reports extremely fast - sometimes orders of magnitude faster than traditional report-writing tools. This approach is shared by several other libraries - PDFlib for C, iText for Java, iTextSharp for .NET and others. However, The ReportLab library differs in that it can work at much higher levels, with a full featured engine for laying out documents complete with tables and charts.
In addition, because you are writing a program in a powerful general purpose language, there are no restrictions at all on where you get your data from, how you transform it, and the kind of output you can create. And you can reuse code across whole families of reports.
The ReportLab library is expected to be useful in at least the following contexts:
- Dynamic PDF generation on the web
- High-volume corporate reporting and database publishing
- An embeddable print engine for other applications, including a 'report language' so that users can customize their own reports. This is particularly relevant to cross-platform apps which cannot rely on a consistent printing or previewing API on each operating system.
- A 'build system' for complex documents with charts, tables and text such as management accounts, statistical reports and scientific papers
- Going from XML to PDF in one step
ReportLab's commercial software
The ReportLab library forms the foundation of our commercial solution for PDF generation, Report Markup Language (RML). This is available for evaluation on our web site with full documentation. We believe that RML is the fastest and easiest way to develop rich PDF workflows. You work in a markup language at a similar level to HTML, using your favorite templating system to populate an RML document; then call our rml2pdf API function to generate a PDF. It's what ReportLab staff use to build all of the solutions you can see on reportlab.com. Key differences:
- Fully documented with two manuals, a formal specification (the DTD) and extensive self-documenting tests. (By contrast, we try to make sure the open source documentation isn't wrong, but we don't always keep up with the code)
- Work in high-level markup rather than constructing graphs of Python objects
- Requires no Python expertise - your colleagues may thank you after you've left!'
- Support for vector graphics and inclusion of other PDF documents
- Many more useful features expressed with a single tag, which would need a lot of coding in the open source package
- Commercial support is included
We ask open source developers to consider trying out RML where it is appropriate. You can register on our site and try out a copy before buying. The costs are reasonable and linked to the volume of the project, and the revenue helps us spend more time developing this software.
What is Python?
Python is an interpreted, interactive, object-oriented programming language. It is often compared to Tcl, Perl, Scheme or Java.
Python combines remarkable power with very clear syntax. It has modules, classes, exceptions, very high level dynamic data types, and dynamic typing. There are interfaces to many system calls and libraries, as well as to various windowing systems (X11, Motif, Tk, Mac, MFC). New built-in modules are easily written in C or C++. Python is also usable as an extension language for applications that need a programmable interface.
Python is as old as Java and has been growing steadily in popularity for years; since our library first came out it has entered the mainstream. Many ReportLab library users are already Python devotees, but if you are not, we feel that the language is an excellent choice for document-generation apps because of its expressiveness and ability to get data from anywhere.
Python is copyrighted but freely usable and distributable, even for commercial use.
Many people have contributed to ReportLab. We would like to thank in particular
(in alphabetical order):
Fubu @ bitbucket,
Germán M. Bravo,
Keven D Smith,
Magnus Lie Hetland,
Mark de Wit,
Publio da Costa Melo,
Randolph Bentson, Robert Alsina, Robert Hölzl, Robert Kern, Ron Peleg, Ruby Yocum, Simon King, Stephan Richter, Steve Halasz, Stoneleaf @ bitbucket, T Blatter, Tim Roberts, Tomasz Swiderski, Ty Sarna, Volker Haas, Yoann Roman, and many more.
Special thanks go to Just van Rossum for his valuable assistance with font technicalities.
Moshe Wagner and Hosam Aly deserve a huge thanks for contributing to the RTL patch, which is not yet on the trunk.
Marius Gedminas deserves a big hand for contributing the work on TrueType fonts and we are glad to include these in the toolkit. Finally we thank Michal Kosmulski for the DarkGarden font for and Bitstream Inc. for the Vera fonts.
Installation and Setup
To avoid duplication, the installation instructions are kept in the README file in our distribution, which can be viewed online at https://hg.reportlab.com/hg-public/reportlab/
ReportLab is an Open Source project. Although we are a commercial company we provide the core PDF generation sources freely, even for commercial purposes, and we make no income directly from these modules. We also welcome help from the community as much as any other Open Source project. There are many ways in which you can help:
General feedback on the core API. Does it work for you? Are there any rough edges? Does anything feel clunky and awkward?
New objects to put in reports, or useful utilities for the library. We have an open standard for report objects, so if you have written a nice chart or table class, why not contribute it?
Snippets and Case Studies: If you have produced some nice output, register online on http://www.reportlab.com and submit a snippet of your output (with or without scripts). If ReportLab solved a problem for you at work, write a little 'case study' and submit it. And if your web site uses our tools to make reports, let us link to it. We will be happy to display your work (and credit it with your name and company) on our site!
Working on the core code: we have a long list of things to refine or to implement. If you are missing some features or just want to help out, let us know!
The first step for anyone wanting to learn more or get involved is to join the mailing list. To Subscribe visit http://two.pairlist.net/mailman/listinfo/reportlab-users. From there you can also browse through the group's archives and contributions. The mailing list is the place to report bugs and get support.
The code now lives on our website (http://hg.reportlab.com/hg-public/reportlab) in a Mercurial repository, along with an issue tracker and wiki. Everyone should feel free to contribute, but if you are working actively on some improvements or want to draw attention to an issue, please use the mailing list to let us know.
There are a number of options which most likely need to be configured globally for a site.
The python script module
reportlab/rl_config.py aggregates the various settings files. You may want inspect the file
contains defaults for the currently used variables. There are several overrides for
reportlab_settings (a script file anywhere on the python path)
and finally the file
~/.reportlab_settings (note no .py). Temporary changes can be made using evironment variables which
are the variables from
rl_settings.py prefixed with
Useful rl_config variables
- verbose: set to integer values to control diagnostic output.
- shapeChecking: set this to zero to turn off a lot of error checking in the graphics modules
- defaultEncoding: set this to WinAnsiEncoding or MacRomanEncoding.
- defaultPageSize: set this to one of the values defined in reportlab/lib/pagesizes.py; as delivered it is set to pagesizes.A4; other values are pagesizes.letter etc.
- defaultImageCaching: set to zero to inhibit the creation of .a85 files on your hard-drive. The default is to create these preprocessed PDF compatible image files for faster loading
- T1SearchPath: this is a python list of strings representing directories that may be queried for information on Type 1 fonts
- TTFSearchPath: this is a python list of strings representing directories that may be queried for information on TrueType fonts
- CMapSearchPath: this is a python list of strings representing directories that may be queried for information on font code maps.
- showBoundary: set to non-zero to get boundary lines drawn.
- ZLIB_WARNINGS: set to non-zero to get warnings if the Python compression extension is not found.
- pageCompression: set to non-zero to try and get compressed PDF.
- allowtableBoundsErrors: set to 0 to force an error on very large Platypus table elements
- emptyTableAction: Controls behaviour for empty tables, can be 'error' (default), 'indicate' or 'ignore'.
- trustedHosts: if not
Nonea list of glob patterns of trusted hosts; these may be used in places like <img> tags in paragraph texts.
- trustedSchemes: a list of allowed
URLschemes used with
trustedHostsFor the full list of variables see the file
More complex modifications to the reportlab toolkit environment may be made using one
of the modules
rep[ortlab.local_rl_mods (.py script in reportlab folder),
reportlab_mods (.py file on the python path) or
~/.reportlab_mods (note no .py).
Learning More About Python
If you are a total beginner to Python, you should check out one or more from the growing number of resources on Python programming. The following are freely available on the web:
Python Documentation. A list of documentation on the Python.org web site. http://www.python.org/doc
Python Tutorial. The official Python Tutorial , originally written by Guido van Rossum himself. http://docs.python.org/tutorial
Learning to Program. A tutorial on programming by Alan Gauld. Has a heavy emphasis on Python, but also uses other languages. http://www.freenetpages.co.uk/hp/alan.gauld
Instant Python. A 6-page minimal crash course by Magnus Lie Hetland. https://folk.idi.ntnu.no/mlh/hetland_org/writing/instant-python.html
Dive Into Python. A free Python tutorial for experienced programmers. http://www.diveintopython.net
Goals of the 3.x release series
ReportLab 3.0 has been produced to help in the migration to Python 3.x. Python 3.x will be standard in future Ubuntu releases and is gaining popularity, and a good proportion of major Python packages now run on Python 3.
- Python 3.x compatibility. A single line of code should run on 3.6 and higher
- init.py restricts to >=3.6
- init.py allow the import of on optional reportlab.local_rl_mods to allow monkey patching etc.
- rl_config now imports rl_settings, optionally local_rl_settings, reportlab_settings.py & finally ~/.reportlab_settings
- ReportLab C extensions now live inside reportlab; _rl_accel is no longer required. All _rl_accel imports now pass through reportlab.lib.rl_accel
- xmllib is gone, alongside the paraparser stuff that caused issues in favour of HTMLParser.
- some obsolete C extensions (sgmlop and pyHnj) are gone
- Improved support for multi-threaded systems to the _rl_accel C extension module.
- Removed reportlab/lib/ para.py & pycanvas.py. These would better belong in third party packages, which can make use of the monkeypatching feature above.
- Add ability to output greyscale and 1-bit PIL images without conversion to RGB. (contributed by Matthew Duggan)
- highlight annotation (contributed by Ben Echols)
- full compliance with pip, easy_install, wheels etc
Detailed release notes are available at http://www.reportlab.com/software/documentation/relnotes/30