HTML Tidy  5.9.15
The HTACG Tidy HTML Project
About HTML Tidy

Purpose

HTML Tidy and LibTidy correct and clean up HTML content by fixing markup errors such as mismatched, mis-nested, and missing tags; missing end "/" tags; missing quotations; and many, many more discrepant conditions, and serves as an HTML pretty printer.

For example…

<h1><hr>heading</h1>
<h2>sub<hr>heading</h2>
<a href="#refs">References<a>

…can be converted to

<hr>
<h1>heading</h1>
<h2>sub</h2>
<hr>
<h2>heading</h2>
<a href="#refs">References</a>

About this Documentation

This documentation is intended to serve as reference materials for both developers and LibTidy implementers so that they can have a reference for troubleshooting code, fixing issues with Tidy, and for implementing their own programs with LibTidy’s API.

Consumer documentation can be referenced from tidy -help on all platforms, and via man tidy on Unix platforms.

In addition to the automatically generated documentation from Tidy’s source code, these four major sections provide practical information about using Tidy, how to contribute, and how LibTidy works:

  • General
    • These articles show how HTML Tidy works from a user perspective. Although the console program is described, keep in mind that all of these articles apply to LibTidy, too, as the console is simply one type of clients to the library.
  • LibTidy
    • High level information about LibTidy is discussed in this series of articles, including a sample implementation of the library.
  • Programming
    • These articles detail how to work with the HTML Tidy source code. If you plan to work with HTACG and donate your own contributions, please have a look at these articles so that you have an easy transition to our team!
  • Modules
    • External Public API If you are a user of LibTidy, then this is all of the documentation that is of use to you in order to implement LibTidy in any language that can link to a C library.
    • Internal API New developers of LibTidy may find browsing this section useful in order to become more familiar with the organization of the source code and the files. Note that this Doxygen project is not a source code viewer, this internal documentation is incomplete, and this is not meant to be a substitute for examining the source code.
    • Tidy Console Application We've also organized the console application as its own Doxygen project. This can be useful to new console application tinkers, and also provide insight on how to use LibTidy in your own programs.

Components

HTML Tidy is composed of two major components:

Tidy Console

Tidy’s console program is the piece that most people interact with in their console or terminal application. It implements LibTidy on macOS, Linux and other Unixes, Windows, and more, and is a relatively simple program that can serve as a good reference model for programs wishing to implement LibTidy.

General information about the console application is available in General.

LibTidy

The critical mass of Tidy consists of LibTidy, a C static or dynamic library (your choice) that developers can integrate into their own program in order to bring all of Tidy’s power to your favorite tools.

An introduction to LibTidy is available in LibTidy, and some helpful guides are present in the Programming section, too.