The dynamic binding has no link-time dependency on the FreeType library, so the FreeType shared library must be manually loaded at run time. When configured as a static binding, there is a link-time dependency on the FreeType library—either the static library or the appropriate file for linking with shared libraries on your platform (see below). Install FreeType Library Once you’ve downloaded it, extract the archive and install it as shown below. Tar xvfj freetype-2.5.0.1.tar.bz2 cd freetype-2.5.0.1/./configure -without-png make make install.
Introduction
This is the second section of the FreeType 2 tutorial. It describes how to
- retrieve glyph metrics
- easily manage glyph images
- retrieve global metrics (including kerning)
- render a simple string of text, with kerning
- render a centered string of text (with kerning)
- render a transformed string of text (with centering)
- access metrics in design font units when needed, and how to scale them to device space
1. Glyph metrics
Glyph metrics are, as their name suggests, certain distances associated with each glyph in order to describe how to use it to layout text.
There are usually two sets of metrics for a single glyph: Those used to layout the glyph in horizontal text layouts (Latin, Cyrillic, Arabic, Hebrew, etc.), and those used to layout the glyph in vertical text layouts (Chinese, Japanese, Korean, etc.).
Note that only a few font formats provide vertical metrics. You can test whether a given face object contains them by using the macro FT_HAS_VERTICAL, which is true when appropriate.
Individual glyph metrics can be accessed by first loading the glyph in a face's glyph slot, then accessing them through the face->glyph->metrics structure, whose type is FT_Glyph_Metrics. We will discuss this in more detail below; for now, we only note that it contains the following fields:
width | This is the width of the glyph image's bounding box. It is independent of the layout direction. |
height | This is the height of the glyph image's bounding box. It is independent of the layout direction. Be careful not to confuse it with the ‘height’ field in the FT_Size_Metrics structure. |
horiBearingX | For horizontal text layouts, this is the horizontal distance from the current cursor position to the leftmost border of the glyph image's bounding box. |
horiBearingY | For horizontal text layouts, this is the vertical distance from the current cursor position (on the baseline) to the topmost border of the glyph image's bounding box. |
horiAdvance | For horizontal text layouts, this is the horizontal distance used to increment the pen position when the glyph is drawn as part of a string of text. |
vertBearingX | For vertical text layouts, this is the horizontal distance from the current cursor position to the leftmost border of the glyph image's bounding box. |
vertBearingY | For vertical text layouts, this is the vertical distance from the current cursor position (on the baseline) to the topmost border of the glyph image's bounding box. |
vertAdvance | For vertical text layouts, this is the vertical distance used to increment the pen position when the glyph is drawn as part of a string of text. |
NOTE: As not all fonts do contain vertical metrics, the values of vertBearingX, vertBearingY and vertAdvance should not be considered reliable when FT_HAS_VERTICAL is false.
The following graphics illustrate the metrics more clearly. First, for horizontal metrics, where the baseline is the horizontal axis:
For vertical text layouts, the baseline is vertical, identical to the vertical axis:
The metrics found in face->glyph->metrics are normally expressed in 26.6 pixel format (i.e., 1/64th of pixels), unless you use the FT_LOAD_NO_SCALE flag when calling FT_Load_Glyph or FT_Load_Char. In this case, the metrics will be expressed in original font units.
The glyph slot object has also a few other interesting fields that will ease a developer's work. You can access them through face->glyph->xxx, where xxx is one of the following fields:
advance | This field is a FT_Vector which holds the transformed advance for the glyph. That is useful when you are using a transform through FT_Set_Transform, as shown in the rotated text example of section I. Other than that, its value is by default (metrics.horiAdvance,0), unless you specify FT_LOAD_VERTICAL when loading the glyph image; it will then be (0,metrics.vertAdvance) |
linearHoriAdvance | This field contains the linearly scaled value of the glyph's horizontal advance width. Indeed, the value of metrics.horiAdvance that is returned in the glyph slot is normally rounded to integer pixel coordinates (i.e., it will be a multiple of 64) by the font driver used to load the glyph image. linearHoriAdvance is a 16.16 fixed float number that gives the value of the original glyph advance width in 1/65536th of pixels. It can be use to perform pseudo device-independent text layouts. |
linearVertAdvance | This is the similar to linearHoriAdvance but for the glyph's vertical advance height. Its value is only reliable if the font face contains vertical metrics. |
2. Managing glyph images
The glyph image that is loaded in a glyph slot can be converted into a bitmap, either by using FT_LOAD_RENDER when loading it, or by calling FT_Render_Glyph. Each time you load a new glyph image, the previous one is erased from the glyph slot.
There are situations, however, where you may need to extract this image from the glyph slot in order to cache it within your application, and even perform additional transformations and measures on it before converting it to a bitmap.
The FreeType 2 API has a specific extension which is capable of dealing with glyph images in a flexible and generic way. To use it, you first need to include the FT_GLYPH_H header file, as in:
We will now explain how to use the functions defined in this file:
a. Extracting the glyph image:
You can extract a single glyph image very easily. Here some code that shows how to do it:
As you see, we have:
Created a variable, named glyph, of type FT_Glyph. This is a handle (pointer) to an individual glyph image.
Loaded the glyph image normally in the face's glyph slot. We did not use FT_LOAD_RENDER because we want to grab a scalable glyph image, in order to later transform it.
Copy the glyph image from the slot into a new FT_Glyph object, by calling FT_Get_Glyph. This function returns an error code and sets glyph.
It is important to note that the extracted glyph is in the same format as the original one that is still in the slot. For example, if we are loading a glyph from a TrueType font file, the glyph image will really be a scalable vector outline.
You can access the field glyph->format if you want to know exactly how the glyph is modeled and stored. A new glyph object can be destroyed with a call to FT_Done_Glyph.
The glyph object contains exactly one glyph image and a 2D vector representing the glyph's advance in 16.16 fixed float coordinates. The latter can be accessed directly as glyph->advance
Note that unlike other FreeType objects, the library doesn't keep a list of all allocated glyph objects. This means you have to destroy them yourself instead of relying on FT_Done_FreeType doing all the clean-up.
b. Transforming & copying the glyph image
If the glyph image is scalable (i.e., if glyph->format is not equal to FT_GLYPH_FORMAT_BITMAP), it is possible to transform the image anytime by a call to FT_Glyph_Transform.
You can also copy a single glyph image with FT_Glyph_Copy. Here is some example code:
Note that the 2×2 transform matrix is always applied to the 16.16 advance vector in the glyph; you thus don't need to recompute it.
c. Measuring the glyph image
You can also retrieve the control (bounding) box of any glyph image (scalable or not) through the FT_Glyph_Get_CBox function, as in:
Coordinates are relative to the glyph origin (0,0), using the y upwards convention. This function takes a special argument, the bbox mode, to indicate how box coordinates are expressed.
If the glyph has been loaded with FT_LOAD_NO_SCALE, bbox_mode must be set to FT_GLYPH_BBOX_UNSCALED to get unscaled font units in 26.6 pixel format. The value FT_GLYPH_BBOX_SUBPIXELS is another name for this constant.
Note that the box's maximum coordinates are exclusive, which means that you can always compute the width and height of the glyph image, be in in integer or 26.6 pixels, with:
Note also that for 26.6 coordinates, if FT_GLYPH_BBOX_GRIDFIT is used as the bbox mode, the coordinates will also be grid-fitted, which corresponds to
To get the bbox in integer pixel coordinates, set bbox_mode to FT_GLYPH_BBOX_TRUNCATE.
Finally, to get the bounding box in grid-fitted pixel coordinates, set bbox_mode to FT_GLYPH_BBOX_PIXELS.
d. Converting the glyph image to a bitmap
You may need to convert the glyph object to a bitmap once you have conveniently cached or transformed it. This can be done easily with the FT_Glyph_To_Bitmap function. It is in charge of converting any glyph object into a bitmap, as in:
Some notes:
The first parameter is the address of the source glyph's handle. When the function is called, it reads its to access the source glyph object. After the call, the handle will point to a new glyph object that contains the rendered bitmap.
The second parameter is a standard render mode, that is used to specify what kind of bitmap we want. It can be FT_RENDER_MODE_DEFAULT for an 8-bit anti-aliased pixmap, or FT_RENDER_MODE_MONO for a 1-bit monochrome bitmap.
The third parameter is a pointer to a two-dimensional vector that is used to translate the source glyph image before the conversion. Note that the source image will be translated back to its original position (and will thus be left unchanged) after the call. If you do not need to translate the source glyph before rendering, set this pointer to 0.
The last parameter is a boolean that indicates whether the source glyph object should be destroyed by the function. If false, the original glyph object is never destroyed, even if its handle is lost (it is up to client applications to keep it).
The new glyph object always contains a bitmap (if no error is returned), and you must typecast its handle to the FT_BitmapGlyph type in order to access its content. This type is a sort of ‘subclass’ of FT_Glyph that contains additional fields (see FT_BitmapGlyphRec):
left | Just like the bitmap_left field of a glyph slot, this is the horizontal distance from the glyph origin (0,0) to the leftmost pixel of the glyph bitmap. It is expressed in integer pixels. |
top | Just like the bitmap_top field of a glyph slot, this is the vertical distance from the glyph origin (0,0) to the topmost pixel of the glyph bitmap (more precise, to the pixel just above the bitmap). This distance is expressed in integer pixels, and is positive for upwards y. |
bitmap | This is a bitmap descriptor for the glyph object, just like the bitmap field in a glyph slot. |
3. Global glyph metrics
Unlike glyph metrics, global metrics are used to describe distances and features of a whole font face. They can be expressed either in 26.6 pixel format or in design ‘font units’ for scalable formats.
a. Design global metrics
For scalable formats, all global metrics are expressed in font units in order to be later scaled to the device space, according to the rules described in the last chapter of this section of the tutorial. You can access them directly as simple fields of a FT_Face handle.
However, you need to check that the font face's format is scalable before using them. One can do it by using the macro FT_IS_SCALABLE which returns true when appropriate.
In this case, you can access the global design metrics as:
units_per_EM | This is the size of the EM square for the font face. It is used by scalable formats to scale design coordinates to device pixels, as described in the last chapter of this section. Its value usually is 2048 (for TrueType) or 1000 (for Type 1), but others are possible too. It is set to 1 for fixed-size formats like FNT/FON/PCF/BDF. |
bbox | The global bounding box is defined as the largest rectangle that can enclose all the glyphs in a font face. |
ascender | The ascender is the vertical distance from the horizontal baseline to the highest ‘character’ coordinate in a font face. Unfortunately, font formats define the ascender differently. For some, it represents the ascent of all capital latin characters (without accents), for others it is the ascent of the highest accented character, and finally, other formats define it as being equal to bbox.yMax. |
descender | The descender is the vertical distance from the horizontal baseline to the lowest ‘character’ coordinate in a font face. Unfortunately, font formats define the descender differently. For some, it represents the descent of all capital latin characters (without accents), for others it is the ascent of the lowest accented character, and finally, other formats define it as being equal to bbox.yMin. This field is negative for values below the baseline. |
height | This field is simply used to compute a default line spacing (i.e., the baseline-to-baseline distance) when writing text with this font. Note that it usually is larger than the sum of the ascender and descender taken as absolute values. There is also no guarantee that no glyphs extend above or below subsequent baselines when using this distance. |
max_advance_width | This field gives the maximum horizontal cursor advance for all glyphs in the font. It can be used to quickly compute the maximum advance width of a string of text. It doesn't correspond to the maximum glyph image width! |
max_advance_height | Same as max_advance_width but for vertical text layout. |
underline_position | When displaying or rendering underlined text, this value corresponds to the vertical position, relative to the baseline, of the underline bar's center. It is negative if it is below the baseline. |
underline_thickness | When displaying or rendering underlined text, this value corresponds to the vertical thickness of the underline. |
Notice how, unfortunately, the values of the ascender and the descender are not reliable (due to various discrepancies in font formats).
b. Scaled global metrics
Each size object also contains a scaled versions of some of the global metrics described above. They can be accessed directly through the face->size->metrics structure.
Note that these values correspond to scaled versions of the design global metrics, with no rounding or grid-fitting performed. They are also completely independent of any hinting process. In other words, don't rely on them to get exact metrics at the pixel level. They are expressed in 26.6 pixel format.
ascender | The scaled version of the original design ascender. |
descender | The scaled version of the original design descender. |
height | The scaled version of the original design text height (the vertical distance from one baseline to the next). This is probably the only field you should really use in this structure. Be careful not to confuse it with the ‘height’ field in the FT_Glyph_Metrics structure. |
max_advance | The scaled version of the original design max advance. |
Note that the face->size->metrics structure contains other fields that are used to scale design coordinates to device space. They are described in the last chapter.
c. Kerning
Kerning is the process of adjusting the position of two subsequent glyph images in a string of text in order to improve the general appearance of text. Basically, it means that when the glyph for an ‘A’ is followed by the glyph for a ‘V’, the space between them can be slightly reduced to avoid extra ‘diagonal whitespace’.
Note that in theory kerning can happen both in the horizontal and vertical direction between two glyphs; however, it only happens in the horizontal direction in nearly all cases except really extreme ones.
Not all font formats contain kerning information, and not all kerning formats are supported by FreeType; in particular, for TrueType fonts, the API can only access kerning via the ‘kern’ table; OpenType kerning via the ‘GPOS’ table is not supported. You need a higher-level library like Pango or ICU to handle that.
Sometimes, the font file is associated with an additional file that contains various glyph metrics, including kerning, but no glyph images. A good example is the Type 1 format where glyph images are stored in a file with extension .pfa or .pfb, and where kerning metrics can be found in a file with extension .afm or .pfm.
FreeType 2 allows you to deal with this, by providing the FT_Attach_File and FT_Attach_Stream APIs. Both functions are used to load additional metrics into a face object by reading them from an additional format-specific file. For example, you could open a Type 1 font by doing the following:
Note that FT_Attach_Stream is similar to FT_Attach_File except that it doesn't take a C string to name the extra file but a FT_Stream handle. Also, reading a metrics file is in no way mandatory.
Finally, the file attachment APIs are very generic and can be used to load any kind of extra information for a given face. The nature of the additional content is entirely font format specific.
FreeType 2 allows you to retrieve the kerning information for two glyphs through the FT_Get_Kerning function, whose interface looks like:
As you see, the function takes a handle to a face object, the indices of the left and right glyph for which the kerning value is desired, as well as an integer, called the kerning mode, and a pointer to a destination vector that receives the corresponding distances.
The kerning mode is very similar to the bbox mode described in a previous chapter. It is a enumeration that indicates how the kerning distances are expressed in the target vector.
The default value is FT_KERNING_DEFAULT which has value 0. It corresponds to kerning distances expressed in 26.6 grid-fitted pixels (which means that the values are multiples of 64). For scalable formats, this means that the design kerning distance is scaled, then rounded.
The value FT_KERNING_UNFITTED corresponds to kerning distances expressed in 26.6 unfitted pixels (i.e., that do not correspond to integer coordinates). It is the design kerning distance that is scaled without rounding.
Finally, the value FT_KERNING_UNSCALED is used to return the design kerning distance, expressed in font units. You can later scale it to the device space using the computations explained in the last chapter of this section.
Note that the ‘left’ and ‘right’ positions correspond to the visual order of the glyphs in the string of text. This is important for bidirectional text, or simply when writing right-to-left text.
4. Simple text rendering: kerning + centering
In order to show off what we just learned, we will now demonstrate how to modify the example code that was provided in section I to render a string of text, and enhance it to support kerning and delayed rendering.
a. Kerning support
Adding support for kerning to our code is trivial, as long as we consider that we are still dealing with a left-to-right script like Latin. We simply need to retrieve the kerning distance between two glyphs in order to alter the pen position appropriately. The code looks like:
We are done. Some notes:
As kerning is determined from glyph indices, we need to explicitly convert our character codes into glyph indices, then later call FT_Load_Glyph instead of FT_Load_Char.
We use a boolean named use_kerning which is set with the result of the macro FT_HAS_KERNING. It is certainly faster not to call FT_Get_Kerning when we know that the font face does not contain kerning information.
We move the position of the pen before a new glyph is drawn.
We initialize the variable previous with the value 0, which always corresponds to the ‘missing glyph’ (also called .notdef in the Postscript world). There is never any kerning distance associated with this glyph.
We do not check the error code returned by FT_Get_Kerning. This is because the function always sets the content of delta to (0,0) when an error occurs.
b. Centering
Our code begins to become interesting but it is still a bit too simple for normal use. For example, the position of the pen is determined before we do the rendering; normally, you would rather layout the text and measure it before computing its final position (centering, etc.) or perform things like word-wrapping.
Let us now decompose our text rendering function into two distinct but successive parts: The first one will position individual glyph images on the baseline, while the second one will render the glyphs. As we will see, this has many advantages.
We will thus start by storing individual glyph images, as well as their position on the baseline. This can be done with code like:
This is a very slight variation of our previous code where we extract each glyph image from the slot, and store it, along with the corresponding position, in our tables.
Note also that pen_x contains the total advance for the string of text. We can now compute the bounding box of the text string with a simple function like:
The resulting bounding box dimensions are expressed in integer pixels and can then be used to compute the final pen position before rendering the string as in:
Some remarks:
The pen position is expressed in the cartesian space (i.e., y upwards).
We call FT_Glyph_To_Bitmap with the destroy parameter set to 0 (false), in order to avoid destroying the original glyph image. The new glyph bitmap is accessed through image after the call and is typecast to FT_BitmapGlyph.
We use translation when calling FT_Glyph_To_Bitmap. This ensures that the left and top fields of the bitmap glyph object are already set to the correct pixel coordinates in the cartesian space.
Of course, we still need to convert pixel coordinates from cartesian to device space before rendering, hence the my_target_height - bitmap->top in the call to my_draw_bitmap.
The same loop can be used to render the string anywhere on our display surface, without the need to reload our glyph images each time. We could also decide to implement word wrapping, and only draw
5. Advanced text rendering: transformation + centering + kerning
We are now going to modify our code in order to be able to easily transform the rendered string, for example to rotate it. We will start by performing a few minor improvements:
a. packing & translating glyphs
We start by packing the information related to a single glyph image into a single structure instead of parallel arrays. We thus define the following structure type:
We also translate each glyph image directly after it is loaded to its position on the baseline at load time. As we will see, this has several advantages. Our glyph sequence loader thus becomes:
Note that translating glyphs now has several advantages. The first one is that we don't need to translate the glyph bbox when we compute the string's bounding box. The code becomes:
Now take a closer look: The compute_string_bbox function can now compute the bounding box of a transformed glyph string. For example, we can do something like:
b. Rendering a transformed glyph sequence
However, directly transforming the glyphs in our sequence is not a good idea if we want to reuse them in order to draw the text string with various angles or transformations. It is better to perform the affine transformation just before the glyph is rendered, as in the following code:
There are a few changes compared to the original version of this code:
We keep the original glyph images untouched; instead, we transform a copy.
We perform clipping computations in order to avoid rendering & drawing glyphs that are not within our target surface
We always destroy the copy when calling FT_Glyph_To_Bitmap in order to get rid of the transformed scalable image. Note that the image is not destroyed if the function returns an error code (which is why FT_Done_Glyph is only called within the compound statement.
The translation of the glyph sequence to the start pen position is integrated in the call to FT_Glyph_Transform instead of FT_Glyph_To_Bitmap.
It is possible to call this function several times to render the string width different angles, or even change the way start is computed in order to move it to different place.
This code is the basis of the FreeType 2 demonstration program named ftstring.c. It could be easily extended to perform advanced text layout or word-wrapping in the first part, without changing the second one.
Note, however, that a normal implementation would use a glyph cache in order to reduce memory needs. For example, let us assume that our text string is ‘FreeType&rsquo'. We would store three identical glyph images in our table for the letter ‘e’, which isn't optimal (especially when you consider longer lines of text, or even whole pages).
6. Accessing metrics in design font units, and scaling them
Scalable font formats usually store a single vectorial image, called an outline, for each glyph in a face. Each outline is defined in an abstract grid called the design space, with coordinates expressed in nominal font units. When a glyph image is loaded, the font driver usually scales the outline to device space according to the current character pixel size found in a FT_Size object. The driver may also modify the scaled outline in order to significantly improve its appearance on a pixel-based surface (a process known as hinting or grid-fitting).
This chapter describes how design coordinates are scaled to the device space, and how to read glyph outlines and metrics in font units. This is important for a number of things:
‘true’ WYSIWYG text layout
accessing font content for conversion or analysis purposes
a. Scaling distances to device space
Design coordinates are scaled to the device space using a simple scaling transformation whose coefficients are computed with the help of the character pixel size:
Here, the value EM_size is font-specific and corresponds to the size of an abstract square of the design space (called the EM), which is used by font designers to create glyph images. It is thus expressed in font units. It is also accessible directly for scalable font formats as face->units_per_EM. You should check that a font face contains scalable glyph images by using the FT_IS_SCALABLE macro, which returns true when appropriate.
When you call the function FT_Set_Pixel_Sizes, you are specifying the value of pixel_size_x and pixel_size_y FreeType shall use. The library will immediately compute the values of x_scale and y_scale.
When you call the function FT_Set_Char_Size, you are specifying the character size in physical points, which is used, along with the device's resolutions, to compute the character pixel size and the corresponding scaling factors.
Note that after calling any of these two functions, you can access the values of the character pixel size and scaling factors as fields of the face->size->metrics structure. These fields are:
x_ppem | The field name stands for ‘x pixels per EM’; this is the horizontal size in integer pixels of the EM square, which also is the horizontal character pixel size, called pixel_size_x in the above example. |
y_ppem | The field name stands for ‘y pixels per EM’; this is the vertical size in integer pixels of the EM square, which also is the vertical character pixel size, called pixel_size_y in the above example. |
x_scale | This is a 16.16 fixed float scale that is used to directly scale horizontal distances from design space to 1/64th of device pixels. |
y_scale | This is a 16.16 fixed float scale that is used to directly scale vertical distances from design space to 1/64th of device pixels. |
You can scale a distance expressed in font units to 26.6 pixel format directly with the help of the FT_MulFix function, as in:
However, you can also scale the value directly with more accuracy by using doubles:
b. Accessing design metrics (glyph & global)
You can access glyph metrics in font units simply by specifying the FT_LOAD_NO_SCALE bit flag in FT_Load_Glyph or FT_Load_Char. The metrics returned in face->glyph->metrics will all be in font units.
You can access unscaled kerning data using the FT_KERNING_MODE_UNSCALED mode.
Freetype Font Library
Finally, a few global metrics are available directly in font units as fields of the FT_Face handle, as described in chapter 3 of this section.
Conclusion
This is the end of the second section of the FreeType 2 tutorial. You are now able to access glyph metrics, manage glyph images, and render text much more intelligently (kerning, measuring, transforming & caching).
How To Install Freetype Library
You have now sufficient knowledge to build a pretty decent text service on top of FreeType 2, and you could possibly stop here if you want.
Freetype Library Not Found
The demo programs in the ‘ft2demos’ bundle (especially ‘ftview’) are a kind of reference implementation, and are a good resource to turn to for answers. They also show how to use additional features, such as the glyph stroker and cache.
Freetype Library Centos
The next section will deal with FreeType 2 internals (like modules, vector outlines, font drivers, renderers), as well as a few font format specific issues (mainly, how to access certain TrueType or Type 1 tables).