GNU `gettext' utilities

GNU gettext utilities
1 Introduction
- 1.1 The Purpose of GNU gettext
- 1.2 I18n, L10n, and Such
- 1.3 Aspects in Native Language Support
- 1.4 Files Conveying Translations
- 1.5 Overview of GNU gettext
2 The User's View
- 2.1 Operating System Installation
- 2.2 Setting the Locale Used by GUI Programs
- 2.3 Setting the Locale through Environment Variables
- 2.4 Installing Translations for Particular Programs
3 The Format of PO Files
4 Preparing Program Sources
- 4.1 Importing the gettext declaration
- 4.2 Triggering gettext Operations
- 4.3 Preparing Translatable Strings
- 4.4 How Marks Appear in Sources
- 4.5 Marking Translatable Strings
- 4.6 Special Comments preceding Keywords
- 4.7 Special Cases of Translatable Strings
- 4.8 Letting Users Report Translation Bugs
- 4.9 Marking Proper Names for Translation
- 4.10 Preparing Library Sources
5 Making the PO Template File
- 5.1 Invoking the xgettext Program
6 Creating a New PO File
- 6.1 Invoking the msginit Program
- 6.2 Filling in the Header Entry
7 Updating Existing PO Files
- 7.1 Invoking the msgmerge Program
8 Editing PO Files
- 8.1 KDE's PO File Editor
- 8.2 GNOME's PO File Editor
- 8.3 Emacs's PO File Editor
- 8.4 Using Translation Compendia
  - 8.4.1 Creating Compendia
    - 8.4.1.1 Concatenate PO Files
    - 8.4.1.2 Extract a Message Subset from a PO File
  - 8.4.2 Using Compendia
    - 8.4.2.1 Initialize a New Translation File
    - 8.4.2.2 Update an Existing Translation File
9 Manipulating PO Files
- 9.1 Invoking the msgcat Program
- 9.2 Invoking the msgconv Program
- 9.3 Invoking the msggrep Program
- 9.4 Invoking the msgfilter Program
- 9.5 Invoking the msguniq Program
- 9.6 Invoking the msgcomm Program
- 9.7 Invoking the msgcmp Program
- 9.8 Invoking the msgattrib Program
- 9.9 Invoking the msgen Program
- 9.10 Invoking the msgexec Program
- 9.11 Highlighting parts of PO files
- 9.12 Writing your own programs that process PO files
10 Producing Binary MO Files
- 10.1 Invoking the msgfmt Program
- 10.2 Invoking the msgunfmt Program
- 10.3 The Format of GNU MO Files
11 The Programmer's View
- 11.1 About catgets
  - 11.1.1 The Interface
  - 11.1.2 Problems with the catgets Interface?!
- 11.2 About gettext
- 11.3 Comparing the Two Interfaces
- 11.4 Using libintl.a in own programs
- 11.5 Being a gettext grok
- 11.6 Temporary Notes for the Programmers Chapter
12 The Translator's View
- 12.1 Introduction 0
- 12.2 Introduction 1
- 12.3 Discussions
- 12.4 Organization
- 12.5 Information Flow
- 12.6 Translating plural forms
- 12.7 Prioritizing messages: How to determine which messages to translate first
13 The Maintainer's View
- 13.1 Flat or Non-Flat Directory Structures
- 13.2 Prerequisite Works
- 13.3 Invoking the gettextize Program
- 13.4 Files You Must Create or Alter
- 13.5 Autoconf macros for use in configure.ac
- 13.6 Integrating with CVS
- 13.7 Creating a Distribution Tarball
14 The Installer's and Distributor's View
15 Other Programming Languages
- 15.1 The Language Implementor's View
- 15.2 The Programmer's View
- 15.3 The Translator's View
- 15.4 The Maintainer's View
- 15.5 Individual Programming Languages
- 15.6 Internationalizable Data
16 Concluding Remarks
- 16.1 History of GNU gettext
- 16.2 Related Readings
Appendix A Language Codes
- A.1 Usual Language Codes
- A.2 Rare Language Codes
Appendix B Country Codes
Appendix C Licenses
- C.1 GNU GENERAL PUBLIC LICENSE
  - Preamble
  - Appendix: How to Apply These Terms to Your New Programs
- C.2 GNU LESSER GENERAL PUBLIC LICENSE
  - Preamble
  - How to Apply These Terms to Your New Libraries
- C.3 GNU Free Documentation License
  - ADDENDUM: How to use this License for your documents
Program Index
Option Index
Variable Index
PO Mode Index
Autoconf Macro Index
General Index

GNU gettext

This manual documents the GNU gettext version 0.18.

--- The Detailed Node Listing ---

Introduction href="#Introduction">Introduction, href="#dir">(dir), (dir) utilities

1 Introduction

This chapter explains the goals sought in the creation of GNU gettext and the free Translation Project. Then, it explains a few broad concepts around Native Language Support, and positions message translation with regard to other aspects of national and cultural variance, as they apply to programs. It also surveys those files used to convey the translations. It explains how the various tools interact in the initial generation of these files, and later, how the maintenance cycle should usually operate.

In this manual, we use he when speaking of the programmer or maintainer, she when speaking of the translator, and they when speaking of the installers or end users of the translated program. This is only a convenience for clarifying the documentation. It is absolutely not meant to imply that some roles are more appropriate to males or females. Besides, as you might guess, GNU gettext is meant to be useful for people using computers, whatever their sex, race, religion or nationality!

1.1 The Purpose of GNU gettext

Usually, programs are written and documented in English, and use English at execution time to interact with users. This is true not only of GNU software, but also of a great deal of proprietary and free software. Using a common language is quite handy for communication between developers, maintainers and users from all countries. On the other hand, most people are less comfortable with English than with their own native language, and would prefer to use their mother tongue for day to day's work, as far as possible. Many would simply love to see their computer screen showing a lot less of English, and far more of their own language.

However, to many people, this dream might appear so far fetched that they may believe it is not even worth spending time thinking about it. They have no confidence at all that the dream might ever become true. Yet some have not lost hope, and have organized themselves. The Translation Project is a formalization of this hope into a workable structure, which has a good chance to get all of us nearer the achievement of a truly multi-lingual set of programs.

GNU gettext is an important step for the Translation Project, as it is an asset on which we may build many other steps. This package offers to programmers, translators and even users, a well integrated set of tools and documentation. Specifically, the GNU gettext utilities are a set of tools that provides a framework within which other free packages may produce multi-lingual messages. These tools include

GNU gettext is designed to minimize the impact of internationalization on program sources, keeping this impact as small and hardly noticeable as possible. Internationalization has better chances of succeeding if it is very light weighted, or at least, appear to be so, when looking at program sources.

The Translation Project also uses the GNU gettext distribution as a vehicle for documenting its structure and methods. This goes beyond the strict technicalities of documenting the GNU gettext proper. By so doing, translators will find in a single place, as far as possible, all they need to know for properly doing their translating work. Also, this supplemental documentation might also help programmers, and even curious users, in understanding how GNU gettext is related to the remainder of the Translation Project, and consequently, have a glimpse at the big picture.

1.2 I18n, L10n, and Such

Two long words appear all the time when we discuss support of native language in programs, and these words have a precise meaning, worth being explained here, once and for all in this document. The words are internationalization and localization. Many people, tired of writing these long words over and over again, took the habit of writing i18n and l10n instead, quoting the first and last letter of each word, and replacing the run of intermediate letters by a number merely telling how many such letters there are. But in this manual, in the sake of clarity, we will patiently write the names in full, each time...

By internationalization, one refers to the operation by which a program, or a set of programs turned into a package, is made aware of and able to support multiple languages. This is a generalization process, by which the programs are untied from calling only English strings or other English specific habits, and connected to generic ways of doing the same, instead. Program developers may use various techniques to internationalize their programs. Some of these have been standardized. GNU gettext offers one of these standards. See Programmers.

By localization, one means the operation by which, in a set of programs already internationalized, one gives the program all needed information so that it can adapt itself to handle its input and output in a fashion which is correct for some native language and cultural habits. This is a particularisation process, by which generic methods already implemented in an internationalized program are used in specific ways. The programming environment puts several functions to the programmers disposal which allow this runtime configuration. The formal description of specific set of cultural habits for some country, together with all associated translations targeted to the same native language, is called the locale for this language or country. Users achieve localization of programs by setting proper values to special environment variables, prior to executing those programs, identifying which locale should be used.

In fact, locale message support is only one component of the cultural data that makes up a particular locale. There are a whole host of routines and functions provided to aid programmers in developing internationalized software and which allow them to access the data stored in a particular locale. When someone presently refers to a particular locale, they are obviously referring to the data stored within that particular locale. Similarly, if a programmer is referring to “accessing the locale routines”, they are referring to the complete suite of routines that access all of the locale's information.

One uses the expression Native Language Support, or merely NLS, for speaking of the overall activity or feature encompassing both internationalization and localization, allowing for multi-lingual interactions in a program. In a nutshell, one could say that internationalization is the operation by which further localizations are made possible.

Also, very roughly said, when it comes to multi-lingual messages, internationalization is usually taken care of by programmers, and localization is usually taken care of by translators.

1.3 Aspects in Native Language Support

For a totally multi-lingual distribution, there are many things to translate beyond output messages.

As we already stressed, translation is only one aspect of locales. Other internationalization aspects are system services and are handled in GNU libc. There are many attributes that are needed to define a country's cultural conventions. These attributes include beside the country's native language, the formatting of the date and time, the representation of numbers, the symbols for currency, etc. These local rules are termed the country's locale. The locale represents the knowledge needed to support the country's native attributes.

There are a few major areas which may vary between countries and hence, define what a locale must describe. The following list helps putting multi-lingual messages into the proper context of other tasks related to locales. See the GNU libc manual for details.

GNU `gettext' utilities

Table of Contents

1 Introduction

1.1 The Purpose of GNU `gettext`

1.2 I18n, L10n, and Such

1.3 Aspects in Native Language Support

GNU `gettext' utilities

Table of Contents

GNU gettext This manual documents the GNU gettext version 0.18.

1 Introduction

1.1 The Purpose of GNU gettext

1.2 I18n, L10n, and Such

1.3 Aspects in Native Language Support

1.1 The Purpose of GNU `gettext`