Skip to content

A PHP-7 language codes converter, from and to the most common formats (ISO or not)

License

Notifications You must be signed in to change notification settings

whitecube/lingua

Repository files navigation

Lingua - Language Codes Converter

Travis Test Coverage GitHub release Packagist GitHub issues GitHub pull requests license

This package will convert languages from and to some common formats (ISO codes, W3C standards, PHP localization strings), including human-readable strings.

Installation

composer require whitecube/lingua

Content

The package currently supports over 220 languages. Most living languages are included, with country codes, names and possible alphabet sets (Latin, Cyrillic, Arabic, ...).

Available input and output formats are:

  1. name: The english name of the language (most of the time).
  2. native: The autonym of the language.
  3. ISO-639-1: The official two-letter code for the language. Some languages do not have this code.
  4. ISO-639-2t: An official three-letter code for terminology applications (ISO 639-2/T) for the language.
  5. ISO-639-2b: An official three-letter code for bibliographic applications (ISO 639-2/B) for the language.
  6. ISO-639-3: The official three-letter code. This is also the most common terminology and therefore the most complete language representation. In most cases, this is the same format as ISO-639-2t, except for macrolanguages.
  7. W3C: A valid string as described by the BCP 47 specification (used in the W3C's language attributes recommendations).
  8. PHP: A string with the appropriate format for PHP's setlocale(). This does not check if the locale is available on your server.

Note on macrolanguages: The ISO-639-3 output will also indicate the amount of sub-languages represented with the same ISO code.

Usage

The Lingua converter works in two stages: first you'll need to instantiate it by providing the original format, than you can convert this string as many times as you want to any of the available formats.

Note: you can instanciate some languages with their deprecated ISO code. For example, you can instanciate Hebrew with the ISO 639_1 code "iw" instead of "he".

Setters

use WhiteCube\Lingua\Service as Lingua;

// Create a converter, without knowing the original format (this will try to guess it for you)
$language = Lingua::create('en_GB');

// Create a converter from a language name
$language = Lingua::createFromName('french');
$language = (new Lingua())->fromName('german');

// Create a converter from a language's native name
$language = Lingua::createFromNative('nederlands');
$language = (new Lingua())->fromNative('magyar');

// Create a converter from a ISO 639-1 code
$language = Lingua::createFromISO_639_1('mk');
$language = (new Lingua())->fromISO_639_1('ko');

// Create a converter from a ISO 639-2t code
$language = Lingua::createFromISO_639_2t('heb');
$language = (new Lingua())->fromISO_639_2t('gle');

// Create a converter from a ISO 639-2b code
$language = Lingua::createFromISO_639_2b('her');
$language = (new Lingua())->fromISO_639_2b('iku');

// Create a converter from a ISO 639-3 code
$language = Lingua::createFromISO_639_3('aze + 2');
$language = (new Lingua())->fromISO_639_3('asm');

// Create a converter from a valid W3C language string
$language = Lingua::createFromW3C('ae');
$language = (new Lingua())->fromW3C('zh-hans-SG');

// Create a converter from a valid PHP localization string
$language = Lingua::createFromPHP('fr_BE');
$language = (new Lingua())->fromPHP('kk_Cyrl_KZ');

Formatting

use WhiteCube\Lingua\Service as Lingua;

$language = Lingua::createFromNative('français');

// Format a language in a human readable string (the language's english name)
echo $language->toName(); // "french"

// Format a language in its native form
echo $language->toNative(); // "français"

// Format a language in a ISO 639-1 string
echo $language->toISO_639_1(); // "fr"

// Format a language in a ISO 639-2t string
echo $language->toISO_639_2t(); // "fra"

// Format a language in a ISO 639-2b string
echo $language->toISO_639_2b(); // "fre"

// Format a language in a ISO 639-3 string
echo $language->toISO_639_3(); // "fra"

// Format a language in a valid W3C language attribute string (according to BCP 47)
echo $language->toW3C(); // "fr" in this case but could be "fr-BE" if country code was specified

// Format a localization string for PHP's setlocale()
echo $language->toPHP(); // "fr" in this case but could be "fr_BE" if country code was specified

Default formatting

Lingua instances can be automatically transformed to strings without calling any formatting method.

The default format is set to w3c, this means you can use Lingua instances as strings and the toW3C() method will be called out of the box.

use WhiteCube\Lingua\Service as Lingua;

echo Lingua::createFromName('italian'); // "it"

You can change this default behavior by calling the static setFormat method. Available formats are: name, native, iso-639-1, iso-639-2t, iso-639-2b, iso-639-3, w3c and php.

use WhiteCube\Lingua\Service as Lingua;

Lingua::setFormat('native');
$language = Lingua::createFromISO_639_3('ita');
echo $language; // "italiano"

Lingua::setFormat('iso-639-1');
echo $language; // "it"

Additionally it is also possible to specify a desired format during the instantiation of Lingua. This will always ignore the default formatting, even if you just called the static setFormat method.

use WhiteCube\Lingua\Service as Lingua;

$language = new Lingua('native');
$language->fromISO_639_1('ks');
echo $language; // "कश्मीरी, كشميري‎"

// Trying to change the default formatting
Lingua::setFormat('name');

echo $language; // still "कश्मीरी, كشميري‎"
echo $language->toName(); // "kashmiri"

This behavior is also possible with the static create methods. You can specify the desired formatting as second parameter.

use WhiteCube\Lingua\Service as Lingua;

echo Lingua::createFromName('maltese', 'native'); // "malti"

Registering custom languages

Sometimes you'll want to add new languages to the built-in languages repository in order to recognize them later on. Of course, if said languages should be added to the package, you should consider contributing to this repo. In the meantime you can also add them from your code thanks to the LanguagesRepository::register() method.

use WhiteCube\Lingua\Service as Lingua;
use WhiteCube\Lingua\LanguagesRepository;

// Note: all keys of the register-array showed below are optionnal.
LanguagesRepository::register([
    'name' => 'custom',
    'native' => 'custom-language',
    'iso-639-1' => 'cl',
    'iso-639-2t' => 'cla',
    'iso-639-2b' => 'cla',
    'iso-639-3' => 'cla + 2',
    'countries' => ['MY' => 'My Country'],
    'scripts' => ['latn' => 'latin', 'cyrl' => 'cyrillic']
]);

echo Lingua::createFromName('custom')->toNative(); // "custom-language"

This will not modify the ./languages.php file! Registered languages are only stored in memory.

Contributing

Your help is precious in order to make this package more accurate! You can contribute on two levels.

Working on core features

In order to add new features, only well-documented and tested pull-requests will be accepted.

Please open or comment existing issues on this repository in order to report bugs. If you're sure about what you're doing, you can send a pull-request.

Maintaining the languages repository

There are currently some languages marked with a // TODO comment in the ./languages.php file. This means we probably didn't find enough information about these languages. You know how to fill the gaps? Please let us know! You can open an issue, make a pull-request or contact us on any available platform.

And of course you can also add new languages to the repository, but only if you're sure about the terminology.

Please beware that changing some existing values in the languages.php file can result in some PHPUnit test failures, so make sure to run the tests and update them accordingly before submitting your changes.

Thank you!

💖 Sponsorships

If you are reliant on this package in your production applications, consider sponsoring us! It is the best way to help us keep doing what we love to do: making great open source software.