Skip to content

aryamanarora/bhasacitra

Repository files navigation

Bhāṣācitra

Bhāṣācitra (lit. "language map" in Sanskrit) is a database of linguistic resources on South Asian languages. Drawing from location data compiled from the sources, we map language distributions across the Subcontinent. It is a new kind of bibliography, meant to centralise and ease access to useful linguistic work on all languages of the region.

To suggest or add new entries, make a pull request or issue in this repo.

Paper and citation

The paper about this work was published at the LChange 2021 workshop held at ACL, available online open-access.

Here is the citation for that:

Aryaman Arora, Adam Farris, Gopalakrishnan R, and Samopriya Basu. 2021. Bhāṣācitra: Visualising the dialect geography of South Asia. In Proceedings of the 2nd International Workshop on Computational Approaches to Historical Language Change 2021, pages 51–57, Online. Association for Computational Linguistics.

And the BibTeX:

@inproceedings{arora-etal-2021-bhasacitra,
    title = "Bh{\=a}ṣ{\=a}citra: Visualising the dialect geography of {S}outh {A}sia",
    author = "Arora, Aryaman  and
      Farris, Adam  and
      R, Gopalakrishnan  and
      Basu, Samopriya",
    booktitle = "Proceedings of the 2nd International Workshop on Computational Approaches to Historical Language Change 2021",
    month = aug,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2021.lchange-1.7",
    doi = "10.18653/v1/2021.lchange-1.7",
    pages = "51--57",
    abstract = "We present Bh{\=a}ṣ{\=a}citra, a dialect mapping system for South Asia built on a database of linguistic studies of languages of the region annotated for topic and location data. We analyse language coverage and look towards applications to typology by visualising example datasets. The application is not only meant to be useful for feature mapping, but also serves as a new kind of interactive bibliography for linguists of South Asian languages.",
}

(Feel free to chance the in the title to \d{s} if your LaTeX installation does not permit that Unicode character.)