Skip to content
/ lebmap Public

🇱🇧 A dialectal-feature map of Lebanese Arabic

License

Notifications You must be signed in to change notification settings

lebnxyz/lebmap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

An interactive map charting dialectal features of Lebanese Arabic. Based on the results of this survey posted to /r/Lebanon on 8/13/19.

Please check out this quick photo album for an intro that's not technical. (The captions do assume knowledge of Arabic, but it should help you understand how the map and querying features work even if you can't read those parts.) If you're looking for a more-boring description, carry on!

How to use

The default tab, Questions, contains a list of everything that was asked on the survey. If you click on any one question, you'll be provided with a list of the different words or situations that were asked about as a part of that question, if any; clicking on one of these will then show you a list of specific options for the realization of those words or situations. This will activate the chart at the top to show how many people answered one way or another. Clicking on one of these specific options will highlight pins on the map.

The pins represent respondents to the survey, where a bigger pin means more respondents at that location. Clicking on a pin selects it, and clicking (& dragging, if you wish) on regions of the map selects all of the pins inside. If you select a pin, several pins, or a bunch of regions, the only survey answers taken into account will be the ones from those specific areas. You can use this to your advantage to compare how many people in different regions picked a particular answer to a question.

The Answers tab currently does nothing, although you can at least use it to check the names of the regions you've selected.

The Query tab is a little bit more advanced, but also more powerful. It lets you search for specific combinations of answers to see how many people in your selected region of the map chose the answers you're looking for. For example, let's say our survey looks like this:

Question #0: How often do you drink coffee?
    Example #0: Espresso
        Option #0: Every day
        Option #1: Semi-weekly
        Option #2: Never
    Example #1: Decaf
        Option #0: Every day
        Option #1: Semi-weekly
        Option #2: Never

Question #1: Do you go to bed early or late?
    Option #0: Early
    Option #1: Late
    Option #2: I don't sleep.

Let's try using the code question number . example number : option number to refer to specific answer choices. The code 0.0 would refer to "Espresso" (question #0 & example #0), 0.1 would refer to "Decaf" (question #0 & example #1), and 0.0:1 would refer to semi-weekly espresso-drinkers (question #0, example #0, option #1). Similarly, 0.1:2 would refer to non-drinkers of decaf (question #0, example #1, option #2). Notice that question #1 has no examples listed, and it instead jumps straight into the options; that means our code for referring to it doesn't have to have the dot in it. 1:0 refers to early sleepers (question #1, option #0), and 1:2 refers to our insomniac friends who don't sleep (question #1, option #2).

So! Under the Query tab, you can make use of these codes along with some operators to phrase a specific question about the survey's respondents. If I want to find out about the number of people who drink espresso daily and don't sleep, I could phrase that question as 0.0:0 & 1:2; we've discussed those two codes, and the operator & says that we're looking for people for whom both the first one and the second one hold true. If I want to find out about people who drink either espresso or decaf a few times a week, I could phrase that question as 0.0:1 | 0.1:1; the operator | means that we want either of the two to be true. Now, if I want to find out about the amount of those people that can't go to sleep at night, I could use the query (0.0:1 | 0.1:1) & 1:2.

On the other hand, if I want to find out about the correlation of those results -- that is, how many people picked the same answer for not being able to sleep as they did for drinking either espresso or decaf semi-weekly -- that's where the = operator comes in. 0.0:0 = 1:2 lets me know how strong the correlation is between espressoholics and non-sleepers, for example. Feel free to play around combining this with other operators to create complex queries.

Lastly, I'm not really sure if the "Out of..." textbox on the Query tab is useful or not. It shouldn't be used for now because it's incomplete, but my reasoning for including it was: by default, when you input a query, it searches for respondents within the region you've selected on the map (or within the whole map if you haven't selected any region), so the idea of putting a second query into the "Out of..." box is that it further restricts the search to only people who match that second query. For example, I can search for 0.0:0 out of 0.0:0 | 0.0:1 to figure out how many people drink espresso daily, out of the amount of people who drink it either daily or semi-weekly. However, this doesn't yet affect what pins are highlighted on the map (it only impacts what's graphed on the bar-chart), and the app doesn't check to make sure there's a valid relationship between your "Search for..." query and your "Out of..." query, so it's up to you to make sure the latter's results actually make up a superset of the former's.

For convenience, you can click around inside the list of questions below the textbox to have question codes inputted automatically, and you can add operators using the row of buttons below the textbox. The full syntax spec for querying is as follows:

  • option = option: Find respondents who picked either both of the two options or neither.
  • option & option: Find respondents who picked both options.
  • option | option: Find respondents who picked either one option or the other.
  • !option: Find respondents who did not pick this option.
  • (some operation goes here): Parentheses are used for grouping, like in PEMDAS. The default order of evaluation is just left-to-right, so parentheses can be necessary.

(There's also a semi-hidden syntax using the asterisk *. If you specify question number : * or question number . example number : *, e.g. 0.0:* or 1:*, it gives you the people who chose "Other" for that question, i.e. wrote in a response that wasn't one of the multiple-choice answers.)

Comments

I procrastinated a lil bit and stretched out less than a month's worth of work over, like, 3/4 of a year from start to finish. But at least we did eventually finish!

There's still a bit more that I can do, anyway. For example, the Lebanon map data I used was apparently specifically intended for tracking voting, so it treats Beirut as three separate parts -- that isn't useful here, and it slightly messes up the way Beirut's pin(s) look(s) when it or its region is selected. I could also try implementing the "Answers" tab, which would show specific respondents and their statistics and answer choices. But the really necessary stuff is all in place, which I'm very happy about.

OTOH, I'm a bit sad that I published the survey before I actually knew enough about Lebanese to ask half of these questions. A lot of them are based on shaky understandings of phonology or regional morphological variation and could have yielded some way-more-useful insights had I waited to ask them until I'd learned a bit more. Oh well. This project is a precursor to and building-block for the remainder of @lebnxyz, which will include a comprehensive online dictionary hooked into a (hopefully better) survey.

Also! Some of the code is hot garbage, especially the parts relating to the query functionality. Pardon me there :)

Build info

I really wanted to love Parcel, but (as of release 1.12.3) it just isn't mature enough yet to be reliable. The issues at parcel-bundler#501, parcel-bundler#1294, and parcel-bundler#2262 slowly added to my mounting frustration, with the third being the absolute last straw, so I switched to Webpack... and everything about the build process is a little bit ickier as a result, but I'll gladly take icky over broken.

So if you'd like to tinker with the project yourself, the following four NPM scripts are relevant:

  • dev: Serves a development build. Automatically opens in default browser.
  • prodserve: Ditto, but serves a production build (just in case).
  • build: Builds for production.
  • devbuild: Bulids for development (just in case).

(And don't look at the remaining scripts. Please. It's for your own good.)

Of note is my patent abuse of Pages' "build from docs/" feature, allowing for local debugging from the project root without requiring maintenance of a separate gh-pages branch to deploy from. If it works...!