GitHub - macs30200-s22/thesis_pranathiiyer: replication-materials-pranathiiyer created by GitHub Classroom

The final thesis can be found under Pranathi_thesis_final_draft. The code and data in this repository are a preliminary step towards the final objective of understanding the structure, trends, relevance,and success of matrimonial advertisements in the Indian context, throught this project. While this repository currently contains advertisements for prospective brides and grooms from the year 2001 to 2009, and 2014, the final project will also have data for 2010, 2011, 2012, and 2013.These years are slightly difficult to scrape owing to changes in the website. The data for grooms can be found under grooms_data in the folder data, and the data for brides can be found under brides_data in the folder data.

The code is written in Python 3.9.10 and all of its dependencies can be installed by running the following in the terminal (with the requirements.txt file included in this repository):

                     pip install -r requirements.txt

Then, you can import the analysis and plot_num_ads module located in this repository to reproduce the analysis in the project that this code supplements (in a Jupyter Notebook, like README.ipynb in this repository, or in any other Python script):

Findings

It is extremely challenging to reproduce the tables generated in first section of the results section. This is because the analysis involves running a code for individual years and then synthesizing relevant words and presenting them in a tabular format. However, you can generate year-wise proper nounns and top three character words for either category of ads for any year as shown below.

analysis.final_func('data/brides_data/brides-wanted_2008.csv')

The above commands can easily be run for any of the files from the data folder to reproduce other analyses.

You can use the file sikh_jat_ads.py to successfully generate the first figure of the paper. You can also run the command below to generate the plot.

import sikh_jat_ads

sikh_jat_ads.plot()

You can run use the file KL_divergence_grooms.py and KL_divergence_brides.py to generate the heat maps in figure 2. You can also run the following command to generate the results.

import KL_divergence_brides

KL_divergence_brides.heat_map()

You can run the same command using KL_divergence_grooms to generate the second heatmap.

You can generate the projections in Figure 3 and Figure 4 using the files groom_projection_adjectives.py, groom_projection_occupations.py, brides_projection_adjectives.py, and bride_projection_occupations.py. You can run the commands below to generate the results.

import groom_projection_adjectives

groom_projection_adjectives.final_plot()

You can run the same command on groom_projection_occupations.py to generate the projection of occupational words.

import brides_projection_adjectives

brides_projection_adjectives.final_plot()

You can run the same command on bride_projection_occupations.py to generate the projection of occupational words.

You can also find the embedding models trained on bride and groom seeking advertisements under brides_wanted and grooms_wanted respectively.

If you choose to cite this work, or repo, please use the citation under "cite this repository" on github.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Findings

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
README_files		README_files
data		data
CITATION.cff		CITATION.cff
KL_divergence_brides.py		KL_divergence_brides.py
KL_divergence_grooms.py		KL_divergence_grooms.py
Pranathi_thesis_final_draft.pdf		Pranathi_thesis_final_draft.pdf
README.ipynb		README.ipynb
README.md		README.md
analysis.py		analysis.py
bride_projections_occupations.py		bride_projections_occupations.py
brides_projection_adjectives.py		brides_projection_adjectives.py
brides_wanted		brides_wanted
groom_projection_adjectives.py		groom_projection_adjectives.py
groom_projection_occupation.py		groom_projection_occupation.py
grooms_wanted		grooms_wanted
requirements.txt		requirements.txt
sikh_jat_ads.py		sikh_jat_ads.py

macs30200-s22/thesis_pranathiiyer

Folders and files

Latest commit

History

Repository files navigation

Findings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages