Skip to content
This repository has been archived by the owner on Jan 31, 2023. It is now read-only.

Latest commit

 

History

History
331 lines (212 loc) · 5.16 KB

json_spec.md

File metadata and controls

331 lines (212 loc) · 5.16 KB

JSON File Specification

A description of the fields within the JSON file for each case.

TOC

Case Name

Example

Saint Paul Holmes v. John Doe

Field Name

name

Data Type

string

Required

yes

Notes

This will generally just be the names of the two parties, but not always.

Case Number

Example

981428

Field Name

number

Data Type

string

Required

yes

Notes

The format of this varies enormously between courts.

Date Published

Example

1999-04-16

Field Name

date_published

Data Type

date

Required

yes

Notes

Sometimes there are multiple dates, such as when a case is revised. Noah is evaluating. Other fields include the date that the decision was published in a law journal (with no other date provided) and the date that a decision was revised.

Is Published?

Example

true

Field Name

is_published

Data Type

boolean

Required

yes

Notes

There are two kinds of decisions: "published" and "unpublished." The term doesn't refer to whether the decisions are written and appear online, but instead whether they are to be considered as a source of precedent or not. The default state of an decision is published, but it is important to be able to store if an opinion is unpublished.

Court Name

Example

Supreme Court of Virginia

Field Name

court

Data Type

string

Required

yes

Notes

Court Location

Example

Richmond

Field Name

court_location

Data Type

string

Required

no

Notes

While this is unstructured text, it ought to be a placename, and may be geocodable.

Author

Example

Charles L. McCormick, III

Field Name

author

Data Type

string

Required

no

Notes

This is probably the name of one the judges who heard the case, but theoretically could be somebody else.

Outcome

Example

Affirmed

Field Name

outcome

Data Type

unknown

Required

no

Notes

It's not yet clear what this is going to look like. Noah should have some ideas about this.

Parties

Example

array(	"plaintiff" => "Saint Paul Holmes",
		"defendant" => "John Doe")

Field Name

parties

Data Type

indexed array

Required

yes

Notes

Given the nature of appeals (the role of the parties depends on who appeals, so it can reverse along the way), is there even value in storing whether somebody is the plaintiff or the defendant?

Judges

Example

array("Charles L. McCormick, III")

Field Name

judges

Data Type

array

Required

no

Summary

Example

In a jury trial of an automobile accident liability claim, the trial court properly permitted the introduction of expert testimony concerning the relationship between tire tread depth and hydroplaning by a motor vehicle. The judgment is affirmed.

Field Name

summary

Data Type

string

Required

no

Key Sentence

Example

Lee’s expert testimony was appropriate to aid the jury in putting this factual evidence into an appropriate context from which the jury could draw its own conclusions.

Field Name

key_sentence

Data Type

string

Required

no

Notes

This key sentence is extracted via NLP.

Attorneys

Example

array(	"plaintiff" => "Lionel Hutz, Esq.",
		"defendant" => "Miguel Sanchez")

Field Name

attorneys

Data Type

indexed array

Required

no

Notes

Noah is evaluating the viability of IDing the party whom each attorney represents.

Might these actually be nested arrays? Often there will be more than one attorney for a party. Do we want to represent each one of those attorney as their own element in an array, or just make it one long string for each party?

Type of Case

Example

civil

Field Name

type

Data Type

enum: civil or criminal

Required

yes

Cited Laws

Example

array(	"38.2-2206",
		"46.2-1043",
		"8.01-401.3")

Field Name

cited_laws

Data Type

array

Required

no

Notes

Some laws are cited repeatedly. Do we want to store a representation of the unique laws that were cited, or do we want to store a representation of all citations? That is, are we telling people that § 38.2-2206 was cited 3 times, or just that it was cited at all?

Cited Cases

Example

unclear—working on this

Field Name

cited cases

Data Type

indexed array

Required

no

Notes

Some cases are cited repeatedly. See Cited Laws for the ramifications of this.

Decision Text

Example

array(	'format' => 'pdf', url => 'http://www.courts.state.va.us/opinions/opnscvwp/1981428.pdf',
		'format' => 'txt', url => '/rulings/1981428.txt')

Field Name

text

Data Type

array

Required

yes

Notes

This does not store the actual text, but just a link the text in its various formats.