Replies: 1 comment 2 replies
-
Sorry, can you clarify exactly which data points you're having trouble getting? |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
There used to be a good research dataset for public company bankruptcies https://lopucki.law.ufl.edu/index.php
~$Florida-UCLA-LoPucki Attributes.xlsx
It has dissappeared and I want to create an updated academic dataset using your data with perhaps the use of LLM agents to extract the appropriate datapoints.
There are a few data attributes I am interested in as highlighted in the file:
Case Number, Company Name, CityDisposed, Chapter (7/11), ClaimsAgent (Yes, No) Date365Sale (if happend) DateConfirmed, TortCause etc.
I have been able to very nicely grab docket information using your api https://www.courtlistener.com/api/rest/v4/dockets/
using these case types
bankruptcy_case_types = ['Chapter 7', 'Chapter 11', 'Chapter 13', 'Chapter 12']
I have since gone directly to your https://storage.courtlistener.com/bulk-data/dockets-2024-10-31.csv.bz2 bulk dockets instead.
And have found complementary data at https://storage.courtlistener.com/bulk-data/fjc-integrated-database-2024-10-31.csv.bz2
As you can see these datasets do not give me enough information to recuperate all the facts, I think for that I need to have access to the recap archive. I believe that accessing the Recap Archive is crucial for this purpose, but I am encountering challenges in using the API to download the main (free) PDF documents needed to automate the extraction of this information.
Perhaps I have got it all wrong:
Thank you so much for your time and for the remarkable work you continue to do in supporting research and data accessibility. Any bit of advice here would be so greatly appreciated.
Beta Was this translation helpful? Give feedback.
All reactions