Skip to content
View jacobmarks's full-sized avatar

Highlights

  • Pro
Block or Report

Block or report jacobmarks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jacobmarks/README.md

πŸ‘‹ Hi there!

I'm Jacob, a Senior Machine Learning Engineer & Researcher at Voxel51 voxel51 icon , where we're revolutionizing AI and computer vision with our powerful open-source toolset, FiftyOne.

πŸŽ“ Education

  • Ph.D. in Theoretical Physics, Stanford University
  • B.S. in Intensive Physics, Math & Philosophy, Yale University

πŸ“ Blogging & Writing

I regularly contribute to technical topics on Medium, where I have over 7,500 followers. My writings cover AI, ML, computer vision, data cleaning and curation, and more!

See popular articles
How I Turned My Company's Docs into a Searchable Database with OpenAI How I Turned My Company's Docs into a Searchable Database with OpenAI
April 25, 2023 | Towards Data Science
In this article, I discuss how I leveraged OpenAI's GPT-3 to turn my company's documentation into a searchable database. This project simplifies the way we access and interact with internal resources, enhancing productivity.
How I Turned ChatGPT into an SQL-Like Translator for Image and Video Datasets How I Turned ChatGPT into an SQL-Like Translator for Image and Video Datasets
June 08, 2023 | Towards Data Science
In this article, I discuss how I used GPT-3.5 to create a text-to-query translator that allows users to interact with image and video datasets using natural language.
What I Learned Pushing Prompt Engineering to the Limit What I Learned Pushing Prompt Engineering to the Limit
June 12, 2023 | Towards Data Science
In this article, I share my experiences and lessons learned from pushing the boundaries of prompt engineering. Using advanced techniques, I explore how to make the most out of language models for various applications.
AI Telephone β€” A Battle of Multimodal Models AI Telephone β€” A Battle of Multimodal Models
Jun 15, 2023 | Towards Data Science
In this article, I explore the competitive landscape of multimodal AI models by setting up an "AI Telephone" experiment. I discuss the intricacies of various models and how they perform in this unique setup.
An Ode to my Physics Ph.D. An Ode to my Physics Ph.D.
July 18, 2023 | Towards Data Science
In this article, I open up on the journey from physics to machine learning, the challenge of transitioning into industry, and lessons learned along the way!
How to Build a Semantic Search Engine for Emojis How to Build a Semantic Search Engine for Emojis
January 09, 2024 | Towards Data Science
In this article, I detail the process of building a custom vector search pipeline utilizing multimodal data, cross-encoders, and reranking!

πŸ€— Connect!

LinkedIn Medium Twitter Hugging Face

If you have an idea for an integration, plugin, blog post, or something else you'd like to chat about, feel free to reach out!

Things I Care About

  • βš›οΈ Physics
  • 🌎 Climate
  • πŸ“– Open source | Open science
  • πŸ«€ Building with purpose

Awesome Open Source Projects

Here is a short list of some open source libraries I love ❀️! I've contributed to some of them, and some I just love using 😎

Data
Models
LLMs
Vector Databases

Pinned

  1. voxel51/fiftyone-docs-search voxel51/fiftyone-docs-search Public

    Search docs.voxel51.com with an LLM!

    Python 332 57

  2. awesome-neurips-2023 awesome-neurips-2023 Public

    Conference schedule, top papers, and analysis of the data for NeurIPS 2023!

    Jupyter Notebook 100 6

  3. voxel51/voxelgpt voxel51/voxelgpt Public

    AI assistant that can query visual datasets, search the FiftyOne docs, and answer general computer vision questions

    Python 220 19

  4. image-quality-issues image-quality-issues Public

    FiftyOne Plugin for finding common image quality issues

    Python 21 2

  5. voxel51/papers-with-data voxel51/papers-with-data Public

    A curated list of papers that released datasets along with their work

    Python 124 8

  6. zero-shot-prediction-plugin zero-shot-prediction-plugin Public

    Run zero-shot prediction models on your data

    Python 26 2