Skip to content

Collecting and maintaining crucial Julia package data, including Names, UUIDs, and download statistics, for enhanced accessibility, insight, and discoverability.

License

Notifications You must be signed in to change notification settings

analyticsinmotion/julia-packages-data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation


Julia Packages Data Tweet

Julia   MIT license   Status: Stable   Julia Packages Data   Julia Package Requests   Analytics in Motion  


👋 Welcome to the Julia Packages Data Repository on GitHub!

About this Project

This repository offers a systematic and consistent mapping of all Julia package names to their Universally Unique Identifiers (UUIDs). Our aim is to maintain an accurate and current record of these essential identifiers which can then be utilized to enhance other datasets, including the Julia package download statistics. This approach offers valuable insights into the Julia package ecosystem, making it easier for users and developers to access and analyze the relevant information.

Thank you for being a part of the Julia Package Data Repository. Explore, and stay informed with the latest in Julia Packages Data. Your support and involvement are greatly appreciated!


Files available in Julia Packages Data

The following table provides an overview of the data that can be accessed in this repository.

File Name Updated Description
julia_package_names.csv Daily Matches all current Julia Package Names to their UUID.
julia_package_requests.csv Daily Total request count for all Julia Packages.
julia_package_requests_by_date.csv Daily The request counts by date for all Julia Packages.
julia_package_requests_by_region.csv Daily Total requests for each Julia Package, categorized by geographical region.
julia_package_requests_by_region_by_date.csv Daily Request counts for each Julia Package, split by Geographical Region, and request date.

How to access and download data files

There are two methods that can be used to access and download the data files:

Download directly from GitHub

  1. Click on the link from the table able to access the raw version of the data file. Some of the files are quite large so the data may not show in the screen. This is perfectly fine and will not affect the download at all.

  2. Click the "Download" button located at the top right of the screen to download the raw data file to your local machine. If prompted, choose a location to save the file.


Using Julia to access data

  1. Install the required packages:
using Pkg; Pkg.add(["HTTP", "CSV", "DataFrames"])
  1. Import the required modules:
using HTTP, CSV, DataFrames
  1. Create a helper function:
function get_julia_package_data(file_name::String, return_dataframe::Bool=true, download_csv::Bool=false)
    url = string("https://raw.githubusercontent.com/analyticsinmotion/julia-packages-data/main/data/", file_name)
    response = HTTP.get(url)
    response.status == 200 || error("Failed to retrieve data from the URL")
    df_julia_package_data = IOBuffer(response.body) |> CSV.File |> DataFrame    
    download_csv ? CSV.write("julia_package_names.csv", df_julia_package_data) : nothing  
    return return_dataframe ? df_julia_package_data : nothing
end
  1. Call the following helper functions

For Julia Package Names

# Variable values of true, true will return the DataFrame within Julia and also export it as a CSV file
file_name = "julia_package_names.csv"
get_julia_package_data(file_name, true, true)

For Julia Package Requests

# Variable values of true, true will return the DataFrame within Julia and also export it as a CSV file
file_name = "julia_package_requests.csv"
get_julia_package_data(file_name, true, true)

For Julia Package Requests by Date

# Variable values of true, true will return the DataFrame within Julia and also export it as a CSV file
file_name = "julia_package_requests_by_date.csv"
get_julia_package_data(file_name, true, true)

For Julia Package Requests by Region

# Variable values of true, true will return the DataFrame within Julia and also export it as a CSV file
file_name = "julia_package_requests_by_region.csv"
get_julia_package_data(file_name, true, true)

For Julia Package Requests by Region by Date

# Variable values of true, true will return the DataFrame within Julia and also export it as a CSV file
file_name = "julia_package_requests_by_region_by_date.csv"
get_julia_package_data(file_name, true, true)



Data Dictionary

Julia Package Names

The julia_package_names.csv file contains a list of all current Julia package names along with their corresponding Universally Unique Identifier (UUID).


File Details
Filename: julia_package_names
Extension: .csv
Delimiter: Comma (,)
Header: True


File Schema

Column Name Data Type Description
package_name String The name of the Julia Package
package_uuid String The Universally Unique Identifier (UUID) for the Julia Package