How to search for the 100 biggest repositories


#1

I know that this search is possible with bigquery, using gharchive.org’s data from 2010 to 2015, but I was wondering if it is available in graphQL.

Essentially, right now I know you can search the diskUsage based on knowing the name and owner of the repository, but can you search for the top 100 biggest diskUsage fields?


#2

Hi @jackgrantweb,

you can search the diskUsage based on knowing the name and owner of the repository

Yes, this value is calculated when the request is made, as far as I can tell. We don’t cache this value, so we will not be able to provide a search or filter based on it.

Your best bet is to continue using BigQuery and the gharchive.org data.