Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove old files from this repository's history to reduce size #120

Open
choldgraf opened this issue Sep 18, 2019 · 0 comments
Open

Remove old files from this repository's history to reduce size #120

choldgraf opened this issue Sep 18, 2019 · 0 comments
Milestone

Comments

@choldgraf
Copy link
Contributor

Currently, cloning the data 8 textbook repo takes a long time on a non-fast internet connection. This is because the repo is nearly 140mb in size!

I ran a quick git history file size script and found the following files in git history that are over 500KB.

What do folks think about going through our git history and removing any file over 500KB that isn't currently in the repository?

Here's the list of files (as you can see there are many duplicates in there)

100755 blob d2d18e5b6520d59f2ca1bb9205966901e371c07a 43012650   notebooks/trip.csv
100644 blob d2d18e5b6520d59f2ca1bb9205966901e371c07a 43012650   notebooks/trip.csv
100644 blob d2d18e5b6520d59f2ca1bb9205966901e371c07a 43012650   data/trip.csv
100644 blob d2d18e5b6520d59f2ca1bb9205966901e371c07a 43012650   content/chapters/trip.csv
100644 blob d2d18e5b6520d59f2ca1bb9205966901e371c07a 43012650   _chapters/trip.csv
100644 blob d2d18e5b6520d59f2ca1bb9205966901e371c07a 43012650   _build/chapters/trip.csv
100644 blob 990ccbf23748db284ec08b606bdae15f9f8eb597 38702692   book.pdf
100644 blob 027fb6107edbc24b42e1a9ecd2792ed2695d6a8f 32346714   notebooks/all-lprs.csv.gz
100644 blob 027fb6107edbc24b42e1a9ecd2792ed2695d6a8f 32346714   data/all-lprs.csv.gz
100644 blob 70b40b57a5245fee78bfe10195a605ddf85a8bc1 10332739   notebooks/san_francisco_2015.csv
100644 blob 70b40b57a5245fee78bfe10195a605ddf85a8bc1 10332739   data/san_francisco_2015.csv
100644 blob 70b40b57a5245fee78bfe10195a605ddf85a8bc1 10332739   content/chapters/san_francisco_2015.csv
100644 blob 70b40b57a5245fee78bfe10195a605ddf85a8bc1 10332739   _chapters/san_francisco_2015.csv
100644 blob 70b40b57a5245fee78bfe10195a605ddf85a8bc1 10332739   _build/chapters/san_francisco_2015.csv
100644 blob b712368a4a34e737ae702864555f3e6917bc7603 8373602    notebooks/airline_ontime.csv
100644 blob 8e1a07f1da16dd544e56c7831352a7eaa1e7553f 7806463    notebooks/airline_ontime.csv
100644 blob 8e1a07f1da16dd544e56c7831352a7eaa1e7553f 7806463    data/airline_ontime.csv
100644 blob 8e1a07f1da16dd544e56c7831352a7eaa1e7553f 7806463    content/chapters/airline_ontime.csv
100644 blob 8e1a07f1da16dd544e56c7831352a7eaa1e7553f 7806463    _chapters/airline_ontime.csv
100644 blob 8e1a07f1da16dd544e56c7831352a7eaa1e7553f 7806463    _build/chapters/airline_ontime.csv
100644 blob 66ad0e165b09d65fcee0b7fc20a6d1dac1ae48bd 2615895    images/function_execution.jpg
100644 blob d2d42784ec852c5e33b7b10e9b00b770bdd685f1 2482316    images/function_definition.key
100644 blob 9634e429713a1664492ed806fd676d66c6c7eaee 2397433    images/minard.png
100644 blob be8e76820b292695b989029b6fafda0530c48670 1994091    images/post_bad_graph.png
100644 blob be8e76820b292695b989029b6fafda0530c48670 1994091    images/bad_post_graph.png
100644 blob 01f9f859cd1e719b46a33e922229038c9406f012 1782803    images/function_definition.jpg
100644 blob 52179b5ff86201ea459d8748079e5fed4a249f0b 1433869    notebooks/Bootstrap.ipynb
100644 blob 55b1bb4982508de51c5a7aad73a15c04178cc9cb 1433827    notebooks/Bootstrap.ipynb
100644 blob 8b1e32ff595eb9dcba006f4f780f11509a75b335 1433799    notebooks/Bootstrap.ipynb
100644 blob 3589108ea3ec9f76e6779f54e9713bb1167179ec 1388718    content/chapters/13/2/Bootstrap.ipynb
100644 blob 5fc6557335b45c20bf79eeb65b61a2cf3a406126 1388666    content/chapters/13/2/Bootstrap.ipynb
100644 blob cd32d34f4ad5b735e3f87d0203f607a08c9b2b7d 1388664    notebooks/13/2/Bootstrap.ipynb
100644 blob 73e96a1e7e6a583e5dd6aba7601bfa00d0acab48 1388663    notebooks/13/2/Bootstrap.ipynb
100644 blob 73e96a1e7e6a583e5dd6aba7601bfa00d0acab48 1388663    content/chapters/13/2/Bootstrap.ipynb
100644 blob 5623842e6f44a495fb3cdbfae179c4e641a802ad 1384088    notebooks/13/2/Bootstrap.ipynb
100644 blob 3eb80feb99a2a234733144e64fa886bf9e024770 1350185    notebooks/13/2/Bootstrap.ipynb
100644 blob 91b4f2d6276856bbcb171c3b6380af8e2b4db359 1350184    notebooks/13/2/Bootstrap.ipynb
100644 blob 9e74f4794a814d5967267002abe752c159253ff1 1053440    notebooks/little_women.txt
100644 blob 9e74f4794a814d5967267002abe752c159253ff1 1053440    data/little_women.txt
100644 blob 9e74f4794a814d5967267002abe752c159253ff1 1053440    chapters/01/3/little_women.txt
100644 blob 8540f026a22f9e9285c13b24292796f20e021011  994676    node_modules/gitbook-plugin-mathjax/node_modules/MathJax-node/node_modules/MathJax/unpacked/jax/output/SVG/fonts/Latin-Modern/NonUnicode/Regular/Main.js
100644 blob 95070752147c526a5e41f2e5fcad6cd378855385  960806    notebooks/house.csv
100644 blob 95070752147c526a5e41f2e5fcad6cd378855385  960806    data/house.csv
100644 blob 95070752147c526a5e41f2e5fcad6cd378855385  960806    content/chapters/house.csv
100644 blob 95070752147c526a5e41f2e5fcad6cd378855385  960806    _chapters/house.csv
100644 blob 95070752147c526a5e41f2e5fcad6cd378855385  960806    _build/chapters/house.csv
100644 blob 7a00cc9302b04ffb2266e9219ba582578642c56f  935467    notebooks/Regression_Line.ipynb
100644 blob 8bff93f3223e9494429ed4c59b53d5f9973a81c5  933159    notebooks/Regression_Line.ipynb
100644 blob 770037c8ce2db302327be1eb0f986c94fffdd864  930533    notebooks/Regression_Line.ipynb
100644 blob 75092fb92406d6e707c39a0bc3ffb6e6fabcfd10  930523    notebooks/Regression_Line.ipynb
100644 blob 0352f684d4ddf765ec333876bd93cbefb014196f  930522    notebooks/Regression_Line.ipynb
100644 blob d639fd67e7e0e6567f4edc48ef5a398cbde9fccf  929381    notebooks/Training_and_Testing.ipynb
100644 blob 38772dbb6b8f2412103d6863ba6f7eef23778826  886388    notebooks/Nearest_Neighbors_old.ipynb
100644 blob 38772dbb6b8f2412103d6863ba6f7eef23778826  886388    notebooks/Nearest_Neighbors.ipynb
100644 blob 38772dbb6b8f2412103d6863ba6f7eef23778826  886388    notebooks/Nearest Neighbors.ipynb
100644 blob 127fc54ac4c5adb52d4a60bab8c2c92b0418fcf8  883590    notebooks/bootstrap_pic.png
100644 blob 127fc54ac4c5adb52d4a60bab8c2c92b0418fcf8  883590    notebooks-images/Bootstrap_25_0.png
100644 blob 127fc54ac4c5adb52d4a60bab8c2c92b0418fcf8  883590    images/bootstrap_pic.png
100644 blob 127fc54ac4c5adb52d4a60bab8c2c92b0418fcf8  883590    _build/images/chapters/13/2/Bootstrap_25_0.png
100644 blob 22ac694112863548689d34e7564eb5e0a27a56e4  825726    images/canada_incomes.png
100644 blob b8ed1ffcef51fa6529f0cb160cd6fbd2e9263bcc  803788    notebooks/Regression_Line.ipynb
100644 blob f7ef14780bd7e522800aa3d70558aec60358fbd8  689645    notebooks/Nearest Neighbors.ipynb
100644 blob b7d09c3b9ce34bf54a38af044bbf5d9901ddc0a2  689644    notebooks/Nearest Neighbors.ipynb
100644 blob 7d7a379eb6369188eec08c103d0f933326247a5e  689587    notebooks/Nearest Neighbors.ipynb
100644 blob fce1af3f2353ca499f123f18df11b08827f5c47a  689542    notebooks/Nearest Neighbors.ipynb
100644 blob 57baad3697af5e398b81a657abcdf14eb8172160  679272    notebooks/RegressionInference.ipynb
100644 blob 88b04b5181c49a9c946490f45ed56cf87e479f1e  675946    notebooks/RegressionInference.ipynb
100644 blob d952b3b12ca185f37d3446954ef35871e638e754  675929    notebooks/RegressionInference.ipynb
100644 blob fc2e4c15346ff60d0e75f0120869d2e78e1d1f33  675894    notebooks/RegressionInference.ipynb
100644 blob c30269edde77753a8539622a13af31f936beef79  669960    notebooks/Regression_Line.ipynb
100644 blob 86797041e16d97ea1a5faead5450245a3d9dec80  669959    notebooks/Regression_Line.ipynb
100644 blob 4fdabd035d2d765a61bbb668d777c9a0d57d11cf  655189    content/chapters/15/2/Regression_Line.ipynb
100644 blob 2389016e7135cbe610be3b945b291d0e08c2eabe  655065    content/chapters/15/2/Regression_Line.ipynb
100644 blob 772af4adf9a420250d4b55ff4d5cb94acd71a939  655062    notebooks/15/2/Regression_Line.ipynb
100644 blob 772af4adf9a420250d4b55ff4d5cb94acd71a939  655062    content/chapters/15/2/Regression_Line.ipynb
100644 blob 6bf18798f9b09585e62893fb6b55d43e47831a33  654310    notebooks/15/2/Regression_Line.ipynb
100644 blob bba2c327792950c4eabbf72499cf67fa722e4d61  648107    notebooks/Classification_Sp16_redone_code.ipynb
100644 blob 27b5285986f5e60a3fa33c224bcfb6cebe09f970  634243    notebooks/Classification_Sp16_redone_code.ipynb
100644 blob b331cfe33588eb972fb5c6f1bdc4d5797d80dfc1  632446    node_modules/gitbook-plugin-mathjax/node_modules/MathJax-node/node_modules/MathJax/unpacked/jax/output/SVG/fonts/Gyre-Pagella/NonUnicode/Regular/Main.js
100644 blob 3f637eb711cfc15fe42e8ac1d42ba30dfccb7542  631659    notebooks/Regression.ipynb
100644 blob 13a0285386f41cb5492e4bd352d3610767da4a1c  621865    notebooks-html/Classification.html
100644 blob 902d9ae38b1d17b464114969cf0f2f3c31fc2c1f  621864    notebooks/Classification.html
100644 blob a49d9426959002355fd09685f946a2acd57f863f  619987    notebooks-html/Classification.html
100644 blob 0a184bebb47b21efd4e21bbcaa77275568e2c752  615045    notebooks/Classification.ipynb
100644 blob 7f0c316cd33e237395403ff4fac322a33336cfd7  611700    notebooks-html/Classification.html
100644 blob 52020276df565ffe07bda9c37755add3669ab64f  611698    notebooks-html/Classification.html
100644 blob 2e9176ca3a2822dc363591fee633a7af80657d79  611698    notebooks-html/Classification.html
100644 blob 449ed075e5883d24b49d2873a6a3ab1e200a068b  611685    notebooks-html/Classification.html
100644 blob e25e15cbf8bd462b98c5879ea6478c1bfc561f04  611654    notebooks-html/Classification.html
100644 blob d015789f5f45804668ce8614aaa913c92445e76d  610155    notebooks/huck_finn.txt
100644 blob d015789f5f45804668ce8614aaa913c92445e76d  610155    data/huck_finn.txt
100644 blob d015789f5f45804668ce8614aaa913c92445e76d  610155    chapters/01/3/huck_finn.txt
100644 blob 15da0d360f57067e7d63cf4f1bd5d465005e1de1  605219    content/chapters/08/5/Bike_Sharing_in_the_Bay_Area.ipynb
100644 blob 6893af1f191e3a3ac8ca458b66503484b723debf  605193    content/chapters/08/5/Bike_Sharing_in_the_Bay_Area.ipynb
100644 blob 058642a3065908f925ea2736b115f9feceacc92c  605190    notebooks/08/5/Bike_Sharing_in_the_Bay_Area.ipynb
100644 blob 058642a3065908f925ea2736b115f9feceacc92c  605190    content/chapters/08/5/Bike_Sharing_in_the_Bay_Area.ipynb
100644 blob b852b3af23cf3f51629c328bab821a76c5aec29b  587918    notebooks/Training_and_Testing.ipynb
100644 blob 9b4468c952222a0f90c96b21b55a007b852bc5f5  587878    notebooks/Training_and_Testing.ipynb
100644 blob 77475e618ac5c67c774b5ac8684d7e898edd088b  579632    node_modules/gitbook-plugin-mathjax/node_modules/MathJax-node/node_modules/MathJax/unpacked/jax/output/SVG/fonts/Gyre-Termes/NonUnicode/Regular/Main.js
100644 blob ed2ece99554ad828198c5dc6c2c61b95aa980c15  577791    notebooks/Regression.ipynb
100644 blob c3f52af60478b42744e74724a251a994f15a2ddd  563815    notebooks/Training_and_Testing.ipynb
100644 blob 06184790bb8e6692b67e37d5baa74577d4ec2127  556545    node_modules/gitbook-plugin-mathjax/node_modules/MathJax-node/node_modules/MathJax/unpacked/jax/output/SVG/fonts/Neo-Euler/NonUnicode/Regular/Main.js
100644 blob a83bc8fedb484a8dc226919b5015e5203ed3f6f1  550930    images/function_execution.pdf
100644 blob 34129de0bbd2ef7d55ac9c7aa4095e311468a018  531189    notebooks-html/Correlation.html
100644 blob 69ee0c3b0be752a3947b9f2b37fe015b4321f5f8  531188    notebooks/Correlation.html
100644 blob e0299441d3b4bad1de663b13707b811685129fe7  526670    notebooks/RegressionInference.ipynb
100644 blob 3e6df8983a803b5e8bb731bc44fad2c3cc2500f4  526566    notebooks/RegressionInference.ipynb
100644 blob 8d517b718aeedf7251d2daed969cb50b5ca9bbc0  526530    notebooks/RegressionInference.ipynb
100644 blob 33a253f86026459b5f02210ca43685bcbbee475c  524883    notebooks-html/Correlation.html
100644 blob 60abc3c2c31c9ed4c3e3ab2445d91138aeb66328  515307    notebooks/Classification_Sp16_redone_code.ipynb
100644 blob 2ef11f4824d562bdb545e6a0ba0128e02c2ab92c  511501    notebooks-html/Correlation.html
100644 blob d13d69e971e0344c79c8654f9412316dc2cea3bd  511499    notebooks-html/Correlation.html
100644 blob 7a89f2b34511e00135286370b19d9fddd998d249  511499    notebooks-html/Correlation.html
100644 blob d87e5dfedb91547d983ec840f838be4225ee5e18  511486    notebooks-html/Correlation.html
100644 blob 720c34cbbd4be027e5ab0ff84299cb91c1b92a7d  511455    notebooks-html/Correlation.html
100644 blob 12973cc36431baeb632a76b2973e73eab891dec3  510114    notebooks/Correlation.ipynb
100644 blob 6865e075fcfa0e1c8f4d370f019e675ee6e4f696  510086    notebooks/Correlation.ipynb
100644 blob 4ed300181311cffc3a2f5d370e0414e50217f016  507662    notebooks/Classification.ipynb
100644 blob 92e8077e43fa85acb5fc760ac2f943007ffc0ce8  507229    notebooks/Classification_Sp16.ipynb
100644 blob 6ce1b40e808c221dcff41ab796235da296f19811  507167    notebooks/Classification.ipynb
100644 blob 5aed03c177e7bd51aa18cbf7c8f21e760b341ebb  505669    notebooks/Correlation.ipynb
100644 blob 03b87b64c70269ebcb4ff4a8087259da2d7f6333  505391    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob e3d5e1f6f35f7db292db257162281a1abda0c075  504140    notebooks-html/Regression.html
100644 blob 9146a15f3402500ada76ab02a7f93d747f7dd40a  504139    notebooks/Regression.html
100644 blob 87740fc3c66c546a8c210ee3ac13537ab1d74c84  502370    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob cf656790bd1fdcc7076ed7d5c4a95cc3eb1b7c29  502349    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob 0d650c6c9959c7ba128dc05b06d94f5b7ae6fbc7  501896    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob 051d2afb28e1d585bfbbd3d3793381263009e76b  501894    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob c8ce709c7d723a90e9e63d3bc8ad9a568ec56f57  501893    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob 80f7f89a4059aa961f2d77804577a89e11dc0820  501840    _build/chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob 4dd0a14ec5a72e42b09df7c45f6c6d54a06c7cbc  501837    _chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob f88404fc31f73f71015c552cb913c59822ee9f04  501809    _chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob fe5729ccb39a504b3923300503b73f55e66342cf  501793    _chapters/08/5/Bike_Sharing_in_the_Bay_Area.md
100644 blob 65ecdea2ca46af06a641cb0c80d8971d1e2932d4  500356    notebooks-html/Regression.html
@papajohn papajohn added this to the 2nd Edition milestone Mar 22, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants