Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

- tarball handler py ###########/ |||||||||| - shell script3.sh ########## |||||||||||||||||| - Unsupervised learning Clustering kmeans and hierarchical ##### |||||||||||||||| - topics: | 20-topics | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

License

CoderSales/unsupervised-learning-clustering

Repository files navigation

unsupervised-learning-clustering

Key elements in this repository:

Setup:

  1. References
  2. script3.sh - activates virtual environment (or source scripy3.sh)
  3. .bashrc - for virtual environment
    • in .venv folder
  4. Python Select Interpreter
  5. pip install ipykernel
  6. pip install jupyter
  7. sh installer.sh
  8. python.exe -m pip install --upgrade pip
  9. pip install notebook
  10. pip install pandas
  11. python -m pip install -U pip
  12. python -m pip install -U matplotlib
  13. pip install seaborn
  14. pip install -U scikit-learn
  15. pip install openpyxl
  16. pip install nb-black
  17. pip install xlwings
  18. xlwings addin install
  19. pip install natsort

Save setup

  1. pip freeze > requirements.txt

Load setup

  1. pip install -r requirements.txt

Analysis:

  1. notebooks/K-Means.ipynb
  2. data/technical_support_data-2.csv

Tarball Data Extraction:

  1. python tarball-handler.py

    Add to gitignore:

    1. custom components
    1.1 large files
    1.1.1 tarball

    TCGA-PANCAN-HiSeq-801x20531.tar.gz

    1.1.2 large data from tarball

    data/gene_data/TCGA-PANCAN-HiSeq-801x20531/TCGA-PANCAN-HiSeq-801x20531/data.csv data/gene_data/TCGA-PANCAN-HiSeq-801x20531/TCGA-PANCAN-HiSeq-801x20531/labels.csv

Note on data files and Large Data sets on GitHub:

Add data in own commit in case of 50 MB GitHub warning

update to .gitignore:

*.json
!spec/*.json

adapted to:

!*/ProcessedData.xlsx

git ignore all files of a certain type, except those in a specific subfolder

About

- tarball handler py ###########/ |||||||||| - shell script3.sh ########## |||||||||||||||||| - Unsupervised learning Clustering kmeans and hierarchical ##### |||||||||||||||| - topics: | 20-topics | may-2023-filtered | may-2023-filtered-2 | may-2023-filtered-3 | filtered-4 | frequent

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

AltStyle によって変換されたページ (->オリジナル) /