Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Feature/scala code/ch02 biman (#8) #9

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
deepakmca05 merged 2 commits into master from feature/scala-code/ch02
Jan 7, 2022
Merged

Conversation

@deepakmca05
Copy link
Collaborator

@deepakmca05 deepakmca05 commented Jan 7, 2022

  • added kmers for FASTA and FASTQ formats

  • DNA Based count in scala

  • Indentation fix

  • improved documentation

  • ch02

  • Feature/scala code/ch01 (Feature/scala code/ch01 #5 )

  • Scala Code for data algorithms with spark-ch01

  • Refactored the package name

  • With spark contextclose

  • Feature/scala code/ch01 biman (Feature/scala code/ch01 biman #3 )

  • fold by key

  • added comments for fold by key

  • added Average by Group By Key

  • added Average by ReduceByKey

  • dataframe action describe

  • dataframe add column

  • dataframe drop column

  • dataframe filter rows

  • dataframe join

  • dataframe join inner

  • dataframe join left

  • dataframe join right

  • dataframe sql

  • rdd map transformations

  • rdd filter transformations

  • rdd flatMap transformations

  • rdd transformations groupByKey

  • rdd transformations Join

  • rdd transformations Cartesian

  • rdd transformations Combine By Key

  • rdd transformations MapPartitions

  • rdd transformations Sort By

  • rdd action take ordered

  • Feature/scala code/ch01 deepak (Feature/scala code/ch01 deepak #4 )

  • fold by key

  • added comments for fold by key

  • added Average by Group By Key

  • added Average by ReduceByKey

  • dataframe action describe

  • dataframe add column

  • dataframe drop column

  • dataframe filter rows

  • dataframe join

  • dataframe join inner

  • dataframe join left

  • Refactroed code + RDD Transformation

  • INDENTATION CORRECT

  • ReduceByKey

  • ReduceByKey1

Co-authored-by: bimanmandal biman29@gmail.com

  • added gradle shorthands

  • Feature/generate sh files (Feature/generate sh files #6 )

  • added gradle shorthands

  • added the shell script

  • added the shell script

  • updated the shell commands

Co-authored-by: Biman Mandal 43241698+bimanmandal@users.noreply.github.com
Co-authored-by: bimanmandal biman29@gmail.com

  • updated README.md

  • updated README.md

  • ch02-changes

  • ch02-changes

  • Feature/scala code/ch01 missing class gradle (Feature/scala code/ch01 missing class gradle #7 )

  • gradle wrapper fix

  • gradle wrapper fix

  • gradle wrapper fix

  • gradle wrapper fix

  • added bonus chapter correlation

  • added bonus chapter correlation

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • updated docs

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • improved documentation

  • DNABaseCountFastq

  • added the code changes for chapter 2

  • added the run_spark_applications_scripts

  • added the conditions for 1GB data

  • added the readme file

Co-authored-by: Mahmoud Parsian mahmoud.parsian@yahoo.com
Co-authored-by: deepakmca05 deepakmca05@gmail.com

bimanmandal and others added 2 commits January 7, 2022 10:47
* added kmers for FASTA and FASTQ formats
* DNA Based count in scala
* Indentation fix
* improved documentation
* ch02
* Feature/scala code/ch01 (#5)
* Scala Code for data algorithms with spark-ch01
* Refactored the package name
* With spark contextclose
* Feature/scala code/ch01 biman (#3)
* fold by key
* added comments for fold by key
* added Average by Group By Key
* added Average by ReduceByKey
* dataframe action describe
* dataframe add column
* dataframe drop column
* dataframe filter rows
* dataframe join
* dataframe join inner
* dataframe join left
* dataframe join right
* dataframe sql
* rdd map transformations
* rdd filter transformations
* rdd flatMap transformations
* rdd transformations groupByKey
* rdd transformations Join
* rdd transformations Cartesian
* rdd transformations Combine By Key
* rdd transformations MapPartitions
* rdd transformations Sort By
* rdd action take ordered
* Feature/scala code/ch01 deepak (#4)
* fold by key
* added comments for fold by key
* added Average by Group By Key
* added Average by ReduceByKey
* dataframe action describe
* dataframe add column
* dataframe drop column
* dataframe filter rows
* dataframe join
* dataframe join inner
* dataframe join left
* Refactroed code + RDD Transformation
* INDENTATION CORRECT
* ReduceByKey
* ReduceByKey1
Co-authored-by: bimanmandal <biman29@gmail.com>
* added gradle shorthands
* Feature/generate sh files (#6)
* added gradle shorthands
* added the shell script
* added the shell script
* updated the shell commands
Co-authored-by: Biman Mandal <43241698+bimanmandal@users.noreply.github.com>
Co-authored-by: bimanmandal <biman29@gmail.com>
* updated README.md
* updated README.md
* ch02-changes
* ch02-changes
* Feature/scala code/ch01 missing class gradle (#7)
* gradle wrapper fix
* gradle wrapper fix
* gradle wrapper fix
* gradle wrapper fix
* added bonus chapter correlation
* added bonus chapter correlation
* updated docs
* updated docs
* updated docs
* updated docs
* updated docs
* updated docs
* updated docs
* updated docs
* updated docs
* updated docs
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* improved documentation
* DNABaseCountFastq
* added the code changes for chapter 2
* added the run_spark_applications_scripts
* added the conditions for 1GB data
* added the readme file
Co-authored-by: Mahmoud Parsian <mahmoud.parsian@yahoo.com>
Co-authored-by: deepakmca05 <deepakmca05@gmail.com>
@deepakmca05 deepakmca05 merged commit 5b951fe into master Jan 7, 2022
@deepakmca05 deepakmca05 deleted the feature/scala-code/ch02 branch January 7, 2022 10:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

No reviews

Assignees

No one assigned

Labels

None yet

Projects

None yet

Milestone

No milestone

Development

Successfully merging this pull request may close these issues.

AltStyle によって変換されたページ (->オリジナル) /