Internship2021

Here I will be sharing my journey throughout comprising of codes, documentations and algorithmns.

Task1: Visualise how data is distributed as RDD across a cluster of 5 machines of 4 cores. Describe a complete picture of RDD, partitions, lambda implementing on each partition and correlation with the JVMs.

Task2: Learn about the Scala and Java Future Objects to achieve concurrency in Spark

Task3: Implement practically and code the Scala and Java Future objects to achieve concurrency in Spark

Task4: Work on analyzing using presidio. For example, if there is an email, then it should automatically detect PII information and encrypt/mark it

Task5: Work on anonymizing using presidio. For example, hello I am taking disprin and my age is 25 and i am having bipolar disorder then it should automatically detect PII information and encrypt/mark it.

Task6: Creating a Model using Spacy and training it to detect the medicine, age and diseases in the given statement.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
Task-7		Task-7
python-client		python-client
README.md		README.md
Task1-18thmay.pdf		Task1-18thmay.pdf
Task2-19th May.docx		Task2-19th May.docx
Task3 - 20th May.pdf		Task3 - 20th May.pdf
Task4 - 21st May.pdf		Task4 - 21st May.pdf
Task5 - 22nd May.pdf		Task5 - 22nd May.pdf
Task6 - 24nd May.pdf		Task6 - 24nd May.pdf
Task7 - 28th May.ipynb		Task7 - 28th May.ipynb
Task7 - 28th May.pdf		Task7 - 28th May.pdf
pom.xml		pom.xml
python-client-generated.zip		python-client-generated.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Internship2021

About

Releases

Packages

Languages

lakshay2k/Internship2021

Folders and files

Latest commit

History

Repository files navigation

Internship2021

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages