Sørensen Dice Coefficient Algorithm Project

Using the Sørensen-Dice coefficient algorithm (https://en.wikipedia.org/wiki/Dice%27s_coefficient)

Save Time On Research and Writing
Hire a Pro to Write You a 100% Plagiarism-Free Paper.
Get My Paper

You must create a combosquatting detector that aims to find the similar names % rate of a domain and compare to the other domain with in the domain lists provided from a txt file.

You must be able to insert the domain in mind and then its compared to a list of other domain that are in .txt file. after that it must print the dice similarity rate and the string it has been compared to. The required to be done is to drop the tld; that is the www and the .com or .net or any other thing and compare what is in between. Example:

www.xxx.com

I want to compare xxx with the other xxx from the list of websites,

Save Time On Research and Writing
Hire a Pro to Write You a 100% Plagiarism-Free Paper.
Get My Paper

www.xxx.net

or

www.cxx.com

with www.xxx.com and like that. After that it must save the result into a new txt file.

the result should be something like: www.xxx.com and www.xxx.net has a similarity rate of: xx% and so on.

this implementation should be in parallel and in single python documented commented file. You must not use any API and instead implement the Sørensen-Dice coefficient algorithm directly.

You must check first which of the domians are less in length and then jump into the compare, just so that is not so expensive when dealing with huge list.

www.smascoa.com
www.cosmas.com
www.cnn.com

Still stressed from student homework?
Get quality assistance from academic writers!

Order your essay today and save 25% with the discount code LAVENDER