WORDSEARCH

Wordlist taken from: mit.edu/~ecprice/wordlist.100000

WORDSEARCH

This is a brief project to find whether a word is in a randomized list of 100,000 words which are split across multiple files.

Learning objectives:

Learn Makefiles to build project
Work with C file readers and file I/O
Use OpenMP to parallelize the search and make it more efficient

The file randomize.py contains a function randomize_wordlists which splits up dictionary_large.txt into N smaller dictionary files with an independent random word ordering. Thus, the C wordsearch program can be tested by determining if a given word is found in 2 text files vs. 200 textfiles.

Timing results (400 dictionary files + 8 threads)

With 400 dictionary files and 8 threads on my laptop, average speedup is ~2x with a simple parallelization approach using OpenMP.

Unoptimized: serial approach which loops through every word in each file until a word is found (or not found in worst-case scenario).

Optimized: splits up work among min(num_dictionaries, num_threads) different threads resulting in a performance gain.

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
bin		bin
img		img
include		include
src		src
wordsrc		wordsrc
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
randomize.py		randomize.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WORDSEARCH

Timing results (400 dictionary files + 8 threads)

About

Releases

Packages

Languages

noah-CAL/wordsearch

Folders and files

Latest commit

History

Repository files navigation

WORDSEARCH

Timing results (400 dictionary files + 8 threads)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages