ConcurrentProject

CSE305 Project: Parallel sequence alignment

Authors: Andreea Patarlageanu, Marta Teodora Trales, Joanne Jegou

During this project, we worked on local parallel sequence alignement with affine gap penalty, with the set of characters {A, C, G, T}. We focused on the computation of the score and not the alignement it-self. We implemented the Smith-Waterman algorithm, which is a dynamic programming algorithm. Because of that, parallelization of this algorithm requires some thinking. We first implemented a simple sequential version on CPU, then we introduced the lazy computation of the matrices. We worked on parallelizing those algorithms, and executing them on GPU.

We worked together on implementation of several algorithms, starting first with CPU and then working on the GPU code. We decided on the work to be done as we advanced in the project, and met in person at least once a week.

Organisation of the project and comments

Please note that the commits on Github are not completely representative of the work done by each of the group members, as Andreea and Joanne wrote together some of the files but committed from a single computer.

Differents files

CPU

main.cpp is the first sequential implementation of the parallel sequence alignement on CPU
lazySmith.cpp is the sequential implementation of the Lazy Smith Algorithm
lazySmith_parallel_threads.cpp is the parallel version of the Lazy Smith algorithm using threads
lazySmith_parallel_future.cpp is the parallel version of the Lazy Smith algorithm using future but it is NOT WORKING

GPU

simpleGPU.cu the simple sequential implementation on GPU with wavefront approach on anti-diagonals
cudaSmithM.cu a second version of the simple sequential implementation on GPU (we had a miscommunication and both of us took that approach on GPU)
cudaLazy.cpp the parallelised lazy implementation of the algorithm on GPU
smithDiagonalGPU.cu the working version of the wavefront approach on anti-diagonals on GPU, with high performance (and speedup)

Testing

Makefile1 the makefile for testing on CPU only (run make -f Makefile1)
Makefile2 the makefile for testing on CPU and GPU (run make -f Makefile2)
TestFile.cpp the testing file for CPU (run ./test_runner1 after compiling)
TestFileWithGPU.cpp the testing file for CPU and GPU (run ./test_runner2 after compiling)
algoCPU.h the header file for .cpp files
algoGPU.h the header file for .cu files

Testing

We have two main testing files, which indicates the SUCCESS of the computations (success corresponds to the equality of all computed scores, we get an ERROR otherwise), the time for computations, and the speed-up for each method compared to the simple sequential version). To test the different files, we commented out all the void main() functions in the different files. To run the files indepently, you have to uncomment the main() functions.

Computations and testing on GPU are executed through SSH on the school's computers. The testing is as follows:

TestFile.cpp is the testing file for CPU computations only. It is compiled through the command make -f Makefile1 and ran with ./test_runner1
TestFileWithGPU.cpp is the testing file that includes comparison with GPU computations as well. It is compiled through the command make -f Makefile2 and ran with ./test_runner2

The parameters for testing can be changed directly in the main() function of the test files: we test the score computation for different values of the length N of the sequences to compare, which are defined in the sequence_lengths vector. num_tests is the number of tests performed for each N. We fixed by default the score's parameters MATCH, MISMATCH, GAP_INIT and GAP_EXT to respectively 1, -1, 1, 1, and they have to be changed directly in the function files.

When running the test files, the user will be prompted to enter 1 or 2:

1 outputs the time and speed-up per test for each N (it gives the result of all tests)
2 outputs the average speed-up for each method for each N (to compare the improvement in performance for each method depending on the sequence length N).

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.vscode		.vscode
CPUtesting.cpp		CPUtesting.cpp
CSE305_Project__Parallel_sequence_alignment.pdf		CSE305_Project__Parallel_sequence_alignment.pdf
Makefile1		Makefile1
Makefile2		Makefile2
Makefile_CUDASW4.mak		Makefile_CUDASW4.mak
README.md		README.md
SmithDiagonalGPU.cu		SmithDiagonalGPU.cu
SmithDiagonalGPUrefactored.cu		SmithDiagonalGPUrefactored.cu
TestFile.cpp		TestFile.cpp
TestFileWithGPU.cpp		TestFileWithGPU.cpp
algoCPU.h		algoCPU.h
algoGPU.h		algoGPU.h
csur_2016_camera_ready.pdf		csur_2016_camera_ready.pdf
cudaCompareSmith.cu		cudaCompareSmith.cu
cudaLazy.cu		cudaLazy.cu
cudaSmithM.cu		cudaSmithM.cu
lazySmith.cpp		lazySmith.cpp
lazySmith_parallel_futures.cpp		lazySmith_parallel_futures.cpp
lazySmith_parallel_threads.cpp		lazySmith_parallel_threads.cpp
main.cpp		main.cpp
mainMarta.cpp		mainMarta.cpp
mainMarta_thread.cpp		mainMarta_thread.cpp
score.py		score.py
simpleGPU.cu		simpleGPU.cu
smith_compare.cpp		smith_compare.cpp
testLazyGPU_CPU.cu		testLazyGPU_CPU.cu
timing.sh		timing.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ConcurrentProject

Organisation of the project and comments

Differents files

CPU

GPU

Testing

Testing

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

AndreeaPatarlageanu/ConcurrentProject

Folders and files

Latest commit

History

Repository files navigation

ConcurrentProject

Organisation of the project and comments

Differents files

CPU

GPU

Testing

Testing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages