Skip to content

srbarden/TextMining

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Long German Words (lange deutsche Wörter)

What it Does

To English speakers, very long German words often look silly. This software analyzes some of the works of Johann Wolfgang von Goethe. It uses the texts from Project Gutenberg (in German) to find the three longest words in each of several of his works.

How to Use It

Install pickle and requests libraries before using. Run "textmining_requests.py" to get the text files from Project Gutenberg. Then, run "textmining_processing.py" to find the longest German words! They will print in the terminal.

Progress

This project was originally written in Week 6 of Software Design, and I reworked it in Week 13. It is now written in an object-oriented style and has some updated text mining processes.

Some long words found

  • freundschaftliche - friendly
  • Allerdurchlauchtigster - most gracious/serene
  • unwahrscheinlichem - unlikely
  • zusammenschrumpfte - shriveled up
  • Freundschaftsbezeigungen - demonstrations of friendship
  • durcheinandergeschüttelt - agitated
  • Amtschreiberstochter - office clerk's daughter

About

This is the base repo for the text mining and analysis project for Software Design, Spring 2017 at Olin College.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%