Skip to content

dataset keras and some questions about preprocessing #3

@itaowei

Description

@itaowei

Thanks for your wonderful work!

May I ask for the raw data which named keras which you mentioned in your paper.
When I try to preprocess the data, I find it is missing.
I had already downloaded other data from https://osf.io/67kyc/?view_only=ad588fe5d1a14dd795553fb4951b5bf9

By the way, when I try to preprocess (./preprocess.py FOLDER_NAME --language LANGUAGE), there are a lot of warnings.
For example, when I run python2 preprocess.py Theano_commits --language python,there are a lot of warnings like .../commitgen/commitgen/code.py:209: UserWarning: 'ascii' codec can't encode character u'\u2014' in position 61: ordinal not in range(128) warnings.warn(str(e))
Meantime, the number of commits parsed is less than the extracted.
Do they influenced the final results?
Could you tell me about your developing environment?

Looking forward to your reply!

Thanks a lot!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions