AWS file structure
/nlp_data
|
βββ NLP_dhruv_harshit
|
βββ DataStats_ofChat
| |
βΒ Β βββ words_not_in_glove_6B.txt
βΒ Β βββ words_not_in_glove840_cased.txt
βΒ Β βββ words_not_in_glove840.txt
βΒ Β βββ words_not_in_glove_twitter_cased.txt
βΒ Β βββ words_not_in_glove_twitter.txt
|
βββ emotion_analysis
| |
βΒ Β βββ emotion_analysis.ipynb
βΒ Β βββ imdb_emotions.csv
|
βββ sentiment_analysis
|
βββ Dataset (Contains the chat dataset & glove)
| |
βΒ Β βββ beam_cable_google_tagged_data.json
βΒ Β βββ droom_google_tagged_data.json
| |
βΒ Β βββ data.csv
βΒ Β βββ train.csv
βΒ Β βββ test.csv
βΒ Β βββ valid.csv
| |
βΒ Β βββ glove.6B.300d.txt
βΒ Β βββ glove.840B.300d.txt
| |
βΒ Β βββ test_x.pkl
βΒ Β βββ test_y.pkl
βΒ Β βββ train_x.pkl
βΒ Β βββ train_y.pkl
βΒ Β βββ val_x.pkl
βΒ Β βββ val_y.pkl
| |
| βββ Message_VS_sentiment&Emotion - Sheet1.csv
|
βββ GloveDataDistribution_4B_and_840B
βΒ Β |
βΒ Β βββ glove6B
βΒ Β βΒ Β βββ conv_length2count.jpg
βΒ Β βΒ Β βββ create_embedding.ipynb
βΒ Β βΒ Β βββ glove_vocab_comparison.ipynb
βΒ Β βΒ Β βββ no.ofMsgs_vs_scores_afterSubsampling.png
βΒ Β βΒ Β βββ no.ofMsgs_vs_scores_beforeSubsampling.png
βΒ Β βΒ Β βββ no.ofMsgs_vs_scores_both.png
βΒ Β βΒ Β βββ plot_NoOfMsgAboveThreshold_vs_NoOfConversations_HA.ipynb
βΒ Β βΒ Β βββ plot_NoOfMsg_vs_NoOfConversations_And_Polarity_vs_NoOfConversations_DS.ipynb
βΒ Β βΒ Β βββ samarth_data.ipynb
βΒ Β βΒ Β βββ samarth_data.py
βΒ Β βΒ Β βββ score2freq.png
βΒ Β βΒ Β
βΒ Β βββ glove6B_DataInput_and_DataStats_and_DataDistributionPlot_DS.ipynb
βΒ Β βββ glove6B_DataStats_HA.ipynb
βΒ Β βββ glove6B_OldPreprocessor.ipynb
βΒ Β βββ glove840_vocab_comparison.pyΒ
βΒ Β βββ vocab_of_chat.txt
βΒ Β βββ words_of_glovetwitter6B.txt
|
|
βββ milestone-1 (IMDB & Feed forward Net)
βΒ Β βββ data
βΒ Β βΒ Β βββ labeledTrainData.tsv
βΒ Β βΒ Β βββ testData.tsv
βΒ Β βββ M1_code .ipynb
βΒ Β βββ sentiment_nn.py
|
βββ milestone-2 (IMDB & RNN)
| |
βΒ Β βββ M2_code-harshit.ipynb
βΒ Β βββ M2_code.ipynb
βΒ Β βββ M2_code.py
|
βββ milestone-3 (Chat & RNN & GloVe.6B)
| |
βΒ Β βββ data_input.ipynb
βΒ Β βββ GeneralityCheck_HS.ipynb
βΒ Β βββ glove840_vocab_comparison.py
βΒ Β βββ M2_code-harshit-without-dropout.ipynb
βΒ Β βββ M2_code-harshit-without-dropout-new+(5).ipynb
βΒ Β βββ M2_code-harshit-without-dropout-new.ipynb
βΒ Β βββ M2_code.ipynb
βΒ Β βββ unique_words.py
|
βββ milestone-3.5 (Chat & LR)
βΒ Β βββ LR.ipynb
βΒ Β βββ LR.py
|
βββ milestone-4 (Chat & GLoVe.840B)
| |
βΒ Β βββ LeftPadding_6B.ipynb
βΒ Β βββ main_hyperparameter_tuning.ipynb
βΒ Β βββ main_hyperparameter_tuning.py
βΒ Β βββ make_pkl_files.py
βΒ Β βββ Make_TrainValTest_pickle.ipynb
βΒ Β βββ samarth_data.ipynb
βΒ Β βΒ Β
βΒ Β βββ wrong_result.ipynb
β
βββ serving (Contains Production code)
| βββ tensorflow_serving
| βββ example
| βββ BUILD
| βββ sentiment_saved_mode.py
| βββ sentiment_client.py
|
|
βββ Results (Contains results of Hyperparameter tuning)
βΒ Β βββ Result6B
βΒ Β βββ Result840B
| βββ Results.md
| βββ FinalReport = https://docs.google.com/document/d/1HlEqMnx75JYu33f9TFHU_EpVRqIOqaX05_EnjXmBVUk/edit
| βββresult sheet = https://docs.google.com/spreadsheets/d/1CdMQFgi3VF_tO60C0EBcSVn9nyZDFxIPCRVJ5xw9G2Y/edit
|
βββ FinalCode-part0.ipynb
βββ FinalCode-part1.ipynb
βββ FinalCode-part2.ipynb