Editing Twitter Analysis DB Details

Jump to navigation Jump to search

Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.

The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then save the changes below to finish undoing the edit.

Latest revision Your text
Line 63: Line 63:
 
This is a concordance ( for background a concordance is an index of all words used in a body of text, here tweets, cross referenced to the tweets text, perhaps first produced as a reference to the bible.  For more information see: *'''[https://en.wikipedia.org/wiki/Concordance_%28publishing%29 Concordance (publishing) - Wikipedia ]''' )  A concordance gives insight into the vocabulary of the author, and since tweets are dated we can also see how he vocabulary changes over time.  While the concordance table is only generated at database load time each select against the table can be viewed as creating a new concordance.
 
This is a concordance ( for background a concordance is an index of all words used in a body of text, here tweets, cross referenced to the tweets text, perhaps first produced as a reference to the bible.  For more information see: *'''[https://en.wikipedia.org/wiki/Concordance_%28publishing%29 Concordance (publishing) - Wikipedia ]''' )  A concordance gives insight into the vocabulary of the author, and since tweets are dated we can also see how he vocabulary changes over time.  While the concordance table is only generated at database load time each select against the table can be viewed as creating a new concordance.
  
Commonly abbreviated in the app as concord, it is a table generated from the tweets by extracting each "word" of the tweet. "Word" is in quotes because quite a bit of processing is involved in determining what is a word.  The simple way would be to just break the tweets up on blank spaces, but what is actually done is a bit more complicated as described in see '''Building a Database''' section below.
+
Commonly abbreviated in the app as concord, it is a table generated from the tweets by extracting each "word" of the tweet. "Word" is in quotes because quite a bit of processing is involved in determining what is a word.  The simple way would be to just break the tweets up on blank spaces, but what is actually done is a bit more complicated as described in [[TBD]]
  
  

Please note that all contributions to OpenCircuits may be edited, altered, or removed by other contributors. If you do not want your writing to be edited mercilessly, then do not submit it here.
You are also promising us that you wrote this yourself, or copied it from a public domain or similar free resource (see OpenCircuits:Copyrights for details). Do not submit copyrighted work without permission!

Cancel Editing help (opens in new window)