Register and Try Sentimetrix
home > help

faq

SentiMetrix™ offers the SentiGrade™ family of services for tracking opinions towards arbitrary topics expressed in the online news media, blogs, message boards, customer reviews and client-provided data.

1. Dashboard help

Query Structure:

  •      honda  hybrid - will find the scores for the entities with ALL terms, like "honda hybrid" and "hybrid sedan by honda"
  •      "honda hybrid" - an exact match for "honda hybrid"
  •      honda + hybrid sedan, mileage - accord,civic - will find the scores for the entities containing "honda" in the documents including either BOTH words "hybrid" and "sedan" OR "milage" but containing NEITHER the word "accord" NOR "civic"
  •      Both positive(+) and negative(-) contextual terms are optional. You can  use exact match (enclose the term phrase in double quotes) with them just as you would with the main term. For example honda + "good mileage", hybrid, "best value" - "four wheel drive"

 Speed:

  •  Stemming algorithm makes Sentimetrix search for the words with the common stem. For example, if you specify the word 'politics', the Slower option will be searching for the words 'poitics', 'political', 'politician' while the Faster one only look for 'politics' . This applies to all the words you put in your search term: both main term and positive and negative contextual terms (those would be the terms following the main term prefixed with the '+' and '-' sign respectively. To search for an exact phrase enclose it in the double quotes.

Sources:

  • You can restrict your queries to only News or Blog sources, or select 'All' to get complete coverage

Languages:

  • For a limited trial period,  you can search in several languages in addition to English. Non-English languages are a premium feature, so if you are interested in working with them going forward, please contact us

Date format:

  • To switch to European date format, put a non-US country in your profile

 

 

2. How can we be sure that we are measuring sentiment accurately?

Using a series of control experiments, we have carefully verified the accuracy of our measurements where the same documents have been graded both by our system and by human subjects. Because different people may assign somewhat different grades to the same documents,  we have used the Pearson product-moment correlation coefficient as the measure of the tendency of the grades assigned by our system and by individuals to be similar. We have determined that the variance between sentiment grades assigned by our system and the average grade assigned by people differs only slightly from the variance between individuals themselves. Our conclusions were validated by an independent third party for the US Intelligence community, as well as an academic institution.

3. How do we maintain our sentiment vocabulary?

We understand the importance of keeping the sentiment vocabulary database up to date. As our systems process data from the sources we track, they also note terms not previously encountered. As new terms occur with greater frequency,  they are automatically added into the vocabulary scoring pipeline. We also re-process our vocabulary with the words already in the database to detect subtle nuances and changes in word usage over time. We employ qualified analysts to provide grading data for texts containing words that are of interest to us, and process this scoring data using our proprietary algorithms to extract word scores.

4. How the data sources are selected and maintained?

To represent the broadest and most popular market segments, we track over 70,00 news content sources and augment these sources with one million blogs. As we add or remove sources to our library of content, we do so with the intent to provide a representative sample of the most widely accessed content. In cases where we build an application for a client, we either access sources they provide to us, or crawl additional industry specific sources from the web.

5. Can I limit the analysis to the sources that interest me?

You can limit the sources by type in the free version. Signing up for premium services will allow you to select sources by location (including country), select individual sources from the list or work with us to create a new collection or category grouping.

6. Can I use SentiGrade to analyze my proprietary data?

Yes, you can. Our service allows clients to send the documents to us for scoring, and later track sentiments expressed in those documents. Please contact us for more information.

7. What’s so special about SentiMetrix technology?

We are able to track the full spectrum of the Internet media, not just one part of it, such as news sites, newspapers, or blogs.  We believe that this is the only way our customers can get a balanced, objective view of issues that affect their brands, and understand the trends that will shape tomorrow’s events.

We measure sentiments on a continuous scale, not just “good” vs. “bad”.  Our system provides the level of granularity on par with traditional marketing research methods.  We have taken great care to ensure that these measurements are indeed accurate.

We provide access to full range of our features and functionality via a Web site and an API, and do not require our customers to sign long-term consulting agreements.

Finally, each member of the SentiMetrix team has many years if not decades of experience dealing with large amounts of Internet content using natural language processing and machine learning.  Our expertise in developing text mining methods is second to none, and we fully understand the urgency of our clients’ need for information and insight into the sentiments of the online community.