Thursday, 6 November 2014

Difference between tableau extensions .twb and .twbx



Twbx:
  1. .twbx file is a Tableau Packaged Workbook, meaning it is the original .twb file grouped together with the datasource(s) in one package. 
  2.  .twbx files can be considered analogous to specialized zip files, in which these “zip” files contain all the information necessary to work in Tableau.   
  3. The primary advantage to using .twbx files is that analysis can be performed without network/internet connections to your data because your data is already present on your computer in this packaged file.
Twb:
  1. The .twb file alone is not enough to perform any analysis because it only contains Tableau’s instructions for interacting with a datasource.  
  2. In actuality, .twb files are XML files specially tailored to interact with datasources.
  3. They are custom built to make the awesome visualizations that Tableau generates.  Here you can see a picture of a .twb file opened in Notepad++.

What is the Dimensions and Measures?

DimensionA dimension is a field that can be considered an independent variable.

Dimensions typically produce headers when added to the rows or columns shelves in the view. By default, Tableau treats any field containing qualitative, categorical information as a dimension. This includes, for instance, any field with text or dates values.

This means that a measure can be aggregated for each value of the dimension. For instance, you might calculate the Sum of “Sales” for every “State”. In this case the State field is acting as a dimension because you want to aggregate sales for each state. The values of Sales are dependent on the State, so State is an independent field and Sales is a dependent field.

Measure:A measure is a field that is a dependent variable; that is, its value is a function of one or more dimensions.

Measures typically produce axes when added to the rows or columns shelves. By default, Tableau treats any field containing numeric (quantitative) information as a measure.

This means that a measure is a function of other dimensions placed on the worksheet. For instance, you might calculate the Sum of “Sales” for every “State”. In this case, the Sales field is acting as a measure because you want to aggregate the field for each state. But measures could also result in a non-numeric result. For instance, you might create a calculated measure called “Sales Rating” that results in the word “Good” if sales are good and “Bad” otherwise. In this case the “Sales Rating” field acts as a measure even though it produces a non-numeric result. It is considered a measure because it is a function of the dimensions in the view.

Friday, 11 July 2014

What is KPI in Tableau.?

We can easily create a view that shows Key Progress Indicators (KPIs). To do this, you complete the following tasks:
  • Create the base view with the fields you want to measure.
  • Build a calculated field that establishes the figure from which you measure progress for the data you’re measuring.
  • Use shapes that Tableau provides that are designed specifically for KPIs.
This example shows how to build a KPI view that shows a green check mark for any sales figure over $125,000, and a red X for any sales figure under $125,000.


Wednesday, 2 July 2014

Preparing data for Tableau.

  1. Cleanup dimensions and measure names.
  2. Set attribute aliases.
  3. Set default colors
  4. Set default measure aggregations.
  5. Create calculated fields.

Wednesday, 25 June 2014

Is Parameter have it's dropdown list..?

Yes, But it will be called as Compact list.


What is the criteria to blend the data from multiple data sources.?

There should be a common dimension to blend the data source into single worksheet.

For example, when blending Acutal and Target sales data, the two data sources may have a Date field in common. The Date field must be used on the sheet. Then when you switch to the secondary data source in the Data window, Tableau automatically links fields that have the same name. If they don’t have the same name, you can define a custom relationship that creates the correct mapping between fields.

Can we use Groups and Sets in calculation field.?

  • Groups: No, we can not use Groups in calculation fields.
  • Sets: Yes, we can use Sets in calculation fields.

Difference between Grouping and Sets.?

  • Groups – Combine dimension members into higher level categories.
  • Sets – Create a custom field based on existing dimensions that can be used to encode the view with multiple dimension members across varying dimension levels.

What is context filter.?

If you are applying filters to a large data source, you can improve performance by setting up context filters. A context filter is applied to the data source first, and then the other filters are applied only to the resulting records. This sequence avoids applying each filter to each record in the data source.

You may create a context filter to:

  • Improve performance – If you set a lot of filters or have a large data source, the queries can be slow. You can set one or more context filters to improve performance.
  • Create a dependent numerical or top N filter – You can set a context filter to include only the data of interest, and then set a numerical or a top N filter.

What is Dual Axis.?

You can compare multiple measures using dual axes, which are two independent axes that are layered on top of each other. Dual axes are useful when you have two measures that have different scales.

For example, the view below shows Dow Jones and NASDAQ close values over time.

To add the measure as dual axis drag the field to the right side of the view and drop it when you see a black dashed line. You can also select Dual Axis on the field menu for the measure.



The two axes are independent scales but the marks are layered in the same pane.

Can we use Parameter in Filter.?

Yes, We can use.






Friday, 20 June 2014

What is page self..?

The Pages shelf is a powerful part of Tableau that you can use to control the display of output as well as the printed result of that output. 


Monday, 7 April 2014

How to Improve Performance in Tableau.?

1. Use an extract.
There is nothing else that comes close to the efficiency gained using an extract.  If you don't absolutely need live data, extracting is the best bet.

2. Limit your dashboard to fully answering only one scenario.
At it's simplest, a dashboard should be able to fully explore a single scenario.  If your dashboard has six sheets, five actions, and 3 quick filters, you might not be looking at only one scenario.  Remember, no matter how elegant and comprehensive your solution is, if it doesn't run as quickly as the user would like it to, he or she will not use it.  I would not recommend butchering your dashboard so heavily that it cannot fully handle a scenario.  If the user has to go somewhere else to find the answer, why did they use your dashboard at all?

3. Limit the data being introduced to each worksheet.
If you are not planning on using a set of rows, you should filter them out of the data set as early as possible.  If your table contains all sales, and you only want to look at US sales, create a Custom SQL query that filters it out.  If the filter is worksheet dependent, try using a Context Filter.  For more information on filtering, check out my other post Types of Filters in Tableau.  You can also click the Down Arrow beside the word "Dimension" and Select "Hide All Unused Fields" to hide any fields you are not using in any of your worksheets.  I'm not sure if this improves efficiency; but I'd have to imagine that it does, less data should always improve performance.

4. Remove components that add no value.
While aesthetics are very important to building a usable dashboard, unimportant objects aren't worth losing efficiency over.  In fact, you would be better off adding more functionality than you would by adding a purely aesthetic object.

5. Eliminate any non-essential components from the visualization.
This refers to values that would appear on the Pages, Filters, and Level of Detail Shelves.  If they are purely there for the user to see if they scroll over a point, then they aren't adding any value to the initial glance.  However, I leave this as the last step because it should be a last resort.

In most cases, a little forethought can save you a lot of heartache when you are creating dashboards.  Decide exactly what story you want to tell, and tell only that.  It is much easier to add functionality to a small dashboard, than to butcher a large one.  Thanks for reading.

How many types of filters are there in Tableau.?

In Tableau, there are three types of filters.  More explicitly, there are three different ways to limit the data that is displayed by your graph. Each of these has its own strengths and weaknesses, and we will look at them one at a time.

These types are
  1. Custom SQL "Filters"
  2. Context Filters
  3. Traditional Filters.  
Custom SQL Filters:
Custom SQL "Filter" is a WHERE clause that is placed in the SQL that queries the data to be used in the workbook.  "Filter" is a Tableau term that technically applies only to Context and Traditional Filters; however, the Custom SQL "Filter" emulates the behavior of a global Context Filter, so we will refer to it as such.  By construction, Custom SQL "Filters" are always global.  The most common reason for using a Custom SQL "Filter" is to limit the size of a data extract.  The smaller your data extract, the more quickly your charts will load.  In other words, you can make more complex charts without sacrificing efficiency.

One of the ways to create a Custom SQL "Filter" is during the Server Connection process.

Context Filters:
a Context Filter is a filter in Tableau that affects the data that is transferred to each individual worksheet.  Context Filters are great when you want to limit the data seen by the worksheet.  When a worksheet queries the data source, it creates a temporary, flat table that is uses to compute the chart.  This temporary table includes all values that are not filtered out by either the Custom SQL or the Context Filter.  Just like with Custom SQL "Filters", your goal is to make this temporary table as small as possible.

Context Filters have a few advantages over Traditional Filters.  First, they execute more quickly than Traditional Filters.  They are also executed before Traditional Filters and can be executed all at once, which further improves efficiency.  However, they do have one drawback.  It takes time for the filter to be placed into context.  A rule of thumb, from Tableau's KnowledgeBase, is to only place a filter into context if it reduces the data by at least 10%.

A Context Filter is created by dragging a field onto the "Filters" Shelf and editing the filter.  Then, you can Right-Click the field on the shelf and select "Add to Context."  If you have multiple context filters, you can CTRL-Select them all and add them to context in a batch.  This will improve the efficiency of your filter.

Traditional Filters:
Traditional Filter is exactly what most people think of when they think of filters.  When Tableau is creating the visualization, it will check to see if a value is filtered out by a Traditional Filter.  Since this is not performed at the table level, it is the slowest of all filter types.  However, it does have the advantage of being performed after the Context Filters.  This is a necessity if you are dealing with complex "Top N" filters.  A Traditional Filter can be created by simply dragging a field onto the "Filters" Shelf.

Monday, 17 March 2014

Tableau Interview Questions -FAQs

  1. Is there any new features implemented in tableau 8.0 regarding the tableau server performance improvement?
  2. What are the other settings I need to reconfigure to get better performance as I am using 7.0 tableau server and planning to upgrade to latest versions?Suggest best configurations based on the provided server details?
  3. How many viz SQL process should I run?
  4. How many extracts (extract type) can be used on a single server(without effecting server performance like memory) ?
  5. What are the possible reasons for slow performance in Tableau?
  6. How to check the performance step by step manner(DB, Report side, Network) in tableau report ?
  7. How to improve the tableau report performance?
  8. How we can find the tableau Report Rendering Time.?
  9. What kind of technology is a Tableau Viz?
  10. Is there a limit on storage space for the data?
  11. How do you create dashboard.? Can you explain the life cycle.?
  12. Can you explain about table calculations?
  13. I have one scenario like Year in integer and week in String and wanted to calculate the YTD.. how to do this.?
  14. What kind of join do you see in data blending?
  15. What is data blending..? When do you use this.?
  16. When do you use horizontal and vertical components?
  17. Name the components of dashboard?
  18. Can we have multiple value selection in parameter?
  19. What are parameters and when do you use it?
  20. What is the difference between tableau 7.0 and 8.0 versions?
  21. How Does Tableau Work?
  22. What is Tableau Reader?
  23. What is Tableau Server?
  24. What is Tableau Desktop?
  25. What is the Defference between quick filter and Normal filter.?
  26. What is the Difference between connect live and import all data and Import some data.?
  27. How do I automate reports using Tableau software?
  28. How does Tableau perform with huge datasets?
  29. How is Tableau so fast when working with databases?
  30. How many types of filters are there in Tableau.?

Is there any new features implemented in tableau 8.0 regarding the tableau server performance improvement?

1. Use an extract:
2. Limit your dashboard to fully answering only one scenario:
3. Limit the data being introduced to each worksheet:
4. Remove components that add no value:
5. Eliminate any non-essential components from the visualization

What are the other settings I need to reconfigure to get better performance as I am using 7.0 tableau server and planning to upgrade to latest versions?Suggest best configurations based on the provided server details?

Tableau 8,8.1 and 8.2 also supported for 4GB ram and core processors.

How many viz SQL process should I run?

Ddepending on Data Capacity 

How many extracts (extract type) can be used on a single server(without effecting server performance like memory) ?

Better 10

What are the possible reasons for slow performance in Tableau?

More Extracts, filters and depends on data sources

How to check the performance step by step manner(DB, Report side, Network) in tableau report ?

Go to help menu and select performance tuning option 

How to improve the tableau report performance?

If you are not planning on using a set of rows, you should filter them out of the data set as early as possible. If your table contains all sales, and you only want to look at US sales, create a Custom SQL query that filters it out. If the filter is worksheet dependent, try using a Context Filter. For more information on filtering, check out my other post Types of Filters in Tableau. You can also click the Down Arrow beside the word "Dimension" and Select "Hide All Unused Fields" to hide any fields you are not using in any of your worksheets. I'm not sure if this improves efficiency; but I'd have to imagine that it does, less data should always improve performance.

How we can find the tableau Report Rendering Time.?

Report rendering time=Network time(request from URL to Report server) +Query execution time + Network time(response from SQL Server)+calculations(table column)+time taken to display the report in desired format(html/ pdf/ excel)

What kind of technology is a Tableau Viz?

It is a thin AJAX based JavaScript application

Is there a limit on storage space for the data?

Yes, there is a 1 gigabyte limit on storage space for data. For the vast majority of users, we expect that 1 GB will be more space than needed.

Sunday, 16 March 2014

How do you create dashboard.? Can you explain the life cycle.?

Once we have proper requirement we will create the worksheet in Developer environment and create the dashboard with all the worksheets and then do the unit testing and if everything looks good then we will publish the same in to the dev  server with valid permission and the migration will be taken care by our tableau administrator

Can you explain about table calculations?

These are inbuilt calculations in tableau which we normally use to calculate Percentange from or for YTD and other calculations like the measure across table, below table and etc..

I have one scenario like Year in integer and week in String and wanted to calculate the YTD.. how to do this.?

In Tableau, the relative date filter enables flexible analysis of time periods. Sometimes, however, you might want to see both year-to-date (YTD) and month-to-date (MTD) values for a particular measure on the same view. To accomplish this task, you can create date calculations.

Create a calculated column which replaces week from string to integer and make use this in another calculation for YTD.

YTD:

MTD:




What kind of join do you see in data blending?

There won't be any joins as such but we will just give the column references like primary and foreign key relation.

What is data blending..? When do you use this.?

Data blending is when you blend data from multiple data sources on a single worksheet. The data is joined on common dimensions. Data Blending does not create row level joins and is not a way to add new dimensions or rows to your data.

We use this when we want to fetch data from different sources and make use in single worksheet.



When do you use horizontal and vertical components?

We can use these when we want to have all sheets or filter to move in single shot.. however we can still  create the dashboard without this also.. this allows us to make our work simple

Name the components of dashboard?

  • Horizontal
  • Vertical
  • Text
  • Images etc

Can we have multiple value selection in parameter?

No

What are parameters and when do you use it?

Parameters are dynamic values that can replace constant values in calculations.
or
Parameters are used when you want to change the static values.

How to use parameters -1:


How to use parameters - 2:

What is the difference between tableau 7.0 and 8.0 versions

  • New visualizations are introduced like treemap, bubble chart and box and whisker plot
  • We can copy worksheet directly from one workbook to another workbook
  • Introduced R script

Thursday, 13 March 2014

How Does Tableau Work?

While Tableau lets you analyze databases and spreadsheets like never before, you don’t need to know anything about databases to use Tableau. In fact, Tableau is designed to allow business people with no technical training to analyze their data efficiently.
Tableau is based on three simple concepts:
  1. Connect - Connect Tableau to any database that you want to analyze. Note that Tableau does not import the data. Instead it queries to the database directly.
  2. Analyze - Analyzing data means viewing it, filtering it, sorting it, performing calculations on it, reorganizing it, summarizing it, and so on.
    Using Tableau you can do all of these things by simply arranging fields of your data source on a Tableau worksheet. When you drop a field on a worksheet, Tableau queries the data using standard drivers and query languages (like SQL and MDX) and presents a visual analysis of the data.
  3. Share - You can share results with others either by sharing workbooks with other Tableau users, by pasting results into applications such as Microsoft Office, printing to PDF or by using Tableau Server to publish or embed your views across your organization.

What is Tableau Reader?

Tableau Reader is a free viewing application that lets anyone read and interact with packaged workbooks created by Tableau Desktop

What is Tableau Server?

Tableau Server is a business intelligence solution that provides browser-based visual analytics anyone can use at just a fraction of the cost of typical BI software. With just a few clicks, you can publish or embed live, interactive graphs, dashboards and reports with current data automatically customized to the needs of everyone across your organization. It deploys in minutes and users can produce thousands of reports without the need of IT services — all within your IT infrastructure.

What is Tableau Desktop?

Tableau Desktop is a data visualization application that lets you analyze virtually any type of structured data and produce highly interactive, beautiful graphs, dashboards, and reports in just minutes. After a quick installation, you can connect to virtually any data source from spreadsheets to data warehouses and display information in multiple graphic perspectives. Designed to be easy to use, you’ll be working faster than ever before.

Tuesday, 11 March 2014

What is the difference between Quick Filter and Normal filter.?

Normal Filter:- This is used to restrict the data from database based on selected dimension or measure. If you drag a dimension into filter shelf you can filter the data by selecting list of values. when dragging by  measure you can select a range of values.

Quick Filters:- what ever you have applied filter, if you want give a chance to user for dynamically changing data members at run time this is useful. In this you are able to see only non restricted data members only.

OR

The quick filter is a representation of filter to play around and normal filter will not be shown in the workbook to play around.


What is the Difference between connect live and import all data and Import some data.?


  • Connect live – Creates a direct connect to your data. The speed of your data source will determine performance.
  • Import all data – Imports the entire data source into Tableau’s fast data engine as an extract. The extract is saved with the workbook.
  • Import some data – Imports a subset of your data into Tableau’s fast data engine as an extract. This option requires you to specify what data you want to extract using filters.

How do I automate reports using Tableau software?


How does Tableau perform with huge datasets?


How is Tableau so fast when working with databases?

Tableau compiles the elements of your visual canvas into a SQL or MDX query for the remote database to process. Since a database typically runs on more powerful hardware than the laptops / workstations used by analysts, you should generally expect the database to handle queries much faster than most in-memory BI applications limited by end-user hardware. Tableau's ability to push computation (queries) close to the data is increasingly important for large data sets, which may reside on a fast cluster and may be too large to bring in-memory.

Another factor in performance relates to data transfer, or in Tableau's case resultset transfer. Since Tableau visualizations are designed for human consumption, they are tailored to the capabilities and limits of the human perception system. This generally means that the amount of data in a query resultset is small relative to the size of the underlying data, and visualizations focus on aggregation and filtering to identify trends and outliers. The small resultsets require little network bandwidth, so Tableau is able to fetch and render the resultset very quickly. And, as Ross mentioned, Tableau will cache query results for fast reuse.

The last factor as mentioned by Eriglen involves Tableau's ability to use in-memory acceleration as needed (for example, when working with very slow databases, text files, etc.). Tableau's Data Engine uses memory-mapped I/O, so while it takes advantage of in-memory acceleration it can easily work with large data sets which cannot fit in memory. The Data Engine will work only with the subsets of data on disk which are needed for a given query, and the data subsets are mapped into memory as needed.

OR

Tableau does do some amount of in-memory storage to increase speed (when extracted), but a great portion of its speed actually comes from not having to store data in memory.

This is because Tableau only stores the data relevant to your queries in-memory, whereas other solutions will store the entire set in memory, which can take more time to load.

OR

Tableau main feature "data engine" is really a cool feature. If you work with a large amount of data it takes some time to import, create indexes and sort data but after that every thing speedup. Tableau data engine is not really in-memory technology. The data is stored in disk after imported and then RAM is hardly utilized. This conception brings the desired performance.