fbpx
Wikipedia

Pivot table

A pivot table is a table of values which are aggregations of groups of individual values from a more extensive table (such as from a database, spreadsheet, or business intelligence program) within one or more discrete categories. The aggregations or summaries of the groups of the individual terms might include sums, averages, counts, or other statistics. A pivot table is the outcome of the statistical processing of tabularized raw data and can be used for decision-making.

Although pivot table is a generic term, Microsoft held a trademark on the term in the United States from 1994 to 2020.[1]

History edit

In their book Pivot Table Data Crunching,[2] Bill Jelen and Mike Alexander refer to Pito Salas as the "father of pivot tables". While working on a concept for a new program that would eventually become Lotus Improv, Salas noted that spreadsheets have patterns of data. A tool that could help the user recognize these patterns would help to build advanced data models quickly. With Improv, users could define and store sets of categories, then change views by dragging category names with the mouse. This core functionality would provide the model for pivot tables.

Lotus Development released Improv in 1991 on the NeXT platform. A few months after the release of Improv, Brio Technology published a standalone Macintosh implementation, called DataPivot (with technology eventually patented in 1999).[3] Borland purchased the DataPivot technology in 1992 and implemented it in their own spreadsheet application, Quattro Pro.

In 1993 the Microsoft Windows version of Improv appeared. Early in 1994 Microsoft Excel 5[4] brought a new functionality called a "PivotTable" to market. Microsoft further improved this feature in later versions of Excel:

  • Excel 97 included a new and improved PivotTable Wizard, the ability to create calculated fields, and new pivot cache objects that allow developers to write Visual Basic for Applications macros to create and modify pivot tables
  • Excel 2000 introduced "Pivot Charts" to represent pivot-table data graphically

In 2007 Oracle Corporation made PIVOT and UNPIVOT operators available in Oracle Database 11g.[5]

Mechanics edit

For typical data entry and storage, data usually appear in flat tables, meaning that they consist of only columns and rows, as in the following portion of a sample spreadsheet showing data on shirt types:

A B C D E F G
1 Region Gender Style Ship date Units Price Cost
2 East Boy Tee 2005-01-31 12 11.04 10.42
3 East Boy Golf 2005-01-31 12 13.00 12.60
4 East Boy Fancy 2005-01-31 12 11.96 11.74
5 East Girl Tee 2005-01-31 10 11.27 10.56
6 East Girl Golf 2005-01-31 10 12.12 11.95
7 East Girl Fancy 2005-01-31 10 13.74 13.33
8 West Boy Tee 2005-01-31 11 11.44 10.94
9 West Boy Golf 2005-01-31 11 12.63 11.73
10 West Boy Fancy 2005-01-31 11 12.06 11.51
11 West Girl Tee 2005-01-31 15 13.42 13.29
12 West Girl Golf 2005-01-31 15 11.48 10.67

While tables such as these can contain many data items, it can be difficult to get summarized information from them. A pivot table can help quickly summarize the data and highlight the desired information. The usage of a pivot table is extremely broad and depends on the situation. The first question to ask is, "What am I seeking?" In the example here, let us ask, "How many Units did we sell in each Region for every Ship Date?":

Sum of units Ship date ▼
Region ▼ 2005-01-31 2005-02-28 2005-03-31 2005-04-30 2005-05-31 2005-06-30
East 66 80 102 116 127 125
North 96 117 138 151 154 156
South 123 141 157 178 191 202
West 78 97 117 136 150 157
(blank)
Grand total 363 435 514 581 622 640

A pivot table usually consists of row, column and data (or fact) fields. In this case, the column is ship date, the row is region and the data we would like to see is (sum of) units. These fields allow several kinds of aggregations, including: sum, average, standard deviation, count, etc. In this case, the total number of units shipped is displayed here using a sum aggregation.

Implementation edit

Using the example above, the software will find all distinct values for Region. In this case, they are: North, South, East, West. Furthermore, it will find all distinct values for Ship date. Based on the aggregation type, sum, it will summarize the fact, the quantities of Unit, and display them in a multidimensional chart. In the example above, the first datum is 66. This number was obtained by finding all records where both Region was East and Ship Date was 2005-01-31, and adding the Units of that collection of records (i.e., cells E2 to E7) together to get a final result.

Pivot tables are not created automatically. For example, in Microsoft Excel one must first select the entire data in the original table and then go to the Insert tab and select "Pivot Table" (or "Pivot Chart"). The user then has the option of either inserting the pivot table into an existing sheet or creating a new sheet to house the pivot table. A pivot table field list is provided to the user which lists all the column headers present in the data. For instance, if a table represents sales data of a company, it might include Date of sale, Sales person, Item sold, Color of item, Units sold, Per unit price, and Total price. This makes the data more readily accessible.

Date of sale Sales person Item sold Color of item Units sold Per unit price Total price
2013-10-01 Jones Notebook Black 8 25000 200000
2013-10-02 Prince Laptop Red 4 35000 140000
2013-10-03 George Mouse Red 6 850 5100
2013-10-04 Larry Notebook White 10 27000 270000
2013-10-05 Jones Mouse Black 4 700 2800

The fields that would be created will be visible on the right hand side of the worksheet. By default, the pivot table layout design will appear below this list.

Pivot Table fields are the building blocks of pivot tables. Each of the fields from the list can be dragged on to this layout, which has four options:

  1. Filters
  2. Columns
  3. Rows
  4. Values

Some uses of pivot tables are related to the analysis of questionnaires with optional responses but some implementations of pivot tables do not allow these use cases. For example the implementation in LibreOffice Calc since 2012 is not able to process empty cells.[6][7]

Filters edit

Report filter is used to apply a filter to an entire table. For example, if the "Color of Item" field is dragged to this area, then the table constructed will have a report filter inserted above the table. This report filter will have drop-down options (Black, Red, and White in the example above). When an option is chosen from this drop-down list ("Black" in this example), then the table that would be visible will contain only the data from those rows that have the "Color of Item= Black".

Columns edit

Column labels are used to apply a filter to one or more columns that have to be shown in the pivot table. For instance if the "Salesperson" field is dragged to this area, then the table constructed will have values from the column "Sales Person", i.e., one will have a number of columns equal to the number of "Salesperson". There will also be one added column of Total. In the example above, this instruction will create five columns in the table — one for each salesperson, and Grand Total. There will be a filter above the data — column labels — from which one can select or deselect a particular salesperson for the pivot table.

This table will not have any numerical values as no numerical field is selected but when it is selected, the values will automatically get updated in the column of "Grand total".

Rows edit

Row labels are used to apply a filter to one or more rows that have to be shown in the pivot table. For instance, if the "Salesperson" field is dragged on this area then the other output table constructed will have values from the column "Salesperson", i.e., one will have a number of rows equal to the number of "Sales Person". There will also be one added row of "Grand Total". In the example above, this instruction will create five rows in the table — one for each salesperson, and Grand Total. There will be a filter above the data — row labels — from which one can select or deselect a particular salesperson for the Pivot table.

This table will not have any numerical values, as no numerical field is selected, but when it is selected, the values will automatically get updated in the Row of "Grand Total".

Values edit

This usually takes a field that has numerical values that can be used for different types of calculations. However, using text values would also not be wrong; instead of Sum, it will give a count. So, in the example above, if the "Units sold" field is dragged to this area along with the row label of "Salesperson", then the instruction will add a new column, "Sum of units sold", which will have values against each salesperson.

Row labels Sum of units sold
Jones 12
Prince 4
George 6
Larry 10
Grand total 32

Application support edit

Pivot tables or pivot functionality are an integral part of many spreadsheet applications and some database software, as well as being found in other data visualization tools and business intelligence packages.

Spreadsheets edit

Database support edit

  • PostgreSQL, an object–relational database management system, allows the creation of pivot tables using the tablefunc module.[13]
  • MariaDB, a MySQL fork, allows pivot tables using the CONNECT storage engine.[14]
  • Microsoft Access supports pivot queries under the name "crosstab" query. [citation needed]
  • Microsoft SQL Server supports pivot as of SQL Server 2016 with the FROM...PIVOT keywords[15]
  • Oracle Database supports the PIVOT operation.[citation needed]
  • Some popular databases that do not directly support pivot functionality, such as SQLite, can usually simulate pivot functionality using embedded functions, dynamic SQL or subqueries. The issue with pivoting in such cases is usually that the number of output columns must be known at the time the query starts to execute; for pivoting this is not possible as the number of columns is based on the data itself. Therefore, the names must be hard coded or the query to be executed must itself be created dynamically (meaning, prior to each use) based upon the data.[citation needed]

Web applications edit

  • ZK, an Ajax framework, also allows the embedding of pivot tables in Web applications.[citation needed]

Programming languages and libraries edit

Programming languages and libraries suited to work with tabular data contain functions that allow the creation and manipulation of pivot tables.

  • Python data analysis toolkit pandas has the function pivot_table[16] and the xs method useful to obtain sections of pivot tables.[citation needed]
  • R has the Tidyverse metapackage, which contains a collection of tools providing pivot table functionality,[17][18] as well as the pivottabler package.[19]

Online analytical processing edit

Excel pivot tables include the feature to directly query an online analytical processing (OLAP) server for retrieving data instead of getting the data from an Excel spreadsheet. On this configuration, a pivot table is a simple client of an OLAP server. Excel's PivotTable not only allows for connecting to Microsoft's Analysis Service, but to any XML for Analysis (XMLA) OLAP standard-compliant server.

See also edit

References edit

  1. ^ "United States Trademark Serial Number 74472929". 1994-12-27. Retrieved 2022-03-23.
  2. ^ Jelen, Bill; Alexander, Michael (2006). Pivot table data crunching. Indianapolis: Que. pp. 274. ISBN 0-7897-3435-4.
  3. ^ Gartung, Daniel L.; Edholm, Yorgen H.; Edholm, Kay-Martin; McNall, Kristen N.; Lew, Karl M., Patent #5915257, retrieved 2010-02-16
  4. ^ Darlington, Keith (2012-08-06). VBA For Excel Made Simple. Routledge (published 2012). p. 19. ISBN 9781136349775. Retrieved 2014-09-10. [...] Excel 5, released in early 1994, included the first version of VBA.
  5. ^ Shah, Sharanam; Shah, Vaishali (2008). Oracle for Professionals - Covers Oracle 9i, 10g and 11g. Shroff Publishing Series. Navi Mumbai: Shroff Publishers (published July 2008). p. 549. ISBN 9788184045260. Retrieved 2014-09-10. One of the most useful new features of the Oracle Database 11g from the SQL perspective is the introduction of Pivot and Unpivot operators.
  6. ^ "LibreOffice Calc and Pivot table with empty cells". StackOverflow. 2021-06-17. Retrieved 2021-06-17.
  7. ^ "Functionality request for PIVOTTABLE". LibreOffice bugs. 2012-03-19. Retrieved 2021-06-17.
  8. ^ Dalgleish, Debra (2007). Beginning PivotTables in Excel 2007: From Novice to Professional. Apress. pp. 233–257. ISBN 9781430204336. Retrieved 18 September 2018.
  9. ^ "Busy Developers' Guide to HSSF and XSSF Features". poi.apache.org. Retrieved 2022-12-09.
  10. ^ "Pivot Tables".
  11. ^ "Create & use pivot tables". Docs Editors Help. Google Inc. Retrieved 6 August 2020.
  12. ^ "iWork update brings major changes to Mac, iPhone, and iPad apps". Macworld. Retrieved 2021-09-28.
  13. ^ "PostgreSQL: Documentation: 9.2: tablefunc". postgresql.org. 9 November 2017.
  14. ^ "CONNECT Table Types - PIVOT Table Type". mariadb.com.
  15. ^ "FROM clause plus JOIN, APPLY, PIVOT (T-SQL) - SQL Server".
  16. ^ "pandas.pivot_table". Retrieved 21 November 2023.
  17. ^ dplyr and Pivot Tables.
  18. ^ Pivoting.
  19. ^ "pivottabler".

Further reading edit

pivot, table, cross, tabulation, that, aggregates, only, counting, rather, than, summing, averaging, contingency, table, pivot, table, table, values, which, aggregations, groups, individual, values, from, more, extensive, table, such, from, database, spreadshe. For cross tabulation that aggregates only by counting rather than summing averaging etc see Contingency table A pivot table is a table of values which are aggregations of groups of individual values from a more extensive table such as from a database spreadsheet or business intelligence program within one or more discrete categories The aggregations or summaries of the groups of the individual terms might include sums averages counts or other statistics A pivot table is the outcome of the statistical processing of tabularized raw data and can be used for decision making Although pivot table is a generic term Microsoft held a trademark on the term in the United States from 1994 to 2020 1 Contents 1 History 2 Mechanics 3 Implementation 3 1 Filters 3 2 Columns 3 3 Rows 3 4 Values 4 Application support 4 1 Spreadsheets 4 2 Database support 4 3 Web applications 4 4 Programming languages and libraries 5 Online analytical processing 6 See also 7 References 8 Further readingHistory editIn their book Pivot Table Data Crunching 2 Bill Jelen and Mike Alexander refer to Pito Salas as the father of pivot tables While working on a concept for a new program that would eventually become Lotus Improv Salas noted that spreadsheets have patterns of data A tool that could help the user recognize these patterns would help to build advanced data models quickly With Improv users could define and store sets of categories then change views by dragging category names with the mouse This core functionality would provide the model for pivot tables Lotus Development released Improv in 1991 on the NeXT platform A few months after the release of Improv Brio Technology published a standalone Macintosh implementation called DataPivot with technology eventually patented in 1999 3 Borland purchased the DataPivot technology in 1992 and implemented it in their own spreadsheet application Quattro Pro In 1993 the Microsoft Windows version of Improv appeared Early in 1994 Microsoft Excel 5 4 brought a new functionality called a PivotTable to market Microsoft further improved this feature in later versions of Excel Excel 97 included a new and improved PivotTable Wizard the ability to create calculated fields and new pivot cache objects that allow developers to write Visual Basic for Applications macros to create and modify pivot tables Excel 2000 introduced Pivot Charts to represent pivot table data graphicallyIn 2007 Oracle Corporation made PIVOT and UNPIVOT operators available in Oracle Database 11g 5 Mechanics editFor typical data entry and storage data usually appear in flat tables meaning that they consist of only columns and rows as in the following portion of a sample spreadsheet showing data on shirt types A B C D E F G1 Region Gender Style Ship date Units Price Cost2 East Boy Tee 2005 01 31 12 11 04 10 423 East Boy Golf 2005 01 31 12 13 00 12 604 East Boy Fancy 2005 01 31 12 11 96 11 745 East Girl Tee 2005 01 31 10 11 27 10 566 East Girl Golf 2005 01 31 10 12 12 11 957 East Girl Fancy 2005 01 31 10 13 74 13 338 West Boy Tee 2005 01 31 11 11 44 10 949 West Boy Golf 2005 01 31 11 12 63 11 7310 West Boy Fancy 2005 01 31 11 12 06 11 5111 West Girl Tee 2005 01 31 15 13 42 13 2912 West Girl Golf 2005 01 31 15 11 48 10 67 While tables such as these can contain many data items it can be difficult to get summarized information from them A pivot table can help quickly summarize the data and highlight the desired information The usage of a pivot table is extremely broad and depends on the situation The first question to ask is What am I seeking In the example here let us ask How many Units did we sell in each Region for every Ship Date Sum of units Ship date Region 2005 01 31 2005 02 28 2005 03 31 2005 04 30 2005 05 31 2005 06 30East 66 80 102 116 127 125North 96 117 138 151 154 156South 123 141 157 178 191 202West 78 97 117 136 150 157 blank Grand total 363 435 514 581 622 640A pivot table usually consists of row column and data or fact fields In this case the column is ship date the row is region and the data we would like to see is sum of units These fields allow several kinds of aggregations including sum average standard deviation count etc In this case the total number of units shipped is displayed here using a sum aggregation Implementation editUsing the example above the software will find all distinct values for Region In this case they are North South East West Furthermore it will find all distinct values for Ship date Based on the aggregation type sum it will summarize the fact the quantities of Unit and display them in a multidimensional chart In the example above the first datum is 66 This number was obtained by finding all records where both Region was East and Ship Date was 2005 01 31 and adding the Units of that collection of records i e cells E2 to E7 together to get a final result Pivot tables are not created automatically For example in Microsoft Excel one must first select the entire data in the original table and then go to the Insert tab and select Pivot Table or Pivot Chart The user then has the option of either inserting the pivot table into an existing sheet or creating a new sheet to house the pivot table A pivot table field list is provided to the user which lists all the column headers present in the data For instance if a table represents sales data of a company it might include Date of sale Sales person Item sold Color of item Units sold Per unit price and Total price This makes the data more readily accessible Date of sale Sales person Item sold Color of item Units sold Per unit price Total price2013 10 01 Jones Notebook Black 8 25000 2000002013 10 02 Prince Laptop Red 4 35000 1400002013 10 03 George Mouse Red 6 850 51002013 10 04 Larry Notebook White 10 27000 2700002013 10 05 Jones Mouse Black 4 700 2800The fields that would be created will be visible on the right hand side of the worksheet By default the pivot table layout design will appear below this list Pivot Table fields are the building blocks of pivot tables Each of the fields from the list can be dragged on to this layout which has four options Filters Columns Rows ValuesSome uses of pivot tables are related to the analysis of questionnaires with optional responses but some implementations of pivot tables do not allow these use cases For example the implementation in LibreOffice Calc since 2012 is not able to process empty cells 6 7 Filters edit Report filter is used to apply a filter to an entire table For example if the Color of Item field is dragged to this area then the table constructed will have a report filter inserted above the table This report filter will have drop down options Black Red and White in the example above When an option is chosen from this drop down list Black in this example then the table that would be visible will contain only the data from those rows that have the Color of Item Black Columns edit Column labels are used to apply a filter to one or more columns that have to be shown in the pivot table For instance if the Salesperson field is dragged to this area then the table constructed will have values from the column Sales Person i e one will have a number of columns equal to the number of Salesperson There will also be one added column of Total In the example above this instruction will create five columns in the table one for each salesperson and Grand Total There will be a filter above the data column labels from which one can select or deselect a particular salesperson for the pivot table This table will not have any numerical values as no numerical field is selected but when it is selected the values will automatically get updated in the column of Grand total Rows edit Row labels are used to apply a filter to one or more rows that have to be shown in the pivot table For instance if the Salesperson field is dragged on this area then the other output table constructed will have values from the column Salesperson i e one will have a number of rows equal to the number of Sales Person There will also be one added row of Grand Total In the example above this instruction will create five rows in the table one for each salesperson and Grand Total There will be a filter above the data row labels from which one can select or deselect a particular salesperson for the Pivot table This table will not have any numerical values as no numerical field is selected but when it is selected the values will automatically get updated in the Row of Grand Total Values edit This usually takes a field that has numerical values that can be used for different types of calculations However using text values would also not be wrong instead of Sum it will give a count So in the example above if the Units sold field is dragged to this area along with the row label of Salesperson then the instruction will add a new column Sum of units sold which will have values against each salesperson Row labels Sum of units soldJones 12Prince 4George 6Larry 10Grand total 32Application support editPivot tables or pivot functionality are an integral part of many spreadsheet applications and some database software as well as being found in other data visualization tools and business intelligence packages Spreadsheets edit Microsoft Excel supports PivotTables which can be visualized through PivotCharts 8 Apache POI 9 LibreOffice Calc and Openoffice Calc support pivot tables Prior to version 3 4 this feature was named DataPilot citation needed Calligra Sheets supports pivot tables 10 Google Sheets natively supports pivot tables 11 Numbers from Apple Inc gained pivot table support in version 11 2 12 Database support edit PostgreSQL an object relational database management system allows the creation of pivot tables using the tablefunc module 13 MariaDB a MySQL fork allows pivot tables using the CONNECT storage engine 14 Microsoft Access supports pivot queries under the name crosstab query citation needed Microsoft SQL Server supports pivot as of SQL Server 2016 with the FROM PIVOT keywords 15 Oracle Database supports the PIVOT operation citation needed Some popular databases that do not directly support pivot functionality such as SQLite can usually simulate pivot functionality using embedded functions dynamic SQL or subqueries The issue with pivoting in such cases is usually that the number of output columns must be known at the time the query starts to execute for pivoting this is not possible as the number of columns is based on the data itself Therefore the names must be hard coded or the query to be executed must itself be created dynamically meaning prior to each use based upon the data citation needed Web applications edit ZK an Ajax framework also allows the embedding of pivot tables in Web applications citation needed Programming languages and libraries edit Programming languages and libraries suited to work with tabular data contain functions that allow the creation and manipulation of pivot tables Python data analysis toolkit pandas has the function pivot table 16 and the xs method useful to obtain sections of pivot tables citation needed R has the Tidyverse metapackage which contains a collection of tools providing pivot table functionality 17 18 as well as the pivottabler package 19 Online analytical processing editExcel pivot tables include the feature to directly query an online analytical processing OLAP server for retrieving data instead of getting the data from an Excel spreadsheet On this configuration a pivot table is a simple client of an OLAP server Excel s PivotTable not only allows for connecting to Microsoft s Analysis Service but to any XML for Analysis XMLA OLAP standard compliant server See also editAggregate function Business reporting Comparison of office suites Comparison of OLAP servers Contingency table a crosstab that tallies counts rather than totals Data drilling Data mining Data visualization Data warehouse Extract transform load Fold higher order function OLAP cube Relational algebra Wide and narrow dataReferences edit United States Trademark Serial Number 74472929 1994 12 27 Retrieved 2022 03 23 Jelen Bill Alexander Michael 2006 Pivot table data crunching Indianapolis Que pp 274 ISBN 0 7897 3435 4 Gartung Daniel L Edholm Yorgen H Edholm Kay Martin McNall Kristen N Lew Karl M Patent 5915257 retrieved 2010 02 16 Darlington Keith 2012 08 06 VBA For Excel Made Simple Routledge published 2012 p 19 ISBN 9781136349775 Retrieved 2014 09 10 Excel 5 released in early 1994 included the first version of VBA Shah Sharanam Shah Vaishali 2008 Oracle for Professionals Covers Oracle 9i 10g and 11g Shroff Publishing Series Navi Mumbai Shroff Publishers published July 2008 p 549 ISBN 9788184045260 Retrieved 2014 09 10 One of the most useful new features of the Oracle Database 11g from the SQL perspective is the introduction of Pivot and Unpivot operators LibreOffice Calc and Pivot table with empty cells StackOverflow 2021 06 17 Retrieved 2021 06 17 Functionality request for PIVOTTABLE LibreOffice bugs 2012 03 19 Retrieved 2021 06 17 Dalgleish Debra 2007 Beginning PivotTables in Excel 2007 From Novice to Professional Apress pp 233 257 ISBN 9781430204336 Retrieved 18 September 2018 Busy Developers Guide to HSSF and XSSF Features poi apache org Retrieved 2022 12 09 Pivot Tables Create amp use pivot tables Docs Editors Help Google Inc Retrieved 6 August 2020 iWork update brings major changes to Mac iPhone and iPad apps Macworld Retrieved 2021 09 28 PostgreSQL Documentation 9 2 tablefunc postgresql org 9 November 2017 CONNECT Table Types PIVOT Table Type mariadb com FROM clause plus JOIN APPLY PIVOT T SQL SQL Server pandas pivot table Retrieved 21 November 2023 dplyr and Pivot Tables Pivoting pivottabler Further reading editA Complete Guide to PivotTables A Visual Approach ISBN 1 59059 432 0 in depth review at slashdot org Excel 2007 PivotTables and PivotCharts Visual blueprint ISBN 978 0 470 13231 9 Pivot Table Data Crunching Business Solutions ISBN 0 7897 3435 4 Beginning Pivot Tables in Excel 2007 ISBN 1 59059 890 3 Retrieved from https en wikipedia org w index php title Pivot table amp oldid 1204741963, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.