-<!-- $PostgreSQL: pgsql/doc/src/sgml/pgbench.sgml,v 1.12 2009/12/15 15:59:57 tgl Exp $ -->
-
-<sect1 id="pgbench">
- <title>pgbench</title>
-
- <indexterm zone="pgbench">
- <primary>pgbench</primary>
- </indexterm>
-
- <para>
- <application>pgbench</application> is a simple program for running benchmark
- tests on <productname>PostgreSQL</>. It runs the same sequence of SQL
- commands over and over, possibly in multiple concurrent database sessions,
- and then calculates the average transaction rate (transactions per second).
- By default, <application>pgbench</application> tests a scenario that is
- loosely based on TPC-B, involving five <command>SELECT</>,
- <command>UPDATE</>, and <command>INSERT</> commands per transaction.
- However, it is easy to test other cases by writing your own transaction
- script files.
- </para>
-
- <para>
- Typical output from pgbench looks like:
-
- <programlisting>
-transaction type: TPC-B (sort of)
-scaling factor: 10
-query mode: simple
-number of clients: 10
-number of threads: 1
-number of transactions per client: 1000
-number of transactions actually processed: 10000/10000
-tps = 85.184871 (including connections establishing)
-tps = 85.296346 (excluding connections establishing)
- </programlisting>
-
- The first six lines report some of the most important parameter
- settings. The next line reports the number of transactions completed
- and intended (the latter being just the product of number of clients
- and number of transactions per client); these will be equal unless the run
- failed before completion. The last two lines report the TPS rate,
- figured with and without counting the time to start database sessions.
- </para>
-
- <sect2>
- <title>Overview</title>
-
- <para>
- The default TPC-B-like transaction test requires specific tables to be
- set up beforehand. <application>pgbench</> should be invoked with
- the <literal>-i</> (initialize) option to create and populate these
- tables. (When you are testing a custom script, you don't need this
- step, but will instead need to do whatever setup your test needs.)
- Initialization looks like:
-
- <programlisting>
-pgbench -i <optional> <replaceable>other-options</> </optional> <replaceable>dbname</>
- </programlisting>
-
- where <replaceable>dbname</> is the name of the already-created
- database to test in. (You may also need <literal>-h</>,
- <literal>-p</>, and/or <literal>-U</> options to specify how to
- connect to the database server.)
- </para>
-
- <caution>
- <para>
- <literal>pgbench -i</> creates four tables <structname>pgbench_accounts</>,
- <structname>pgbench_branches</>, <structname>pgbench_history</>, and
- <structname>pgbench_tellers</>,
- destroying any existing tables of these names.
- Be very careful to use another database if you have tables having these
- names!
- </para>
- </caution>
-
- <para>
- At the default <quote>scale factor</> of 1, the tables initially
- contain this many rows:
- </para>
- <programlisting>
-table # of rows
----------------------------------
-pgbench_branches 1
-pgbench_tellers 10
-pgbench_accounts 100000
-pgbench_history 0
- </programlisting>
- <para>
- You can (and, for most purposes, probably should) increase the number
- of rows by using the <literal>-s</> (scale factor) option. The
- <literal>-F</> (fillfactor) option might also be used at this point.
- </para>
-
- <para>
- Once you have done the necessary setup, you can run your benchmark
- with a command that doesn't include <literal>-i</>, that is
-
- <programlisting>
-pgbench <optional> <replaceable>options</> </optional> <replaceable>dbname</>
- </programlisting>
-
- In nearly all cases, you'll need some options to make a useful test.
- The most important options are <literal>-c</> (number of clients),
- <literal>-t</> (number of transactions), <literal>-T</> (time limit),
- and <literal>-f</> (specify a custom script file).
- See below for a full list.
- </para>
-
- <para>
- <xref linkend="pgbench-init-options"> shows options that are used
- during database initialization, while
- <xref linkend="pgbench-run-options"> shows options that are used
- while running benchmarks, and
- <xref linkend="pgbench-common-options"> shows options that are useful
- in both cases.
- </para>
-
- <table id="pgbench-init-options">
- <title><application>pgbench</application> initialization options</title>
- <tgroup cols="2">
- <thead>
- <row>
- <entry>Option</entry>
- <entry>Description</entry>
- </row>
- </thead>
-
- <tbody>
- <row>
- <entry><literal>-i</literal></entry>
- <entry>
- Required to invoke initialization mode.
- </entry>
- </row>
- <row>
- <entry><literal>-s</literal> <replaceable>scale_factor</></entry>
- <entry>
- Multiply the number of rows generated by the scale factor.
- For example, <literal>-s 100</> will create 10,000,000 rows
- in the <structname>pgbench_accounts</> table. Default is 1.
- </entry>
- </row>
- <row>
- <entry><literal>-F</literal> <replaceable>fillfactor</></entry>
- <entry>
- Create the <structname>pgbench_accounts</>,
- <structname>pgbench_tellers</> and
- <structname>pgbench_branches</> tables with the given fillfactor.
- Default is 100.
- </entry>
- </row>
- </tbody>
- </tgroup>
- </table>
-
- <table id="pgbench-run-options">
- <title><application>pgbench</application> benchmarking options</title>
- <tgroup cols="2">
- <thead>
- <row>
- <entry>Option</entry>
- <entry>Description</entry>
- </row>
- </thead>
-
- <tbody>
- <row>
- <entry><literal>-c</literal> <replaceable>clients</></entry>
- <entry>
- Number of clients simulated, that is, number of concurrent database
- sessions. Default is 1.
- </entry>
- </row>
- <row>
- <entry><literal>-j</literal> <replaceable>threads</></entry>
- <entry>
- Number of worker threads within <application>pgbench</application>.
- Using more than one thread can be helpful on multi-CPU machines.
- The number of clients must be a multiple of the number of threads,
- since each thread is given the same number of client sessions to manage.
- Default is 1.
- </entry>
- </row>
- <row>
- <entry><literal>-t</literal> <replaceable>transactions</></entry>
- <entry>
- Number of transactions each client runs. Default is 10.
- </entry>
- </row>
- <row>
- <entry><literal>-T</literal> <replaceable>seconds</></entry>
- <entry>
- Run the test for this many seconds, rather than a fixed number of
- transactions per client. <literal>-t</literal> and
- <literal>-T</literal> are mutually exclusive.
- </entry>
- </row>
- <row>
- <entry><literal>-M</literal> <replaceable>querymode</></entry>
- <entry>
- Protocol to use for submitting queries to the server:
- <itemizedlist>
- <listitem>
- <para><literal>simple</>: use simple query protocol.</para>
- </listitem>
- <listitem>
- <para><literal>extended</>: use extended query protocol.</para>
- </listitem>
- <listitem>
- <para><literal>prepared</>: use extended query protocol with prepared statements.</para>
- </listitem>
- </itemizedlist>
- The default is simple query protocol. (See <xref linkend="protocol">
- for more information.)
- </entry>
- </row>
- <row>
- <entry><literal>-N</literal></entry>
- <entry>
- Do not update <structname>pgbench_tellers</> and
- <structname>pgbench_branches</>.
- This will avoid update contention on these tables, but
- it makes the test case even less like TPC-B.
- </entry>
- </row>
- <row>
- <entry><literal>-S</literal></entry>
- <entry>
- Perform select-only transactions instead of TPC-B-like test.
- </entry>
- </row>
- <row>
- <entry><literal>-f</literal> <replaceable>filename</></entry>
- <entry>
- Read transaction script from <replaceable>filename</>.
- See below for details.
- <literal>-N</literal>, <literal>-S</literal>, and <literal>-f</literal>
- are mutually exclusive.
- </entry>
- </row>
- <row>
- <entry><literal>-n</literal></entry>
- <entry>
- Perform no vacuuming before running the test.
- This option is <emphasis>necessary</>
- if you are running a custom test scenario that does not include
- the standard tables <structname>pgbench_accounts</>,
- <structname>pgbench_branches</>, <structname>pgbench_history</>, and
- <structname>pgbench_tellers</>.
- </entry>
- </row>
- <row>
- <entry><literal>-v</literal></entry>
- <entry>
- Vacuum all four standard tables before running the test.
- With neither <literal>-n</> nor <literal>-v</>, pgbench will vacuum the
- <structname>pgbench_tellers</> and <structname>pgbench_branches</>
- tables, and will truncate <structname>pgbench_history</>.
- </entry>
- </row>
- <row>
- <entry><literal>-D</literal> <replaceable>varname</><literal>=</><replaceable>value</></entry>
- <entry>
- Define a variable for use by a custom script (see below).
- Multiple <literal>-D</> options are allowed.
- </entry>
- </row>
- <row>
- <entry><literal>-C</literal></entry>
- <entry>
- Establish a new connection for each transaction, rather than
- doing it just once per client session.
- This is useful to measure the connection overhead.
- </entry>
- </row>
- <row>
- <entry><literal>-l</literal></entry>
- <entry>
- Write the time taken by each transaction to a logfile.
- See below for details.
- </entry>
- </row>
- <row>
- <entry><literal>-s</literal> <replaceable>scale_factor</></entry>
- <entry>
- Report the specified scale factor in <application>pgbench</>'s
- output. With the built-in tests, this is not necessary; the
- correct scale factor will be detected by counting the number of
- rows in the <structname>pgbench_branches</> table. However, when testing
- custom benchmarks (<literal>-f</> option), the scale factor
- will be reported as 1 unless this option is used.
- </entry>
- </row>
- <row>
- <entry><literal>-d</literal></entry>
- <entry>
- Print debugging output.
- </entry>
- </row>
- </tbody>
- </tgroup>
- </table>
-
- <table id="pgbench-common-options">
- <title><application>pgbench</application> common options</title>
- <tgroup cols="2">
- <thead>
- <row>
- <entry>Option</entry>
- <entry>Description</entry>
- </row>
- </thead>
-
- <tbody>
- <row>
- <entry><literal>-h</literal> <replaceable>hostname</></entry>
- <entry>database server's host</entry>
- </row>
- <row>
- <entry><literal>-p</literal> <replaceable>port</></entry>
- <entry>database server's port</entry>
- </row>
- <row>
- <entry><literal>-U</literal> <replaceable>login</></entry>
- <entry>username to connect as</entry>
- </row>
- </tbody>
- </tgroup>
- </table>
- </sect2>
-
- <sect2>
- <title>What is the <quote>transaction</> actually performed in pgbench?</title>
-
- <para>
- The default transaction script issues seven commands per transaction:
- </para>
-
- <orderedlist>
- <listitem><para><literal>BEGIN;</literal></para></listitem>
- <listitem><para><literal>UPDATE pgbench_accounts SET abalance = abalance + :delta WHERE aid = :aid;</literal></para></listitem>
- <listitem><para><literal>SELECT abalance FROM pgbench_accounts WHERE aid = :aid;</literal></para></listitem>
- <listitem><para><literal>UPDATE pgbench_tellers SET tbalance = tbalance + :delta WHERE tid = :tid;</literal></para></listitem>
- <listitem><para><literal>UPDATE pgbench_branches SET bbalance = bbalance + :delta WHERE bid = :bid;</literal></para></listitem>
- <listitem><para><literal>INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (:tid, :bid, :aid, :delta, CURRENT_TIMESTAMP);</literal></para></listitem>
- <listitem><para><literal>END;</literal></para></listitem>
- </orderedlist>
-
- <para>
- If you specify <literal>-N</>, steps 4 and 5 aren't included in the
- transaction. If you specify <literal>-S</>, only the <command>SELECT</> is
- issued.
- </para>
- </sect2>
-
- <sect2>
- <title>Custom Scripts</title>
-
- <para>
- <application>pgbench</application> has support for running custom
- benchmark scenarios by replacing the default transaction script
- (described above) with a transaction script read from a file
- (<literal>-f</literal> option). In this case a <quote>transaction</>
- counts as one execution of a script file. You can even specify
- multiple scripts (multiple <literal>-f</literal> options), in which
- case a random one of the scripts is chosen each time a client session
- starts a new transaction.
- </para>
-
- <para>
- The format of a script file is one SQL command per line; multi-line
- SQL commands are not supported. Empty lines and lines beginning with
- <literal>--</> are ignored. Script file lines can also be
- <quote>meta commands</>, which are interpreted by <application>pgbench</>
- itself, as described below.
- </para>
-
- <para>
- There is a simple variable-substitution facility for script files.
- Variables can be set by the command-line <literal>-D</> option,
- explained above, or by the meta commands explained below.
- In addition to any variables preset by <literal>-D</> command-line options,
- the variable <literal>scale</> is preset to the current scale factor.
- Once set, a variable's
- value can be inserted into a SQL command by writing
- <literal>:</><replaceable>variablename</>. When running more than
- one client session, each session has its own set of variables.
- </para>
-
- <para>
- Script file meta commands begin with a backslash (<literal>\</>).
- Arguments to a meta command are separated by white space.
- These meta commands are supported:
- </para>
-
- <variablelist>
- <varlistentry>
- <term>
- <literal>\set <replaceable>varname</> <replaceable>operand1</> [ <replaceable>operator</> <replaceable>operand2</> ]</literal>
- </term>
-
- <listitem>
- <para>
- Sets variable <replaceable>varname</> to a calculated integer value.
- Each <replaceable>operand</> is either an integer constant or a
- <literal>:</><replaceable>variablename</> reference to a variable
- having an integer value. The <replaceable>operator</> can be
- <literal>+</>, <literal>-</>, <literal>*</>, or <literal>/</>.
- </para>
-
- <para>
- Example:
- <programlisting>
-\set ntellers 10 * :scale
- </programlisting>
- </para>
- </listitem>
- </varlistentry>
-
- <varlistentry>
- <term>
- <literal>\setrandom <replaceable>varname</> <replaceable>min</> <replaceable>max</></literal>
- </term>
-
- <listitem>
- <para>
- Sets variable <replaceable>varname</> to a random integer value
- between the limits <replaceable>min</> and <replaceable>max</> inclusive.
- Each limit can be either an integer constant or a
- <literal>:</><replaceable>variablename</> reference to a variable
- having an integer value.
- </para>
-
- <para>
- Example:
- <programlisting>
-\setrandom aid 1 :naccounts
- </programlisting>
- </para>
- </listitem>
- </varlistentry>
-
- <varlistentry>
- <term>
- <literal>\sleep <replaceable>number</> [ us | ms | s ]</literal>
- </term>
-
- <listitem>
- <para>
- Causes script execution to sleep for the specified duration in
- microseconds (<literal>us</>), milliseconds (<literal>ms</>) or seconds
- (<literal>s</>). If the unit is omitted then seconds are the default.
- <replaceable>number</> can be either an integer constant or a
- <literal>:</><replaceable>variablename</> reference to a variable
- having an integer value.
- </para>
-
- <para>
- Example:
- <programlisting>
-\sleep 10 ms
- </programlisting>
- </para>
- </listitem>
- </varlistentry>
-
- <varlistentry>
- <term>
- <literal>\setshell <replaceable>varname</> <replaceable>command</> [ <replaceable>argument</> ... ]</literal>
- </term>
-
- <listitem>
- <para>
- Sets variable <replaceable>varname</> to the result of the shell command
- <replaceable>command</>. The command must return an integer value
- through its standard output.
- </para>
-
- <para>
- <replaceable>argument</> can be either a text constant or a
- <literal>:</><replaceable>variablename</> reference to a variable of
- any types. If you want to use <replaceable>argument</> starting with
- colons, you need to add an additional colon at the beginning of
- <replaceable>argument</>.
- </para>
-
- <para>
- Example:
- <programlisting>
-\setshell variable_to_be_assigned command literal_argument :variable ::literal_starting_with_colon
- </programlisting>
- </para>
- </listitem>
- </varlistentry>
-
- <varlistentry>
- <term>
- <literal>\shell <replaceable>command</> [ <replaceable>argument</> ... ]</literal>
- </term>
-
- <listitem>
- <para>
- Same as <literal>\setshell</literal>, but the result is ignored.
- </para>
-
- <para>
- Example:
- <programlisting>
-\shell command literal_argument :variable ::literal_starting_with_colon
- </programlisting>
- </para>
- </listitem>
- </varlistentry>
- </variablelist>
-
- <para>
- As an example, the full definition of the built-in TPC-B-like
- transaction is:
-
- <programlisting>
-\set nbranches :scale
-\set ntellers 10 * :scale
-\set naccounts 100000 * :scale
-\setrandom aid 1 :naccounts
-\setrandom bid 1 :nbranches
-\setrandom tid 1 :ntellers
-\setrandom delta -5000 5000
-BEGIN;
-UPDATE pgbench_accounts SET abalance = abalance + :delta WHERE aid = :aid;
-SELECT abalance FROM pgbench_accounts WHERE aid = :aid;
-UPDATE pgbench_tellers SET tbalance = tbalance + :delta WHERE tid = :tid;
-UPDATE pgbench_branches SET bbalance = bbalance + :delta WHERE bid = :bid;
-INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (:tid, :bid, :aid, :delta, CURRENT_TIMESTAMP);
-END;
- </programlisting>
-
- This script allows each iteration of the transaction to reference
- different, randomly-chosen rows. (This example also shows why it's
- important for each client session to have its own variables —
- otherwise they'd not be independently touching different rows.)
- </para>
-
- </sect2>
-
- <sect2>
- <title>Per-transaction logging</title>
-
- <para>
- With the <literal>-l</> option, <application>pgbench</> writes the time
- taken by each transaction to a logfile. The logfile will be named
- <filename>pgbench_log.<replaceable>nnn</></filename>, where
- <replaceable>nnn</> is the PID of the pgbench process.
- The format of the log is:
-
- <programlisting>
- <replaceable>client_id</> <replaceable>transaction_no</> <replaceable>time</> <replaceable>file_no</> <replaceable>time_epoch</> <replaceable>time_us</>
- </programlisting>
-
- where <replaceable>time</> is the elapsed transaction time in microseconds,
- <replaceable>file_no</> identifies which script file was used
- (useful when multiple scripts were specified with <literal>-f</>),
- and <replaceable>time_epoch</>/<replaceable>time_us</> are a
- UNIX epoch format timestamp and an offset
- in microseconds (suitable for creating a ISO 8601
- timestamp with fractional seconds) showing when
- the transaction completed.
- </para>
-
- <para>
- Here are example outputs:
- <programlisting>
- 0 199 2241 0 1175850568 995598
- 0 200 2465 0 1175850568 998079
- 0 201 2513 0 1175850569 608
- 0 202 2038 0 1175850569 2663
- </programlisting>
- </para>
- </sect2>
-
- <sect2>
- <title>Good Practices</title>
-
- <para>
- It is very easy to use <application>pgbench</> to produce completely
- meaningless numbers. Here are some guidelines to help you get useful
- results.
- </para>
-
- <para>
- In the first place, <emphasis>never</> believe any test that runs
- for only a few seconds. Use the <literal>-t</> or <literal>-T</> option
- to make the run last at least a few minutes, so as to average out noise.
- In some cases you could need hours to get numbers that are reproducible.
- It's a good idea to try the test run a few times, to find out if your
- numbers are reproducible or not.
- </para>
-
- <para>
- For the default TPC-B-like test scenario, the initialization scale factor
- (<literal>-s</>) should be at least as large as the largest number of
- clients you intend to test (<literal>-c</>); else you'll mostly be
- measuring update contention. There are only <literal>-s</> rows in
- the <structname>pgbench_branches</> table, and every transaction wants to
- update one of them, so <literal>-c</> values in excess of <literal>-s</>
- will undoubtedly result in lots of transactions blocked waiting for
- other transactions.
- </para>
-
- <para>
- The default test scenario is also quite sensitive to how long it's been
- since the tables were initialized: accumulation of dead rows and dead space
- in the tables changes the results. To understand the results you must keep
- track of the total number of updates and when vacuuming happens. If
- autovacuum is enabled it can result in unpredictable changes in measured
- performance.
- </para>
-
- <para>
- A limitation of <application>pgbench</> is that it can itself become
- the bottleneck when trying to test a large number of client sessions.
- This can be alleviated by running <application>pgbench</> on a different
- machine from the database server, although low network latency will be
- essential. It might even be useful to run several <application>pgbench</>
- instances concurrently, on several client machines, against the same
- database server.
- </para>
- </sect2>
-
-</sect1>
+<!-- $PostgreSQL: pgsql/doc/src/sgml/pgbench.sgml,v 1.13 2010/03/23 01:29:22 itagaki Exp $ -->\r
+\r
+<sect1 id="pgbench">\r
+ <title>pgbench</title>\r
+\r
+ <indexterm zone="pgbench">\r
+ <primary>pgbench</primary>\r
+ </indexterm>\r
+\r
+ <para>\r
+ <application>pgbench</application> is a simple program for running benchmark\r
+ tests on <productname>PostgreSQL</>. It runs the same sequence of SQL\r
+ commands over and over, possibly in multiple concurrent database sessions,\r
+ and then calculates the average transaction rate (transactions per second).\r
+ By default, <application>pgbench</application> tests a scenario that is\r
+ loosely based on TPC-B, involving five <command>SELECT</>,\r
+ <command>UPDATE</>, and <command>INSERT</> commands per transaction.\r
+ However, it is easy to test other cases by writing your own transaction\r
+ script files.\r
+ </para>\r
+\r
+ <para>\r
+ Typical output from pgbench looks like:\r
+\r
+ <programlisting>\r
+transaction type: TPC-B (sort of)\r
+scaling factor: 10\r
+query mode: simple\r
+number of clients: 10\r
+number of threads: 1\r
+number of transactions per client: 1000\r
+number of transactions actually processed: 10000/10000\r
+tps = 85.184871 (including connections establishing)\r
+tps = 85.296346 (excluding connections establishing)\r
+ </programlisting>\r
+\r
+ The first six lines report some of the most important parameter\r
+ settings. The next line reports the number of transactions completed\r
+ and intended (the latter being just the product of number of clients\r
+ and number of transactions per client); these will be equal unless the run\r
+ failed before completion. The last two lines report the TPS rate,\r
+ figured with and without counting the time to start database sessions.\r
+ </para>\r
+\r
+ <sect2>\r
+ <title>Overview</title>\r
+\r
+ <para>\r
+ The default TPC-B-like transaction test requires specific tables to be\r
+ set up beforehand. <application>pgbench</> should be invoked with\r
+ the <literal>-i</> (initialize) option to create and populate these\r
+ tables. (When you are testing a custom script, you don't need this\r
+ step, but will instead need to do whatever setup your test needs.)\r
+ Initialization looks like:\r
+\r
+ <programlisting>\r
+pgbench -i <optional> <replaceable>other-options</> </optional> <replaceable>dbname</>\r
+ </programlisting>\r
+\r
+ where <replaceable>dbname</> is the name of the already-created\r
+ database to test in. (You may also need <literal>-h</>,\r
+ <literal>-p</>, and/or <literal>-U</> options to specify how to\r
+ connect to the database server.)\r
+ </para>\r
+\r
+ <caution>\r
+ <para>\r
+ <literal>pgbench -i</> creates four tables <structname>pgbench_accounts</>,\r
+ <structname>pgbench_branches</>, <structname>pgbench_history</>, and\r
+ <structname>pgbench_tellers</>,\r
+ destroying any existing tables of these names.\r
+ Be very careful to use another database if you have tables having these\r
+ names!\r
+ </para>\r
+ </caution>\r
+\r
+ <para>\r
+ At the default <quote>scale factor</> of 1, the tables initially\r
+ contain this many rows:\r
+ </para>\r
+ <programlisting>\r
+table # of rows\r
+---------------------------------\r
+pgbench_branches 1\r
+pgbench_tellers 10\r
+pgbench_accounts 100000\r
+pgbench_history 0\r
+ </programlisting>\r
+ <para>\r
+ You can (and, for most purposes, probably should) increase the number\r
+ of rows by using the <literal>-s</> (scale factor) option. The\r
+ <literal>-F</> (fillfactor) option might also be used at this point.\r
+ </para>\r
+\r
+ <para>\r
+ Once you have done the necessary setup, you can run your benchmark\r
+ with a command that doesn't include <literal>-i</>, that is\r
+\r
+ <programlisting>\r
+pgbench <optional> <replaceable>options</> </optional> <replaceable>dbname</>\r
+ </programlisting>\r
+\r
+ In nearly all cases, you'll need some options to make a useful test.\r
+ The most important options are <literal>-c</> (number of clients),\r
+ <literal>-t</> (number of transactions), <literal>-T</> (time limit),\r
+ and <literal>-f</> (specify a custom script file).\r
+ See below for a full list.\r
+ </para>\r
+\r
+ <para>\r
+ <xref linkend="pgbench-init-options"> shows options that are used\r
+ during database initialization, while\r
+ <xref linkend="pgbench-run-options"> shows options that are used\r
+ while running benchmarks, and\r
+ <xref linkend="pgbench-common-options"> shows options that are useful\r
+ in both cases.\r
+ </para>\r
+\r
+ <table id="pgbench-init-options">\r
+ <title><application>pgbench</application> initialization options</title>\r
+ <tgroup cols="2">\r
+ <thead>\r
+ <row>\r
+ <entry>Option</entry>\r
+ <entry>Description</entry>\r
+ </row>\r
+ </thead>\r
+\r
+ <tbody>\r
+ <row>\r
+ <entry><literal>-i</literal></entry>\r
+ <entry>\r
+ Required to invoke initialization mode.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-s</literal> <replaceable>scale_factor</></entry>\r
+ <entry>\r
+ Multiply the number of rows generated by the scale factor.\r
+ For example, <literal>-s 100</> will create 10,000,000 rows\r
+ in the <structname>pgbench_accounts</> table. Default is 1.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-F</literal> <replaceable>fillfactor</></entry>\r
+ <entry>\r
+ Create the <structname>pgbench_accounts</>,\r
+ <structname>pgbench_tellers</> and\r
+ <structname>pgbench_branches</> tables with the given fillfactor.\r
+ Default is 100.\r
+ </entry>\r
+ </row>\r
+ </tbody>\r
+ </tgroup>\r
+ </table>\r
+\r
+ <table id="pgbench-run-options">\r
+ <title><application>pgbench</application> benchmarking options</title>\r
+ <tgroup cols="2">\r
+ <thead>\r
+ <row>\r
+ <entry>Option</entry>\r
+ <entry>Description</entry>\r
+ </row>\r
+ </thead>\r
+\r
+ <tbody>\r
+ <row>\r
+ <entry><literal>-c</literal> <replaceable>clients</></entry>\r
+ <entry>\r
+ Number of clients simulated, that is, number of concurrent database\r
+ sessions. Default is 1.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-j</literal> <replaceable>threads</></entry>\r
+ <entry>\r
+ Number of worker threads within <application>pgbench</application>.\r
+ Using more than one thread can be helpful on multi-CPU machines.\r
+ The number of clients must be a multiple of the number of threads,\r
+ since each thread is given the same number of client sessions to manage.\r
+ Default is 1.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-t</literal> <replaceable>transactions</></entry>\r
+ <entry>\r
+ Number of transactions each client runs. Default is 10.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-T</literal> <replaceable>seconds</></entry>\r
+ <entry>\r
+ Run the test for this many seconds, rather than a fixed number of\r
+ transactions per client. <literal>-t</literal> and\r
+ <literal>-T</literal> are mutually exclusive.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-M</literal> <replaceable>querymode</></entry>\r
+ <entry>\r
+ Protocol to use for submitting queries to the server:\r
+ <itemizedlist>\r
+ <listitem>\r
+ <para><literal>simple</>: use simple query protocol.</para>\r
+ </listitem>\r
+ <listitem>\r
+ <para><literal>extended</>: use extended query protocol.</para>\r
+ </listitem>\r
+ <listitem>\r
+ <para><literal>prepared</>: use extended query protocol with prepared statements.</para>\r
+ </listitem>\r
+ </itemizedlist>\r
+ The default is simple query protocol. (See <xref linkend="protocol">\r
+ for more information.)\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-N</literal></entry>\r
+ <entry>\r
+ Do not update <structname>pgbench_tellers</> and\r
+ <structname>pgbench_branches</>.\r
+ This will avoid update contention on these tables, but\r
+ it makes the test case even less like TPC-B.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-S</literal></entry>\r
+ <entry>\r
+ Perform select-only transactions instead of TPC-B-like test.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-f</literal> <replaceable>filename</></entry>\r
+ <entry>\r
+ Read transaction script from <replaceable>filename</>.\r
+ See below for details.\r
+ <literal>-N</literal>, <literal>-S</literal>, and <literal>-f</literal>\r
+ are mutually exclusive.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-n</literal></entry>\r
+ <entry>\r
+ Perform no vacuuming before running the test.\r
+ This option is <emphasis>necessary</>\r
+ if you are running a custom test scenario that does not include\r
+ the standard tables <structname>pgbench_accounts</>,\r
+ <structname>pgbench_branches</>, <structname>pgbench_history</>, and\r
+ <structname>pgbench_tellers</>.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-v</literal></entry>\r
+ <entry>\r
+ Vacuum all four standard tables before running the test.\r
+ With neither <literal>-n</> nor <literal>-v</>, pgbench will vacuum the\r
+ <structname>pgbench_tellers</> and <structname>pgbench_branches</>\r
+ tables, and will truncate <structname>pgbench_history</>.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-D</literal> <replaceable>varname</><literal>=</><replaceable>value</></entry>\r
+ <entry>\r
+ Define a variable for use by a custom script (see below).\r
+ Multiple <literal>-D</> options are allowed.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-C</literal></entry>\r
+ <entry>\r
+ Establish a new connection for each transaction, rather than\r
+ doing it just once per client session.\r
+ This is useful to measure the connection overhead.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-l</literal></entry>\r
+ <entry>\r
+ Write the time taken by each transaction to a logfile.\r
+ See below for details.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-s</literal> <replaceable>scale_factor</></entry>\r
+ <entry>\r
+ Report the specified scale factor in <application>pgbench</>'s\r
+ output. With the built-in tests, this is not necessary; the\r
+ correct scale factor will be detected by counting the number of\r
+ rows in the <structname>pgbench_branches</> table. However, when testing\r
+ custom benchmarks (<literal>-f</> option), the scale factor\r
+ will be reported as 1 unless this option is used.\r
+ </entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-d</literal></entry>\r
+ <entry>\r
+ Print debugging output.\r
+ </entry>\r
+ </row>\r
+ </tbody>\r
+ </tgroup>\r
+ </table>\r
+\r
+ <table id="pgbench-common-options">\r
+ <title><application>pgbench</application> common options</title>\r
+ <tgroup cols="2">\r
+ <thead>\r
+ <row>\r
+ <entry>Option</entry>\r
+ <entry>Description</entry>\r
+ </row>\r
+ </thead>\r
+\r
+ <tbody>\r
+ <row>\r
+ <entry><literal>-h</literal> <replaceable>hostname</></entry>\r
+ <entry>database server's host</entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-p</literal> <replaceable>port</></entry>\r
+ <entry>database server's port</entry>\r
+ </row>\r
+ <row>\r
+ <entry><literal>-U</literal> <replaceable>login</></entry>\r
+ <entry>username to connect as</entry>\r
+ </row>\r
+ </tbody>\r
+ </tgroup>\r
+ </table>\r
+ </sect2>\r
+\r
+ <sect2>\r
+ <title>What is the <quote>transaction</> actually performed in pgbench?</title>\r
+\r
+ <para>\r
+ The default transaction script issues seven commands per transaction:\r
+ </para>\r
+\r
+ <orderedlist>\r
+ <listitem><para><literal>BEGIN;</literal></para></listitem>\r
+ <listitem><para><literal>UPDATE pgbench_accounts SET abalance = abalance + :delta WHERE aid = :aid;</literal></para></listitem>\r
+ <listitem><para><literal>SELECT abalance FROM pgbench_accounts WHERE aid = :aid;</literal></para></listitem>\r
+ <listitem><para><literal>UPDATE pgbench_tellers SET tbalance = tbalance + :delta WHERE tid = :tid;</literal></para></listitem>\r
+ <listitem><para><literal>UPDATE pgbench_branches SET bbalance = bbalance + :delta WHERE bid = :bid;</literal></para></listitem>\r
+ <listitem><para><literal>INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (:tid, :bid, :aid, :delta, CURRENT_TIMESTAMP);</literal></para></listitem>\r
+ <listitem><para><literal>END;</literal></para></listitem>\r
+ </orderedlist>\r
+\r
+ <para>\r
+ If you specify <literal>-N</>, steps 4 and 5 aren't included in the\r
+ transaction. If you specify <literal>-S</>, only the <command>SELECT</> is\r
+ issued.\r
+ </para>\r
+ </sect2>\r
+\r
+ <sect2>\r
+ <title>Custom Scripts</title>\r
+\r
+ <para>\r
+ <application>pgbench</application> has support for running custom\r
+ benchmark scenarios by replacing the default transaction script\r
+ (described above) with a transaction script read from a file\r
+ (<literal>-f</literal> option). In this case a <quote>transaction</>\r
+ counts as one execution of a script file. You can even specify\r
+ multiple scripts (multiple <literal>-f</literal> options), in which\r
+ case a random one of the scripts is chosen each time a client session\r
+ starts a new transaction.\r
+ </para>\r
+\r
+ <para>\r
+ The format of a script file is one SQL command per line; multi-line\r
+ SQL commands are not supported. Empty lines and lines beginning with\r
+ <literal>--</> are ignored. Script file lines can also be\r
+ <quote>meta commands</>, which are interpreted by <application>pgbench</>\r
+ itself, as described below.\r
+ </para>\r
+\r
+ <para>\r
+ There is a simple variable-substitution facility for script files.\r
+ Variables can be set by the command-line <literal>-D</> option,\r
+ explained above, or by the meta commands explained below.\r
+ In addition to any variables preset by <literal>-D</> command-line options,\r
+ the variable <literal>scale</> is preset to the current scale factor.\r
+ Once set, a variable's\r
+ value can be inserted into a SQL command by writing\r
+ <literal>:</><replaceable>variablename</>. When running more than\r
+ one client session, each session has its own set of variables.\r
+ </para>\r
+\r
+ <para>\r
+ Script file meta commands begin with a backslash (<literal>\</>).\r
+ Arguments to a meta command are separated by white space.\r
+ These meta commands are supported:\r
+ </para>\r
+\r
+ <variablelist>\r
+ <varlistentry>\r
+ <term>\r
+ <literal>\set <replaceable>varname</> <replaceable>operand1</> [ <replaceable>operator</> <replaceable>operand2</> ]</literal>\r
+ </term>\r
+\r
+ <listitem>\r
+ <para>\r
+ Sets variable <replaceable>varname</> to a calculated integer value.\r
+ Each <replaceable>operand</> is either an integer constant or a\r
+ <literal>:</><replaceable>variablename</> reference to a variable\r
+ having an integer value. The <replaceable>operator</> can be\r
+ <literal>+</>, <literal>-</>, <literal>*</>, or <literal>/</>.\r
+ </para>\r
+\r
+ <para>\r
+ Example:\r
+ <programlisting>\r
+\set ntellers 10 * :scale\r
+ </programlisting>\r
+ </para>\r
+ </listitem>\r
+ </varlistentry>\r
+\r
+ <varlistentry>\r
+ <term>\r
+ <literal>\setrandom <replaceable>varname</> <replaceable>min</> <replaceable>max</></literal>\r
+ </term>\r
+\r
+ <listitem>\r
+ <para>\r
+ Sets variable <replaceable>varname</> to a random integer value\r
+ between the limits <replaceable>min</> and <replaceable>max</> inclusive.\r
+ Each limit can be either an integer constant or a\r
+ <literal>:</><replaceable>variablename</> reference to a variable\r
+ having an integer value.\r
+ </para>\r
+\r
+ <para>\r
+ Example:\r
+ <programlisting>\r
+\setrandom aid 1 :naccounts\r
+ </programlisting>\r
+ </para>\r
+ </listitem>\r
+ </varlistentry>\r
+\r
+ <varlistentry>\r
+ <term>\r
+ <literal>\sleep <replaceable>number</> [ us | ms | s ]</literal>\r
+ </term>\r
+\r
+ <listitem>\r
+ <para>\r
+ Causes script execution to sleep for the specified duration in\r
+ microseconds (<literal>us</>), milliseconds (<literal>ms</>) or seconds\r
+ (<literal>s</>). If the unit is omitted then seconds are the default.\r
+ <replaceable>number</> can be either an integer constant or a\r
+ <literal>:</><replaceable>variablename</> reference to a variable\r
+ having an integer value.\r
+ </para>\r
+\r
+ <para>\r
+ Example:\r
+ <programlisting>\r
+\sleep 10 ms\r
+ </programlisting>\r
+ </para>\r
+ </listitem>\r
+ </varlistentry>\r
+\r
+ <varlistentry>\r
+ <term>\r
+ <literal>\setshell <replaceable>varname</> <replaceable>command</> [ <replaceable>argument</> ... ]</literal>\r
+ </term>\r
+\r
+ <listitem>\r
+ <para>\r
+ Sets variable <replaceable>varname</> to the result of the shell command\r
+ <replaceable>command</>. The command must return an integer value\r
+ through its standard output.\r
+ </para>\r
+\r
+ <para>\r
+ <replaceable>argument</> can be either a text constant or a\r
+ <literal>:</><replaceable>variablename</> reference to a variable of\r
+ any types. If you want to use <replaceable>argument</> starting with\r
+ colons, you need to add an additional colon at the beginning of\r
+ <replaceable>argument</>.\r
+ </para>\r
+\r
+ <para>\r
+ Example:\r
+ <programlisting>\r
+\setshell variable_to_be_assigned command literal_argument :variable ::literal_starting_with_colon\r
+ </programlisting>\r
+ </para>\r
+ </listitem>\r
+ </varlistentry>\r
+\r
+ <varlistentry>\r
+ <term>\r
+ <literal>\shell <replaceable>command</> [ <replaceable>argument</> ... ]</literal>\r
+ </term>\r
+\r
+ <listitem>\r
+ <para>\r
+ Same as <literal>\setshell</literal>, but the result is ignored.\r
+ </para>\r
+\r
+ <para>\r
+ Example:\r
+ <programlisting>\r
+\shell command literal_argument :variable ::literal_starting_with_colon\r
+ </programlisting>\r
+ </para>\r
+ </listitem>\r
+ </varlistentry>\r
+ </variablelist>\r
+\r
+ <para>\r
+ As an example, the full definition of the built-in TPC-B-like\r
+ transaction is:\r
+\r
+ <programlisting>\r
+\set nbranches :scale\r
+\set ntellers 10 * :scale\r
+\set naccounts 100000 * :scale\r
+\setrandom aid 1 :naccounts\r
+\setrandom bid 1 :nbranches\r
+\setrandom tid 1 :ntellers\r
+\setrandom delta -5000 5000\r
+BEGIN;\r
+UPDATE pgbench_accounts SET abalance = abalance + :delta WHERE aid = :aid;\r
+SELECT abalance FROM pgbench_accounts WHERE aid = :aid;\r
+UPDATE pgbench_tellers SET tbalance = tbalance + :delta WHERE tid = :tid;\r
+UPDATE pgbench_branches SET bbalance = bbalance + :delta WHERE bid = :bid;\r
+INSERT INTO pgbench_history (tid, bid, aid, delta, mtime) VALUES (:tid, :bid, :aid, :delta, CURRENT_TIMESTAMP);\r
+END;\r
+ </programlisting>\r
+\r
+ This script allows each iteration of the transaction to reference\r
+ different, randomly-chosen rows. (This example also shows why it's\r
+ important for each client session to have its own variables —\r
+ otherwise they'd not be independently touching different rows.)\r
+ </para>\r
+\r
+ </sect2>\r
+\r
+ <sect2>\r
+ <title>Per-transaction logging</title>\r
+\r
+ <para>\r
+ With the <literal>-l</> option, <application>pgbench</> writes the time\r
+ taken by each transaction to a logfile. The logfile will be named\r
+ <filename>pgbench_log.<replaceable>nnn</></filename>, where\r
+ <replaceable>nnn</> is the PID of the pgbench process.\r
+ If the <literal>-j</> option is 2 or higher, creating multiple worker\r
+ threads, each will have its own log file. The first worker will use the\r
+ the same name for its log file as in the standard single worker case. \r
+ The additional log files for the other workers will be named\r
+ <filename>pgbench_log.<replaceable>nnn</>.<replaceable>mmm</></filename>,\r
+ where <replaceable>mmm</> is a sequential number for each worker starting\r
+ with 1.\r
+ </para>\r
+\r
+ <para>\r
+ The format of the log is:\r
+\r
+ <programlisting>\r
+ <replaceable>client_id</> <replaceable>transaction_no</> <replaceable>time</> <replaceable>file_no</> <replaceable>time_epoch</> <replaceable>time_us</>\r
+ </programlisting>\r
+\r
+ where <replaceable>time</> is the elapsed transaction time in microseconds,\r
+ <replaceable>file_no</> identifies which script file was used\r
+ (useful when multiple scripts were specified with <literal>-f</>),\r
+ and <replaceable>time_epoch</>/<replaceable>time_us</> are a\r
+ UNIX epoch format timestamp and an offset\r
+ in microseconds (suitable for creating a ISO 8601\r
+ timestamp with fractional seconds) showing when\r
+ the transaction completed.\r
+ </para>\r
+\r
+ <para>\r
+ Here are example outputs:\r
+ <programlisting>\r
+ 0 199 2241 0 1175850568 995598\r
+ 0 200 2465 0 1175850568 998079\r
+ 0 201 2513 0 1175850569 608\r
+ 0 202 2038 0 1175850569 2663\r
+ </programlisting>\r
+ </para>\r
+ </sect2>\r
+\r
+ <sect2>\r
+ <title>Good Practices</title>\r
+\r
+ <para>\r
+ It is very easy to use <application>pgbench</> to produce completely\r
+ meaningless numbers. Here are some guidelines to help you get useful\r
+ results.\r
+ </para>\r
+\r
+ <para>\r
+ In the first place, <emphasis>never</> believe any test that runs\r
+ for only a few seconds. Use the <literal>-t</> or <literal>-T</> option\r
+ to make the run last at least a few minutes, so as to average out noise.\r
+ In some cases you could need hours to get numbers that are reproducible.\r
+ It's a good idea to try the test run a few times, to find out if your\r
+ numbers are reproducible or not.\r
+ </para>\r
+\r
+ <para>\r
+ For the default TPC-B-like test scenario, the initialization scale factor\r
+ (<literal>-s</>) should be at least as large as the largest number of\r
+ clients you intend to test (<literal>-c</>); else you'll mostly be\r
+ measuring update contention. There are only <literal>-s</> rows in\r
+ the <structname>pgbench_branches</> table, and every transaction wants to\r
+ update one of them, so <literal>-c</> values in excess of <literal>-s</>\r
+ will undoubtedly result in lots of transactions blocked waiting for\r
+ other transactions.\r
+ </para>\r
+\r
+ <para>\r
+ The default test scenario is also quite sensitive to how long it's been\r
+ since the tables were initialized: accumulation of dead rows and dead space\r
+ in the tables changes the results. To understand the results you must keep\r
+ track of the total number of updates and when vacuuming happens. If\r
+ autovacuum is enabled it can result in unpredictable changes in measured\r
+ performance.\r
+ </para>\r
+\r
+ <para>\r
+ A limitation of <application>pgbench</> is that it can itself become\r
+ the bottleneck when trying to test a large number of client sessions.\r
+ This can be alleviated by running <application>pgbench</> on a different\r
+ machine from the database server, although low network latency will be\r
+ essential. It might even be useful to run several <application>pgbench</>\r
+ instances concurrently, on several client machines, against the same\r
+ database server.\r
+ </para>\r
+ </sect2>\r
+\r
+</sect1>\r