Bo Peng [Wed, 18 Nov 2020 05:11:51 +0000 (14:11 +0900)]
Prepare 3.7.17.
Bo Peng [Wed, 18 Nov 2020 05:07:54 +0000 (14:07 +0900)]
Doc: update doc version.
Bo Peng [Tue, 17 Nov 2020 08:23:53 +0000 (17:23 +0900)]
Doc: add release note.
Bo Peng [Tue, 17 Nov 2020 03:45:37 +0000 (12:45 +0900)]
Fix query rewrite syntax error of "INSERT ... ON CONFLICT" in native replication mode.
per bug 654.
Bo Peng [Tue, 17 Nov 2020 02:11:19 +0000 (11:11 +0900)]
Doc: update "Aurora Configuration Example".
Tatsuo Ishii [Thu, 29 Oct 2020 20:59:23 +0000 (05:59 +0900)]
Fix connection count when query is canceled.
Connection counter was not counted down when a query is canceled.
Per bug 656.
Tatsuo Ishii [Sat, 3 Oct 2020 01:29:30 +0000 (10:29 +0900)]
Doc: mention that GSSAPI is not supported.
Tatsuo Ishii [Sat, 3 Oct 2020 00:09:41 +0000 (09:09 +0900)]
Deal with GSSAPI encrypted connection request from frontend.
If GSSAPI connection request comes from frontend, Pgpool-II responded
in inappropriate way. For example,
psql: error: could not connect to server: server closed the connection unexpectedly
This probably means the server terminated abnormally
before or while processing the request.
or even:
FATAL: client authentication failed
DETAIL: no pool_hba.conf entry for host "x.x.x.x", user "", database "", SSL off
(https://www.pgpool.net/pipermail/pgpool-general/2020-September/007353.html)
With this commit Pgpool-II responds better. i.e. replies back to
frontend with "we don't support GSSAPI" and expects that the frontend
falls back to SSL connection (if frontend's param: gssencmode=prefer
(default) and SSL is configure when Pgpool-II was built) or fals back
to non-SSL connection.
Patch by me, reviewed and tested by Umar Hayat.
Tatsuo Ishii [Wed, 30 Sep 2020 04:44:20 +0000 (13:44 +0900)]
Doc: fix usable versions of PostgreSQL.
Make usable versions of PostgreSQL to be 7.4 or later.
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-September/003836.html
Bo Peng [Wed, 16 Sep 2020 08:20:46 +0000 (17:20 +0900)]
Doc: add more release note.
Bo Peng [Wed, 16 Sep 2020 08:29:04 +0000 (17:29 +0900)]
Doc: fix the incorrect description regarding the running modes of Pgpoo-II in which online recovery is available.
Bo Peng [Wed, 16 Sep 2020 02:04:53 +0000 (11:04 +0900)]
Prepare 3.7.16.
Bo Peng [Wed, 16 Sep 2020 01:46:37 +0000 (10:46 +0900)]
Doc: Add relesae note 3.5.27 - 4.1.4.
Tatsuo Ishii [Sun, 13 Sep 2020 23:09:18 +0000 (08:09 +0900)]
Remove unnecessary checks in some code path.
Patch contributed by Hou, Zhijie.
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-September/003805.html
Tatsuo Ishii [Sun, 13 Sep 2020 11:14:42 +0000 (20:14 +0900)]
Follow the guide line of PostgreSQL 12.4 regarding extensions.
CREATE OR REPLACE FUNCTION should be avoided. Use CREATE FUNCTION instead.
Tatsuo Ishii [Sun, 6 Sep 2020 10:52:56 +0000 (19:52 +0900)]
Fix relcache query sometimes sent to other than primary.
In streaming replication mode, relcache queries are supposed to be
sent to the primary node. But actually they were not sent the primary
node if primary node was not the master node. Typically this could
happen when the primary is not node 0.
Tatsuo Ishii [Tue, 1 Sep 2020 03:40:18 +0000 (12:40 +0900)]
Fix connection_life_time not working when serialize_accept is enabled.
If serialize_accept is enabled, pgpool child process tries to acquire
semaphore locking so that there's only one process which can issue
accept(2). Unfortunately if connection_life_time is enabled, an alarm
is set right before the semaphore locking. So when the alarm fires,
nothing happened because the process was in acquiring semaphore lock
loop by using pool_semaphore_lock().
To fix this new pool_semaphore_lock_allow_interrupt() is introduced,
which immediately returns if interrupted by a signal. The caller:
wait_for_new_connections() checks whether connection_life_time alarm
is fired. If so, call backend_timer() to close expired backend
connection cache.
Discussion: https://www.pgpool.net/pipermail/pgpool-general/2020-August/007233.html
Bo Peng [Mon, 31 Aug 2020 01:56:30 +0000 (10:56 +0900)]
Add .gitignore file.
Tatsuo Ishii [Sun, 30 Aug 2020 02:35:14 +0000 (11:35 +0900)]
Display more informative error message in authentication process.
When backends offers mistakenly different authentication methods,
pgpool just showed:
"unable to read message length"
"message length (%d) in slot %d does not match with slot 0(%d)", length, i, length0)));
because pool_read_message_length() called ereport(ERROR) in this
case. Actually the caller pool_do_auth() prepared more informative
message:
ereport(ERROR,
(errmsg("invalid authentication packet from backend"),
errdetail("failed to get the authentication packet length"),
errhint("This is likely caused by the inconsistency of auth method among DB nodes. \
Please check the previous error messages (hint: length field) \
from pool_read_message_length and recheck the pg_hba.conf settings.")));
Change ereport(ERROR) to ereport(LOG) in pool_read_message_length() so
that the informative message actually shows up.
Bo Peng [Mon, 24 Aug 2020 12:39:38 +0000 (21:39 +0900)]
Replace "PGBIN" and "LPATH" in pgpool_setup and watchdog_setup using PostgreSQL's bin path and lib path.
Tatsuo Ishii [Fri, 21 Aug 2020 10:51:04 +0000 (19:51 +0900)]
Fix sefault in pgpool child process in certain case.
When all backend go down, pgpool refuses to accept connection from
clients and try to send a message in
validate_backend_connectivity(). For this purpose it connects to the
client and try to read the startup packet. This is done in a PG_TRY
block. The startup packet is read in memory pointed to by "sp", which
is declared as "StartupPacket *volatile". This is fine. But it was
forgotten to initialize the value with NULL. As a result, if reading
startup packet fails, sp would be garbage a pointer and segfaults later
in pool_free_startup_packet(sp).
Fix is, initialize "sp" with NULL.
I have found this accidentally in following way:
1) shutdown all backends.
2) connect to pgpool with invalid client. I have used pcp_attach_node
with pgpool's port number, not pcp's.
Bo Peng [Thu, 20 Aug 2020 05:51:05 +0000 (14:51 +0900)]
Fix pcp_unix_domain_path.patch error.
Bo Peng [Wed, 19 Aug 2020 15:24:29 +0000 (00:24 +0900)]
Fix pgpool.spec.
Bo Peng [Wed, 19 Aug 2020 12:47:08 +0000 (21:47 +0900)]
Prepare 3.7.15.
Bo Peng [Tue, 18 Aug 2020 02:07:46 +0000 (11:07 +0900)]
Doc: add release note.
Tatsuo Ishii [Tue, 18 Aug 2020 03:05:58 +0000 (12:05 +0900)]
Doc: mention that pgpool_setup needs to configure ssh log into localhost without password.
Bo Peng [Mon, 17 Aug 2020 07:46:10 +0000 (16:46 +0900)]
Doc: fix indent.
Bo Peng [Mon, 17 Aug 2020 05:58:41 +0000 (14:58 +0900)]
Doc: change repository rpm urls.
Bo Peng [Mon, 17 Aug 2020 04:46:36 +0000 (13:46 +0900)]
Use pg_config to get PostgreSQL lib path and add this path to LD_LIBRARY_PATH environment paramater in regress.sh.
Bo Peng [Sun, 16 Aug 2020 04:15:56 +0000 (13:15 +0900)]
Fix pgpool_setup to get correct PostgreSQL version.
Tatsuo Ishii [Sat, 15 Aug 2020 09:14:11 +0000 (18:14 +0900)]
Fix pgpool_setup so that native replication mode and SI mode works with PostgreSQL 12.
In these mode, it created recovery.conf which is not allowed in
PostgreSQL 12.
Tatsuo Ishii [Sat, 15 Aug 2020 03:37:39 +0000 (12:37 +0900)]
Fix oversight in connection_life_time fix.
Commit
39eb90fe199d03253b99bec38d85c40f66aa55ab did not consider the case
when node is down connection struct is NULL. As a result, segfault occurred.
Tatsuo Ishii [Fri, 14 Aug 2020 05:15:22 +0000 (14:15 +0900)]
Check panic or fatal error by using 'V' token if available.
To identify panic or fatal message in is_panic_or_fatal_error(), 'S'
token was used. However 'S' could be localized and simply comparing
"PANIC" or "FATAL" with the message may not work. Instead 'V' should be
used if possible because it's not localized.
Bo Peng [Fri, 14 Aug 2020 02:18:25 +0000 (11:18 +0900)]
Fix 031.connection_life_time regression test failure.
Bo Peng [Thu, 13 Aug 2020 06:35:34 +0000 (15:35 +0900)]
Before running regression test clean the remaining processes and sockets.
Tatsuo Ishii [Wed, 12 Aug 2020 11:17:44 +0000 (20:17 +0900)]
Doc: mention that ssl_ciphers only affects to TLS 1.2 and lower.
The changes are imported from PostgreSQL manual.
Bo Peng [Wed, 12 Aug 2020 07:07:04 +0000 (16:07 +0900)]
At pgpool startup, if pid file exists, truncate pid file to zero length before writing.
Patch is created by maliangzhu.
Bo Peng [Tue, 11 Aug 2020 14:58:51 +0000 (23:58 +0900)]
Doc: fix failback_command documetation error.
Bo Peng [Tue, 11 Aug 2020 09:40:34 +0000 (18:40 +0900)]
Make RPMs to include pcp.conf and pool_hba.conf sample files.
Tatsuo Ishii [Tue, 11 Aug 2020 01:26:43 +0000 (10:26 +0900)]
Fix connection_life_time does not work.
If master node is not 0 (this could happen in the case when primary
node is not 0 in streaming replication mode for example), pgpool
failed to find connection_life_time timer because the timer was set
only in virtual_master_node, which could be changed after session ends
since query context is not there any more.
To fix this, connection_life_time timer is set to all valid nodes.
Regression test is also added.
Discussion: https://www.pgpool.net/pipermail/pgpool-general/2020-August/007242.html
Tatsuo Ishii [Fri, 7 Aug 2020 00:43:34 +0000 (09:43 +0900)]
Revert "Fix pgpool_setup problem with -r option and PostgreSQL 12."
This reverts commit
566a48e8c5a1ed8efbcef143322ac1bea54882f9.
Since pg_rewind is supported in Pgpool-II 3.7 or before, the fix is
not necessary.
Tatsuo Ishii [Wed, 5 Aug 2020 05:42:18 +0000 (14:42 +0900)]
Fix query cache bug.
If EXPLAIN ANALYZE takes data-modifying SQL, it must invalidate query
cache, but actually it was missed.
Author: Hou, Zhijie
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-August/003767.html
Tatsuo Ishii [Wed, 5 Aug 2020 01:47:06 +0000 (10:47 +0900)]
Fix pgpool_setup problem with -r option and PostgreSQL 12.
pgpool_setup -r uses pg_rewind if it's possible. After rewinding,
pgpool_setup failed to create standby.signal file, which is required in
PostgreSQL 12 or later, on the recovered target node. As a result, the
recovered node booted up as a primary server, rather than a standby
server.
Discussion: https://www.pgpool.net/mantisbt/view.php?id=624
Bo Peng [Fri, 31 Jul 2020 00:45:39 +0000 (09:45 +0900)]
Change PCP UNIX_DOMAIN_PATH of RPM package to "/var/run/postgresql".
Tatsuo Ishii [Tue, 28 Jul 2020 02:36:11 +0000 (11:36 +0900)]
Doc: clarify what the command actual does.
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-July/003744.html
Reviewed-by: Bo Peng
Bo Peng [Mon, 27 Jul 2020 02:40:25 +0000 (11:40 +0900)]
Rename src/redhat/pgpool_rhel7.sysconfig to src/redhat/pgpool_rhel.sysconfig to make this file available on RHEL8/CentOS8.
Bo Peng [Wed, 22 Jul 2020 01:23:13 +0000 (10:23 +0900)]
Fix typos.
Patch is created by Lu, Chenyang.
Tatsuo Ishii [Thu, 16 Jul 2020 10:46:33 +0000 (19:46 +0900)]
Doc: fix typo in memqcache doc.
Patch contributed by Hou, Zhijie.
Discussion: https://www.pgpool.net/pipermail/pgpool-general/2020-July/007204.html
Tatsuo Ishii [Tue, 14 Jul 2020 13:07:59 +0000 (22:07 +0900)]
Prevent data modifying CTE to be cached.
Data modifying CTE was mistakenly treated as normal read only CTE and
result query was created. As a result subsequent CTE was not
executed.
Problem reported and patch created by Hou, Zhijie.
Subtle changes to the regression test by me.
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-July/003705.html
Tatsuo Ishii [Thu, 9 Jul 2020 00:11:03 +0000 (09:11 +0900)]
Fix pgpool hang in a corner case.
It is possible that an "out of band" message from backend has been
read into buffer at the time when a ready for query message is
processed. If the messages are from all backends, there should be no
problem because ProcessBackendResponse() will read the messages from
all backends by using read_kind_from_backend(). However there could be
a corner case: 1) If the message is coming from only one of backend
(this could happen when recovery conflict or backend receiving SIGTERM
and so on) and 2) the message is already in the backend read
buffer. In this case pgpool will hang in pool_read() called by
read_kind_from_backend() at either: 1)
read_kind_from_one_backend(frontend, backend, (char *) &kind,
MASTER_NODE_ID) (the message is not coming from master backend) or 2)
pool_read(CONNECTION(backend, i), &kind, 1) (the message is not coming
from other than master).
Note If the message is not in the buffer, there should be no problem
since read_packets_and_process() will take care that "out of band"
messages.
The solution is, read and discard such a message in ReadyforQuery(),
emitting log to make sure that the read buffer is empty after
returning from ReadyForQuery(). (remember that unless the ready for
query message is returned to frontend, the frontend will not issue
next query and there's should be no response from backend except the
out of band messages).
If the message was FATAL, the backend will disconnect to pgpool. So
next time pgpool should notice that the connection is closed anyway.
For the master branch, probably we should treat that kind of FATAL
message in a same way as read_packets_and_process() already does. This
requires some code refactoring and I would like to leave the job
separated from this commit.
Tatsuo Ishii [Wed, 8 Jul 2020 01:25:12 +0000 (10:25 +0900)]
Fix per_node_error_log() so that it respects unread parameter.
per_node_error_log() unconditionally set the "unread" parameter to
true when it should have respected the unread parameter given by user.
It seems all callers to per_node_error_log() set the unread parameter
to true anyway. The only exception is pool_do_auth. As far as testing
test 03[0-4] regression tests, the change is ok.
In summary this change will not affect to Pgpool-II but bug is bug.
Tatsuo Ishii [Tue, 7 Jul 2020 03:35:10 +0000 (12:35 +0900)]
Fix lazy coding of ereport in pgpool_adm extension.
Before it was coded that ereport's "domain" parameters were just "0".
This generated lots of compiler warning. To fix this give plausible
error codes to each ereport call.
Tatsuo Ishii [Mon, 29 Jun 2020 23:07:12 +0000 (08:07 +0900)]
Doc: fix typo in the explanation of pgpool_adm.
Patch provided by sherlockcpp.
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-June/003679.html
Tatsuo Ishii [Mon, 29 Jun 2020 00:35:23 +0000 (09:35 +0900)]
Fix typo in query cache module.
Tatsuo Ishii [Fri, 26 Jun 2020 11:46:33 +0000 (20:46 +0900)]
Fix startup packet reading process.
If there are parameters other than "user", "database" and
"application_name", reading startup packet resulted in failing to
extract user: "no PostgreSQL user name specified in startup
packet". This was found while a frontend was trying to speak
replication protocol by using "replication=database".
Discussion: https://www.pgpool.net/pipermail/pgpool-general/2020-June/007162.html
Takuma Hoshiai [Mon, 22 Jun 2020 02:41:54 +0000 (11:41 +0900)]
Fix miscount of connection_count
There is a possibility that pgpool duplicate the count down of conections
when query cancel such as pg_terminate_backend() commmand.
Tatsuo Ishii [Thu, 4 Jun 2020 02:32:33 +0000 (11:32 +0900)]
Merge branch 'V3_7_STABLE' of ssh://git.postgresql.org/pgpool2 into V3_7_STABLE
Tatsuo Ishii [Thu, 4 Jun 2020 02:29:33 +0000 (11:29 +0900)]
Fix segmentation fault when application name is included in log_line_prefix.
In Pgpool-II 4.1 or before log_line_prefix unconditionally tried to
fetch the application name from backend info (slots array).
Unfortunately in certain corner cases this was not possible. When
connection_cache is off, pgpool resets connection slots
(session->backend->slots[]) at the time when clients sends termination
request to pgpool. If log_min_messages is DEBUG5, pgpool wants to emit
this log message:
DEBUG: RESET ALL CONFIG VARIABLE
which caused a segfault because pgpool tried to refer NULL pointer.
Fix is, if session->backend->slots[] is NULL, do not try to fetch
application name from the variable.
Per bug 615.
Takuma Hoshiai [Tue, 2 Jun 2020 08:08:00 +0000 (17:08 +0900)]
Fix incorrect execution of failover process
This problem happen by executing pg_terminate_backend command with native replication.
In addition, add regression test for pg_terminate_backend command.
Tatsuo Ishii [Wed, 27 May 2020 06:24:07 +0000 (15:24 +0900)]
Fix 004.watchdog test crash on IBM Z hardware.
When watchdog copies primary node id on the master watchdog node, it
did not consider the case that primary node id on the shared memory
(Req_info->primary_node_id) is remaining in the initial value (-2),
which causes out of range subscript access to backend info
array. Interestingly accessing array[-2] does not crash on intel
architecture but does crash IBM Z hardware. Anyway the reason why the
value remains in -2 is that the regression test is performed in raw
mode. I think the code block handling the primary node id should only
be executed in streaming or logical replication mode.
Bug report and patch provided by gregn123, slightly modified by me.
Mantis bug report: https://www.pgpool.net/mantisbt/view.php?id=614
Bo Peng [Fri, 22 May 2020 07:32:27 +0000 (16:32 +0900)]
Doc: Improve the description of wd_priority.
Patch is provided by Kenichiro Tanaka.
Tatsuo Ishii [Fri, 22 May 2020 01:58:48 +0000 (10:58 +0900)]
Add mention about hostssl/hostnossl to pool_hba.conf sample file.
Although hostssl/hostnossl are supported, pool_hba.conf comment did
not mention about it.
Bo Peng [Tue, 19 May 2020 09:38:20 +0000 (18:38 +0900)]
Prepare 3.7.14
Bo Peng [Mon, 18 May 2020 20:16:27 +0000 (05:16 +0900)]
Doc: Add release-note.
Tatsuo Ishii [Sat, 16 May 2020 00:14:09 +0000 (09:14 +0900)]
Remove more duplicate global data.
This causes a link error on some system (Fedra 32).
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-April/003593.html
Tatsuo Ishii [Sun, 26 Apr 2020 01:22:22 +0000 (10:22 +0900)]
Remove duplicate global data.
This causes a link error on some system (Fedra 32).
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-April/003593.html
Tatsuo Ishii [Sun, 26 Apr 2020 00:58:50 +0000 (09:58 +0900)]
Remove duplicate global data.
This causes a link error on some system (Fedra 32).
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-April/003593.html
Tatsuo Ishii [Thu, 23 Apr 2020 12:32:28 +0000 (21:32 +0900)]
Fix unnecessary checks.
Patch contributed by sherlockcpp.
Discussion: https://www.pgpool.net/pipermail/pgpool-general/2020-April/007062.html
Tatsuo Ishii [Thu, 23 Apr 2020 05:40:23 +0000 (14:40 +0900)]
Doc: fix typo.
Patch contributed by Umar Hayat.
Discussion: https://www.pgpool.net/pipermail/pgpool-hackers/2020-April/003587.html
Muhammad Usama [Thu, 16 Apr 2020 10:09:14 +0000 (15:09 +0500)]
Fix for segmentation fault in PCP processor: Bug 598
The crash was caused by an unconditionally pfree on buf without verifyingif it
was actually allocated or not. And freeing the but was not actually required at
the first places since the associated MemoryContext always gets resets anyway
after each loop iteration.
reference: https://www.pgpool.net/mantisbt/view.php?id=598
Tatsuo Ishii [Sun, 12 Apr 2020 09:15:05 +0000 (18:15 +0900)]
Doc: fix typo in failover.sgml.
Patch provided by sherlockcpp.
Tatsuo Ishii [Sat, 11 Apr 2020 05:34:40 +0000 (14:34 +0900)]
Fix pool show command code.
Enhance common function send_complete_and_ready() so that it uses
local variable in more sane way.
Also fix unnecessary limitation of number of rows affected by the
command.
Patch provided by sherlockcpp and Tatsuo Ishii.
Muhammad Usama [Tue, 7 Apr 2020 16:01:16 +0000 (21:01 +0500)]
Minor improvement in configure script
While checking for libpq the configure script was not honoring the LDFLAGS
settings.This sometimes breaks the configure when libraries are not present at
the defaultlocations# Please enter the commit message for your changes
Bo Peng [Sun, 5 Apr 2020 16:10:37 +0000 (01:10 +0900)]
Doc: fix document typos.
Patch is provided by sherlockcpp@foxmail.com.
Tatsuo Ishii [Tue, 31 Mar 2020 07:58:01 +0000 (16:58 +0900)]
Fix bug with query cache.
If an execute message is issued right after Sync message and query
cache hits, Pgpool-II hangs in inject_cached_message() because there's
no data to be read from backend. Fix is, change the code block which
checks the data availability in backend to the beginning of the loop
from the end of the loop. Also teach is_cache_empty() to call
pool_is_suspend_reading_from_frontend() so that if we suspend reading
from frontend. If so, return that cache is empty even if there's data
in frontend cache to receive a ready for query message from backend.
Tatsuo Ishii [Fri, 13 Mar 2020 01:36:17 +0000 (10:36 +0900)]
Fix problems in watchdog processing json data.
Comment on the patch from the author:
In the watchdog source code (src/watchdog/wd_json_data.c), there are some instances of bad handling of values read from json data.
For example:
1) The boolean pool configuration settings "load_balance_mode" and "master_slave_mode" are read using json_get_int_value_for_key(), resulting in 4-bytes being written into their location within the POOL_CONFIG, yet (being bool) they are only 1-byte long. This corrupts the values of the structure members following them.
2) Similarly, when parsing node function json data, "Flags" is read using json_get_int_value_for_key(), resulting in 4-bytes being written into an "unsigned char flags" variable on the stack, overwriting 3-bytes of stack memory following it. On a big-endian system (e.g. Solaris-sparc or Linux for IBM Z), this causes regression test "013.watchdog_failover_require_consensus" to fail, since 0 is written into Flags, rather than the intended value which is in the least significant byte of the int value written.
Bug reported in:
https://www.pgpool.net/mantisbt/view.php?id=596
Patch author:
Greg Nancarrow (Fujitsu Australia)
Tatsuo Ishii [Wed, 11 Mar 2020 04:13:28 +0000 (13:13 +0900)]
Merge branch 'V3_7_STABLE' of ssh://git.postgresql.org/pgpool2 into V3_7_STABLE
Tatsuo Ishii [Wed, 11 Mar 2020 04:05:43 +0000 (13:05 +0900)]
Fix possible data inconsistency in native replication mode.
There is a long standing bug with native replication mode. As reported
in pgpool-general, it is possible to lost sync of database if slave
DB's postgres process is killed. This is due to an oversight in
read_packets_and_process().
In replication mode if slave server's postgres is killed, then local
backend status is set to down.
*(my_backend_status[i]) = CON_DOWN;
So next DDL/DML in the same session is only issued to master node (and
other slave if there are multiple slave nodes). Of course this leads
to serious data inconsistency problem because in native replication
mode all DB nodes must receive DDL/DML at the same time.
Fix is triggering failover in thiscase.
Discussions:
https://www.pgpool.net/pipermail/pgpool-general/2020-March/006954.html
https://www.pgpool.net/pipermail/pgpool-hackers/2020-March/003540.html
pengbo [Thu, 5 Mar 2020 22:39:58 +0000 (07:39 +0900)]
Fix watchdog ping probes fail with long hostnames due to small buffer.
per 516.
Tatsuo Ishii [Wed, 26 Feb 2020 09:16:18 +0000 (18:16 +0900)]
Revert "Fix last status changed timestamp is not set."
This reverts commit
a9f5a9b4e3f081822e454e7dab04a64b0e398e24.
Tatsuo Ishii [Wed, 26 Feb 2020 01:31:37 +0000 (10:31 +0900)]
Fix last status changed timestamp is not set.
If there's no status file or -D is specified when starting up
Pgpool-II, the last status changed timestamp was not set and
pcp_node_info command for example did not show the proper timestamp,
rather showed 1970/1/1, which is the Unix timestamp epoch.
Note that correct timestamp is set once clients connect to Pgpool-II
because at that time the status is changed from "waiting" to "up". So
the phenomena is only observed before any client connects to the
server.
Bo Peng [Sat, 22 Feb 2020 00:59:10 +0000 (09:59 +0900)]
Update copyright year.
Bo Peng [Wed, 19 Feb 2020 07:44:16 +0000 (16:44 +0900)]
Update pgpool.spec file.
Bo Peng [Wed, 19 Feb 2020 06:38:15 +0000 (15:38 +0900)]
Prepare 3.7.13.
Bo Peng [Wed, 19 Feb 2020 06:25:42 +0000 (15:25 +0900)]
Doc: Update doc version.
Bo Peng [Wed, 19 Feb 2020 06:19:53 +0000 (15:19 +0900)]
Doc: add release-notes.
Bo Peng [Wed, 19 Feb 2020 06:12:35 +0000 (15:12 +0900)]
Doc: update japanese documentation.
Muhammad Usama [Tue, 18 Feb 2020 12:23:10 +0000 (17:23 +0500)]
Disallowing the quorum aware failover option for the native replication mode.
In native replication mode, Pgpool-II is responsible for replicating the data
on all backend nodes, and if a node becomes quarantined then Pgpool-II stops
sending the writes to that node and when that quarantined node becomes available
again there is no way to figure out which portion of data was not replicated
during the quarantine period. This is dangerous since it can cause
data inconsistency.
So as per the discussion, we reached the conclusion to disallow failover require
consensus and quorum existence with the native replication mode so that backend
node should never get into quarantine when Pgpool-II is configured in the
replication mode.
Bo Peng [Tue, 18 Feb 2020 08:43:23 +0000 (17:43 +0900)]
Fix rewriting query errors in native replication mode.
per bug551.
Tatsuo Ishii [Tue, 18 Feb 2020 01:28:39 +0000 (10:28 +0900)]
Doc: clarify the session disconnection conditions while failover.
Bo Peng [Mon, 17 Feb 2020 13:14:53 +0000 (22:14 +0900)]
Doc: Fix documentation error regarding delegate_IP.
Bo Peng [Wed, 12 Feb 2020 01:03:13 +0000 (10:03 +0900)]
Fix child process segfault after reload if "health_check_database" is empty.
per bug571.
Tatsuo Ishii [Sat, 8 Feb 2020 10:39:14 +0000 (19:39 +0900)]
Fix oversight to adopt PostgreSQL 12.
From PostgreSQL 12 WAL dir is renamed to "pg_wal". pgpool_setup should
have dealt with this.
Takuma Hoshiai [Tue, 4 Feb 2020 02:51:33 +0000 (11:51 +0900)]
Fix error messsage typo
Bo Peng [Tue, 4 Feb 2020 00:04:05 +0000 (09:04 +0900)]
Doc: fix doc typo.
Bo Peng [Tue, 28 Jan 2020 06:20:46 +0000 (15:20 +0900)]
Feature: Check if socket file exists at startup and remove them if PID file doesn't exist.
If the previuos socket files exist, pgpool startup fails due to bind() failire.
This commit is to remove socket files at startup if PID file doesn't exist.
Also, I found some messages are effectively ignorede because log system is not yet initialized at start point.
I changed them to standard fprintf(stderr...).
Tatsuo Ishii [Sat, 11 Jan 2020 11:49:29 +0000 (20:49 +0900)]
Fix 001.load_balance failure caused by previous commit.
Tatsuo Ishii [Fri, 10 Jan 2020 05:50:38 +0000 (14:50 +0900)]
Attempt to fix 001.load_balance test failure.
It is likely that sleep time after reload was not enough. Change them
from 1 second to 10 seconds.
Tatsuo Ishii [Sat, 4 Jan 2020 01:30:22 +0000 (10:30 +0900)]
More fix for occasional regression test 003.failover failure.
Comparison between show pool_node result and expected file failed
because of replication delay (the expected file set 0 to replication
delay column.) Fix is, disabling replication delay.
Tatsuo Ishii [Tue, 31 Dec 2019 03:06:50 +0000 (12:06 +0900)]
Fix occasional regression test 003.failover failure.
The test compares the result of show pool_nodes against pre-computed
expected text file. After primary node goes down and old standby
promotes to primary, it is possible that streaming replication worker
process stores the replication state (async, streaming) in shared
memory before the failover but fails to update the state after the
failover. Fix is, replace "async", "streaming" with space characters
in the result file to match the expected file.