David's Astronomy Pages
Notes - Session 1242 (2024-09-12)

 
Bullet Session Aims & Highlights
 - Observing Result
 - Night Summary Plot
 - Session Event Log
Bullet Operational Issues
  - Critical Issues (0),  Major Issues (0),  Minor Issues (2),  Small Defects (1),  Continuous Improvement (0)
Bullet Images from 2024-09-12 >>         
   
Bullet Investigation - Job Queue freezing whilst performing a Locate Frame Image Link, leading to Session Suspension
   

Session Aims & Highlights (2024-09-12)

Main aims

  1. Targets.  Acquire images of a selection of variable stars, nearby stars, comets & deep sky targets as allowed by sky conditions.

 Equipment & Software

Highlights-

Notes:

Summary Plots & Logs

Observing Plan
Image
  
Observing Result
Image
   
Image
 
Dome & Scope Slewing Performance
Image
  
Slew/Centering Performance
Image
  
Guiding Performance
Image
  
Sky Conditions (Locate Frames)
Image 
  
Night Sky Summary Plot
Top axis: Sky Brightness at Zenith (in ADU/s)
Lefthand axis: Local Time (hh LT). Righthand axis: Sun Altitude (degs)
Image   
  
Actual Weather vs Pre-Session Weather Forecast
Image
Image
  
Session Event Log
Time     Event Detail
22:15:32 Session Monitoring AutoStart monitoring for Live Session opportunity between 22:15 & 04:40
22:15:33 Session AutoStarting Session autostarting (22:15)
22:15:55   CCDSoft Restarting CCDSoft being restarted (to set AutoSave No.)
22:16:23   Camera1 Connected SBIG Camera connected (Set point -20°C)
22:16:25 Session Created Session Created (Live, 2024-09-12 S01242, ImageSaveNum: 1242001)
22:16:30   Scope Switched On Telescope Power has been switched on via UPB Powerbox.
22:18:27   Services Started Observatory Services started
22:18:34 Observatory (Auto) Observatory placed in Fully-Automated Mode
22:18:38 Session Pending Session pending (2024-09-12)
22:18:40 Session Initiating Session initiating (2024-09-12)
22:18:49   Camera1 Connected SBIG Camera connected (Set point -20°C)
22:18:56   Plan Requested Observing Plan requested from AstroPlan (1.40.8)
22:19:43   Plan Loaded Observing Plan loaded to queue (Plan ID: 1017)
22:20:23   Telescope Connected Telescope connected (TheSky6)
22:20:47 Session Equilibration Session ready to Open Dome
22:21:37   Dome Opened Dome opened (Opening time 50s, Zigbee 40s)
22:21:57 Session Running Session running
22:21:59   Queue Started Observing Queue started (24 targets selected)
22:22:02     Target Started (NrZen) Target started (Focus Field 21, HIP 103850)
22:22:15       Dome Unparked Dome unparked
22:26:51       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
22:29:39       Focusing Completed Foc1 AutoFocus Completed (Profile No 1, wide)
22:31:51       Focusing Completed Foc1 AutoFocus Completed (Profile No 1)
22:31:54       Focusing Started-Foc2 Foc2 Focusing Started (Secondary Scope, using ShCap)
22:33:55       Focusing Completed Foc2 AutoFocus Completed (Profile No 2, wide)
22:35:39       Focusing Completed Foc2 AutoFocus Completed (Profile No 2)
22:35:56     Target Completed (NrZen) Target completed (Focus Field 21, HIP 103850)
22:36:01     Target Started (1/24) Target started (1/24, KIC 8462852)
22:39:15       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
22:41:46       Focusing Completed Foc1 AutoFocus Completed (Profile No 3)
22:45:42     Target Completed Target completed (1/24, KIC 8462852)
22:45:46     Target Started (2/24) Target started (2/24, GCVS T CrB)
22:49:17       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
22:51:45       Focusing Completed Foc1 AutoFocus Completed (Profile No 4)
22:57:51     Target Completed Target completed (2/24, GCVS T CrB)
22:57:55     Target Started (3/24) Target started (3/24, GCVS CY Lyr)
23:01:11       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
23:03:36       Focusing Completed Foc1 AutoFocus Completed (Profile No 5)
23:07:32     Target Completed Target completed (3/24, GCVS CY Lyr)
23:07:56     Target Started (4/24) Target started (4/24, AT2024jqo (CV))
23:10:51       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
23:13:18       Focusing Completed Foc1 AutoFocus Completed (Profile No 6)
23:19:11     Target Completed Target completed (4/24, AT2024jqo (CV))
23:19:56     Target Started (5/24) Target started (5/24, AT2023csw (CV))
23:20:58   ScopeMonitor Frozen ScopeMonitor appears to be frozen at around 23:20
23:22:39   HardSuspend Called Hard Suspend called due to TheSky6 failure
23:24:40 Session Suspending Session suspending
23:25:30   Dome Closed Dome closed (Closing time 50s, Zigbee 43s)
23:25:33 Session Suspended Session suspended
23:26:07     Target Aborted (5/24) Target aborted (5/24, AT2023csw (CV)) due to Obs.Manager Request
23:26:09   Queue Paused Queue paused for Session Suspension
23:27:35 Session Resuming Session resuming
23:28:20   Dome Opened Dome opened (Opening time 45s, Zigbee 40s)
23:28:23 Session Running Session running
23:28:25   Queue Resumed Observing Queue resumed
23:28:28     Target Started (5/24) Target started (5/24, AT2023csw (CV))
23:36:24     Target Completed Target completed (5/24, AT2023csw (CV))
23:36:28     Target Started (6/25) Target started (6/25, GCVS T CrB, ToO)
23:38:22   ScopeMonitor Frozen ScopeMonitor appears to be frozen at around 23:38
23:40:06   HardSuspend Called Hard Suspend called due to TheSky6 failure
23:42:07 Session Suspending Session suspending
23:42:57   Dome Closed Dome closed (Closing time 50s, Zigbee 43s)
23:43:00 Session Suspended Session suspended
23:43:29     Target Aborted (6/25) Target aborted (6/25, GCVS T CrB, ToO) due to Obs.Manager Request
23:43:31   Queue Paused Queue paused for Session Suspension
23:45:02 Session Resuming Session resuming
23:45:47   Dome Opened Dome opened (Opening time 45s, Zigbee 41s)
23:45:50 Session Running Session running
23:45:52   Queue Resumed Observing Queue resumed
23:45:55     Target Started (6/25) Target started (6/25, GCVS T CrB, ToO)
23:51:24     Target Completed Target completed (6/25, GCVS T CrB, ToO)
23:51:26     Target Missed (7/25) Target's time slot was missed (7/25, Nova Vul 2024)
23:51:29     Target Started (8/25) Target started (8/25, AT2023dwm (CV))
23:57:59     Target Completed Target partially completed (8/25, AT2023dwm (CV))
23:58:03     Target Started (9/25) Target started (9/25, GCVS BL Lac)
00:05:43     Target Completed Target completed (9/25, GCVS BL Lac)
00:05:47     Target Started (10/25) Target started (10/25, NGC 3073 w/AT2024unf)
00:10:18       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
00:12:53       Focusing Completed Foc1 AutoFocus Completed (Profile No 7)
00:29:27     Target Completed Target completed (10/25, NGC 3073 w/AT2024unf)
00:29:31     Target Started (11/25) Target started (11/25, NGC 3206 w/SN2024bch)
00:32:45       Focusing Skipped Foc1 focusing skipped - star is too dim (TCF-S)
00:49:19     Target Completed Target completed (11/25, NGC 3206 w/SN2024bch)
00:49:24     Target Started (12/25) Target started (12/25, AT2023abqc (CV))
00:52:51       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
00:55:13       Focusing Completed Foc1 AutoFocus Completed (Profile No 8)
01:01:05     Target Completed Target completed (12/25, AT2023abqc (CV))
01:01:09     Target Started (13/25) Target started (13/25, NGC 7253B w/SN2023wuk)
01:03:40       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
01:06:19       Focusing Completed Foc1 AutoFocus Completed (Profile No 9)
01:22:10     Target Completed Target completed (13/25, NGC 7253B w/SN2023wuk)
01:23:04     Target Started (14/25) Target started (14/25, GCVS SS Cyg)
01:25:48       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
01:28:10       Focusing Completed Foc1 AutoFocus Completed (Profile No 10)
01:32:28     Target Completed Target completed (14/25, GCVS SS Cyg)
01:40:49     Target Started (15/25) Target started (15/25, AT2021afdl (CV))
01:43:42       Focusing Skipped Foc1 focusing skipped - star is too dim (TCF-S)
01:59:42     Target Completed Target completed (15/25, AT2021afdl (CV))
02:03:09     Target Started (16/25) Target started (16/25, AT2024koe)
02:05:41       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
02:08:23       Focusing Completed Foc1 AutoFocus Completed (Profile No 11)
02:24:45     Target Completed Target completed (16/25, AT2024koe)
02:25:09     Target Started (17/25) Target started (17/25, WDS GRB 34)
02:27:35       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
02:29:52       Focusing Completed Foc1 AutoFocus Completed (Profile No 12)
02:31:59     Target Completed Target completed (17/25, WDS GRB 34)
02:33:37     Target Started (18/25) Target started (18/25, AT2024jri)
02:36:30       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
02:39:18       Focusing Completed Foc1 AutoFocus Completed (Profile No 13)
02:55:26     Target Completed Target completed (18/25, AT2024jri)
02:55:37     Target Started (19/25) Target started (19/25, M31 w/Nova M31 AT2023yoa)
02:58:14       Focusing Skipped Foc1 focusing skipped - star is too dim (TCF-S)
03:14:16     Target Completed Target completed (19/25, M31 w/Nova M31 AT2023yoa)
03:17:57     Target Started (20/25) Target started (20/25, AT2022absk (CV))
03:20:41       Focusing Skipped Foc1 focusing skipped - star is too dim (TCF-S)
03:26:36     Target Completed Target completed (20/25, AT2022absk (CV))
03:30:17     Target Started (21/25) Target started (21/25, GCVS RX And)
03:32:47       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
03:35:11       Focusing Completed Foc1 AutoFocus Completed (Profile No 14)
03:38:19     Target Completed Target completed (21/25, GCVS RX And)
03:39:37     Target Started (22/25) Target started (22/25, AT2023uqd)
03:43:13       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
03:45:55       Focusing Completed Foc1 AutoFocus Completed (Profile No 15)
04:01:47     Target Completed Target completed (22/25, AT2023uqd)
04:01:51     Target Started (23/25) Target started (23/25, 237P/LINEAR)
04:05:38       Focusing Skipped Foc1 focusing skipped - star is too dim (TCF-S)
04:16:08     Target Completed Target completed (23/25, 237P/LINEAR)
04:18:21     Target Started (24/25) Target started (24/25, M33 w/Nova M33 2024-08a?)
04:21:10       Focusing Started-Foc1 Foc1 Focusing Started (TCF-S)
04:24:06       Focusing Completed Foc1 AutoFocus Completed (Profile No 16)
04:40:10     Target Completed Target completed (24/25, M33 w/Nova M33 2024-08a?)
04:40:42     Target Started (25/25) Target started (25/25, AT2024eja)
04:46:03       Focusing Skipped Foc1 focusing skipped - star is too dim (TCF-S)
05:02:46     Target Completed Target completed (25/25, AT2024eja)
05:02:48   Queue Completed Job Queue completed
05:02:50 Session Closing Session closing
05:03:44   Dome Closed Dome closed (Closing time 50s, Zigbee 44s)
05:04:16   Dome Parked Dome parked (parking time 25s), Az: 90.0 deg
05:05:31   Telescope Parked Telescope parked (parking time 71s total)
05:05:46   Telescope State Handbox reads 'Scope parked Turn scope off.'
05:06:17   Telescope Switched Off Telescope Power has been switched off via UPB Switch.
05:07:41   Services Stopped Night Services stopped
05:07:44 Session Finishing Session Finishing started (Create Fits Summary, Transfer Files)
05:08:11     Dome (Find Park) Find Park started (Search Az 88.0 to 92.0°, Step 0.2°, Narrow)
05:09:09     Dome Find Park found (Best Park Az 90.6°)
05:09:15     Dome Dome has been synced at park position (Az 90.0°, Adjustment: -0.6°)
05:09:17   Dome Parked Dome parked
05:09:19   Dome Charging Dome is parked and re-charging Ok (12mA)
05:09:21 Session Finished Session Finished

 
Session Alerts & Alarms
Time     Type       Name Detail
22:18:38 Yellow Alert Disk Space Free Space on C:\ Drive (47.56 GB) is less than 50 GB
23:22:39 Red Alert ScopeMonitor ScopeMonitor appears to be frozen (last cycle at 23:20:58)
23:22:41 Red Alert TheSky6 Alert TheSky6 isn't running or isn't working correctly
23:26:09 Green Clear ScopeMonitor ScopeMonitor has resumed again (23:26) after 0.0 mins
23:40:06 Red Alert ScopeMonitor ScopeMonitor appears to be frozen (last cycle at 23:38:22)
23:40:08 Red Alert TheSky6 Alert TheSky6 isn't running or isn't working correctly
23:43:31 Green Clear ScopeMonitor ScopeMonitor has resumed again (23:43) after 0.0 mins
23:52:05 Yellow Alert Anomalous Dome Slew Dome slew resumed at Az 282.0° (T+31.5s)
00:07:15 Yellow Alert Dome Discrepency Difference between Dome Az and Slave Az is > 1 deg (NGC 3073)
00:50:42 Yellow Alert Dome Discrepency Difference between Dome Az and Slave Az is > 1 deg (AT2023abqc)
05:08:04 Yellow Alert Disk Space Free Space on C:\ Drive (46.18 GB) is less than 50 GB
05:09:19 Green Clear Dome Charging Dome is parked and re-charging Ok (12mA)
 

Back to Top


Operational Issues (2024-09-12 S1242)

[ Prev | Next ]

Critical Issues

Major Issues

Minor Issues

Small Defects

Continuous Improvement

[ Prev | Next ]


Fig 1.  Strange Light moving north of the Observatory (2024-09-13 20:27)
After review this was concluded to a tractor or other farm equipment being used in fields north of the Observavtory

Image
 

[ Prev | Next ]

Back to Top


Investigation - Job Queue freezing whilst performing a Locate Frame Image Link, leading to Session Suspension

Issue

1a) Job Queue froze whilst performing an Image Link on Locate Frame (23:22, S1242
1b) Job Queue froze whilst performing an Image Link on Locate Frame (23:40, S1242)
2)  Alerts raised that ScopeMonitor was frozen and TheSky6 isn't running (2024-09-10 22:28, S1240 ). 
3) Session interrupted by a session suspension when Image Link operation in CCDSoft/TheSky6 took an unusually long time (2024-08-31 03:40, S1237). 
4)  Session interrupted by a session suspension when Image Link operation in CCDSoft/TheSky6 took an unusually long time, TheSky6 appeared to 'hang' and AstroMain's scope monitor froze (2024-08-29 00:01, S1235)
5) Three Job Queue freezes during session S1243 (2024-09-12) at 21:12, 00:38 & 04:05
6) Two Job Queue freezes during session S1244 (2024-09-15) at 00:21 & 04:38

Description

1a) S1242 2024-09-12 23:22.  Whilst performing plate solution on Locate 1 frame (#1242034, 111 stars) for Target 5/24 (AT2023csw) Obs.Overseer gave a warning "ScopeMonitor is frozen (23:22) ", followed by a Red 'TheSky6' Alert with the message "TheSky6 isn't running or isn't working correctly" and a warning "Task in ImageAndLink/LinkImage is taking a long time? (23:23)". The ObsManager aborted the target and the session was suspended.

1b)  S1242 2024-09-12 23:40.  Whilst performing plate solution on Locate 2 frame (#1242043, 14 stars) for Target 6/25 (GCVS T CrB) Obs.Overseer gave a warning "ScopeMonitor appears to be frozen (last cycle at 23:38:22)  ", a TheSky6 Alert with message "TheSky6 isn't running or isn't working correctly" and a warning "Task in ImageAndLink/LinkImage is taking a long time? (23:40)" . The ObsManager aborted the Target and the session suspended.

In both cases the issue resolved itself after a short while, with the results of Image Link (solved in 309.9s and solved in 305.2s respectively ) being posted whilst ObsCam was taking a 'Supended Session' image. Whilst the session then continued on ok , the issue causes a significant delay, and lost observations for the target affected.

Similar problems have occurred on two other occasions recently.  Is there a common factor ? 

2) S1240 2024-09-10 22:28. Whilst performing plate solution on Locate 1 frame (#01240030, 136 stars) for Target 4/24 (AT2023exh ) Obs.Overseer gave a warning "ScopeMonitor is frozen (22:29) ", followed by a Red 'TheSky6' Alert with the message "TheSky6 isn't running or isn't working correctly" and a warning "Task in ImageAndLink/LinkImage is taking a long time? (22:30)". The ObsManager aborted the target and the session was suspended.  In this case Scope Monitor resumed again at 22:33, indicating a freeze of around 4-5 minutes. The results of a successful image link ("Solved in 310.3s  (Stars 117, Scale 1.406, Angle 180.1°) " )  were reported whilst ObsCam was taking a 'Suspended' Image.

3) S1237 2024-08-31 03:40 Session interrupted by a session suspension when Image Link operation in CCDSoft/TheSky6 took an unusually long time (03:40).  TheSky6 appeared to 'hang' and AstroMain's scope monitor froze. The ScopeMonitor was identified to be frozen and a red alert raised at 03:40:38, after the Monitor had been frozen since it's last completed cycle at 03:38:53.  This was during Image Linking of Locate 2 frame (#01237205 ) for Target 25/27 (M33 w/AT2023nou) , and the following warning was reported "TheSky6 isn't running or isn't working correctly ". ObsManager intervened and raised a bHardSuspend flag (03:40:42 ) in order to suspend the session.  ObsOverseer warned that "a task in ImageAndLink/LinkImage is taking a long time?" (03:41:28).   After a prescribed amount of waiting ObsManager gave the following warning "Timed out waiting for JobQueue to raise bSuspendOk" (03:42:43 ) and set session state to Suspending. This  indicates that JobExecution was held up (almost certainly held up waiting for TheSky6 to complete the ImageLink action). After closing the dome and whilst Session State was suspended  the Image Linking operation eventually completed with the success message "Ok    Solved in 293.5s  (Stars 11, Scale 1.408, Angle 179.9°) " (03:43:46 ). At the same time the Scope Monitor was recorded as having resumed (03:43:46). Inspecting the #01237205 frame it seems normal with plenty of stars and should be easy to link. Indeed during Post-Session 'Reduction/Analyse All Frames' the frame was image linked in only 0.6s.  This suggests that the delay was likely not caused by the specific frame been image linked at the time, but was due to some other hold-up with the TheSky6 program.

4) S1235 2024-08-29 00:01.  The ScopeMonitor was identified to be frozen and a red alert raised at 00:01:53, after the Monitor had been frozen since it's last completed cycle at 00:00:12.  This was during Image Linking of Locate 2 frame (#1235037) for Target 4/19 (NGC 7053), and the following warning was reported "TheSky6 isn't running or isn't working correctly ". ObsManager intervened and raised a bHardSuspend flag (00:01:58) in order to suspend the session.  ObsOverseer warned that "a task in ImageAndLink/LinkImage is taking a long time?" (00:02:48).  After a prescribed amount of waiting ObsManager gave the following warning "Timed out waiting for JobQueue to raise bSuspendOk" (00:03:58) and set session state to Suspending. This indicates that JobExecution was held up (almost certainly held up waiting for TheSky6 to complete the ImageLink action). After closing the dome and whilst Session State was suspended  the Image Linking operation eventually completed with the success message "Ok     Solved in 341.5s (Stars 57, Scale 1.407, Angle 180.0°) "  (00:05:54). At the same time the Scope Monitor was recorded as having resumed (00:05:54). Inspecting the #1235037 frame it seems normal with plenty of stars and should be easy to link. Indeed during Post-Session 'Reduction/Analyse All Frames' the frame was image linked in only 0.6s.  This suggests that the delay was likely not caused by the specific frame been image linked at the time, but was due to some other hold-up with the TheSky6 program. Issue reviewed and analysed as far as it could be taken.   No identified action is required. 

5) S1243 2024-09-14. Three cases with very similar descriptions to earlier cases, except that two of them were associated to image linking the Cam2 Reference Frame. (see Minor Issues 2024-09-14). Collectively the 3 cases were recorded as a Major Issue (see Major Issues 2024-09-15).

6) S1244 2024-09-15. Two cases with very similar descriptions to earlier cases (see Minor Issues 2024-09-15)

Analysis

The descriptions of the 9 cases are almost identical, and point to a common cause. However it obviously doesn't happen for every Locate frame that is being image linked nor does it happen for every target and clearly some sessions are unaffected.  Session S1242 appears to have unlucky by having two occurances of the problem (though there turned out later on that there were 3 cases & 2 cases in Session S1243 &  S1244 respectively .

Some images can take a very long time to image link (often because they have a very large number of stars or hot pixels).  Examing the post-session results from 'Analyse Reduced Folder' operation shows that the locate frame from case 1a & 1b took only 1.6s only 0.7s to image link, strongly suggesting that there was no fundamental issue that caused image linking to take so long during session execution.

At around the time of the 1a incident an attempt was being made to access the ToO form in AstroPlan for Target 'GCVS T CrB (centered)', with the aim of adding it as a ToO target. The AstroPlan program began to run very slowly & stopped responding. It was presumed at the time that the user action taken in AstroPlan with its calls to TheSky6 method/properties led to TheSky6 program to stop responding & thereby caused the ImageLink process/routine to hang which then caused the Job Queue to hand/freeze.   However with no similar user involvement in the 1b incident it is concluded that AstroPlan was most likely hanging/running slowing in the 1a case because TheSky6 was already not responding.

It is hypothesized that the issue occurs because there are calls on TheSky6 from certain routines whilst TheSky is busy performing ImageLinking.
Disabling/deferring calls to TheSky6 whilst Image Linking is being undertaken might fix the problem.

The ScopeMonitor was implemented around 2022-10-24 (CI S1048) and it is unclear why the issue has only appeared in S1235 (2024-08-29).
ScopeMonitor is a loop that cycles every 2.5s that reads the value of TheSky6.Telescope.IsConnected, and checks whether the Telescope is connected or not in TheSky6

If execution holds up it usually means that TheSky6 is too busy to respond or is hung up in someway.   When TheSky6 starts responding again, the ScopeMonitor loop continues again.

There is possibility that it an initial freeze in CCDSoft.Image.InsertWCS routine that is at the route of the problem.  This is probably unlikely and it certain be shown that TheSky6 is effectivity frozen during the problem intervals, and has therefore likely received the Imageand begun the Image Linking process.

The DeviceHub.Telescope log of messages to/from TheSky6 during the S1243 session was examined to see if there are any anomalies at the time of the 3 ImageLink freeze incidents (21:12:49, 00:38:08 & 04:05:35). Normally there are 4 to 6 messages records every second, however for during each of the 3 cases there is a 5 minutes or so gap in records (21:12:50 to 21:18:09 for first incident,  00:38:10 to 00:43:24 for second case,  04:05:37 to 04:10:29 for third case).   Are these gaps due to TheSky6 being effectivity frozen during this time (so not making Ra/Dec requests) or were issues with Telescope causing TheSky and Job Queue to lock up.  The latter explanation seems a very unlikely multiple coincidence, and the presence of unniterrupted message records in the DeviceHub.Telescope logs of messages to/from AstroMain and to/from AstroGuard rule out a problem with DeviceHub.Telescope, MeadeGeneric Driver or the LX200 Telescope.

Using TheSky6.Telescope.Abort is unlikely to be effective as the Telescope isn't being slewed at the time of the incidents, and may in any case cause the calling routine in AstroMain to freeze.

Locate Frame image linking uses the PlateSolveImage() routine, which calls SolvePlateWithTheSky()  [ via a call to SolvePlate() ].   Image Link is performed by call to DoImageLink(x, y)

Examining the Job Queue freeze whilst performing an Image Link on Locate 2 Frame for Target 23/25 (SN 2022qwx ) during sesion S1243 (04:05) the log messages were :

  04:05:36.66 | [ ImageAndLink() LinkImage ]
  04:05:36.68 | Stars= 8 (Objects= 8)
  04:05:36.69 | StarInventory Ok, nObjects = 8, nStars = 8
  04:05:36.69 | 2 - USCA2/USNOB
  04:05:36.74 | 2 - USCA2/USNOB
  04:05:36.91 | NEW\2024-09-14 S01243\01243148.SN_2022qwx.FIT
  04:05:36.91 | [01243148]
  04:05:36.91 | ImageScale(Original)Data | 1.404"/pixel
  04:05:36.92 | CCDXBIN= 3(will assume Image Scale Of 1.404"/pixel)
  04:05:36.92 | Show Image Inventory
  04:05:36.93 | Stars= 8 (Objects= 8)
  04:05:36.93 | Stars= 8 (Stars >=5 is True)
  04:05:36.93 | Hide Image Inventory
  04:05:36.94 | Scale = 1.404 | ModRA 6.81' | ModDec 4.59' |
  04:05:36.94 | Searching for potential Plate Solutions (Radius 0)
  04:05:37.30 | InsertWCS (0\0)   | Stars 5 | Rms 0.23" | Angle 179.87 | Scale 1.416 | RA= 02 02 26.506 Dec=-20 23 48.41
  04:05:50.60 | ScopeMonitor is Pending (15s)
  04:05:55.62 | ScopeMonitor is Pending (20s)
  04:06:00.62 | ScopeMonitor is Pending (25s)

This shows that DoImageLink was passed through

Common Factor is that the first call to DoImageLink (x,y) with x=-, y=0 executes ok and produces a log entry like

 Report  InsertWCS (0\0) Data    | Link | Stars 96 | Rms 0.35" | Angle 180.08 | Scale 1.405 | RA= 19 40 08.177 Dec=+23 26 59.53
Report  InsertWCS (0\0) Data    | Link | Stars 12 | Rms 0.15" | Angle 180.01 | Scale 1.408 | RA= 15 59 37.283 Dec=+25 50 36.44

Report  InsertWCS (0\0) Data    | Link | Stars 10 | Rms 0.53" | Angle 359.74 | Scale 1.013 | RA= 10 21 48.894 Dec=+56 55 43.34
Report  InsertWCS (0\0) Data    | Link | Stars 37 | Rms 0.40" | Angle 0.44   | Scale 1.012 | RA= 00 13 18.903 Dec=+36 50 33.03
Report  InsertWCS (0\0) Data    | Link | Stars 5  | Rms 0.23" | Angle 179.87 | Scale 1.416 | RA= 02 02 26.506 Dec=-20 23 48.41 

this means that the first call to Image.InsertWCS(1)  works ok.


During Session S1244 (2024-09-15) there were 2 further incidents (00:21 & 04:38)

The following trace data was collected for the 00:21 case (AT2021aeft)

00:21:03.00 |     Report  SearchParams       Info    | Scale = 1.404 | ModRA 6.81' | ModDec 4.59' |
00:21:03.00 |     Start   Search Distance #0 Data    | Searching for potential Plate Solutions (Radius 0)
00:21:03.00 |     (trace) ImageLinking       (trace) | Calling DoImageLink (0 \ 0)
00:21:03.02 |     (trace) ImageLinking       (trace) | Calling Image.InsertWCS(1)
00:21:03.55 |     (trace) ImageLinking       (trace) | Back from Image.InsertWCS(1)
00:21:03.57 |     (trace) ImageLinking       (trace) | Image.WcsArray checked
00:21:03.59 |     Report  InsertWCS (0\0)    Data    | Link | Stars 18 | Rms 0.13" | Angle 180.09 | Scale 1.406 | RA=..
00:21:03.59 |     (trace) ImageLinking       (trace) | Back from DoImageLink (0 \ 0)
00:21:03.59 |     (trace) ImageLinking       (trace) | Getting TheSky.GetObjectRaDec2000("Image Link Information")
00:21:03.65 |     (trace) ImageLinking       (trace) | Back from TheSky.GetObjectRaDec2000("Image Link Information")
00:21:21.76 | Ovs Check   ScopeMonitor       Info    | ScopeMonitor is Pending (19s)
00:21:26.78 | Ovs Check   ScopeMonitor       Info    | ScopeMonitor is Pending (24s)
00:22:46.92 | Ovs Check   ScopeMonitor       Alert   | ScopeMonitor is Frozen (104s)
00:22:50.44 |     Set     ScopeMode(7)       Info    | Scope Mode : Suspending
00:22:50.44 | Mgr Check   TheSky6            Fail    | Raising bHardSuspend (TheSky6 program isn't responding)
00:22:50.52 |     (trace) CallMethod         Ok      | Called CCDCamera.Abort
00:22:50.52 |     (trace) CallMethod         Ok      | Call Abort took 0.036s
00:23:37.00 | Ovs Check   JobQueue           Info    | Task in ImageAndLink/LinkImage is taking a long time? ()
00:24:50.65 | Mgr Call    SetState_Suspended Info    | Calling SetState_Suspended due to overdue response to bHardSuspend
00:24:50.88 | Mgr Change  Session.State      Info    | Session : Suspending
00:25:40.99 | Mgr Change  Session.State      Info    | Session : Suspended
00:25:41.03 | Mgr Start   SuspendedStage 3   Info    | Suspended Stage 3 (Wait on ObsCam Image)
00:25:52.25 | Ovs Check   ScopeMonitor       Alert   | ScopeMonitor is Frozen (289s)
00:25:57.27 | Osr Check   ScopeMonitor       Alert   | ScopeMonitor is Frozen (294s)
00:25:58.60 | Mon Check   MonitorTheSkyScope Info    | TheSkyScope.Monitor has resumed (cycle = 0.0s)
00:25:58.66 |     (trace) ImageLinking       (trace) | Back from TheSky.GetObjectRaDec("Image Link Information")
00:25:58.66 |     (trace) ImageLinking       (trace) | Finished x/y loops
00:25:58.66 |     (trace) ImageLinking       (trace) | BestActiveStars= 18, Match=True
00:25:58.66 |     (trace) ImageLinking       (trace) | Logging BestMatch
00:25:58.66 |     Report  Best Match         Data    | Best | Stars 18 | Rms 0.13" | Angle 180.09 | Scale 1.406 | Offset 0/0
00:25:58.66 |     Report  ImageCoords(Orig)  Data    | RA= 23 19 26.481 Dec= +33 11 29.97 | Epoch 2000
00:25:58.66 |     Report  ImageCoords(Image) Data    | RA= 23 19 05.781 Dec= +33 15 50.56 | Epoch 2000
00:25:58.66 |     (trace) ImageLinking       (trace) | Reporting Link Result
00:25:58.66 |     (trace) ImageLinking       (trace) | Calling ValidateWcsCenter
00:25:58.72 |     Check   WcsCenter          Ok      | WCS center matches that from image link
00:25:58.72 |     (trace) ImageLinking       (trace) | Back from ValidateWcsCenter
00:25:58.73 |     Report  LinkingTime        Info    | 296.0s
00:25:58.73 |     Select  StarCatalog        Info    | 1 - GSC Catalog
00:25:58.77 |     (trace) Continue           (trace) | [ CentreTarget() Continue ]
00:25:58.77 |     Set     ObsCamON           Ok      | Sending 'Camera On' request to AstroObsCam
00:25:58.78 |     (trace) CenterTarget       (trace) | [ ExecuteTarget() CenterTarget ]
00:25:58.79 | Exc Abort   Target             Info    | Target (AT2021aeft (CV)) has been aborted due to Obs.Manager Request
00:25:58.81 |     (trace) AddAzAltResult     (trace) | Target No: 15, AT2021aeft (CV), Az:166.4°, Alt: 65.6°,
00:25:58.83 | Exc Suspend JobQueue           Info    | Waiting for bSuspend to clear in order to proceed to next target.
00:25:58.84 |     (trace) Paused ForSusps'n  (trace) | [ ExecuteQueue() Paused ForSuspension ]
00:26:02.26 | Osr Check   ScopeMonitor       Ok      | ScopeMonitor has resumed again (00:26) after 5.0 mins
00:26:10.08 | Zgb Event   AuxLight           Info    | Aux Light has switched off (00:26:10, on time 28 sec)
00:26:10.79 | Mgr (trace) SessionSuspended   Trace   | SuspendedStageNo set to 4

The trace information shows that point of freeze came in between the two following trace points
              - Back from TheSky.GetObjectRaDec2000("Image Link Information")
             -  Back from TheSky.GetObjectRaDec("Image Link Information")

This narrows the point of freeze down to being in SolvePlateWithTheSky() routine in the code between the two tracing statement

  WriteLogFileRecord(szTargetName, "(trace)", "(trace)", "ImageLinking",
                                                "Back from TheSky.GetObjectRaDec2000(""Image Link Information"")")

  TheSky.GetObjectRaDec("Image Link Information")
  BestRA = TheSky.dObjectRa
  BestDec = TheSky.dObjectDec

  WriteLogFileRecord(szTargetName, "(trace)", "(trace)", "ImageLinking",
                                                 "Back from TheSky.GetObjectRaDec(""Image Link Information"")")

The following abbreviated trace data was collected for the 04:38 case (GCVS RR Tau), which confirms the exact same freeze point.

04:39:00.17 |    ( trace) ImageLinking      (trace)  | Getting TheSky.GetObjectRaDec2000("Image Link Information")
04:39:00.24 |    (trace)  ImageLinking      (trace)  | Back from TheSky.GetObjectRaDec2000("Image Link Information")
04:39:14.77 | Osr Check   ScopeMonitor       Info    | ScopeMonitor is Pending (17s)
04:39:19.77 | Ovs Check   ScopeMonitor       Info    | ScopeMonitor is Pending (22s)

04:43:40.18 | Ovs Check   ScopeMonitor       Alert   | ScopeMonitor is Frozen (282s)
04:43:45.19 | Ovs Check   ScopeMonitor       Alert   | ScopeMonitor is Frozen (287s)
04:43:47.87 | Mon Check   MonitorTheSkyScope Info    | TheSkyScope.Monitor has resumed (cycle = 0.0s)
04:43:47.88 |     (trace) ImageLinking      (trace)  | Back from TheSky.GetObjectRaDec("Image Link Information")
04:43:47.88 |     (trace) ImageLinking      (trace)  | Finished x/y loops
04:43:47.88 |     (trace) ImageLinking      (trace)  | BestActiveStars= 32, Match=True

This raises the two following questions
 -  why is freezing occuring during the call to GetObjectRaDec when it appears to have no problem getting TheSky.GetObjectRaDec2000() immediately beforehand. ?
 -  why does 'Get TheSky.GetObjectRaDec()' cause the freeze, but only sometimes ?

Only 2 cases have been traced so far and they show the same freeze point. However it leaves open the possibility that the freeze due to TheSky.GetObjectRaDec2000 simply hasn't been observed yet. If they had equal probability of occurance (we simply don't know at this time), then getting two out of two freezes from GetObjectRaDec() only has a 25% probability outcome.

More 'traced cases' need to be logged and examined.

Despite the absence of a complete confirmation of where the freezes are happening a number of hypothesis can be constructed based on information to date:

  1. 1) The 'TheSky.GetObjectRaDec()' call somehow clash in time with the TheSky's own call to Get Telescope Ra/Dec from the scope 
     
  2.  The 'TheSky.dObjectRa' and/or  'TheSky.dObjectDec' property reads somehow clash in time with updates to TheSky's own internal property values as a result of Ra/Dec updates from the scope.
     
  3.  The 'TheSky.GetObjectRaDec()' call somehow comes too soon after the call to TheSky.GetObjectRaDec2000(), or TheSky.dObjectRa comes too soon after the retrospective call to TheSky.GetObjectRaDec() or TheSky.GetObjectRaDec2000()

Previously (though not last since session S1229) ScopeMonitor would record periods of around 90s duration where TheSky6 would stop responding. It was annoying but not cirtical.   This 90s duration was very consistent across the number of cases that this earlier problem occurred, so much so that only when the non-response had lasted more than 100s would ScopeMonitor'  be given a 'Frozen' Status,  before that it was given a 'Pending' Status.

In session S1237, the ScopeMonitor was Pending for 68s (at 01:07) whilst waiting for Target 15/27 to start.

Proposals:

The other potential course of action is to allow ObsManger to allow the session to continue to run on when the ImageLinking appears to have frozen, and only intervene to Abort Target and Suspend Session after the freeze has lasted more than 6 minutes (the typical freeze time is 5 minutes).

Conclusion

Actions:

Update  2024-09-18

Two sessions (S1245 2024-09-16 & S1246 2024-09-17) have been now been conducted with AstroMain 3.69.8 which contained a potential fix for the 'Frozen ImageLinking / Non-Responsive TheSky'  Issue,  by the insertion of Thread.Sleep(500) delays into the Plate Solving routine to slow down the calls sent to TheSky.

It wasn't certain that this potential fix would work, but there was a possibility and one worth trying based on a hunch.

In the event the fix seems to have significantly reduced the incidence of issue, but it is too soon to know if the fix is absolute.

Update  2024-09-19

Update  2024-09-29

Update  2024-10-15


Back to Top