GLEAN3_01420	SPU_001420		\N
GLEAN3_27359	SPU_027359		\N
GLEAN3_04327	SPU_004327		\N
GLEAN3_13381	SPU_013381		\N
GLEAN3_19817	SPU_019817		\N
GLEAN3_18732	SPU_018732		\N
GLEAN3_04783	SPU_004783		\N
GLEAN3_28130	SPU_028130		\N
GLEAN3_04858	SPU_004858		\N
GLEAN3_12291	SPU_012291		\N
GLEAN3_10711	SPU_010711		\N
GLEAN3_00278	SPU_000278		\N
GLEAN3_02534	SPU_002534		\N
GLEAN3_07568	SPU_007568		\N
GLEAN3_11814	SPU_011814		\N
GLEAN3_25682	SPU_025682		\N
GLEAN3_26118	SPU_026118		\N
GLEAN3_07634	SPU_007634		\N
GLEAN3_04713	SPU_004713		\N
GLEAN3_04042	SPU_004042		\N
GLEAN3_04519	SPU_004519		\N
GLEAN3_19647	SPU_019647		\N
GLEAN3_06044	SPU_006044		\N
GLEAN3_24797	SPU_024797		\N
GLEAN3_01310	SPU_001310		\N
GLEAN3_28817	SPU_028817		\N
GLEAN3_20631	SPU_020631		\N
GLEAN3_15355	SPU_015355		\N
Sp-185/333-04	SPU_030262		\N
Sp185/333-05	SPU_030263		\N
Sp185/333-06	SPU_030264		\N
GLEAN3_03182	SPU_003182		\N
GLEAN3_05334	SPU_005334		\N
GLEAN3_08434	SPU_008434		\N
GLEAN3_19718	SPU_019718		\N
GLEAN3_00812	SPU_000812		\N
GLEAN3_26430	SPU_026430		\N
GLEAN3_27932	SPU_027932		\N
GLEAN3_01282	SPU_001282		\N
GLEAN3_01408	SPU_001408		\N
GLEAN3_06601	SPU_006601		\N
GLEAN3_09636	SPU_009636		\N
GLEAN3_17436	SPU_017436		\N
GLEAN3_21962	SPU_021962		\N
GLEAN3_22557	SPU_022557	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it appears that both scaffolds do not have any gaps or repeats present and if the two were combined, the entire sequence would have an orderly and continuous arrangement. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be strong.	\N
GLEAN3_27090	SPU_027090	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. When examining the excel data to that of the BLAST results, it appears that if the 3 scaffolds were to be combined, the sequence would have an orderly and continuous arrangement. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with all of the values being greater than10	\N
GLEAN3_21829	SPU_021829		\N
GLEAN3_18085	SPU_018085		\N
GLEAN3_15719	SPU_015719		\N
GLEAN3_27015	SPU_027015		\N
GLEAN3_00426	SPU_000426		\N
GLEAN3_04823	SPU_004823		\N
GLEAN3_05839	SPU_005839		\N
GLEAN3_07247	SPU_007247		\N
GLEAN3_11768	SPU_011768		\N
GLEAN3_12471	SPU_012471		\N
GLEAN3_22172	SPU_022172		\N
GLEAN3_25307	SPU_025307		\N
GLEAN3_08692	SPU_008692		\N
GLEAN3_14202	SPU_014202		\N
GLEAN3_14067	SPU_014067		\N
GLEAN3_28033	SPU_028033		\N
GLEAN3_00563	SPU_000563		\N
GLEAN3_28771	SPU_028771		\N
GLEAN3_13464	SPU_013464		\N
GLEAN3_24766	SPU_024766		\N
GLEAN3_17373	SPU_017373		\N
GLEAN3_18184	SPU_018184		\N
GLEAN3_27006	SPU_027006		\N
GLEAN3_28053	SPU_028053		\N
GLEAN3_15607	SPU_015607		\N
GLEAN3_05992	SPU_005992		\N
GLEAN3_20096	SPU_020096		\N
GLEAN3_00611	SPU_000611		\N
GLEAN3_10592	SPU_010592		\N
GLEAN3_24559	SPU_024559		\N
GLEAN3_11645	SPU_011645		\N
GLEAN3_13508	SPU_013508		\N
GLEAN3_16631	SPU_016631		\N
GLEAN3_01357	SPU_001357		\N
25424	SPU_030181		\N
GLEAN3_02555	SPU_002555		\N
GLEAN3_21739	SPU_021739		\N
GLEAN3_18457	SPU_018457		\N
GLEAN3_03402	SPU_003402		\N
GLEAN3_20451	SPU_020451		\N
GLEAN3_02398	SPU_002398		\N
GLEAN3_27734	SPU_027734		\N
GLEAN3_03591	SPU_003591		\N
GLEAN3_22053	SPU_022053		\N
GLEAN3_10946	SPU_010946		\N
GLEAN3_06847	SPU_006847		\N
GLEAN3_00861	SPU_000861		\N
GLEAN3_13301	SPU_013301		\N
GLEAN3_08290	SPU_008290		\N
GLEAN3_14588	SPU_014588		\N
GLEAN3_00001	SPU_000001		\N
GLEAN3_10846	SPU_010846		\N
GLEAN3_15466	SPU_015466		\N
GLEAN3_01752	SPU_001752		\N
GLEAN3_19033	SPU_019033		\N
GLEAN3_01628	SPU_001628		\N
GLEAN3_27618	SPU_027618		\N
GLEAN3_07553	SPU_007553		\N
GLEAN3_06795	SPU_006795		\N
GLEAN3_11755	SPU_011755		\N
GLEAN3_04924	SPU_004924		\N
GLEAN3_00007	SPU_000007		\N
GLEAN3_13570	SPU_013570		\N
GLEAN3_11130	SPU_011130		\N
GLEAN3_09756	SPU_009756		\N
GLEAN3_13399	SPU_013399		\N
GLEAN3_16128	SPU_016128		\N
GLEAN3_02615	SPU_002615		\N
GLEAN3_01113	SPU_001113		\N
GLEAN3_25676	SPU_025676		\N
GLEAN3_07437	SPU_007437		\N
GLEAN3_06912	SPU_006912		\N
GLEAN3_24715	SPU_024715		\N
GLEAN3_01452	SPU_001452		\N
GLEAN3_13436	SPU_013436		\N
GLEAN3_02866	SPU_002866		\N
GLEAN3_28469	SPU_028469		\N
GLEAN3_09346	SPU_009346		\N
GLEAN3_16729	SPU_016729		\N
GLEAN3_28477	SPU_028477		\N
GLEAN3_05702	SPU_005702		\N
GLEAN3_02849	SPU_002849		\N
GLEAN3_02552	SPU_002552		\N
GLEAN3_20925	SPU_020925		\N
GLEAN3_15108	SPU_015108		\N
GLEAN3_16934	SPU_016934		\N
GLEAN3_01293	SPU_001293		\N
GLEAN3_11064	SPU_011064		\N
GLEAN3_09911	SPU_009911		\N
GLEAN3_12252	SPU_012252		\N
GLEAN3_21428	SPU_021428		\N
GLEAN3_19300	SPU_019300		\N
GLEAN3_12329	SPU_012329		\N
GLEAN3_01979	SPU_001979		\N
GLEAN3_03835	SPU_003835		\N
GLEAN3_11029	SPU_011029		\N
GLEAN3_18026	SPU_018026		\N
GLEAN3_12715	SPU_012715		\N
GLEAN3_06176	SPU_006176	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it appears that there are 2 overlaps between subject gb|DS002008| and gb|DS007834| from 297-369 and from 219-301. Besides the overlaps between the 2 scaffolds, the sequence has an orderly arrangement without any gaps. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak (all the values were less than 6.) 	\N
GLEAN3_24339	SPU_024339		\N
GLEAN3_14785	SPU_014785		\N
GLEAN3_06370	SPU_006370		\N
GLEAN3_03290	SPU_003290		\N
GLEAN3_03114	SPU_003114		\N
GLEAN3_00841	SPU_000841		\N
GLEAN3_16443	SPU_016443		\N
GLEAN3_17877	SPU_017877		\N
GLEAN3_01000	SPU_001000		\N
GLEAN3_10168	SPU_010168		\N
GLEAN3_18326	SPU_018326		\N
GLEAN3_21599	SPU_021599		\N
GLEAN3_22846	SPU_022846		\N
GLEAN3_26898	SPU_026898		\N
GLEAN3_14722	SPU_014722		\N
GLEAN3_14721	SPU_014721		\N
GLEAN3_15829	SPU_015829		\N
GLEAN3_15828	SPU_015828		\N
GLEAN3_06676	SPU_006676		\N
GLEAN3_26910	SPU_026910		\N
GLEAN3_04245	SPU_004245		\N
GLEAN3_16907	SPU_016907	After reviewing the data and performing a BLAST search, it appears that the data is distributed onto 2 different scaffolds. When examining both scaffolds individually it is evident that they both have and orderly arrangement without any gaps or repeats present. It appears that if the two scaffolds were to be combined, the overall sequence would have an orderly and continuous arrangement. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most several values that we greater than 10.	\N
GLEAN3_01838	SPU_001838		\N
GLEAN3_01846	SPU_001846		\N
GLEAN3_21462	SPU_021462		\N
GLEAN3_01398	SPU_001398		\N
GLEAN3_14934	SPU_014934		\N
GLEAN3_13549	SPU_013549		\N
GLEAN3_21991	SPU_021991		\N
GLEAN3_19777	SPU_019777		\N
GLEAN3_26095	SPU_026095		\N
GLEAN3_20680	SPU_020680		\N
GLEAN3_06573	SPU_006573		\N
GLEAN3_13699	SPU_013699	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. When examining the excel data, it appears that there are several internal repeats present within one of the scaffolds (v2.1_scaffold34023) but the sequence has an orderly arrangement without any gaps present. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the majority of the values ranging <5	\N
GLEAN3_09250	SPU_009250		\N
GLEAN3_18848	SPU_018848		\N
GLEAN3_25776	SPU_025776		\N
GLEAN3_10740	SPU_010740		\N
GLEAN3_10745	SPU_010745		\N
GLEAN3_03470	SPU_003470		\N
GLEAN3_20581	SPU_020581		\N
GLEAN3_09263	SPU_009263		\N
GLEAN3_25954	SPU_025954		\N
GLEAN3_03348	SPU_003348		\N
GLEAN3_15090	SPU_015090		\N
GLEAN3_03333	SPU_003333		\N
GLEAN3_04551	SPU_004551		\N
GLEAN3_09155	SPU_009155		\N
GLEAN3_18010	SPU_018010		\N
GLEAN3_07939	SPU_007939		\N
GLEAN3_18768	SPU_018768		\N
GLEAN3_23083	SPU_023083		\N
GLEAN3_23295	SPU_023295		\N
GLEAN3_17201	SPU_017201		\N
GLEAN3_19021	SPU_019021		\N
GLEAN3_05277	SPU_005277		\N
GLEAN3_07456	SPU_007456		\N
GLEAN3_28348	SPU_028348		\N
GLEAN3_13140	SPU_013140		\N
GLEAN3_10595	SPU_010595		\N
GLEAN3_09838	SPU_009838		\N
GLEAN3_23813	SPU_023813		\N
GLEAN3_14766	SPU_014766		\N
GLEAN3_28710	SPU_028710		\N
GLEAN3_20918	SPU_020918		\N
GLEAN3_09867	SPU_009867		\N
GLEAN3_14263	SPU_014263		\N
GLEAN3_22284	SPU_022284		\N
GLEAN3_08218	SPU_008218		\N
GLEAN3_26972	SPU_026972		\N
GLEAN3_15424	SPU_015424		\N
GLEAN3_10329	SPU_010329		\N
GLEAN3_15253	SPU_015253		\N
GLEAN3_08133	SPU_008133		\N
GLEAN3_10403	SPU_010403		\N
GLEAN3_21407	SPU_021407		\N
GLEAN3_12548	SPU_012548		\N
GLEAN3_14000	SPU_014000		\N
GLEAN3_09528	SPU_009528	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. It appears that if both scaffolds were to be combined, the entire sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be  somewhat strong with most of the values being greater than 10.	\N
GLEAN3_09527	SPU_009527		\N
GLEAN3_19553	SPU_019553		\N
GLEAN3_03107	SPU_003107		\N
GLEAN3_12084	SPU_012084		\N
GLEAN3_05276	SPU_005276		\N
GLEAN3_04252	SPU_004252		\N
GLEAN3_05105	SPU_005105		\N
GLEAN3_18692	SPU_018692		\N
GLEAN3_08768	SPU_008768		\N
GLEAN3_08115	SPU_008115		\N
GLEAN3_01603	SPU_001603		\N
GLEAN3_14534	SPU_014534		\N
GLEAN3_24903	SPU_024903		\N
GLEAN3_12469	SPU_012469		\N
GLEAN3_14628	SPU_014628		\N
GLEAN3_18483	SPU_018483		\N
GLEAN3_14576	SPU_014576		\N
GLEAN3_21557	SPU_021557		\N
GLEAN3_08528	SPU_008528		\N
GLEAN3_00584	SPU_000584		\N
GLEAN3_02578	SPU_002578		\N
GLEAN3_13804	SPU_013804		\N
GLEAN3_18806	SPU_018806		\N
GLEAN3_27156	SPU_027156		\N
GLEAN3_01649	SPU_001649		\N
GLEAN3_22435	SPU_022435		\N
GLEAN3_17649	SPU_017649		\N
GLEAN3_24769	SPU_024769		\N
GLEAN3_18248	SPU_018248		\N
Sp-carboxypeptiaseA-like	SPU_030029		\N
GLEAN3_02795	SPU_002795		\N
GLEAN3_15599	SPU_015599		\N
GLEAN3_26137	SPU_026137		\N
GLEAN3_01072	SPU_001072		\N
GLEAN3_26216	SPU_026216		\N
GLEAN3_18705	SPU_018705	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When examining the excel data it is apparent that there is a gap within the second scaffold that spans from 1026-1149. However, there were no internal repeats present in either scaffold. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the majority of the values ranging greater than 10.	\N
GLEAN3_08567	SPU_008567		\N
GLEAN3_16761	SPU_016761		\N
GLEAN3_25629	SPU_025629		\N
GLEAN3_04875	SPU_004875		\N
GLEAN3_00950	SPU_000950		\N
GLEAN3_08876	SPU_008876		\N
GLEAN3_22974	SPU_022974		\N
GLEAN3_19024	SPU_019024		\N
GLEAN3_24469	SPU_024469		\N
GLEAN3_21416	SPU_021416		\N
GLEAN3_11937	SPU_011937		\N
GLEAN3_09517	SPU_009517		\N
GLEAN3_08746	SPU_008746		\N
GLEAN3_09516	SPU_009516		\N
GLEAN3_20636	SPU_020636		\N
GLEAN3_10641	SPU_010641		\N
GLEAN3_21328	SPU_021328		\N
GLEAN3_03324	SPU_003324		\N
GLEAN3_09883	SPU_009883		\N
GLEAN3_02933	SPU_002933		\N
GLEAN3_01933	SPU_001933		\N
GLEAN3_14404	SPU_014404		\N
GLEAN3_25896	SPU_025896		\N
GLEAN3_24139	SPU_024139		\N
GLEAN3_25633	SPU_025633		\N
GLEAN3_24948	SPU_024948		\N
GLEAN3_27486	SPU_027486		\N
GLEAN3_01975	SPU_001975		\N
GLEAN3_24838	SPU_024838		\N
GLEAN3_01640	SPU_001640		\N
GLEAN3_20199	SPU_020199		\N
GLEAN3_00975	SPU_000975		\N
GLEAN3_09771	SPU_009771		\N
GLEAN3_23894	SPU_023894		\N
GLEAN3_27969	SPU_027969		\N
GLEAN3_04148	SPU_004148		\N
GLEAN3_06786	SPU_006786		\N
GLEAN3_05569	SPU_005569		\N
GLEAN3_25632	SPU_025632		\N
GLEAN3_16449	SPU_016449		\N
GLEAN3_15982	SPU_015982		\N
GLEAN3_03166	SPU_003166		\N
GLEAN3_23928	SPU_023928		\N
GLEAN3_15221	SPU_015221		\N
GLEAN3_22779	SPU_022779		\N
GLEAN3_02553	SPU_002553		\N
GLEAN3_17627	SPU_017627		\N
GLEAN3_18744	SPU_018744		\N
GLEAN3_17579	SPU_017579		\N
GLEAN3_28897	SPU_028897		\N
GLEAN3_10797	SPU_010797		\N
GLEAN3_22371	SPU_022371		\N
GLEAN3_10468	SPU_010468		\N
GLEAN3_23688	SPU_023688		\N
GLEAN3_22708	SPU_022708		\N
GLEAN3_16379	SPU_016379		\N
GLEAN3_00566	SPU_000566		\N
GLEAN3_22209	SPU_022209		\N
GLEAN3_27032	SPU_027032		\N
GLEAN3_01362	SPU_001362		\N
GLEAN3_25010	SPU_025010		\N
GLEAN3_01624	SPU_001624		\N
GLEAN3_08338	SPU_008338		\N
GLEAN3_19454	SPU_019454		\N
GLEAN3_24918	SPU_024918		\N
GLEAN3_14164	SPU_014164		\N
GLEAN3_13427	SPU_013427		\N
GLEAN3_19803	SPU_019803	From the BLAST results and the excel data, it was evident that the sequence is distributed onto two different scaffolds. When examining the excel data in comparison with the BLAST results, it was apparent that the sequence had an orderly arrangement without any internal repeats or large gaps present.  There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with the majority of the values being greater than 10.	\N
GLEAN3_14809	SPU_014809		\N
GLEAN3_25590	SPU_025590		\N
GLEAN3_25738	SPU_025738		\N
GLEAN3_02320	SPU_002320		\N
GLEAN3_26555	SPU_026555		\N
GLEAN3_15243	SPU_015243		\N
GLEAN3_21790	SPU_021790		\N
GLEAN3_02165	SPU_002165		\N
GLEAN3_02164	SPU_002164		\N
GLEAN3_09234	SPU_009234		\N
GLEAN3_04069	SPU_004069		\N
GLEAN3_03925	SPU_003925		\N
GLEAN3_14808	SPU_014808		\N
GLEAN3_10511	SPU_010511		\N
GLEAN3_12169	SPU_012169		\N
GLEAN3_17726	SPU_017726		\N
GLEAN3_24128	SPU_024128		\N
GLEAN3_16716	SPU_016716		\N
GLEAN3_09345	SPU_009345		\N
GLEAN3_16700	SPU_016700		\N
GLEAN3_11063	SPU_011063		\N
GLEAN3_03915	SPU_003915		\N
GLEAN3_19695	SPU_019695		\N
GLEAN3_00839	SPU_000839		\N
GLEAN3_05127	SPU_005127		\N
GLEAN3_10635	SPU_010635		\N
GLEAN3_00222	SPU_000222		\N
GLEAN3_25657	SPU_025657		\N
GLEAN3_23014	SPU_023014		\N
GLEAN3_08022	SPU_008022		\N
GLEAN3_09140	SPU_009140		\N
GLEAN3_22274	SPU_022274		\N
GLEAN3_26745	SPU_026745		\N
GLEAN3_12961	SPU_012961		\N
GLEAN3_19345	SPU_019345		\N
GLEAN3_12384	SPU_012384		\N
GLEAN3_19002	SPU_019002		\N
GLEAN3_28297	SPU_028297		\N
GLEAN3_06499	SPU_006499		\N
GLEAN3_11271	SPU_011271		\N
GLEAN3_10098	SPU_010098		\N
GLEAN3_22179	SPU_022179		\N
GLEAN3_22178	SPU_022178		\N
GLEAN3_27631	SPU_027631		\N
GLEAN3_01366	SPU_001366		\N
GLEAN3_27235	SPU_027235		\N
GLEAN3_13254	SPU_013254		\N
GLEAN3_00651	SPU_000651		\N
GLEAN3_20975	SPU_020975		\N
GLEAN3_04628	SPU_004628		\N
GLEAN3_03937	SPU_003937		\N
GLEAN3_04702	SPU_004702		\N
GLEAN3_12431	SPU_012431		\N
GLEAN3_23819	SPU_023819		\N
GLEAN3_26450	SPU_026450		\N
GLEAN3_10533	SPU_010533		\N
GLEAN3_00544	SPU_000544		\N
GLEAN3_24094	SPU_024094		\N
GLEAN3_18823	SPU_018823		\N
GLEAN3_19078	SPU_019078		\N
GLEAN3_12506	SPU_012506		\N
GLEAN3_24994	SPU_024994		\N
GLEAN3_01676	SPU_001676		\N
GLEAN3_19389	SPU_019389		\N
GLEAN3_28218	SPU_028218		\N
GLEAN3_00255	SPU_000255		\N
GLEAN3_24640	SPU_024640		\N
GLEAN3_02840	SPU_002840		\N
GLEAN3_15317	SPU_015317		\N
GLEAN3_28828	SPU_028828		\N
GLEAN3_04110	SPU_004110		\N
GLEAN3_06811	SPU_006811		\N
GLEAN3_20920	SPU_020920		\N
GLEAN3_10554	SPU_010554		\N
GLEAN3_06991	SPU_006991		\N
GLEAN3_19494	SPU_019494		\N
GLEAN3_25345	SPU_025345		\N
GLEAN3_18193	SPU_018193		\N
GLEAN3_20669	SPU_020669		\N
GLEAN3_22456	SPU_022456		\N
GLEAN3_25249	SPU_025249		\N
GLEAN3_14614	SPU_014614		\N
GLEAN3_07248	SPU_007248		\N
GLEAN3_10054	SPU_010054		\N
GLEAN3_05576	SPU_005576		\N
GLEAN3_18833	SPU_018833		\N
GLEAN3_10430	SPU_010430		\N
GLEAN3_25584	SPU_025584		\N
GLEAN3_11693	SPU_011693		\N
GLEAN3_16087	SPU_016087		\N
GLEAN3_04619	SPU_004619		\N
GLEAN3_07014	SPU_007014		\N
GLEAN3_25162	SPU_025162		\N
GLEAN3_11223	SPU_011223		\N
GLEAN3_15741	SPU_015741		\N
GLEAN3_01711	SPU_001711		\N
GLEAN3_23723	SPU_023723		\N
GLEAN3_06876	SPU_006876		\N
GLEAN3_01879	SPU_001879		\N
GLEAN3_05300	SPU_005300		\N
GLEAN3_03828	SPU_003828		\N
GLEAN3_07820	SPU_007820		\N
GLEAN3_24346	SPU_024346		\N
GLEAN3_01725	SPU_001725		\N
GLEAN3_07815	SPU_007815		\N
GLEAN3_24342	SPU_024342		\N
GLEAN3_08789	SPU_008789		\N
GLEAN3_21290	SPU_021290		\N
GLEAN3_00012	SPU_000012		\N
GLEAN3_00022	SPU_000022		\N
GLEAN3_00026	SPU_000026		\N
GLEAN3_00036	SPU_000036		\N
GLEAN3_00048	SPU_000048		\N
GLEAN3_00063	SPU_000063		\N
GLEAN3_00077	SPU_000077		\N
GLEAN3_00092	SPU_000092		\N
GLEAN3_00117	SPU_000117		\N
GLEAN3_00173	SPU_000173		\N
GLEAN3_00174	SPU_000174		\N
GLEAN3_00191	SPU_000191		\N
GLEAN3_00236	SPU_000236		\N
GLEAN3_00275	SPU_000275		\N
GLEAN3_00281	SPU_000281		\N
GLEAN3_00364	SPU_000364		\N
GLEAN3_00458	SPU_000458		\N
GLEAN3_00484	SPU_000484		\N
GLEAN3_00522	SPU_000522		\N
GLEAN3_00561	SPU_000561		\N
GLEAN3_00639	SPU_000639		\N
GLEAN3_00641	SPU_000641		\N
GLEAN3_00661	SPU_000661		\N
GLEAN3_00692	SPU_000692		\N
GLEAN3_00694	SPU_000694		\N
GLEAN3_00709	SPU_000709		\N
GLEAN3_00710	SPU_000710		\N
GLEAN3_00761	SPU_000761		\N
GLEAN3_00798	SPU_000798		\N
GLEAN3_00939	SPU_000939		\N
GLEAN3_00951	SPU_000951		\N
GLEAN3_00970	SPU_000970		\N
GLEAN3_00988	SPU_000988		\N
GLEAN3_01038	SPU_001038		\N
GLEAN3_01079	SPU_001079		\N
GLEAN3_01085	SPU_001085		\N
GLEAN3_01098	SPU_001098		\N
GLEAN3_01123	SPU_001123		\N
GLEAN3_01128	SPU_001128	After reviewing the data performing a BLAST search, it appears that the data is distributed onto 2 different scaffolds. When examining the excel data and comparing it to the BLAST results, it is evident that if these 2 scaffolds were combined the sequence would have an orderly and continuous arrangement. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	\N
GLEAN3_01144	SPU_001144		\N
GLEAN3_01174	SPU_001174		\N
GLEAN3_01181	SPU_001181		\N
GLEAN3_01183	SPU_001183		\N
GLEAN3_01191	SPU_001191		\N
GLEAN3_01211	SPU_001211		\N
GLEAN3_01264	SPU_001264		\N
GLEAN3_01287	SPU_001287		\N
GLEAN3_01311	SPU_001311		\N
GLEAN3_01356	SPU_001356		\N
GLEAN3_01387	SPU_001387		\N
GLEAN3_01481	SPU_001481		\N
GLEAN3_01491	SPU_001491		\N
GLEAN3_01559	SPU_001559		\N
GLEAN3_01584	SPU_001584		\N
GLEAN3_01596	SPU_001596		\N
GLEAN3_01668	SPU_001668		\N
GLEAN3_01788	SPU_001788		\N
GLEAN3_01817	SPU_001817		\N
GLEAN3_01831	SPU_001831		\N
GLEAN3_01924	SPU_001924		\N
GLEAN3_01925	SPU_001925		\N
GLEAN3_01986	SPU_001986		\N
GLEAN3_02124	SPU_002124		\N
GLEAN3_02128	SPU_002128		\N
GLEAN3_02182	SPU_002182		\N
GLEAN3_02226	SPU_002226		\N
GLEAN3_02250	SPU_002250	After reviewing the excel data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. There appears to be one sequence overlap within >v2.1_scaffold84012, however, if this overlap is discarded and the two scaffolds were combined the sequence would have an orderly, continuous arrangement. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed with but most of the values are >5. 	\N
GLEAN3_02270	SPU_002270		\N
GLEAN3_02293	SPU_002293		\N
GLEAN3_02307	SPU_002307		\N
GLEAN3_02308	SPU_002308		\N
GLEAN3_02375	SPU_002375		\N
GLEAN3_02376	SPU_002376		\N
GLEAN3_02385	SPU_002385		\N
GLEAN3_02479	SPU_002479		\N
GLEAN3_02481	SPU_002481		\N
GLEAN3_02482	SPU_002482		\N
GLEAN3_02490	SPU_002490		\N
GLEAN3_02516	SPU_002516		\N
GLEAN3_02520	SPU_002520		\N
GLEAN3_02522	SPU_002522		\N
GLEAN3_02558	SPU_002558		\N
GLEAN3_02636	SPU_002636		\N
GLEAN3_02649	SPU_002649		\N
GLEAN3_02661	SPU_002661		\N
GLEAN3_02698	SPU_002698		\N
GLEAN3_02725	SPU_002725		\N
GLEAN3_02743	SPU_002743		\N
GLEAN3_02744	SPU_002744		\N
GLEAN3_02756	SPU_002756		\N
GLEAN3_02777	SPU_002777		\N
GLEAN3_02807	SPU_002807		\N
GLEAN3_02887	SPU_002887		\N
GLEAN3_02927	SPU_002927		\N
GLEAN3_02930	SPU_002930		\N
GLEAN3_02932	SPU_002932		\N
GLEAN3_03033	SPU_003033		\N
GLEAN3_03071	SPU_003071		\N
GLEAN3_03073	SPU_003073		\N
GLEAN3_03085	SPU_003085		\N
GLEAN3_03104	SPU_003104		\N
GLEAN3_03134	SPU_003134		\N
GLEAN3_03135	SPU_003135		\N
GLEAN3_03174	SPU_003174		\N
GLEAN3_03210	SPU_003210		\N
GLEAN3_03229	SPU_003229		\N
GLEAN3_03266	SPU_003266		\N
GLEAN3_03357	SPU_003357		\N
GLEAN3_03432	SPU_003432		\N
GLEAN3_03511	SPU_003511		\N
GLEAN3_03556	SPU_003556		\N
GLEAN3_03642	SPU_003642		\N
GLEAN3_03653	SPU_003653		\N
GLEAN3_03803	SPU_003803		\N
GLEAN3_03823	SPU_003823		\N
GLEAN3_03867	SPU_003867		\N
GLEAN3_03941	SPU_003941		\N
GLEAN3_03972	SPU_003972		\N
GLEAN3_03990	SPU_003990		\N
GLEAN3_04004	SPU_004004		\N
GLEAN3_04025	SPU_004025		\N
GLEAN3_04057	SPU_004057		\N
GLEAN3_04131	SPU_004131		\N
GLEAN3_04137	SPU_004137		\N
GLEAN3_04209	SPU_004209		\N
GLEAN3_04210	SPU_004210		\N
GLEAN3_04235	SPU_004235		\N
GLEAN3_04236	SPU_004236		\N
GLEAN3_04390	SPU_004390		\N
GLEAN3_04404	SPU_004404		\N
GLEAN3_04407	SPU_004407		\N
GLEAN3_04467	SPU_004467		\N
GLEAN3_04473	SPU_004473		\N
GLEAN3_04493	SPU_004493		\N
GLEAN3_04503	SPU_004503		\N
GLEAN3_04504	SPU_004504		\N
GLEAN3_04506	SPU_004506		\N
GLEAN3_04511	SPU_004511		\N
GLEAN3_04592	SPU_004592		\N
GLEAN3_04625	SPU_004625		\N
GLEAN3_04797	SPU_004797		\N
GLEAN3_04798	SPU_004798		\N
GLEAN3_04813	SPU_004813		\N
GLEAN3_04855	SPU_004855		\N
GLEAN3_04856	SPU_004856		\N
GLEAN3_04876	SPU_004876		\N
GLEAN3_04944	SPU_004944		\N
GLEAN3_05054	SPU_005054		\N
GLEAN3_05119	SPU_005119		\N
GLEAN3_05152	SPU_005152		\N
GLEAN3_05201	SPU_005201		\N
GLEAN3_05226	SPU_005226		\N
GLEAN3_05237	SPU_005237		\N
GLEAN3_05244	SPU_005244		\N
GLEAN3_05256	SPU_005256		\N
GLEAN3_05269	SPU_005269		\N
GLEAN3_05290	SPU_005290		\N
GLEAN3_05337	SPU_005337		\N
GLEAN3_05342	SPU_005342		\N
GLEAN3_05507	SPU_005507		\N
GLEAN3_05527	SPU_005527		\N
GLEAN3_05539	SPU_005539		\N
GLEAN3_05540	SPU_005540		\N
GLEAN3_05585	SPU_005585		\N
GLEAN3_05672	SPU_005672	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When comparing the BLAST results with the excel data, it is apparent that both scaffolds have an orderly continuous arrangement with out any repeats or gaps present and if the two scaffolds were combined, the sequence would have good coverage. There was some Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be somewhat weak, with most of the values being less than 5.	\N
GLEAN3_05691	SPU_005691		\N
GLEAN3_05698	SPU_005698		\N
GLEAN3_05806	SPU_005806		\N
GLEAN3_05833	SPU_005833		\N
GLEAN3_05851	SPU_005851		\N
GLEAN3_05895	SPU_005895		\N
GLEAN3_05954	SPU_005954		\N
GLEAN3_05963	SPU_005963		\N
GLEAN3_05978	SPU_005978		\N
GLEAN3_06002	SPU_006002		\N
GLEAN3_06031	SPU_006031		\N
GLEAN3_06048	SPU_006048		\N
GLEAN3_06111	SPU_006111		\N
GLEAN3_06143	SPU_006143		\N
GLEAN3_06153	SPU_006153		\N
GLEAN3_06162	SPU_006162		\N
GLEAN3_06261	SPU_006261		\N
GLEAN3_06308	SPU_006308	From the BLAST results and the excel data it is evident that this sequence is distributed onto 2 different scaffolds. After reviewing the excel data, it was determined that the both scaffolds had an orderly arrangement without any gaps present. However, there is a sequence overlap within the first scaffold that occurs from 512-573 that does appear to be apart of the sequence. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	\N
GLEAN3_06401	SPU_006401		\N
GLEAN3_06452	SPU_006452		\N
GLEAN3_06486	SPU_006486		\N
GLEAN3_06545	SPU_006545		\N
GLEAN3_06576	SPU_006576		\N
GLEAN3_06599	SPU_006599		\N
GLEAN3_06650	SPU_006650		\N
GLEAN3_06664	SPU_006664		\N
GLEAN3_06682	SPU_006682		\N
GLEAN3_06752	SPU_006752		\N
GLEAN3_06759	SPU_006759		\N
GLEAN3_06772	SPU_006772	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it was apparent that there were numerous repeats and gaps present within both scaffolds, resulting in poor overall coverage. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak	\N
GLEAN3_06800	SPU_006800		\N
GLEAN3_06809	SPU_006809		\N
GLEAN3_06903	SPU_006903		\N
GLEAN3_06937	SPU_006937		\N
GLEAN3_06938	SPU_006938		\N
GLEAN3_06944	SPU_006944		\N
GLEAN3_07161	SPU_007161		\N
GLEAN3_07214	SPU_007214		\N
GLEAN3_07260	SPU_007260		\N
GLEAN3_07267	SPU_007267		\N
GLEAN3_07283	SPU_007283		\N
GLEAN3_07294	SPU_007294		\N
GLEAN3_07407	SPU_007407		\N
GLEAN3_07492	SPU_007492		\N
GLEAN3_07502	SPU_007502		\N
GLEAN3_07683	SPU_007683		\N
GLEAN3_07696	SPU_007696		\N
GLEAN3_07775	SPU_007775		\N
GLEAN3_07825	SPU_007825		\N
GLEAN3_07919	SPU_007919		\N
GLEAN3_08073	SPU_008073		\N
GLEAN3_08159	SPU_008159		\N
GLEAN3_08195	SPU_008195		\N
GLEAN3_08250	SPU_008250		\N
GLEAN3_08389	SPU_008389		\N
GLEAN3_08575	SPU_008575		\N
GLEAN3_08776	SPU_008776		\N
GLEAN3_08825	SPU_008825		\N
GLEAN3_08871	SPU_008871		\N
GLEAN3_08984	SPU_008984		\N
GLEAN3_09108	SPU_009108		\N
GLEAN3_09130	SPU_009130		\N
GLEAN3_09182	SPU_009182		\N
GLEAN3_09269	SPU_009269		\N
GLEAN3_09360	SPU_009360		\N
GLEAN3_09407	SPU_009407		\N
GLEAN3_09417	SPU_009417		\N
GLEAN3_09495	SPU_009495		\N
GLEAN3_09588	SPU_009588		\N
GLEAN3_09738	SPU_009738		\N
GLEAN3_09866	SPU_009866		\N
GLEAN3_09961	SPU_009961		\N
GLEAN3_10120	SPU_010120		\N
GLEAN3_10174	SPU_010174		\N
GLEAN3_10293	SPU_010293		\N
GLEAN3_10302	SPU_010302		\N
GLEAN3_10427	SPU_010427		\N
GLEAN3_10520	SPU_010520		\N
GLEAN3_10636	SPU_010636		\N
GLEAN3_10650	SPU_010650		\N
GLEAN3_10651	SPU_010651		\N
GLEAN3_10687	SPU_010687		\N
GLEAN3_10704	SPU_010704		\N
GLEAN3_10705	SPU_010705		\N
GLEAN3_10733	SPU_010733		\N
GLEAN3_10764	SPU_010764		\N
GLEAN3_10775	SPU_010775		\N
GLEAN3_10847	SPU_010847		\N
GLEAN3_10958	SPU_010958		\N
GLEAN3_10969	SPU_010969		\N
GLEAN3_11058	SPU_011058		\N
GLEAN3_11093	SPU_011093		\N
GLEAN3_11119	SPU_011119		\N
GLEAN3_11127	SPU_011127		\N
GLEAN3_11282	SPU_011282		\N
GLEAN3_11293	SPU_011293		\N
GLEAN3_11313	SPU_011313		\N
GLEAN3_11357	SPU_011357		\N
GLEAN3_11416	SPU_011416		\N
GLEAN3_11605	SPU_011605		\N
GLEAN3_11649	SPU_011649		\N
GLEAN3_11650	SPU_011650		\N
GLEAN3_11656	SPU_011656		\N
GLEAN3_11727	SPU_011727		\N
GLEAN3_11778	SPU_011778		\N
GLEAN3_11784	SPU_011784		\N
GLEAN3_11796	SPU_011796		\N
GLEAN3_11828	SPU_011828		\N
GLEAN3_11832	SPU_011832		\N
GLEAN3_11884	SPU_011884		\N
GLEAN3_11933	SPU_011933		\N
GLEAN3_11956	SPU_011956		\N
GLEAN3_12042	SPU_012042		\N
GLEAN3_12049	SPU_012049		\N
GLEAN3_12070	SPU_012070		\N
GLEAN3_12095	SPU_012095		\N
GLEAN3_12097	SPU_012097		\N
GLEAN3_12173	SPU_012173		\N
GLEAN3_12183	SPU_012183		\N
GLEAN3_12199	SPU_012199		\N
GLEAN3_12229	SPU_012229		\N
GLEAN3_12242	SPU_012242		\N
GLEAN3_12250	SPU_012250		\N
GLEAN3_12255	SPU_012255		\N
GLEAN3_12368	SPU_012368		\N
GLEAN3_12373	SPU_012373		\N
GLEAN3_12488	SPU_012488		\N
GLEAN3_12521	SPU_012521		\N
GLEAN3_12545	SPU_012545		\N
GLEAN3_12573	SPU_012573		\N
GLEAN3_12579	SPU_012579		\N
GLEAN3_12589	SPU_012589		\N
GLEAN3_12628	SPU_012628		\N
GLEAN3_12630	SPU_012630		\N
GLEAN3_12636	SPU_012636		\N
GLEAN3_12651	SPU_012651		\N
GLEAN3_12677	SPU_012677		\N
GLEAN3_12690	SPU_012690		\N
GLEAN3_12702	SPU_012702		\N
GLEAN3_12732	SPU_012732		\N
GLEAN3_12735	SPU_012735		\N
GLEAN3_12793	SPU_012793		\N
GLEAN3_12804	SPU_012804	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 3 different scaffolds. However, if the 2 scaffolds were combined there would be several sequence overlaps between v2.1_scaffold68484 and v2.1_scaffold100953 due to similar base pairing between the two. There was also an internal repeat within v2.1_scaffold100953 , but this may have been apart of the sequence since there was only one repeat of the same base (563-609). There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the scores being above 10.	\N
GLEAN3_12839	SPU_012839		\N
GLEAN3_12846	SPU_012846		\N
GLEAN3_12875	SPU_012875		\N
GLEAN3_12921	SPU_012921		\N
GLEAN3_13041	SPU_013041		\N
GLEAN3_13087	SPU_013087		\N
GLEAN3_13096	SPU_013096		\N
GLEAN3_13100	SPU_013100		\N
GLEAN3_13125	SPU_013125		\N
GLEAN3_13135	SPU_013135		\N
GLEAN3_13166	SPU_013166		\N
GLEAN3_13198	SPU_013198		\N
GLEAN3_13225	SPU_013225		\N
GLEAN3_13228	SPU_013228		\N
GLEAN3_13231	SPU_013231		\N
GLEAN3_13245	SPU_013245		\N
GLEAN3_13298	SPU_013298		\N
GLEAN3_13326	SPU_013326		\N
GLEAN3_13380	SPU_013380		\N
GLEAN3_13429	SPU_013429		\N
GLEAN3_13460	SPU_013460		\N
GLEAN3_13501	SPU_013501		\N
GLEAN3_13502	SPU_013502		\N
GLEAN3_13565	SPU_013565		\N
GLEAN3_13644	SPU_013644		\N
GLEAN3_13660	SPU_013660		\N
GLEAN3_13742	SPU_013742		\N
GLEAN3_13754	SPU_013754		\N
GLEAN3_13769	SPU_013769		\N
GLEAN3_13772	SPU_013772		\N
GLEAN3_13819	SPU_013819		\N
GLEAN3_13838	SPU_013838		\N
GLEAN3_13931	SPU_013931		\N
GLEAN3_13944	SPU_013944		\N
GLEAN3_14002	SPU_014002		\N
GLEAN3_14011	SPU_014011		\N
GLEAN3_14046	SPU_014046		\N
GLEAN3_14059	SPU_014059		\N
GLEAN3_14244	SPU_014244		\N
GLEAN3_14287	SPU_014287		\N
GLEAN3_14303	SPU_014303		\N
GLEAN3_14312	SPU_014312		\N
GLEAN3_14339	SPU_014339		\N
GLEAN3_14343	SPU_014343		\N
GLEAN3_14398	SPU_014398		\N
GLEAN3_14399	SPU_014399		\N
GLEAN3_14478	SPU_014478		\N
GLEAN3_14562	SPU_014562		\N
GLEAN3_14564	SPU_014564		\N
GLEAN3_14566	SPU_014566		\N
GLEAN3_14567	SPU_014567		\N
GLEAN3_14568	SPU_014568		\N
GLEAN3_14584	SPU_014584		\N
GLEAN3_14660	SPU_014660		\N
GLEAN3_14665	SPU_014665		\N
GLEAN3_14791	SPU_014791		\N
GLEAN3_14804	SPU_014804		\N
GLEAN3_14938	SPU_014938		\N
GLEAN3_14947	SPU_014947		\N
GLEAN3_14986	SPU_014986		\N
GLEAN3_15142	SPU_015142		\N
GLEAN3_15241	SPU_015241		\N
GLEAN3_15292	SPU_015292		\N
GLEAN3_15423	SPU_015423		\N
GLEAN3_15472	SPU_015472		\N
GLEAN3_15489	SPU_015489		\N
GLEAN3_15514	SPU_015514		\N
GLEAN3_15526	SPU_015526		\N
GLEAN3_15576	SPU_015576		\N
GLEAN3_15691	SPU_015691		\N
GLEAN3_15826	SPU_015826		\N
GLEAN3_15832	SPU_015832		\N
GLEAN3_15836	SPU_015836		\N
GLEAN3_15959	SPU_015959		\N
GLEAN3_15987	SPU_015987		\N
GLEAN3_16090	SPU_016090		\N
GLEAN3_16096	SPU_016096		\N
GLEAN3_16097	SPU_016097		\N
GLEAN3_16110	SPU_016110		\N
GLEAN3_16111	SPU_016111		\N
GLEAN3_16271	SPU_016271		\N
GLEAN3_16298	SPU_016298		\N
GLEAN3_16414	SPU_016414		\N
GLEAN3_16432	SPU_016432		\N
GLEAN3_16441	SPU_016441		\N
GLEAN3_16538	SPU_016538		\N
GLEAN3_16574	SPU_016574		\N
GLEAN3_16587	SPU_016587		\N
GLEAN3_16609	SPU_016609		\N
GLEAN3_16706	SPU_016706		\N
GLEAN3_16739	SPU_016739		\N
GLEAN3_16758	SPU_016758		\N
GLEAN3_16762	SPU_016762		\N
GLEAN3_16767	SPU_016767		\N
GLEAN3_16799	SPU_016799		\N
GLEAN3_16823	SPU_016823		\N
GLEAN3_16900	SPU_016900		\N
GLEAN3_16948	SPU_016948		\N
GLEAN3_16957	SPU_016957		\N
GLEAN3_16993	SPU_016993		\N
GLEAN3_17016	SPU_017016		\N
GLEAN3_17051	SPU_017051		\N
GLEAN3_17159	SPU_017159		\N
GLEAN3_17252	SPU_017252		\N
GLEAN3_17280	SPU_017280		\N
GLEAN3_17338	SPU_017338		\N
GLEAN3_17374	SPU_017374		\N
GLEAN3_17437	SPU_017437		\N
GLEAN3_17601	SPU_017601		\N
GLEAN3_17605	SPU_017605		\N
GLEAN3_17721	SPU_017721		\N
GLEAN3_17751	SPU_017751		\N
GLEAN3_17798	SPU_017798		\N
GLEAN3_17799	SPU_017799		\N
GLEAN3_17976	SPU_017976		\N
GLEAN3_18046	SPU_018046		\N
GLEAN3_18132	SPU_018132		\N
GLEAN3_18329	SPU_018329		\N
GLEAN3_18545	SPU_018545		\N
GLEAN3_18615	SPU_018615		\N
GLEAN3_18624	SPU_018624		\N
GLEAN3_18701	SPU_018701		\N
GLEAN3_18702	SPU_018702		\N
GLEAN3_18765	SPU_018765		\N
GLEAN3_18769	SPU_018769		\N
GLEAN3_18791	SPU_018791		\N
GLEAN3_18922	SPU_018922		\N
GLEAN3_18924	SPU_018924		\N
GLEAN3_18962	SPU_018962		\N
GLEAN3_19040	SPU_019040		\N
GLEAN3_19046	SPU_019046		\N
GLEAN3_19166	SPU_019166		\N
GLEAN3_19206	SPU_019206	The BLAST results showed that only 213 bases were mapped out of a total of 375 indicating that the sequence did not map very well. Sea urchin GBrowse (assembly 2.1) was used to examine the scaffold, which revealed that the sequence had gaps and wasn't covered very well. This is an un-annotated gene so no additional comments were available on the Baylor page. Some EST information was available on the GBrowse V0.5 assembly as well. The transcriptome scores appeared to be a little weak with intensities averaging a little below 10. 	\N
GLEAN3_19216	SPU_019216		\N
GLEAN3_19227	SPU_019227		\N
GLEAN3_19257	SPU_019257		\N
GLEAN3_19258	SPU_019258		\N
GLEAN3_19312	SPU_019312		\N
GLEAN3_19318	SPU_019318		\N
GLEAN3_19356	SPU_019356		\N
GLEAN3_19470	SPU_019470		\N
GLEAN3_19592	SPU_019592		\N
GLEAN3_19593	SPU_019593		\N
GLEAN3_19624	SPU_019624		\N
GLEAN3_19645	SPU_019645		\N
GLEAN3_19685	SPU_019685		\N
GLEAN3_19713	SPU_019713		\N
GLEAN3_19778	SPU_019778		\N
GLEAN3_19809	SPU_019809		\N
GLEAN3_19836	SPU_019836		\N
GLEAN3_19876	SPU_019876		\N
GLEAN3_19896	SPU_019896		\N
GLEAN3_19911	SPU_019911		\N
GLEAN3_19957	SPU_019957		\N
GLEAN3_22006	SPU_022006		\N
GLEAN3_22066	SPU_022066		\N
GLEAN3_22215	SPU_022215		\N
GLEAN3_22239	SPU_022239		\N
GLEAN3_22240	SPU_022240		\N
GLEAN3_22267	SPU_022267		\N
GLEAN3_22370	SPU_022370		\N
GLEAN3_22414	SPU_022414		\N
GLEAN3_22483	SPU_022483		\N
GLEAN3_22596	SPU_022596		\N
GLEAN3_22627	SPU_022627		\N
GLEAN3_22646	SPU_022646		\N
GLEAN3_22652	SPU_022652		\N
GLEAN3_22761	SPU_022761		\N
GLEAN3_22765	SPU_022765		\N
GLEAN3_22783	SPU_022783		\N
GLEAN3_22842	SPU_022842		\N
GLEAN3_22877	SPU_022877		\N
GLEAN3_22888	SPU_022888		\N
GLEAN3_22924	SPU_022924		\N
GLEAN3_23025	SPU_023025		\N
GLEAN3_23099	SPU_023099		\N
GLEAN3_23123	SPU_023123		\N
GLEAN3_23124	SPU_023124		\N
GLEAN3_23134	SPU_023134		\N
GLEAN3_23224	SPU_023224		\N
GLEAN3_23353	SPU_023353		\N
GLEAN3_23420	SPU_023420		\N
GLEAN3_23434	SPU_023434		\N
GLEAN3_23604	SPU_023604		\N
GLEAN3_23609	SPU_023609		\N
GLEAN3_23631	SPU_023631		\N
GLEAN3_23648	SPU_023648		\N
GLEAN3_23662	SPU_023662		\N
GLEAN3_23669	SPU_023669		\N
GLEAN3_23718	SPU_023718		\N
GLEAN3_23762	SPU_023762		\N
GLEAN3_23963	SPU_023963		\N
GLEAN3_20001	SPU_020001		\N
GLEAN3_20006	SPU_020006		\N
GLEAN3_20084	SPU_020084		\N
GLEAN3_20212	SPU_020212		\N
GLEAN3_20549	SPU_020549		\N
GLEAN3_20550	SPU_020550		\N
GLEAN3_20578	SPU_020578		\N
GLEAN3_20592	SPU_020592		\N
GLEAN3_20609	SPU_020609		\N
GLEAN3_20709	SPU_020709		\N
GLEAN3_20788	SPU_020788		\N
GLEAN3_20928	SPU_020928		\N
GLEAN3_20987	SPU_020987		\N
GLEAN3_21000	SPU_021000		\N
GLEAN3_21046	SPU_021046		\N
GLEAN3_21085	SPU_021085		\N
GLEAN3_21171	SPU_021171		\N
GLEAN3_21359	SPU_021359		\N
GLEAN3_21470	SPU_021470		\N
GLEAN3_21473	SPU_021473		\N
GLEAN3_21524	SPU_021524		\N
GLEAN3_21564	SPU_021564		\N
GLEAN3_21682	SPU_021682		\N
GLEAN3_21693	SPU_021693		\N
GLEAN3_21719	SPU_021719		\N
GLEAN3_21752	SPU_021752		\N
GLEAN3_21753	SPU_021753		\N
GLEAN3_21759	SPU_021759		\N
GLEAN3_21773	SPU_021773		\N
GLEAN3_21793	SPU_021793		\N
GLEAN3_21871	SPU_021871		\N
GLEAN3_21888	SPU_021888		\N
GLEAN3_21966	SPU_021966		\N
GLEAN3_24071	SPU_024071		\N
GLEAN3_24102	SPU_024102		\N
GLEAN3_24115	SPU_024115		\N
GLEAN3_24173	SPU_024173		\N
GLEAN3_24328	SPU_024328		\N
GLEAN3_24379	SPU_024379		\N
GLEAN3_24405	SPU_024405		\N
GLEAN3_24446	SPU_024446		\N
GLEAN3_24495	SPU_024495		\N
GLEAN3_24496	SPU_024496		\N
GLEAN3_24657	SPU_024657		\N
GLEAN3_24929	SPU_024929		\N
GLEAN3_24933	SPU_024933		\N
GLEAN3_24967	SPU_024967		\N
GLEAN3_25015	SPU_025015		\N
GLEAN3_25063	SPU_025063		\N
GLEAN3_25182	SPU_025182		\N
GLEAN3_25301	SPU_025301		\N
GLEAN3_25367	SPU_025367		\N
GLEAN3_25397	SPU_025397		\N
GLEAN3_25527	SPU_025527		\N
GLEAN3_25535	SPU_025535		\N
GLEAN3_25636	SPU_025636		\N
GLEAN3_25686	SPU_025686		\N
GLEAN3_25705	SPU_025705		\N
GLEAN3_25730	SPU_025730		\N
GLEAN3_25812	SPU_025812		\N
GLEAN3_25815	SPU_025815		\N
GLEAN3_25891	SPU_025891		\N
GLEAN3_25952	SPU_025952		\N
GLEAN3_25953	SPU_025953		\N
GLEAN3_26003	SPU_026003		\N
GLEAN3_26050	SPU_026050		\N
GLEAN3_26156	SPU_026156		\N
GLEAN3_26206	SPU_026206		\N
GLEAN3_26226	SPU_026226	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. It appears that both scaffolds contain an orderly arrangement and if the two scaffolds were combined the sequence would have good overall coverage (excluding a small overlap from about 571-686.) There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	\N
GLEAN3_26278	SPU_026278		\N
GLEAN3_26295	SPU_026295		\N
GLEAN3_26386	SPU_026386		\N
GLEAN3_26436	SPU_026436		\N
GLEAN3_26439	SPU_026439		\N
GLEAN3_26493	SPU_026493		\N
GLEAN3_26618	SPU_026618		\N
GLEAN3_26624	SPU_026624		\N
GLEAN3_26834	SPU_026834		\N
GLEAN3_26835	SPU_026835		\N
GLEAN3_26943	SPU_026943		\N
GLEAN3_27059	SPU_027059		\N
GLEAN3_27216	SPU_027216		\N
GLEAN3_27236	SPU_027236		\N
GLEAN3_27238	SPU_027238		\N
GLEAN3_27240	SPU_027240		\N
GLEAN3_27252	SPU_027252		\N
GLEAN3_27281	SPU_027281		\N
GLEAN3_27448	SPU_027448		\N
GLEAN3_27457	SPU_027457		\N
GLEAN3_27545	SPU_027545		\N
GLEAN3_27599	SPU_027599		\N
GLEAN3_27635	SPU_027635		\N
GLEAN3_27682	SPU_027682		\N
GLEAN3_27710	SPU_027710		\N
GLEAN3_27946	SPU_027946		\N
GLEAN3_27986	SPU_027986		\N
GLEAN3_28004	SPU_028004		\N
GLEAN3_28168	SPU_028168		\N
GLEAN3_28431	SPU_028431		\N
GLEAN3_28491	SPU_028491		\N
GLEAN3_28548	SPU_028548		\N
GLEAN3_28633	SPU_028633		\N
GLEAN3_28857	SPU_028857		\N
GLEAN3_28873	SPU_028873		\N
GLEAN3_28917	SPU_028917		\N
GLEAN3_05541	SPU_005541		\N
GLEAN3_15059	SPU_015059		\N
GLEAN3_16486	SPU_016486		\N
GLEAN3_19133	SPU_019133		\N
GLEAN3_19563	SPU_019563		\N
GLEAN3_19889	SPU_019889		\N
GLEAN3_26712	SPU_026712		\N
GLEAN3_27054	SPU_027054		\N
GLEAN3_27520	SPU_027520		\N
GLEAN3_02808	SPU_002808		\N
GLEAN3_03010	SPU_003010		\N
GLEAN3_06374	SPU_006374		\N
GLEAN3_06524	SPU_006524		\N
GLEAN3_06843	SPU_006843		\N
GLEAN3_06961	SPU_006961		\N
GLEAN3_12646	SPU_012646		\N
GLEAN3_22050	SPU_022050		\N
GLEAN3_22898	SPU_022898		\N
GLEAN3_22899	SPU_022899		\N
GLEAN3_22901	SPU_022901		\N
GLEAN3_24866	SPU_024866		\N
GLEAN3_10067	SPU_010067		\N
GLEAN3_10371	SPU_010371		\N
GLEAN3_12663	SPU_012663		\N
GLEAN3_12776	SPU_012776		\N
GLEAN3_14278	SPU_014278		\N
GLEAN3_14897	SPU_014897		\N
GLEAN3_15434	SPU_015434		\N
GLEAN3_15678	SPU_015678		\N
GLEAN3_15795	SPU_015795		\N
GLEAN3_15928	SPU_015928		\N
GLEAN3_06394	SPU_006394		\N
GLEAN3_07121	SPU_007121		\N
GLEAN3_07273	SPU_007273		\N
GLEAN3_07392	SPU_007392		\N
GLEAN3_07601	SPU_007601		\N
GLEAN3_07907	SPU_007907		\N
GLEAN3_08249	SPU_008249		\N
GLEAN3_08593	SPU_008593		\N
GLEAN3_08694	SPU_008694		\N
GLEAN3_08803	SPU_008803		\N
GLEAN3_08804	SPU_008804		\N
GLEAN3_08971	SPU_008971		\N
GLEAN3_09067	SPU_009067		\N
GLEAN3_09142	SPU_009142		\N
GLEAN3_09214	SPU_009214		\N
GLEAN3_09217	SPU_009217		\N
GLEAN3_09628	SPU_009628		\N
GLEAN3_09872	SPU_009872		\N
GLEAN3_14203	SPU_014203		\N
GLEAN3_14634	SPU_014634		\N
GLEAN3_15084	SPU_015084		\N
GLEAN3_16693	SPU_016693		\N
GLEAN3_17752	SPU_017752		\N
GLEAN3_17768	SPU_017768		\N
GLEAN3_17892	SPU_017892		\N
GLEAN3_19212	SPU_019212		\N
GLEAN3_19514	SPU_019514		\N
GLEAN3_19603	SPU_019603		\N
GLEAN3_19637	SPU_019637		\N
GLEAN3_20699	SPU_020699		\N
GLEAN3_21226	SPU_021226		\N
GLEAN3_21623	SPU_021623		\N
GLEAN3_22845	SPU_022845		\N
GLEAN3_22914	SPU_022914		\N
GLEAN3_23133	SPU_023133		\N
GLEAN3_23137	SPU_023137		\N
GLEAN3_23325	SPU_023325	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it is apparent that both scaffolds contain an orderly arrangement without any internal repeats or gaps present. If the two scaffolds were to be combined, the overall sequence would have good coverage. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	\N
GLEAN3_23362	SPU_023362		\N
GLEAN3_23531	SPU_023531		\N
GLEAN3_23810	SPU_023810		\N
GLEAN3_23812	SPU_023812		\N
GLEAN3_24232	SPU_024232		\N
GLEAN3_24478	SPU_024478		\N
GLEAN3_24604	SPU_024604		\N
GLEAN3_24699	SPU_024699		\N
GLEAN3_25045	SPU_025045		\N
GLEAN3_25055	SPU_025055		\N
GLEAN3_25300	SPU_025300		\N
GLEAN3_25947	SPU_025947		\N
GLEAN3_26221	SPU_026221		\N
GLEAN3_26440	SPU_026440		\N
GLEAN3_26441	SPU_026441		\N
GLEAN3_26694	SPU_026694		\N
GLEAN3_26928	SPU_026928		\N
GLEAN3_27224	SPU_027224		\N
GLEAN3_27229	SPU_027229	After reviewing the data it was apparent that there was no sufficient GLEAN model that matched. There was only one set of data (subject hit) that was available from the Excel spreadsheet. The sequence had an orderly arrangement but had very poor coverage.	\N
GLEAN3_27502	SPU_027502		\N
GLEAN3_27563	SPU_027563		\N
GLEAN3_27649	SPU_027649		\N
GLEAN3_27673	SPU_027673		\N
GLEAN3_27748	SPU_027748		\N
GLEAN3_27799	SPU_027799		\N
GLEAN3_27891	SPU_027891		\N
GLEAN3_27965	SPU_027965		\N
GLEAN3_28526	SPU_028526		\N
GLEAN3_28569	SPU_028569		\N
GLEAN3_28697	SPU_028697		\N
GLEAN3_28780	SPU_028780		\N
GLEAN3_02990	SPU_002990		\N
GLEAN3_25931	SPU_025931		\N
GLEAN3_00118	SPU_000118		\N
GLEAN3_01704	SPU_001704		\N
GLEAN3_09358	SPU_009358		\N
GLEAN3_09713	SPU_009713		\N
GLEAN3_10959	SPU_010959		\N
GLEAN3_11560	SPU_011560		\N
GLEAN3_11733	SPU_011733		\N
GLEAN3_14563	SPU_014563		\N
GLEAN3_16201	SPU_016201		\N
GLEAN3_17449	SPU_017449		\N
GLEAN3_17652	SPU_017652		\N
GLEAN3_17722	SPU_017722		\N
GLEAN3_22477	SPU_022477		\N
GLEAN3_22478	SPU_022478		\N
GLEAN3_27028	SPU_027028		\N
GLEAN3_11962	SPU_011962	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. If the 3 scaffolds were combined, the overall sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be somewhat strong with most of the values being <5.	\N
GLEAN3_12222	SPU_012222		\N
GLEAN3_12358	SPU_012358		\N
GLEAN3_12745	SPU_012745		\N
GLEAN3_12746	SPU_012746		\N
GLEAN3_13052	SPU_013052		\N
GLEAN3_13168	SPU_013168		\N
GLEAN3_13441	SPU_013441		\N
GLEAN3_13530	SPU_013530		\N
GLEAN3_13532	SPU_013532		\N
GLEAN3_13533	SPU_013533		\N
GLEAN3_13613	SPU_013613		\N
GLEAN3_14195	SPU_014195		\N
GLEAN3_14517	SPU_014517		\N
GLEAN3_15945	SPU_015945		\N
GLEAN3_16581	SPU_016581		\N
GLEAN3_16595	SPU_016595		\N
GLEAN3_17175	SPU_017175		\N
GLEAN3_17336	SPU_017336		\N
GLEAN3_17896	SPU_017896		\N
GLEAN3_17964	SPU_017964		\N
GLEAN3_18444	SPU_018444		\N
GLEAN3_18563	SPU_018563		\N
GLEAN3_19132	SPU_019132		\N
GLEAN3_19192	SPU_019192		\N
GLEAN3_19388	SPU_019388		\N
GLEAN3_19536	SPU_019536		\N
GLEAN3_19880	SPU_019880		\N
GLEAN3_20332	SPU_020332		\N
GLEAN3_20391	SPU_020391		\N
GLEAN3_20717	SPU_020717		\N
GLEAN3_21105	SPU_021105		\N
GLEAN3_21132	SPU_021132		\N
GLEAN3_21350	SPU_021350		\N
GLEAN3_21360	SPU_021360		\N
GLEAN3_21377	SPU_021377	After reviewing the data and performing the BLAST search it appears that there is no good GLEAN model fit for GLEAN3_21377. There are numerous repeats and poor sequence coverage. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values above 5 and several that were above 10.	\N
GLEAN3_21388	SPU_021388		\N
GLEAN3_21500	SPU_021500		\N
GLEAN3_21967	SPU_021967		\N
GLEAN3_22047	SPU_022047		\N
GLEAN3_22623	SPU_022623		\N
GLEAN3_22976	SPU_022976		\N
GLEAN3_23013	SPU_023013		\N
GLEAN3_23238	SPU_023238		\N
GLEAN3_23290	SPU_023290		\N
GLEAN3_23361	SPU_023361		\N
GLEAN3_23363	SPU_023363		\N
GLEAN3_23380	SPU_023380		\N
GLEAN3_23403	SPU_023403		\N
GLEAN3_23538	SPU_023538		\N
GLEAN3_23547	SPU_023547		\N
GLEAN3_23589	SPU_023589	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it is apparent that there are several gaps within each scaffold. However, the entire sequence is covered between the two scaffolds. There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most the values being greater than 10. 	\N
GLEAN3_24112	SPU_024112		\N
GLEAN3_24473	SPU_024473		\N
GLEAN3_24547	SPU_024547		\N
GLEAN3_24837	SPU_024837		\N
GLEAN3_24884	SPU_024884		\N
GLEAN3_25145	SPU_025145		\N
GLEAN3_25228	SPU_025228		\N
GLEAN3_26452	SPU_026452		\N
GLEAN3_27561	SPU_027561		\N
GLEAN3_27816	SPU_027816		\N
GLEAN3_27923	SPU_027923		\N
GLEAN3_28239	SPU_028239		\N
GLEAN3_28389	SPU_028389		\N
GLEAN3_00471	SPU_000471		\N
GLEAN3_00676	SPU_000676		\N
GLEAN3_00776	SPU_000776		\N
GLEAN3_00931	SPU_000931		\N
GLEAN3_01279	SPU_001279		\N
GLEAN3_02356	SPU_002356		\N
GLEAN3_02969	SPU_002969		\N
GLEAN3_03380	SPU_003380		\N
GLEAN3_03827	SPU_003827		\N
GLEAN3_04243	SPU_004243		\N
GLEAN3_05296	SPU_005296		\N
GLEAN3_05865	SPU_005865		\N
GLEAN3_08075	SPU_008075		\N
GLEAN3_08083	SPU_008083		\N
GLEAN3_10335	SPU_010335		\N
GLEAN3_10710	SPU_010710		\N
GLEAN3_10761	SPU_010761		\N
GLEAN3_10990	SPU_010990		\N
GLEAN3_11562	SPU_011562		\N
GLEAN3_12472	SPU_012472		\N
GLEAN3_12743	SPU_012743		\N
GLEAN3_12784	SPU_012784		\N
GLEAN3_12820	SPU_012820		\N
GLEAN3_12871	SPU_012871		\N
GLEAN3_13770	SPU_013770		\N
GLEAN3_15274	SPU_015274		\N
GLEAN3_17604	SPU_017604		\N
GLEAN3_17639	SPU_017639		\N
GLEAN3_19141	SPU_019141		\N
GLEAN3_22997	SPU_022997		\N
GLEAN3_23265	SPU_023265		\N
GLEAN3_20564	SPU_020564	After reviewing the data and performing a BLAST search, it was determined that this is the best match for GLEAN3_20564. Before a BLAST search was done, it was initially thought that the best match was subject gb|DS006697| do to the orderly arrangement of the sequence and the coverage. However, the BLAST search indicated that the best match was subject AAGJ02027540 due to a high bit score and a low E value even though the coverage only spanned a total of 291/960. 	\N
GLEAN3_21056	SPU_021056		\N
GLEAN3_21315	SPU_021315	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. There is a large gap that occurs between the two scaffolds that spans from1032-1188. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 10 (excluding one outlier value at 72.)	\N
GLEAN3_24431	SPU_024431		\N
GLEAN3_26261	SPU_026261		\N
GLEAN3_26580	SPU_026580		\N
GLEAN3_26801	SPU_026801		\N
GLEAN3_27069	SPU_027069		\N
GLEAN3_27483	SPU_027483		\N
GLEAN3_18443	SPU_018443		\N
GLEAN3_18542	SPU_018542		\N
GLEAN3_28035	SPU_028035		\N
GLEAN3_12792	SPU_012792		\N
GLEAN3_17147	SPU_017147		\N
GLEAN3_08543	SPU_008543		\N
GLEAN3_09430	SPU_009430		\N
GLEAN3_06810	SPU_006810		\N
GLEAN3_12157	SPU_012157		\N
GLEAN3_20954	SPU_020954	After reviewing the data from the excel file and performing a BLAST search it appears that there is no sufficient GLEAN model that fits. The sequence is distributed onto 3 different scaffolds. There are also numerous gaps within the sequence and poor sequence coverage. There are also several repeats apparent. There is no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be strong with values ranging from about 5-54. This is an un-annotated gene so no additional information was available from Baylor annotations (gene comments).	\N
GLEAN3_21873	SPU_021873		\N
GLEAN3_21970	SPU_021970		\N
GLEAN3_00167	SPU_000167		\N
GLEAN3_01130	SPU_001130		\N
GLEAN3_02379	SPU_002379		\N
GLEAN3_05926	SPU_005926		\N
GLEAN3_07210	SPU_007210		\N
GLEAN3_09139	SPU_009139		\N
GLEAN3_15974	SPU_015974		\N
GLEAN3_19094	SPU_019094		\N
GLEAN3_19701	SPU_019701		\N
GLEAN3_24200	SPU_024200		\N
GLEAN3_25575	SPU_025575		\N
GLEAN3_26135	SPU_026135		\N
GLEAN3_26583	SPU_026583		\N
GLEAN3_26458	SPU_026458		\N
GLEAN3_12799	SPU_012799		\N
GLEAN3_23223	SPU_023223		\N
GLEAN3_22468	SPU_022468		\N
GLEAN3_22376	SPU_022376		\N
GLEAN3_24924	SPU_024924		\N
GLEAN3_15259	SPU_015259		\N
GLEAN3_06927	SPU_006927		\N
GLEAN3_24231	SPU_024231		\N
GLEAN3_22317	SPU_022317		\N
GLEAN3_16590	SPU_016590		\N
GLEAN3_27227	SPU_027227		\N
GLEAN3_21291	SPU_021291		\N
GLEAN3_01536	SPU_001536		\N
GLEAN3_01850	SPU_001850		\N
GLEAN3_26012	SPU_026012		\N
GLEAN3_01531	SPU_001531		\N
GLEAN3_12318	SPU_012318		\N
GLEAN3_28207	SPU_028207		\N
GLEAN3_15676	SPU_015676		\N
GLEAN3_23661	SPU_023661		\N
GLEAN3_23663	SPU_023663		\N
GLEAN3_19138	SPU_019138		\N
GLEAN3_24343	SPU_024343		\N
GLEAN3_16846	SPU_016846		\N
GLEAN3_17154	SPU_017154		\N
GLEAN3_16845	SPU_016845		\N
GLEAN3_03782	SPU_003782		\N
GLEAN3_06828	SPU_006828		\N
GLEAN3_02809	SPU_002809		\N
GLEAN3_02818	SPU_002818		\N
GLEAN3_00972	SPU_000972		\N
GLEAN3_00973	SPU_000973		\N
GLEAN3_12626	SPU_012626		\N
GLEAN3_15391	SPU_015391		\N
GLEAN3_13837	SPU_013837		\N
GLEAN3_12575	SPU_012575		\N
GLEAN3_07533	SPU_007533		\N
GLEAN3_04602	SPU_004602		\N
GLEAN3_21504	SPU_021504		\N
GLEAN3_17407	SPU_017407		\N
GLEAN3_28524	SPU_028524		\N
GLEAN3_06230	SPU_006230		\N
SpBDNFpseudo	SPU_030083		\N
GLEAN3_06427	SPU_006427		\N
GLEAN3_12098	SPU_012098		\N
GLEAN3_06484	SPU_006484		\N
GLEAN3_06497	SPU_006497		\N
GLEAN3_06637	SPU_006637		\N
GLEAN3_07108	SPU_007108		\N
GLEAN3_07440	SPU_007440		\N
GLEAN3_07636	SPU_007636		\N
GLEAN3_07957	SPU_007957		\N
GLEAN3_09053	SPU_009053		\N
GLEAN3_09427	SPU_009427		\N
GLEAN3_09646	SPU_009646		\N
GLEAN3_09847	SPU_009847		\N
GLEAN3_10383	SPU_010383		\N
GLEAN3_10426	SPU_010426		\N
GLEAN3_10542	SPU_010542		\N
GLEAN3_10649	SPU_010649		\N
GLEAN3_10873	SPU_010873		\N
GLEAN3_11189	SPU_011189		\N
GLEAN3_11226	SPU_011226		\N
GLEAN3_11260	SPU_011260		\N
GLEAN3_11502	SPU_011502		\N
GLEAN3_11611	SPU_011611		\N
GLEAN3_11687	SPU_011687		\N
GLEAN3_11720	SPU_011720		\N
GLEAN3_11761	SPU_011761		\N
GLEAN3_11879	SPU_011879		\N
GLEAN3_12050	SPU_012050		\N
GLEAN3_12111	SPU_012111		\N
GLEAN3_12198	SPU_012198		\N
GLEAN3_12307	SPU_012307		\N
GLEAN3_12315	SPU_012315		\N
GLEAN3_12448	SPU_012448		\N
GLEAN3_12583	SPU_012583		\N
GLEAN3_12724	SPU_012724		\N
GLEAN3_12948	SPU_012948		\N
GLEAN3_12949	SPU_012949		\N
GLEAN3_13123	SPU_013123	From the BLAST results as well as the excel data it is clear that this is the best fit for this particular GLEAN model. When examining the excel data, in conjunction with the BLAST results, it is apparent that there is a large gap present that spans from 1065-1126. The entire scaffold ends at 3183 but is truncated at 1126 resulting in a significant loss of sequence information. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with most of the values being about 5 (excluding 2 outliers at 18 and 30). 	\N
GLEAN3_13307	SPU_013307		\N
GLEAN3_13415	SPU_013415		\N
GLEAN3_13576	SPU_013576		\N
GLEAN3_13611	SPU_013611		\N
GLEAN3_13861	SPU_013861		\N
GLEAN3_13902	SPU_013902		\N
GLEAN3_13904	SPU_013904		\N
GLEAN3_14045	SPU_014045		\N
GLEAN3_14055	SPU_014055		\N
GLEAN3_14256	SPU_014256		\N
GLEAN3_14817	SPU_014817		\N
GLEAN3_14923	SPU_014923		\N
GLEAN3_15362	SPU_015362		\N
GLEAN3_15503	SPU_015503		\N
GLEAN3_15682	SPU_015682		\N
GLEAN3_15683	SPU_015683		\N
GLEAN3_15751	SPU_015751		\N
GLEAN3_15767	SPU_015767		\N
GLEAN3_16114	SPU_016114		\N
GLEAN3_16149	SPU_016149		\N
GLEAN3_17441	SPU_017441		\N
GLEAN3_17470	SPU_017470		\N
GLEAN3_17947	SPU_017947		\N
GLEAN3_18075	SPU_018075		\N
GLEAN3_18393	SPU_018393		\N
GLEAN3_18673	SPU_018673		\N
GLEAN3_18680	SPU_018680		\N
GLEAN3_18797	SPU_018797		\N
GLEAN3_18888	SPU_018888		\N
GLEAN3_18909	SPU_018909		\N
GLEAN3_19067	SPU_019067		\N
GLEAN3_19089	SPU_019089		\N
GLEAN3_19365	SPU_019365		\N
GLEAN3_19407	SPU_019407		\N
GLEAN3_19666	SPU_019666		\N
GLEAN3_19915	SPU_019915		\N
GLEAN3_19959	SPU_019959		\N
GLEAN3_20478	SPU_020478		\N
GLEAN3_20480	SPU_020480		\N
GLEAN3_20563	SPU_020563		\N
GLEAN3_20568	SPU_020568		\N
GLEAN3_20570	SPU_020570		\N
GLEAN3_20726	SPU_020726		\N
GLEAN3_21562	SPU_021562		\N
GLEAN3_21757	SPU_021757		\N
GLEAN3_21851	SPU_021851		\N
GLEAN3_21947	SPU_021947		\N
GLEAN3_22242	SPU_022242		\N
GLEAN3_22487	SPU_022487		\N
GLEAN3_22697	SPU_022697		\N
GLEAN3_22800	SPU_022800		\N
GLEAN3_22841	SPU_022841		\N
GLEAN3_23522	SPU_023522		\N
GLEAN3_24189	SPU_024189		\N
GLEAN3_24315	SPU_024315		\N
GLEAN3_25125	SPU_025125		\N
GLEAN3_25825	SPU_025825		\N
GLEAN3_25959	SPU_025959		\N
GLEAN3_26292	SPU_026292		\N
GLEAN3_26385	SPU_026385		\N
GLEAN3_26390	SPU_026390		\N
GLEAN3_26510	SPU_026510		\N
GLEAN3_27472	SPU_027472		\N
GLEAN3_27491	SPU_027491		\N
GLEAN3_27918	SPU_027918		\N
GLEAN3_28045	SPU_028045		\N
GLEAN3_28301	SPU_028301		\N
GLEAN3_28583	SPU_028583		\N
GLEAN3_28679	SPU_028679		\N
GLEAN3_00483	SPU_000483		\N
GLEAN3_01648	SPU_001648		\N
GLEAN3_01937	SPU_001937		\N
GLEAN3_02090	SPU_002090		\N
GLEAN3_02140	SPU_002140		\N
GLEAN3_02309	SPU_002309		\N
GLEAN3_02554	SPU_002554		\N
GLEAN3_02707	SPU_002707		\N
GLEAN3_02988	SPU_002988		\N
GLEAN3_03239	SPU_003239		\N
GLEAN3_03378	SPU_003378		\N
GLEAN3_03641	SPU_003641		\N
GLEAN3_03685	SPU_003685		\N
GLEAN3_05035	SPU_005035		\N
GLEAN3_05071	SPU_005071		\N
GLEAN3_05122	SPU_005122		\N
GLEAN3_05559	SPU_005559		\N
GLEAN3_06063	SPU_006063		\N
GLEAN3_06064	SPU_006064		\N
GLEAN3_06420	SPU_006420		\N
GLEAN3_06421	SPU_006421		\N
GLEAN3_10197	SPU_010197		\N
GLEAN3_12549	SPU_012549		\N
GLEAN3_13669	SPU_013669		\N
GLEAN3_10699	SPU_010699		\N
GLEAN3_27328	SPU_027328		\N
GLEAN3_05385	SPU_005385		\N
GLEAN3_19462	SPU_019462		\N
GLEAN3_28749	SPU_028749		\N
GLEAN3_10854	SPU_010854		\N
GLEAN3_04717	SPU_004717		\N
GLEAN3_12482	SPU_012482		\N
GLEAN3_09759	SPU_009759		\N
GLEAN3_24461	SPU_024461		\N
GLEAN3_03868	SPU_003868		\N
GLEAN3_16404	SPU_016404		\N
GLEAN3_09415	SPU_009415		\N
GLEAN3_08897	SPU_008897		\N
GLEAN3_04412	SPU_004412		\N
GLEAN3_08895	SPU_008895		\N
GLEAN3_04224	SPU_004224		\N
GLEAN3_22851	SPU_022851		\N
GLEAN3_22570	SPU_022570		\N
GLEAN3_26159	SPU_026159		\N
GLEAN3_19580	SPU_019580		\N
GLEAN3_00627	SPU_000627		\N
GLEAN3_23957	SPU_023957		\N
GLEAN3_25159	SPU_025159		\N
GLEAN3_25893	SPU_025893		\N
GLEAN3_10799	SPU_010799		\N
GLEAN3_04789	SPU_004789		\N
GLEAN3_06069	SPU_006069		\N
GLEAN3_26319	SPU_026319		\N
GLEAN3_27634	SPU_027634		\N
GLEAN3_13023	SPU_013023		\N
GLEAN3_10318	SPU_010318		\N
GLEAN3_07013	SPU_007013		\N
GLEAN3_02655	SPU_002655		\N
GLEAN3_25504	SPU_025504		\N
GLEAN3_23344	SPU_023344		\N
GLEAN3_01712	SPU_001712		\N
GLEAN3_19657	SPU_019657		\N
GLEAN3_08779	SPU_008779		\N
GLEAN3_20465	SPU_020465		\N
GLEAN3_24060	SPU_024060		\N
GLEAN3_02681	SPU_002681		\N
GLEAN3_22734	SPU_022734		\N
GLEAN3_04249	SPU_004249		\N
GLEAN3_26064	SPU_026064		\N
GLEAN3_28775	SPU_028775		\N
GLEAN3_21986	SPU_021986		\N
GLEAN3_20125	SPU_020125		\N
GLEAN3_27592	SPU_027592		\N
GLEAN3_12051	SPU_012051		\N
GLEAN3_06580	SPU_006580		\N
GLEAN3_22770	SPU_022770	After reviewing the data and performing a BLAST search, it appears that this is the best match for this particular GLEAN model. When comparing the BLAST results with the excel data it is apparent that the sequence has an orderly arrangement without any gaps or internal repeats present. However the sequence is truncated at 756 on this scaffold. The other BLAST results displayed part of the rest of this sequence, but there were numerous gaps and repeats present. There was Est support available from GBrowse V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5	\N
GLEAN3_26497	SPU_026497		\N
GLEAN3_24704	SPU_024704		\N
GLEAN3_14832	SPU_014832		\N
GLEAN3_24256	SPU_024256		\N
GLEAN3_03109	SPU_003109		\N
GLEAN3_11442	SPU_011442		\N
GLEAN3_06181	SPU_006181		\N
GLEAN3_17830	SPU_017830		\N
GLEAN3_04639	SPU_004639		\N
GLEAN3_28770	SPU_028770		\N
GLEAN3_27219	SPU_027219		\N
GLEAN3_19423	SPU_019423		\N
GLEAN3_04901	SPU_004901		\N
GLEAN3_01307	SPU_001307		\N
GLEAN3_24485	SPU_024485		\N
GLEAN3_28312	SPU_028312		\N
GLEAN3_11335	SPU_011335		\N
GLEAN3_08011	SPU_008011		\N
GLEAN3_03703	SPU_003703		\N
GLEAN3_06737	SPU_006737		\N
GLEAN3_13670	SPU_013670		\N
GLEAN3_01911	SPU_001911		\N
GLEAN3_09964	SPU_009964		\N
GLEAN3_17527	SPU_017527		\N
GLEAN3_23327	SPU_023327		\N
GLEAN3_23529	SPU_023529		\N
GLEAN3_03102	SPU_003102		\N
GLEAN3_23107	SPU_023107		\N
GLEAN3_20722	SPU_020722		\N
GLEAN3_04287	SPU_004287		\N
GLEAN3_28061	SPU_028061		\N
GLEAN3_21192	SPU_021192		\N
GLEAN3_25482	SPU_025482		\N
GLEAN3_07147	SPU_007147		\N
GLEAN3_26601	SPU_026601		\N
GLEAN3_08196	SPU_008196		\N
GLEAN3_00756	SPU_000756		\N
GLEAN3_23366	SPU_023366		\N
GLEAN3_12008	SPU_012008		\N
GLEAN3_12699	SPU_012699		\N
GLEAN3_21123	SPU_021123		\N
GLEAN3_17287	SPU_017287		\N
GLEAN3_04526	SPU_004526		\N
GLEAN3_06808	SPU_006808		\N
GLEAN3_08117	SPU_008117		\N
GLEAN3_16445	SPU_016445		\N
GLEAN3_26465	SPU_026465		\N
GLEAN3_14332	SPU_014332		\N
GLEAN3_10351	SPU_010351		\N
GLEAN3_14249	SPU_014249		\N
GLEAN3_23037	SPU_023037		\N
GLEAN3_18423	SPU_018423		\N
GLEAN3_06803	SPU_006803		\N
GLEAN3_27218	SPU_027218		\N
GLEAN3_13134	SPU_013134		\N
GLEAN3_02627	SPU_002627		\N
GLEAN3_05572	SPU_005572		\N
GLEAN3_04217	SPU_004217		\N
GLEAN3_02816	SPU_002816		\N
GLEAN3_19290	SPU_019290		\N
GLEAN3_07389	SPU_007389		\N
GLEAN3_03705	SPU_003705		\N
GLEAN3_09465	SPU_009465		\N
GLEAN3_14836	SPU_014836		\N
GLEAN3_06312	SPU_006312		\N
GLEAN3_23177	SPU_023177		\N
GLEAN3_22573	SPU_022573		\N
GLEAN3_26633	SPU_026633	After reviewing the data it appears that the sequence is distributed onto 2 different scaffolds. There doesn't appear to be very good sequence coverage and there are several internal repeats and sequence overlaps on both scaffolds. There was Est information available on GBrowse assembly V0.5 and the transcriptome score intensities appear to be strong. This was an un-annotated gene so no additional information was available on the Baylor (comments). 	\N
GLEAN3_03595	SPU_003595		\N
GLEAN3_03179	SPU_003179		\N
GLEAN3_24307	SPU_024307		\N
GLEAN3_27774	SPU_027774		\N
GLEAN3_15984	SPU_015984		\N
GLEAN3_08175	SPU_008175		\N
GLEAN3_25492	SPU_025492		\N
GLEAN3_21608	SPU_021608		\N
GLEAN3_02614	SPU_002614		\N
GLEAN3_14361	SPU_014361		\N
GLEAN3_16793	SPU_016793		\N
GLEAN3_24623	SPU_024623		\N
GLEAN3_25689	SPU_025689		\N
GLEAN3_24764	SPU_024764		\N
GLEAN3_20885	SPU_020885		\N
GLEAN3_02219	SPU_002219		\N
GLEAN3_28460	SPU_028460		\N
GLEAN3_23425	SPU_023425		\N
GLEAN3_15359	SPU_015359		\N
GLEAN3_12674	SPU_012674		\N
GLEAN3_23486	SPU_023486		\N
GLEAN3_03403	SPU_003403		\N
GLEAN3_09681	SPU_009681		\N
GLEAN3_08180	SPU_008180		\N
GLEAN3_11124	SPU_011124		\N
GLEAN3_08678	SPU_008678		\N
GLEAN3_00513	SPU_000513		\N
GLEAN3_08802	SPU_008802		\N
GLEAN3_05305	SPU_005305		\N
GLEAN3_27423	SPU_027423		\N
GLEAN3_21430	SPU_021430		\N
GLEAN3_19117	SPU_019117		\N
GLEAN3_08889	SPU_008889		\N
GLEAN3_03451	SPU_003451		\N
GLEAN3_12503	SPU_012503		\N
GLEAN3_20398	SPU_020398		\N
GLEAN3_20069	SPU_020069		\N
GLEAN3_23401	SPU_023401		\N
GLEAN3_06549	SPU_006549		\N
GLEAN3_14289	SPU_014289		\N
GLEAN3_16352	SPU_016352		\N
GLEAN3_28649	SPU_028649		\N
GLEAN3_27833	SPU_027833		\N
GLEAN3_17816	SPU_017816		\N
GLEAN3_10280	SPU_010280		\N
GLEAN3_02105	SPU_002105		\N
GLEAN3_00468	SPU_000468		\N
GLEAN3_16547	SPU_016547		\N
GLEAN3_06914	SPU_006914		\N
GLEAN3_04975	SPU_004975		\N
GLEAN3_28073	SPU_028073		\N
GLEAN3_22561	SPU_022561		\N
GLEAN3_12785	SPU_012785		\N
GLEAN3_00961	SPU_000961		\N
GLEAN3_02432	SPU_002432		\N
GLEAN3_27524	SPU_027524		\N
GLEAN3_09470	SPU_009470		\N
GLEAN3_20249	SPU_020249		\N
GLEAN3_08426	SPU_008426		\N
GLEAN3_23188	SPU_023188		\N
GLEAN3_16918	SPU_016918		\N
GLEAN3_27245	SPU_027245		\N
GLEAN3_00642	SPU_000642		\N
GLEAN3_02617	SPU_002617		\N
GLEAN3_00943	SPU_000943		\N
GLEAN3_16505	SPU_016505		\N
GLEAN3_23948	SPU_023948		\N
GLEAN3_22495	SPU_022495		\N
GLEAN3_19686	SPU_019686	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If these two scaffolds were combined, the sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was Est information available and the transcriptome intensity scores appeared to be weak with most of the values being less 5 excluding 2 outliers at about 45. 	\N
GLEAN3_07944	SPU_007944		\N
GLEAN3_26605	SPU_026605		\N
GLEAN3_23309	SPU_023309		\N
GLEAN3_01927	SPU_001927		\N
GLEAN3_06908	SPU_006908		\N
GLEAN3_12203	SPU_012203		\N
GLEAN3_15908	SPU_015908		\N
GLEAN3_09165	SPU_009165		\N
GLEAN3_05808	SPU_005808		\N
GLEAN3_09477	SPU_009477		\N
GLEAN3_09164	SPU_009164		\N
GLEAN3_13289	SPU_013289		\N
GLEAN3_22158	SPU_022158		\N
GLEAN3_11395	SPU_011395		\N
GLEAN3_03213	SPU_003213		\N
GLEAN3_22504	SPU_022504		\N
GLEAN3_26987	SPU_026987		\N
GLEAN3_23789	SPU_023789		\N
GLEAN3_01296	SPU_001296		\N
GLEAN3_23990	SPU_023990		\N
GLEAN3_23197	SPU_023197		\N
GLEAN3_06932	SPU_006932		\N
GLEAN3_15652	SPU_015652		\N
GLEAN3_08331	SPU_008331		\N
GLEAN3_14226	SPU_014226		\N
GLEAN3_06444	SPU_006444		\N
GLEAN3_08560	SPU_008560		\N
GLEAN3_16074	SPU_016074		\N
GLEAN3_27527	SPU_027527		\N
GLEAN3_22940	SPU_022940		\N
GLEAN3_05628	SPU_005628		\N
GLEAN3_18219	SPU_018219		\N
GLEAN3_22923	SPU_022923		\N
GLEAN3_22922	SPU_022922		\N
GLEAN3_21631	SPU_021631		\N
GLEAN3_10629	SPU_010629		\N
GLEAN3_26521	SPU_026521		\N
GLEAN3_00357	SPU_000357		\N
GLEAN3_00872	SPU_000872		\N
GLEAN3_02402	SPU_002402		\N
GLEAN3_03904	SPU_003904		\N
GLEAN3_04832	SPU_004832		\N
GLEAN3_05288	SPU_005288		\N
GLEAN3_05325	SPU_005325		\N
GLEAN3_06037	SPU_006037		\N
GLEAN3_06297	SPU_006297		\N
GLEAN3_16018	SPU_016018		\N
GLEAN3_27560	SPU_027560		\N
GLEAN3_03486	SPU_003486		\N
GLEAN3_07495	SPU_007495		\N
GLEAN3_18904	SPU_018904		\N
GLEAN3_20799	SPU_020799		\N
GLEAN3_25565	SPU_025565		\N
GLEAN3_00884	SPU_000884		\N
GLEAN3_01195	SPU_001195		\N
GLEAN3_01778	SPU_001778		\N
GLEAN3_07231	SPU_007231		\N
GLEAN3_07995	SPU_007995		\N
GLEAN3_08996	SPU_008996		\N
GLEAN3_09311	SPU_009311		\N
GLEAN3_11073	SPU_011073		\N
GLEAN3_11529	SPU_011529		\N
GLEAN3_12109	SPU_012109		\N
GLEAN3_12385	SPU_012385		\N
GLEAN3_13184	SPU_013184		\N
GLEAN3_14140	SPU_014140		\N
GLEAN3_14755	SPU_014755		\N
GLEAN3_16043	SPU_016043		\N
GLEAN3_17536	SPU_017536		\N
GLEAN3_20374	SPU_020374	For this particular GLEAN model it appears that this is the best match. There are several gaps within the sequence and poor coverage (591/1077). There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be week with all of the values ranging less than 5. 	\N
GLEAN3_22411	SPU_022411		\N
GLEAN3_24362	SPU_024362	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. The overall sequence coverage is poor and the first scaffold doesn't begin until about 318. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be somewhat strong with most of the values being greater than 5.	\N
GLEAN3_25068	SPU_025068		\N
GLEAN3_25363	SPU_025363		\N
GLEAN3_26306	SPU_026306		\N
GLEAN3_27747	SPU_027747		\N
GLEAN3_28257	SPU_028257		\N
GLEAN3_28723	SPU_028723		\N
GLEAN3_04954	SPU_004954		\N
GLEAN3_15595	SPU_015595		\N
GLEAN3_05708	SPU_005708		\N
GLEAN3_02872	SPU_002872		\N
GLEAN3_04850	SPU_004850		\N
GLEAN3_07470	SPU_007470		\N
GLEAN3_07471	SPU_007471	From the BLAST results as well as the excel data, it appears that this is the best results for this particular GLEAN model. When examining the excel data, it is apparent that there are several sequence overlaps present as well as internal repeats. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 10 (excluding one outlier at about 130.)	\N
GLEAN3_07714	SPU_007714		\N
GLEAN3_10104	SPU_010104		\N
GLEAN3_13626	SPU_013626		\N
GLEAN3_14151	SPU_014151		\N
GLEAN3_15802	SPU_015802		\N
GLEAN3_15803	SPU_015803		\N
GLEAN3_18570	SPU_018570		\N
GLEAN3_19581	SPU_019581		\N
GLEAN3_19719	SPU_019719		\N
GLEAN3_20430	SPU_020430		\N
GLEAN3_20712	SPU_020712		\N
GLEAN3_22602	SPU_022602		\N
GLEAN3_28927	SPU_028927		\N
GLEAN3_18735	SPU_018735		\N
GLEAN3_19450	SPU_019450		\N
GLEAN3_27341	SPU_027341		\N
GLEAN3_27342	SPU_027342		\N
GLEAN3_27343	SPU_027343		\N
GLEAN3_01529	SPU_001529		\N
GLEAN3_09848	SPU_009848		\N
GLEAN3_26217	SPU_026217		\N
GLEAN3_05868	SPU_005868		\N
GLEAN3_08264	SPU_008264		\N
GLEAN3_15508	SPU_015508		\N
GLEAN3_04019	SPU_004019		\N
GLEAN3_24349	SPU_024349		\N
GLEAN3_17346	SPU_017346		\N
GLEAN3_02788	SPU_002788		\N
GLEAN3_00061	SPU_000061		\N
GLEAN3_00062	SPU_000062		\N
GLEAN3_13273	SPU_013273		\N
GLEAN3_03894	SPU_003894		\N
GLEAN3_07425	SPU_007425		\N
GLEAN3_24912	SPU_024912		\N
GLEAN3_19673	SPU_019673		\N
GLEAN3_05375	SPU_005375		\N
GLEAN3_19557	SPU_019557		\N
GLEAN3_20244	SPU_020244		\N
GLEAN3_07484	SPU_007484		\N
GLEAN3_17989	SPU_017989		\N
GLEAN3_09901	SPU_009901		\N
GLEAN3_04603	SPU_004603		\N
GLEAN3_15957	SPU_015957		\N
GLEAN3_00047	SPU_000047		\N
GLEAN3_20376	SPU_020376		\N
GLEAN3_25465	SPU_025465		\N
GLEAN3_17827	SPU_017827		\N
GLEAN3_16058	SPU_016058		\N
GLEAN3_19568	SPU_019568		\N
GLEAN3_02046	SPU_002046		\N
GLEAN3_12158	SPU_012158		\N
GLEAN3_24507	SPU_024507		\N
GLEAN3_05737	SPU_005737		\N
GLEAN3_23315	SPU_023315		\N
GLEAN3_27375	SPU_027375		\N
GLEAN3_00995	SPU_000995		\N
GLEAN3_04389	SPU_004389		\N
GLEAN3_04587	SPU_004587		\N
GLEAN3_07526	SPU_007526		\N
GLEAN3_19951	SPU_019951		\N
GLEAN3_01498	SPU_001498		\N
GLEAN3_21119	SPU_021119		\N
GLEAN3_16881	SPU_016881		\N
GLEAN3_26763	SPU_026763		\N
GLEAN3_16385	SPU_016385		\N
GLEAN3_01690	SPU_001690		\N
GLEAN3_26376	SPU_026376		\N
GLEAN3_15740	SPU_015740		\N
Sp-IGF2	SPU_030139		\N
GLEAN3_13076	SPU_013076		\N
GLEAN3_18314	SPU_018314		\N
GLEAN3_06172	SPU_006172	For this particular GLEAN model there was no Cds information available from Baylor annotations or from SpBase. However, there was mRNA information available from SpBase. When examining the excel data it appears that this is a very short sequence (489 base pairs in length) and there is a small gap between 91-124. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed, with several values that were high (110-120) and some that were low (<5.)	\N
GLEAN3_17574	SPU_017574		\N
GLEAN3_07023	SPU_007023		\N
GLEAN3_14397	SPU_014397		\N
GLEAN3_06850	SPU_006850		\N
GLEAN3_13161	SPU_013161		\N
GLEAN3_22040	SPU_022040		\N
GLEAN3_02723	SPU_002723		\N
GLEAN3_11954	SPU_011954		\N
GLEAN3_20406	SPU_020406		\N
GLEAN3_03901	SPU_003901		\N
GLEAN3_22076	SPU_022076		\N
GLEAN3_08698	SPU_008698		\N
GLEAN3_25579	SPU_025579		\N
GLEAN3_07721	SPU_007721		\N
GLEAN3_13822	SPU_013822		\N
GLEAN3_04605	SPU_004605		\N
GLEAN3_14496	SPU_014496		\N
GLEAN3_14334	SPU_014334		\N
GLEAN3_15763	SPU_015763		\N
GLEAN3_14403	SPU_014403		\N
GLEAN3_23941	SPU_023941		\N
GLEAN3_14885	SPU_014885	After reviewing the data and performing a BLAST search, it appears that there is no good fit for this particular GLEAN model due to numerous gaps within the sequence and poor coverage. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with the majority of scores being greater than 10.	\N
GLEAN3_14611	SPU_014611		\N
GLEAN3_13990	SPU_013990		\N
GLEAN3_13737	SPU_013737		\N
GLEAN3_13638	SPU_013638		\N
GLEAN3_13179	SPU_013179		\N
GLEAN3_19391	SPU_019391		\N
GLEAN3_21838	SPU_021838		\N
GLEAN3_18512	SPU_018512		\N
GLEAN3_26594	SPU_026594		\N
GLEAN3_01613	SPU_001613		\N
GLEAN3_06755	SPU_006755		\N
GLEAN3_26614	SPU_026614		\N
GLEAN3_11106	SPU_011106		\N
GLEAN3_15374	SPU_015374		\N
GLEAN3_12605	SPU_012605		\N
GLEAN3_09954	SPU_009954		\N
GLEAN3_12247	SPU_012247		\N
GLEAN3_07973	SPU_007973		\N
GLEAN3_16006	SPU_016006		\N
GLEAN3_16763	SPU_016763		\N
GLEAN3_25547	SPU_025547		\N
GLEAN3_00758	SPU_000758		\N
GLEAN3_00910	SPU_000910		\N
GLEAN3_02215	SPU_002215		\N
GLEAN3_02944	SPU_002944		\N
GLEAN3_02945	SPU_002945		\N
GLEAN3_03039	SPU_003039		\N
GLEAN3_04500	SPU_004500		\N
GLEAN3_04590	SPU_004590		\N
GLEAN3_05836	SPU_005836		\N
GLEAN3_06955	SPU_006955		\N
GLEAN3_06970	SPU_006970		\N
GLEAN3_07068	SPU_007068		\N
GLEAN3_07270	SPU_007270		\N
GLEAN3_07381	SPU_007381		\N
GLEAN3_07603	SPU_007603		\N
GLEAN3_07676	SPU_007676		\N
GLEAN3_07811	SPU_007811		\N
GLEAN3_07827	SPU_007827		\N
GLEAN3_07876	SPU_007876		\N
GLEAN3_07914	SPU_007914		\N
GLEAN3_07985	SPU_007985		\N
GLEAN3_09135	SPU_009135		\N
GLEAN3_10730	SPU_010730		\N
GLEAN3_15097	SPU_015097		\N
GLEAN3_15099	SPU_015099		\N
GLEAN3_15290	SPU_015290		\N
GLEAN3_16273	SPU_016273		\N
GLEAN3_16339	SPU_016339		\N
GLEAN3_16703	SPU_016703		\N
GLEAN3_16924	SPU_016924		\N
GLEAN3_17148	SPU_017148		\N
GLEAN3_18331	SPU_018331		\N
GLEAN3_18492	SPU_018492		\N
GLEAN3_18974	SPU_018974		\N
GLEAN3_19723	SPU_019723		\N
GLEAN3_21067	SPU_021067		\N
GLEAN3_21804	SPU_021804		\N
GLEAN3_23349	SPU_023349		\N
GLEAN3_23442	SPU_023442		\N
GLEAN3_24455	SPU_024455		\N
GLEAN3_24773	SPU_024773		\N
GLEAN3_25368	SPU_025368		\N
GLEAN3_25401	SPU_025401		\N
GLEAN3_25881	SPU_025881		\N
GLEAN3_26488	SPU_026488		\N
GLEAN3_26492	SPU_026492		\N
GLEAN3_26577	SPU_026577		\N
GLEAN3_26696	SPU_026696		\N
GLEAN3_27876	SPU_027876		\N
GLEAN3_27889	SPU_027889	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. Both scaffolds have an orderly and continuous arrangement and if they were to be combined, the overall sequence would have good coverage. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely dispersed but overall strong, with most of the values being greater than 5. 	\N
GLEAN3_28708	SPU_028708		\N
GLEAN3_28800	SPU_028800		\N
GLEAN3_03435	SPU_003435		\N
GLEAN3_07397	SPU_007397		\N
GLEAN3_16247	SPU_016247		\N
GLEAN3_06604	SPU_006604		\N
GLEAN3_15939	SPU_015939		\N
GLEAN3_17032	SPU_017032		\N
GLEAN3_19090	SPU_019090		\N
GLEAN3_03260	SPU_003260		\N
GLEAN3_21580	SPU_021580		\N
GLEAN3_03257	SPU_003257		\N
GLEAN3_00091	SPU_000091		\N
GLEAN3_00296	SPU_000296		\N
GLEAN3_21293	SPU_021293		\N
GLEAN3_03045	SPU_003045		\N
GLEAN3_22874	SPU_022874		\N
GLEAN3_11032	SPU_011032		\N
GLEAN3_11031	SPU_011031		\N
GLEAN3_11722	SPU_011722		\N
GLEAN3_24957	SPU_024957		\N
GLEAN3_16790	SPU_016790		\N
GLEAN3_10545	SPU_010545		\N
GLEAN3_25378	SPU_025378		\N
GLEAN3_18101	SPU_018101		\N
GLEAN3_23217	SPU_023217		\N
GLEAN3_22834	SPU_022834		\N
GLEAN3_12292	SPU_012292		\N
GLEAN3_23369	SPU_023369		\N
GLEAN3_02509	SPU_002509		\N
GLEAN3_06396	SPU_006396		\N
GLEAN3_26573	SPU_026573	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When reviewing the excel data, it was apparent that were several internal repeats present as well as gaps within both scaffolds. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	\N
GLEAN3_19184	SPU_019184		\N
GLEAN3_22280	SPU_022280		\N
GLEAN3_17972	SPU_017972	For this particular GLEAN model, there was no Cds information available from both Baylor annotations and SpBase. There was however mRNA information available from SpBase. When examining the excel data there are several repeats that are apparent within one of the scaffolds that are apart of the sequence. Other than this repeat, if the 3 scaffolds were combined, the sequence would have an orderly arrangement. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	\N
GLEAN3_14840	SPU_014840		\N
GLEAN3_20197	SPU_020197		\N
GLEAN3_21211	SPU_021211		\N
GLEAN3_23593	SPU_023593		\N
GLEAN3_24674	SPU_024674		\N
GLEAN3_19392	SPU_019392		\N
GLEAN3_11404	SPU_011404		\N
GLEAN3_07679	SPU_007679		\N
GLEAN3_07019	SPU_007019		\N
GLEAN3_25168	SPU_025168		\N
GLEAN3_11617	SPU_011617	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it was apparent that there were no large gaps present or internal repeats. Both scaffolds had an orderly arrangement and if the two were combined, the overall sequence would have good coverage. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	\N
GLEAN3_02223	SPU_002223		\N
GLEAN3_20836	SPU_020836		\N
GLEAN3_07720	SPU_007720		\N
GLEAN3_16100	SPU_016100		\N
GLEAN3_28904	SPU_028904		\N
GLEAN3_11095	SPU_011095		\N
GLEAN3_17981	SPU_017981		\N
GLEAN3_00241	SPU_000241		\N
GLEAN3_25511	SPU_025511		\N
GLEAN3_26990	SPU_026990		\N
GLEAN3_28169	SPU_028169		\N
GLEAN3_01822	SPU_001822		\N
GLEAN3_23098	SPU_023098		\N
GLEAN3_11620	SPU_011620		\N
GLEAN3_26031	SPU_026031		\N
GLEAN3_27778	SPU_027778		\N
GLEAN3_00046	SPU_000046		\N
GLEAN3_07764	SPU_007764		\N
GLEAN3_22207	SPU_022207		\N
GLEAN3_03609	SPU_003609		\N
GLEAN3_15765	SPU_015765		\N
GLEAN3_03067	SPU_003067		\N
GLEAN3_19980	SPU_019980		\N
GLEAN3_04357	SPU_004357		\N
GLEAN3_02366	SPU_002366		\N
GLEAN3_02304	SPU_002304		\N
GLEAN3_03854	SPU_003854		\N
GLEAN3_05867	SPU_005867		\N
GLEAN3_08019	SPU_008019		\N
GLEAN3_09279	SPU_009279		\N
GLEAN3_09797	SPU_009797		\N
GLEAN3_13171	SPU_013171		\N
GLEAN3_14413	SPU_014413		\N
GLEAN3_15473	SPU_015473	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it was apparent that there were no large gaps present or internal repeats. Both scaffolds had an orderly arrangement and if the two were combined, the overall sequence would have good coverage. There was some Est support available from GBRowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	\N
GLEAN3_16586	SPU_016586		\N
GLEAN3_18354	SPU_018354		\N
GLEAN3_22614	SPU_022614		\N
GLEAN3_07611	SPU_007611		\N
GLEAN3_18567	SPU_018567		\N
GLEAN3_25272	SPU_025272		\N
GLEAN3_01888	SPU_001888		\N
GLEAN3_25555	SPU_025555		\N
GLEAN3_18979	SPU_018979		\N
GLEAN3_18980	SPU_018980		\N
GLEAN3_18981	SPU_018981		\N
GLEAN3_05519	SPU_005519	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed on to 2 different scaffolds. The first scaffold (v2.1_scaffold56386) doesn't begin until the 4th base pair and ends at 152 where the rest of the sequence is continued on it the second scaffold (v2.1_scaffold57984). If the 2 scaffolds were combined the sequence would have an orderly and continuous arrangement.  There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	\N
GLEAN3_05271	SPU_005271		\N
GLEAN3_06826	SPU_006826		\N
GLEAN3_06797	SPU_006797		\N
GLEAN3_06463	SPU_006463		\N
GLEAN3_16382	SPU_016382		\N
GLEAN3_06767	SPU_006767		\N
GLEAN3_19558	SPU_019558		\N
GLEAN3_09481	SPU_009481		\N
GLEAN3_09482	SPU_009482		\N
GLEAN3_09483	SPU_009483		\N
GLEAN3_09433	SPU_009433		\N
GLEAN3_12282	SPU_012282		\N
GLEAN3_07887	SPU_007887		\N
GLEAN3_25223	SPU_025223		\N
GLEAN3_10631	SPU_010631		\N
GLEAN3_20333	SPU_020333		\N
GLEAN3_11301	SPU_011301		\N
GLEAN3_07479	SPU_007479		\N
GLEAN3_13394	SPU_013394		\N
GLEAN3_27580	SPU_027580		\N
GLEAN3_03496	SPU_003496		\N
GLEAN3_27390	SPU_027390		\N
GLEAN3_22680	SPU_022680		\N
GLEAN3_01605	SPU_001605		\N
GLEAN3_25082	SPU_025082		\N
GLEAN3_08662	SPU_008662		\N
GLEAN3_06532	SPU_006532		\N
GLEAN3_01489	SPU_001489		\N
GLEAN3_05798	SPU_005798		\N
GLEAN3_07604	SPU_007604		\N
GLEAN3_14120	SPU_014120		\N
GLEAN3_17146	SPU_017146		\N
GLEAN3_19549	SPU_019549		\N
GLEAN3_10685	SPU_010685		\N
GLEAN3_24513	SPU_024513		\N
GLEAN3_09742	SPU_009742		\N
GLEAN3_23508	SPU_023508		\N
GLEAN3_07350	SPU_007350		\N
GLEAN3_08919	SPU_008919		\N
GLEAN3_13929	SPU_013929		\N
GLEAN3_01518	SPU_001518		\N
GLEAN3_24840	SPU_024840		\N
GLEAN3_17834	SPU_017834		\N
GLEAN3_17138	SPU_017138		\N
GLEAN3_05813	SPU_005813		\N
GLEAN3_03534	SPU_003534		\N
GLEAN3_20852	SPU_020852		\N
GLEAN3_19387	SPU_019387		\N
GLEAN3_13337	SPU_013337	From the BLAST results and the excel data, it appears that the sequence is distributed onto two different scaffolds. There is a sequence overlap between the two scaffolds that occurs between 373-505 and the second scaffold appears to be cut short at the end of the sequence. 	\N
GLEAN3_14525	SPU_014525		\N
GLEAN3_14497	SPU_014497		\N
GLEAN3_24601	SPU_024601		\N
GLEAN3_14037	SPU_014037		\N
GLEAN3_21236	SPU_021236		\N
GLEAN3_28003	SPU_028003		\N
GLEAN3_27336	SPU_027336		\N
GLEAN3_02123	SPU_002123		\N
GLEAN3_28205	SPU_028205	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it appears that if the 2 scaffolds were to be combined the sequence would have an orderly and continuous arrangement without and gaps or repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong as well. 	\N
GLEAN3_00552	SPU_000552		\N
GLEAN3_26944	SPU_026944		\N
GLEAN3_12003	SPU_012003		\N
GLEAN3_11257	SPU_011257		\N
GLEAN3_23982	SPU_023982		\N
GLEAN3_07845	SPU_007845		\N
GLEAN3_10762	SPU_010762		\N
GLEAN3_28618	SPU_028618		\N
GLEAN3_19458	SPU_019458		\N
GLEAN3_14913	SPU_014913		\N
GLEAN3_22154	SPU_022154		\N
GLEAN3_27060	SPU_027060		\N
GLEAN3_12112	SPU_012112		\N
GLEAN3_28209	SPU_028209		\N
GLEAN3_19087	SPU_019087		\N
GLEAN3_24325	SPU_024325		\N
GLEAN3_17072	SPU_017072		\N
GLEAN3_01312	SPU_001312		\N
GLEAN3_10029	SPU_010029		\N
GLEAN3_14711	SPU_014711		\N
GLEAN3_25086	SPU_025086		\N
GLEAN3_12529	SPU_012529		\N
GLEAN3_03654	SPU_003654		\N
GLEAN3_09810	SPU_009810		\N
GLEAN3_15909	SPU_015909		\N
GLEAN3_00054	SPU_000054		\N
GLEAN3_23782	SPU_023782		\N
GLEAN3_14084	SPU_014084		\N
GLEAN3_08154	SPU_008154		\N
GLEAN3_05045	SPU_005045		\N
GLEAN3_05172	SPU_005172	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it appears that there are numerous repeats and gaps present within the sequence resulting in poor sequence coverage overall. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 10.	\N
GLEAN3_23108	SPU_023108		\N
GLEAN3_13471	SPU_013471		\N
GLEAN3_15544	SPU_015544		\N
GLEAN3_12639	SPU_012639		\N
GLEAN3_24725	SPU_024725		\N
GLEAN3_08046	SPU_008046		\N
GLEAN3_17832	SPU_017832		\N
GLEAN3_03162	SPU_003162		\N
GLEAN3_10139	SPU_010139		\N
GLEAN3_13324	SPU_013324		\N
GLEAN3_18195	SPU_018195		\N
GLEAN3_08892	SPU_008892		\N
GLEAN3_15312	SPU_015312		\N
GLEAN3_22157	SPU_022157		\N
GLEAN3_20429	SPU_020429		\N
GLEAN3_13794	SPU_013794		\N
GLEAN3_10505	SPU_010505		\N
GLEAN3_10648	SPU_010648		\N
GLEAN3_14114	SPU_014114		\N
GLEAN3_28002	SPU_028002		\N
GLEAN3_12597	SPU_012597		\N
GLEAN3_22701	SPU_022701		\N
GLEAN3_21141	SPU_021141		\N
GLEAN3_21847	SPU_021847		\N
GLEAN3_14689	SPU_014689		\N
GLEAN3_28181	SPU_028181		\N
GLEAN3_19332	SPU_019332		\N
GLEAN3_05878	SPU_005878		\N
GLEAN3_21289	SPU_021289		\N
GLEAN3_04156	SPU_004156		\N
GLEAN3_28255	SPU_028255		\N
GLEAN3_06378	SPU_006378		\N
GLEAN3_23974	SPU_023974		\N
GLEAN3_00667	SPU_000667		\N
GLEAN3_04747	SPU_004747		\N
GLEAN3_08411	SPU_008411		\N
GLEAN3_06900	SPU_006900		\N
GLEAN3_12394	SPU_012394		\N
GLEAN3_04969	SPU_004969		\N
GLEAN3_08905	SPU_008905		\N
GLEAN3_28774	SPU_028774		\N
GLEAN3_25185	SPU_025185		\N
GLEAN3_18845	SPU_018845		\N
GLEAN3_27340	SPU_027340		\N
GLEAN3_13662	SPU_013662		\N
GLEAN3_25679	SPU_025679	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. There is a large gap present that separates the two scaffolds that spans from 314-454. Other than this missing sequence information that separates the two scaffolds, the BLAST results indicate that both scaffolds contain an orderly arrangement without any internal repeats present.  There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with most of the values being less than 8.	\N
GLEAN3_15889	SPU_015889		\N
GLEAN3_17491	SPU_017491		\N
GLEAN3_05252	SPU_005252		\N
GLEAN3_04786	SPU_004786		\N
GLEAN3_04403	SPU_004403		\N
GLEAN3_27743	SPU_027743		\N
GLEAN3_21558	SPU_021558		\N
GLEAN3_16791	SPU_016791		\N
GLEAN3_02443	SPU_002443		\N
GLEAN3_13320	SPU_013320		\N
GLEAN3_18243	SPU_018243		\N
GLEAN3_05962	SPU_005962		\N
GLEAN3_15825	SPU_015825		\N
GLEAN3_20461	SPU_020461		\N
GLEAN3_18081	SPU_018081		\N
GLEAN3_28180	SPU_028180		\N
GLEAN3_28772	SPU_028772		\N
GLEAN3_25732	SPU_025732		\N
GLEAN3_09590	SPU_009590		\N
GLEAN3_22221	SPU_022221		\N
GLEAN3_12807	SPU_012807		\N
GLEAN3_04840	SPU_004840		\N
GLEAN3_06132	SPU_006132		\N
GLEAN3_11351	SPU_011351		\N
GLEAN3_25951	SPU_025951		\N
GLEAN3_02130	SPU_002130		\N
GLEAN3_01125	SPU_001125		\N
GLEAN3_16617	SPU_016617		\N
GLEAN3_01280	SPU_001280		\N
GLEAN3_08793	SPU_008793		\N
GLEAN3_21412	SPU_021412		\N
GLEAN3_00914	SPU_000914		\N
GLEAN3_10626	SPU_010626		\N
GLEAN3_11075	SPU_011075		\N
GLEAN3_17082	SPU_017082		\N
GLEAN3_18146	SPU_018146		\N
GLEAN3_20745	SPU_020745		\N
GLEAN3_06813	SPU_006813		\N
GLEAN3_06186	SPU_006186		\N
GLEAN3_10416	SPU_010416		\N
GLEAN3_04317	SPU_004317		\N
GLEAN3_03249	SPU_003249		\N
GLEAN3_04735	SPU_004735		\N
GLEAN3_07773	SPU_007773		\N
GLEAN3_07783	SPU_007783		\N
GLEAN3_14974	SPU_014974		\N
GLEAN3_17977	SPU_017977		\N
GLEAN3_18335	SPU_018335		\N
GLEAN3_20664	SPU_020664		\N
GLEAN3_20764	SPU_020764		\N
GLEAN3_23296	SPU_023296		\N
GLEAN3_24896	SPU_024896		\N
GLEAN3_26076	SPU_026076		\N
GLEAN3_25389	SPU_025389		\N
GLEAN3_25940	SPU_025940		\N
GLEAN3_25909	SPU_025909		\N
GLEAN3_15322	SPU_015322		\N
GLEAN3_17998	SPU_017998		\N
GLEAN3_05618	SPU_005618		\N
GLEAN3_12770	SPU_012770		\N
GLEAN3_04626	SPU_004626		\N
GLEAN3_00637	SPU_000637		\N
GLEAN3_15088	SPU_015088		\N
GLEAN3_08305	SPU_008305		\N
GLEAN3_28896	SPU_028896		\N
GLEAN3_08587	SPU_008587	From the BLAST results and the excel data, it is evident that this sequence is distributed onto 2 different scaffolds for this particular GLEAN model. When reviewing the excel data it is appears that both scaffolds have an orderly arrangement. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being about 5 and greater.	\N
GLEAN3_00922	SPU_000922		\N
GLEAN3_00208	SPU_000208		\N
GLEAN3_09982	SPU_009982		\N
GLEAN3_17899	SPU_017899		\N
GLEAN3_04260	SPU_004260		\N
GLEAN3_19735	SPU_019735		\N
GLEAN3_23152	SPU_023152		\N
GLEAN3_03308	SPU_003308		\N
GLEAN3_21517	SPU_021517		\N
GLEAN3_09036	SPU_009036		\N
GLEAN3_05070	SPU_005070		\N
GLEAN3_23462	SPU_023462		\N
GLEAN3_28563	SPU_028563		\N
GLEAN3_22799	SPU_022799		\N
GLEAN3_11498	SPU_011498	After reviewing the data and performing a BLAST search, it is apparent that there is no sufficient fit for this GLEAN model. The sequence is dispersed onto 3 different scaffolds, however, there are numerous repeats and gaps present within each scaffold and poor coverage overall. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak (less than 5.)	\N
GLEAN3_17019	SPU_017019		\N
GLEAN3_00210	SPU_000210		\N
GLEAN3_03651	SPU_003651		\N
GLEAN3_23359	SPU_023359		\N
GLEAN3_25217	SPU_025217		\N
GLEAN3_25749	SPU_025749		\N
GLEAN3_08429	SPU_008429		\N
GLEAN3_13167	SPU_013167		\N
GLEAN3_08173	SPU_008173		\N
GLEAN3_02280	SPU_002280		\N
GLEAN3_19839	SPU_019839		\N
GLEAN3_02473	SPU_002473		\N
GLEAN3_07312	SPU_007312		\N
GLEAN3_03967	SPU_003967		\N
GLEAN3_09985	SPU_009985		\N
GLEAN3_23343	SPU_023343		\N
GLEAN3_03262	SPU_003262		\N
GLEAN3_11531	SPU_011531		\N
GLEAN3_23461	SPU_023461		\N
GLEAN3_11816	SPU_011816		\N
GLEAN3_14350	SPU_014350		\N
GLEAN3_24565	SPU_024565		\N
GLEAN3_04363	SPU_004363		\N
GLEAN3_05378	SPU_005378		\N
GLEAN3_11466	SPU_011466		\N
GLEAN3_16720	SPU_016720		\N
GLEAN3_06033	SPU_006033		\N
GLEAN3_11690	SPU_011690		\N
GLEAN3_23839	SPU_023839		\N
GLEAN3_28511	SPU_028511		\N
GLEAN3_14691	SPU_014691		\N
GLEAN3_16660	SPU_016660		\N
GLEAN3_03521	SPU_003521		\N
GLEAN3_10892	SPU_010892		\N
GLEAN3_07410	SPU_007410		\N
GLEAN3_26479	SPU_026479		\N
GLEAN3_17916	SPU_017916		\N
GLEAN3_17135	SPU_017135		\N
GLEAN3_08866	SPU_008866		\N
GLEAN3_12041	SPU_012041		\N
GLEAN3_25533	SPU_025533		\N
GLEAN3_19161	SPU_019161		\N
GLEAN3_02839	SPU_002839		\N
GLEAN3_28462	SPU_028462		\N
GLEAN3_08863	SPU_008863		\N
GLEAN3_01836	SPU_001836		\N
GLEAN3_02360	SPU_002360		\N
GLEAN3_08916	SPU_008916		\N
GLEAN3_11367	SPU_011367		\N
GLEAN3_15675	SPU_015675		\N
GLEAN3_25701	SPU_025701		\N
GLEAN3_27523	SPU_027523		\N
GLEAN3_28606	SPU_028606		\N
GLEAN3_16585	SPU_016585		\N
GLEAN3_26471	SPU_026471		\N
GLEAN3_27843	SPU_027843		\N
GLEAN3_09446	SPU_009446		\N
GLEAN3_14655	SPU_014655		\N
GLEAN3_17781	SPU_017781		\N
GLEAN3_28263	SPU_028263		\N
GLEAN3_16844	SPU_016844		\N
GLEAN3_19888	SPU_019888		\N
GLEAN3_07929	SPU_007929		\N
GLEAN3_07930	SPU_007930		\N
GLEAN3_09196	SPU_009196		\N
GLEAN3_12341	SPU_012341		\N
GLEAN3_08865	SPU_008865		\N
GLEAN3_08861	SPU_008861		\N
GLEAN3_08859	SPU_008859		\N
GLEAN3_08858	SPU_008858		\N
GLEAN3_08857	SPU_008857		\N
GLEAN3_14824	SPU_014824		\N
GLEAN3_15301	SPU_015301		\N
GLEAN3_22587	SPU_022587		\N
GLEAN3_22588	SPU_022588		\N
GLEAN3_21859	SPU_021859		\N
GLEAN3_21566	SPU_021566		\N
GLEAN3_28781	SPU_028781		\N
GLEAN3_03194	SPU_003194		\N
GLEAN3_10763	SPU_010763		\N
GLEAN3_22629	SPU_022629		\N
GLEAN3_28739	SPU_028739		\N
GLEAN3_20040	SPU_020040		\N
GLEAN3_00035	SPU_000035		\N
GLEAN3_22529	SPU_022529		\N
GLEAN3_14293	SPU_014293		\N
GLEAN3_04805	SPU_004805		\N
GLEAN3_00330	SPU_000330		\N
GLEAN3_13409	SPU_013409		\N
GLEAN3_19252	SPU_019252		\N
GLEAN3_03101	SPU_003101		\N
GLEAN3_19251	SPU_019251		\N
GLEAN3_13024	SPU_013024		\N
GLEAN3_11318	SPU_011318		\N
GLEAN3_13019	SPU_013019		\N
GLEAN3_13020	SPU_013020		\N
GLEAN3_13022	SPU_013022		\N
GLEAN3_13021	SPU_013021		\N
GLEAN3_11319	SPU_011319		\N
GLEAN3_06819	SPU_006819		\N
GLEAN3_13018	SPU_013018		\N
GLEAN3_01760	SPU_001760		\N
GLEAN3_23619	SPU_023619		\N
GLEAN3_06821	SPU_006821		\N
GLEAN3_13028	SPU_013028		\N
GLEAN3_23621	SPU_023621		\N
GLEAN3_13025	SPU_013025		\N
GLEAN3_13026	SPU_013026		\N
GLEAN3_13027	SPU_013027		\N
GLEAN3_06822	SPU_006822		\N
GLEAN3_06820	SPU_006820		\N
GLEAN3_13029	SPU_013029		\N
GLEAN3_23620	SPU_023620		\N
GLEAN3_09580	SPU_009580		\N
GLEAN3_10319	SPU_010319		\N
GLEAN3_07061	SPU_007061		\N
GLEAN3_17091	SPU_017091		\N
GLEAN3_07064	SPU_007064		\N
GLEAN3_17095	SPU_017095		\N
GLEAN3_17093	SPU_017093		\N
GLEAN3_23017	SPU_023017		\N
GLEAN3_07491	SPU_007491		\N
GLEAN3_07063	SPU_007063		\N
GLEAN3_08884	SPU_008884		\N
GLEAN3_06834	SPU_006834		\N
GLEAN3_07490	SPU_007490		\N
GLEAN3_17092	SPU_017092		\N
GLEAN3_07062	SPU_007062		\N
GLEAN3_26059	SPU_026059		\N
GLEAN3_07795	SPU_007795		\N
GLEAN3_07796	SPU_007796		\N
GLEAN3_06833	SPU_006833		\N
GLEAN3_27584	SPU_027584		\N
GLEAN3_21683	SPU_021683		\N
GLEAN3_17096	SPU_017096		\N
GLEAN3_05498	SPU_005498		\N
GLEAN3_07359	SPU_007359		\N
GLEAN3_01020	SPU_001020		\N
GLEAN3_14826	SPU_014826		\N
GLEAN3_07274	SPU_007274		\N
GLEAN3_20295	SPU_020295		\N
GLEAN3_20296	SPU_020296		\N
GLEAN3_12069	SPU_012069		\N
GLEAN3_12325	SPU_012325		\N
GLEAN3_17429	SPU_017429		\N
GLEAN3_14747	SPU_014747		\N
GLEAN3_17094	SPU_017094		\N
GLEAN3_07794	SPU_007794		\N
GLEAN3_24812	SPU_024812		\N
GLEAN3_24817	SPU_024817		\N
GLEAN3_24811	SPU_024811		\N
GLEAN3_23882	SPU_023882		\N
GLEAN3_07204	SPU_007204		\N
GLEAN3_19722	SPU_019722		\N
GLEAN3_07376	SPU_007376		\N
GLEAN3_25160	SPU_025160		\N
GLEAN3_01515	SPU_001515		\N
GLEAN3_03425	SPU_003425		\N
GLEAN3_09912	SPU_009912		\N
GLEAN3_19157	SPU_019157		\N
GLEAN3_22445	SPU_022445		\N
GLEAN3_22778	SPU_022778		\N
GLEAN3_26634	SPU_026634		\N
GLEAN3_27652	SPU_027652		\N
GLEAN3_28016	SPU_028016		\N
GLEAN3_01508	SPU_001508		\N
GLEAN3_03083	SPU_003083		\N
GLEAN3_03847	SPU_003847		\N
GLEAN3_07670	SPU_007670		\N
GLEAN3_10906	SPU_010906		\N
GLEAN3_06065	SPU_006065		\N
GLEAN3_14966	SPU_014966		\N
GLEAN3_17158	SPU_017158		\N
GLEAN3_19427	SPU_019427		\N
GLEAN3_23019	SPU_023019	From the BLAST results and the excel data, it is evident that the sequence is distributed onto two different scaffolds. The first scaffold contains a small gap that occurs from 92-147. However, there are no internal repeats or other gaps present within either or the two remaining scaffolds. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	\N
GLEAN3_24308	SPU_024308		\N
GLEAN3_24670	SPU_024670		\N
GLEAN3_01244	SPU_001244	After reviewing the data and performing a BLAST search, it appears that there is no good fit for this GLEAN model. The best fit was v2.1_scaffold35442 which has a low bit score and a high e-value. When reviewing the excel data it appears that there are numerous repeats and gaps present within the sequence. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be strong with all of the values being greater than 10.	\N
GLEAN3_06405	SPU_006405		\N
GLEAN3_00055	SPU_000055		\N
GLEAN3_13773	SPU_013773		\N
GLEAN3_05192	SPU_005192		\N
GLEAN3_22762	SPU_022762		\N
GLEAN3_18022	SPU_018022		\N
GLEAN3_10382	SPU_010382		\N
GLEAN3_12599	SPU_012599		\N
GLEAN3_10056	SPU_010056		\N
GLEAN3_09086	SPU_009086		\N
GLEAN3_04545	SPU_004545		\N
GLEAN3_24323	SPU_024323		\N
GLEAN3_15952	SPU_015952		\N
GLEAN3_00096	SPU_000096		\N
GLEAN3_28557	SPU_028557		\N
GLEAN3_01154	SPU_001154		\N
GLEAN3_22687	SPU_022687		\N
GLEAN3_12337	SPU_012337	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto3 different scaffolds. When examining the excel data, it is evident that there are several gaps within the first scaffold as well as internal repeats. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the majority of the values ranging <5	\N
GLEAN3_28226	SPU_028226		\N
GLEAN3_20766	SPU_020766		\N
GLEAN3_05061	SPU_005061	After reviewing the data and performing a BLAST search it appears that the sequences is distributed onto 2 different scaffolds. The BLAST search revealed an additional set of information that was not available on the excel spreadsheet (query 287-312). This additional query overlaps with some of the sequence provided from the excel data.  It was also initially thought that the best results would have been from subject gb|DS002183| instead of subject gb|DS003004| as the BLAST search indicated. Subject gb|DS003004| sequence only covered 490/1182 base pairs while subject gb|DS002183| sequence covered 704/1182. It appears that the BLAST search indicated the subject gb|DS003004 was the best match based on high bit score and low e-value. There was also one internal repeat, but it was only repeated twice so the repeat may have been apart of the sequence. There was Est information available from GBrowse assembly V0.5 and the transcriptome information appeared to be somewh at strong (values <5). This is an un-annotated gene so no additional comments were available from Baylor annotation (gene information).	\N
GLEAN3_09277	SPU_009277		\N
GLEAN3_21322	SPU_021322		\N
GLEAN3_02012	SPU_002012		\N
GLEAN3_16061	SPU_016061		\N
GLEAN3_02092	SPU_002092		\N
GLEAN3_04596	SPU_004596		\N
GLEAN3_28892	SPU_028892		\N
GLEAN3_27402	SPU_027402		\N
GLEAN3_25489	SPU_025489		\N
GLEAN3_22447	SPU_022447		\N
GLEAN3_02214	SPU_002214		\N
GLEAN3_13617	SPU_013617		\N
GLEAN3_12470	SPU_012470		\N
GLEAN3_00084	SPU_000084		\N
GLEAN3_00098	SPU_000098		\N
GLEAN3_00146	SPU_000146		\N
GLEAN3_00842	SPU_000842		\N
GLEAN3_01309	SPU_001309		\N
GLEAN3_01446	SPU_001446		\N
GLEAN3_03070	SPU_003070	After reviewing the data and performing a BLAST search it appears that there is no good GLEAN model fit for GLEAN3_03070. The sequence is distributed on to 3 different scaffolds. There are several gaps within v2.1_scaffold9764 and an overlap in the sequence if scaffolds >v2.1_scaffold30607 and v2.1_scaffold9764 were combined. If the three scaffolds were combined, there would still be several gaps and repeats within the sequence. There was Est information available from GBrowse V0.5 and the transcriptome intensity scores appeared to be strong. This is an un-annotated gene so no additional gene information was available from Baylor annotations (comments).	\N
GLEAN3_03132	SPU_003132		\N
GLEAN3_06471	SPU_006471		\N
GLEAN3_07733	SPU_007733		\N
GLEAN3_08258	SPU_008258	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 scaffolds. The first scaffold doesn't begin until the 31st base pair. There also is a large gap between the two scaffolds that spans from 357-627. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	\N
GLEAN3_04104	SPU_004104		\N
GLEAN3_08206	SPU_008206		\N
GLEAN3_04610	SPU_004610		\N
GLEAN3_09319	SPU_009319		\N
GLEAN3_09817	SPU_009817		\N
GLEAN3_10264	SPU_010264		\N
GLEAN3_23794	SPU_023794		\N
GLEAN3_25796	SPU_025796		\N
GLEAN3_27178	SPU_027178		\N
GLEAN3_28501	SPU_028501		\N
GLEAN3_00057	SPU_000057		\N
GLEAN3_00441	SPU_000441		\N
GLEAN3_00469	SPU_000469		\N
GLEAN3_00577	SPU_000577		\N
GLEAN3_01107	SPU_001107		\N
GLEAN3_08505	SPU_008505		\N
GLEAN3_09922	SPU_009922		\N
GLEAN3_18130	SPU_018130		\N
GLEAN3_19782	SPU_019782		\N
GLEAN3_26996	SPU_026996		\N
GLEAN3_01030	SPU_001030		\N
GLEAN3_06788	SPU_006788		\N
GLEAN3_00052	SPU_000052		\N
GLEAN3_04721	SPU_004721		\N
GLEAN3_21727	SPU_021727		\N
GLEAN3_25572	SPU_025572		\N
GLEAN3_00270	SPU_000270		\N
GLEAN3_22357	SPU_022357		\N
GLEAN3_22398	SPU_022398		\N
GLEAN3_02148	SPU_002148		\N
GLEAN3_05174	SPU_005174		\N
GLEAN3_12786	SPU_012786		\N
GLEAN3_04659	SPU_004659		\N
GLEAN3_08300	SPU_008300		\N
GLEAN3_08818	SPU_008818		\N
GLEAN3_14395	SPU_014395		\N
GLEAN3_17452	SPU_017452		\N
GLEAN3_00908	SPU_000908		\N
GLEAN3_04827	SPU_004827		\N
GLEAN3_22225	SPU_022225		\N
GLEAN3_09708	SPU_009708		\N
GLEAN3_04485	SPU_004485		\N
GLEAN3_28920	SPU_028920		\N
GLEAN3_24984	SPU_024984		\N
GLEAN3_21014	SPU_021014		\N
GLEAN3_25814	SPU_025814		\N
GLEAN3_15429	SPU_015429		\N
GLEAN3_19794	SPU_019794		\N
GLEAN3_19555	SPU_019555		\N
GLEAN3_19550	SPU_019550		\N
GLEAN3_03923	SPU_003923		\N
GLEAN3_16371	SPU_016371		\N
GLEAN3_02478	SPU_002478		\N
GLEAN3_27205	SPU_027205		\N
GLEAN3_26897	SPU_026897		\N
GLEAN3_11886	SPU_011886		\N
GLEAN3_19393	SPU_019393		\N
GLEAN3_16653	SPU_016653		\N
GLEAN3_14778	SPU_014778		\N
GLEAN3_13869	SPU_013869		\N
GLEAN3_11296	SPU_011296		\N
GLEAN3_07941	SPU_007941		\N
GLEAN3_04849	SPU_004849		\N
GLEAN3_01721	SPU_001721		\N
GLEAN3_03269	SPU_003269		\N
GLEAN3_03719	SPU_003719		\N
SPU-ASH3-like	SPU_030149		\N
SPU-atonal7-like	SPU_030150		\N
GLEAN3_24371	SPU_024371		\N
GLEAN3_20738	SPU_020738		\N
GLEAN3_27695	SPU_027695		\N
GLEAN3_22298	SPU_022298		\N
GLEAN3_22300	SPU_022300		\N
GLEAN3_23121	SPU_023121		\N
GLEAN3_23651	SPU_023651		\N
GLEAN3_26129	SPU_026129		\N
GLEAN3_26778	SPU_026778		\N
GLEAN3_28336	SPU_028336		\N
GLEAN3_28428	SPU_028428		\N
GLEAN3_28603	SPU_028603		\N
GLEAN3_00172	SPU_000172		\N
GLEAN3_00301	SPU_000301		\N
GLEAN3_18254	SPU_018254		\N
GLEAN3_09305	SPU_009305		\N
GLEAN3_03664	SPU_003664		\N
GLEAN3_12209	SPU_012209		\N
GLEAN3_01333	SPU_001333	From the excel data and the BLAST results it is evident that the sequence is distributed onto two different scaffolds. There is a gap present between the two scaffolds that occurs between 288-314. Other than this missing part of the scaffold, the overall sequence has an orderly arrangement without any repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several values that were greater than 5. 	\N
GLEAN3_01659	SPU_001659		\N
GLEAN3_05302	SPU_005302		\N
GLEAN3_06040	SPU_006040		\N
GLEAN3_08160	SPU_008160		\N
GLEAN3_00148	SPU_000148		\N
GLEAN3_02061	SPU_002061		\N
GLEAN3_00149	SPU_000149		\N
GLEAN3_09052	SPU_009052		\N
GLEAN3_14839	SPU_014839	For this particular GLEAN model there wasn't any Cds information available from either Baylor annotations or SpBase. However, there was mRNA information available from SpBase. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores were very low with values lower than 5.	\N
GLEAN3_06729	SPU_006729		\N
GLEAN3_16335	SPU_016335		\N
GLEAN3_04718	SPU_004718	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first portion of the sequence (v2.1_scaffold46046) has an orderly and continuous arrangement without any gaps or repeats present until 823 is reached. From there, the rest of the sequence is continued on v2.1_scaffold26351 however, this scaffold contains several internal repeats present but the sequence remains orderly. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	\N
GLEAN3_24297	SPU_024297		\N
GLEAN3_21601	SPU_021601		\N
GLEAN3_13281	SPU_013281		\N
GLEAN3_03691	SPU_003691		\N
GLEAN3_17991	SPU_017991		\N
GLEAN3_18490	SPU_018490		\N
GLEAN3_09095	SPU_009095		\N
GLEAN3_27920	SPU_027920		\N
GLEAN3_27655	SPU_027655		\N
GLEAN3_21543	SPU_021543		\N
GLEAN3_27890	SPU_027890		\N
GLEAN3_09150	SPU_009150		\N
GLEAN3_18897	SPU_018897		\N
GLEAN3_20412	SPU_020412		\N
GLEAN3_20329	SPU_020329		\N
GLEAN3_21308	SPU_021308		\N
GLEAN3_02079	SPU_002079		\N
GLEAN3_24348	SPU_024348		\N
GLEAN3_01889	SPU_001889		\N
GLEAN3_00303	SPU_000303		\N
GLEAN3_23895	SPU_023895		\N
GLEAN3_07386	SPU_007386		\N
GLEAN3_08532	SPU_008532		\N
GLEAN3_08860	SPU_008860		\N
GLEAN3_10691	SPU_010691		\N
GLEAN3_08864	SPU_008864		\N
GLEAN3_04419	SPU_004419		\N
GLEAN3_04420	SPU_004420		\N
GLEAN3_04421	SPU_004421		\N
GLEAN3_12768	SPU_012768		\N
GLEAN3_15441	SPU_015441		\N
GLEAN3_16467	SPU_016467		\N
GLEAN3_00016	SPU_000016		\N
GLEAN3_09225	SPU_009225		\N
GLEAN3_06865	SPU_006865		\N
GLEAN3_00739	SPU_000739		\N
GLEAN3_17971	SPU_017971		\N
GLEAN3_18246	SPU_018246		\N
GLEAN3_16671	SPU_016671		\N
GLEAN3_17028	SPU_017028		\N
GLEAN3_17029	SPU_017029		\N
GLEAN3_18841	SPU_018841		\N
GLEAN3_12004	SPU_012004		\N
GLEAN3_17606	SPU_017606		\N
GLEAN3_20251	SPU_020251		\N
GLEAN3_18964	SPU_018964		\N
GLEAN3_01569	SPU_001569		\N
GLEAN3_05667	SPU_005667		\N
GLEAN3_03633	SPU_003633		\N
GLEAN3_23926	SPU_023926		\N
GLEAN3_11699	SPU_011699		\N
GLEAN3_18586	SPU_018586		\N
GLEAN3_08374	SPU_008374		\N
GLEAN3_06107	SPU_006107		\N
GLEAN3_08684	SPU_008684		\N
GLEAN3_10033	SPU_010033		\N
GLEAN3_11279	SPU_011279		\N
GLEAN3_12414	SPU_012414		\N
GLEAN3_14784	SPU_014784		\N
GLEAN3_14884	SPU_014884		\N
GLEAN3_14908	SPU_014908		\N
GLEAN3_15642	SPU_015642		\N
GLEAN3_16644	SPU_016644		\N
GLEAN3_24046	SPU_024046		\N
GLEAN3_18752	SPU_018752		\N
GLEAN3_19253	SPU_019253	From the BLAST results and the excel data it appears that the sequence is distributed onto 2 different scaffolds. There is a small sequence overlap between the 2 scaffolds that occurs from 2485-2610. When revewing the excel data, it is apparent that there are no internal repeats or gaps present within the two scaffolds. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 6.	\N
GLEAN3_25129	SPU_025129		\N
GLEAN3_25554	SPU_025554		\N
GLEAN3_26073	SPU_026073		\N
GLEAN3_20022	SPU_020022		\N
GLEAN3_23141	SPU_023141		\N
GLEAN3_03585	SPU_003585		\N
GLEAN3_03586	SPU_003586		\N
GLEAN3_01198	SPU_001198		\N
GLEAN3_18649	SPU_018649		\N
GLEAN3_14763	SPU_014763		\N
GLEAN3_01859	SPU_001859		\N
GLEAN3_19224	SPU_019224	After reviewing the data and performing a BLAST search it appears that GLEAN3_19224 is on two different scaffolds. The sequence would be continuous and orderly if the two scaffolds were combined. The BLAST search revealed its adjacency and there is an internal overlap in the second scaffold. There was also no EST support found on GBrowse V0.5 and there as well as low transcriptome intensity.\nAdditional information from blastp:\ngb|AAS01046.1|  Src family kinase [Asterina miniata]     380      5e-104	\N
GLEAN3_02254	SPU_002254		\N
GLEAN3_16351	SPU_016351		\N
GLEAN3_20243	SPU_020243		\N
GLEAN3_22256	SPU_022256		\N
GLEAN3_01208	SPU_001208		\N
GLEAN3_24311	SPU_024311	After reviewing the data and performing a BLAST search it appears there is no sufficient GLEAN model fit for GLEAN3_24311. The sequence appears to be distributed onto 2 different scaffolds.  One the first scaffold the sequence doesn't start until aboutHowever, there appears to be an internal overlap between two of the sequences within the query. If one of the overlaps is discarded (805-902), there would be an orderly arrangement of the sequence. There was also a long string of repeats that were on different scaffolds that were discarded. \nAdditional information found:\n                                                          Score     E\nSequences producing significant alignments:               (Bits)  Value\nref|XP_001189237.1|PREDICTED: similar to DEP domain containi.645    0.0 	\N
GLEAN3_09283	SPU_009283	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data and comparing it to the BLAST results it is evident that if the 2 scaffolds were combined, the sequence would have an orderly continuous arrangement without any gaps or repeats present. There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5.	\N
GLEAN3_08398	SPU_008398		\N
GLEAN3_13862	SPU_013862		\N
GLEAN3_28249	SPU_028249		\N
GLEAN3_22004	SPU_022004		\N
GLEAN3_16976	SPU_016976		\N
GLEAN3_28094	SPU_028094		\N
GLEAN3_02659	SPU_002659		\N
GLEAN3_04395	SPU_004395		\N
GLEAN3_05875	SPU_005875		\N
GLEAN3_06866	SPU_006866		\N
GLEAN3_11872	SPU_011872		\N
GLEAN3_12722	SPU_012722		\N
GLEAN3_02819	SPU_002819		\N
GLEAN3_02363	SPU_002363		\N
GLEAN3_06787	SPU_006787		\N
GLEAN3_18991	SPU_018991		\N
GLEAN3_06721	SPU_006721		\N
GLEAN3_20054	SPU_020054		\N
GLEAN3_00105	SPU_000105		\N
GLEAN3_00244	SPU_000244		\N
GLEAN3_00514	SPU_000514		\N
GLEAN3_00824	SPU_000824		\N
GLEAN3_17249	SPU_017249		\N
GLEAN3_25961	SPU_025961		\N
GLEAN3_02851	SPU_002851		\N
GLEAN3_02926	SPU_002926		\N
GLEAN3_03710	SPU_003710		\N
GLEAN3_09638	SPU_009638		\N
GLEAN3_04037	SPU_004037		\N
GLEAN3_04272	SPU_004272		\N
GLEAN3_13608	SPU_013608		\N
GLEAN3_04518	SPU_004518		\N
GLEAN3_05055	SPU_005055		\N
GLEAN3_26724	SPU_026724		\N
GLEAN3_05215	SPU_005215		\N
GLEAN3_05610	SPU_005610		\N
GLEAN3_05882	SPU_005882		\N
GLEAN3_06008	SPU_006008		\N
GLEAN3_06026	SPU_006026		\N
GLEAN3_06276	SPU_006276	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 3 different scaffolds. After reviewing the excel data in comparison with the BLAST results it is clear that there are numerous gaps and internal repeats present within the 3 different scaffolds, resulting in poor overall coverage. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values ranging >6. 	\N
GLEAN3_06402	SPU_006402		\N
GLEAN3_06475	SPU_006475		\N
GLEAN3_09270	SPU_009270		\N
GLEAN3_06869	SPU_006869		\N
GLEAN3_06901	SPU_006901		\N
GLEAN3_02952	SPU_002952		\N
GLEAN3_07078	SPU_007078		\N
GLEAN3_07861	SPU_007861	After reviewing the data and performing a BLAST search, it appears that this is the best match for this particular GLEAN model. When reviewing the excel data, it appears that the sequence is distributed onto 2 different scaffolds, between subject gb|DS010076 and gb|DS001479|. The first part of the sequence doesn't begin until the 37th base pair and continues until the end of the scaffold, 592. The rest of the sequence begins on 635 resulting in a small gap between the two scaffolds. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	\N
GLEAN3_08263	SPU_008263		\N
GLEAN3_08280	SPU_008280		\N
GLEAN3_08347	SPU_008347		\N
GLEAN3_08602	SPU_008602		\N
GLEAN3_08621	SPU_008621		\N
GLEAN3_27570	SPU_027570		\N
GLEAN3_02428	SPU_002428		\N
GLEAN3_08874	SPU_008874		\N
GLEAN3_24794	SPU_024794		\N
GLEAN3_08966	SPU_008966		\N
GLEAN3_28159	SPU_028159		\N
GLEAN3_09079	SPU_009079		\N
GLEAN3_09301	SPU_009301		\N
GLEAN3_09432	SPU_009432		\N
GLEAN3_09434	SPU_009434		\N
GLEAN3_11246	SPU_011246		\N
GLEAN3_09842	SPU_009842		\N
GLEAN3_25507	SPU_025507		\N
GLEAN3_21207	SPU_021207		\N
GLEAN3_06353	SPU_006353		\N
GLEAN3_01885	SPU_001885		\N
GLEAN3_25244	SPU_025244		\N
GLEAN3_03842	SPU_003842		\N
GLEAN3_04173	SPU_004173		\N
GLEAN3_21689	SPU_021689		\N
GLEAN3_10743	SPU_010743		\N
GLEAN3_08651	SPU_008651		\N
GLEAN3_09990	SPU_009990		\N
GLEAN3_10156	SPU_010156		\N
GLEAN3_10385	SPU_010385		\N
GLEAN3_18644	SPU_018644		\N
GLEAN3_10530	SPU_010530		\N
GLEAN3_10590	SPU_010590		\N
GLEAN3_10872	SPU_010872		\N
GLEAN3_10957	SPU_010957		\N
GLEAN3_10964	SPU_010964		\N
GLEAN3_11264	SPU_011264		\N
GLEAN3_11275	SPU_011275		\N
GLEAN3_11470	SPU_011470		\N
GLEAN3_11509	SPU_011509		\N
GLEAN3_11519	SPU_011519		\N
GLEAN3_11521	SPU_011521		\N
GLEAN3_11838	SPU_011838		\N
GLEAN3_12619	SPU_012619		\N
GLEAN3_12848	SPU_012848	After reviewing the excel data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When comparing the BLAST results to that of the excel data, it is evident that if these 2 scaffolds were combined the sequence would have an orderly continuous arrangement without any repeats or gaps present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be some what weak with most of the vlause being <5, but there were several values that were <5 as well	\N
GLEAN3_13349	SPU_013349		\N
GLEAN3_13494	SPU_013494		\N
GLEAN3_14253	SPU_014253		\N
GLEAN3_14587	SPU_014587		\N
GLEAN3_14726	SPU_014726		\N
GLEAN3_15541	SPU_015541		\N
GLEAN3_15674	SPU_015674		\N
GLEAN3_16150	SPU_016150	The BLAST results and the excel data it appears that the sequence is distributed onto 2 different scaffolds. However, there is a large gap between the two scaffolds from 995-1095. There are also several internal repeats within v2.1_scaffold84508. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores were somewhat strong with most of the values being greater than 5	\N
GLEAN3_15587	SPU_015587		\N
GLEAN3_16375	SPU_016375		\N
GLEAN3_16455	SPU_016455		\N
GLEAN3_16533	SPU_016533		\N
GLEAN3_16736	SPU_016736		\N
GLEAN3_16852	SPU_016852		\N
GLEAN3_17316	SPU_017316		\N
GLEAN3_17455	SPU_017455		\N
GLEAN3_17493	SPU_017493		\N
GLEAN3_17886	SPU_017886		\N
GLEAN3_17927	SPU_017927		\N
GLEAN3_17980	SPU_017980		\N
GLEAN3_18191	SPU_018191		\N
GLEAN3_18724	SPU_018724		\N
GLEAN3_19053	SPU_019053		\N
GLEAN3_05358	SPU_005358		\N
GLEAN3_19413	SPU_019413		\N
GLEAN3_19620	SPU_019620		\N
GLEAN3_19799	SPU_019799		\N
GLEAN3_19837	SPU_019837		\N
GLEAN3_21173	SPU_021173		\N
GLEAN3_11446	SPU_011446		\N
GLEAN3_00964	SPU_000964	From the excel data and the BLAST results, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it appears that the two scaffolds have an orderly arrangement without any gaps or repeats present. If the two scaffolds were to be combined, the entire sequence would have an orderly and continuous arrangement. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be clustered in two groups on opposite sides of the graph. Both clusters appeared to have some what strong intensities with most of the values being greater than 5. 	\N
GLEAN3_18697	SPU_018697		\N
GLEAN3_18028	SPU_018028		\N
GLEAN3_05312	SPU_005312		\N
GLEAN3_23783	SPU_023783		\N
GLEAN3_12560	SPU_012560		\N
GLEAN3_00493	SPU_000493		\N
GLEAN3_15443	SPU_015443		\N
GLEAN3_20395	SPU_020395		\N
GLEAN3_21071	SPU_021071		\N
GLEAN3_07148	SPU_007148		\N
GLEAN3_20413	SPU_020413		\N
GLEAN3_20532	SPU_020532		\N
GLEAN3_09044	SPU_009044		\N
GLEAN3_27556	SPU_027556		\N
GLEAN3_28497	SPU_028497		\N
GLEAN3_19339	SPU_019339		\N
GLEAN3_21552	SPU_021552		\N
GLEAN3_21858	SPU_021858		\N
GLEAN3_22200	SPU_022200		\N
GLEAN3_22809	SPU_022809		\N
GLEAN3_22935	SPU_022935		\N
GLEAN3_23136	SPU_023136		\N
GLEAN3_23583	SPU_023583		\N
GLEAN3_23601	SPU_023601		\N
GLEAN3_08733	SPU_008733		\N
GLEAN3_22767	SPU_022767		\N
GLEAN3_23791	SPU_023791		\N
GLEAN3_24035	SPU_024035	After reviewing the data and performing a BLAST search, it appears that there is no good fit for this particular GLEAN model. There are numerous gaps and repeats within the sequence resulting in poor coverage. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be strong with most of the values being greater than 10.	\N
GLEAN3_24332	SPU_024332		\N
GLEAN3_24883	SPU_024883		\N
GLEAN3_25219	SPU_025219		\N
GLEAN3_25674	SPU_025674		\N
GLEAN3_25872	SPU_025872		\N
GLEAN3_25946	SPU_025946		\N
GLEAN3_26079	SPU_026079		\N
GLEAN3_27230	SPU_027230		\N
GLEAN3_27331	SPU_027331		\N
GLEAN3_27765	SPU_027765		\N
GLEAN3_28366	SPU_028366		\N
GLEAN3_17707	SPU_017707		\N
GLEAN3_03984	SPU_003984		\N
GLEAN3_00755	SPU_000755		\N
GLEAN3_26646	SPU_026646		\N
GLEAN3_26647	SPU_026647		\N
GLEAN3_04089	SPU_004089		\N
GLEAN3_08842	SPU_008842		\N
GLEAN3_09508	SPU_009508	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. There are several gaps within each scaffold and several internal repeats as well. Scaffold >v2.1_scaffold73429 was unique in that the sequence covered the first 100 base pairs. There was Est information available from Growse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong as well with values ranging from 2-22. This is an un-annotated gene so no additional information was available from Baylor annotations (gene informational comments).	\N
GLEAN3_06005	SPU_006005		\N
GLEAN3_24278	SPU_024278		\N
GLEAN3_18777	SPU_018777	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data in comparison with the BLAST results it was determined that both scaffolds contain an orderly arrangement, without any gaps or repeats present. If the two scaffolds were to be combined, the overall sequence would have good coverage.  There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall, with most of the values being less than 5. 	\N
GLEAN3_07765	SPU_007765		\N
GLEAN3_12763	SPU_012763		\N
GLEAN3_28694	SPU_028694		\N
GLEAN3_05329	SPU_005329		\N
GLEAN3_08838	SPU_008838		\N
GLEAN3_03467	SPU_003467		\N
GLEAN3_14250	SPU_014250		\N
GLEAN3_06279	SPU_006279		\N
GLEAN3_23516	SPU_023516		\N
GLEAN3_02961	SPU_002961	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 4 different scaffolds. When examining the excel data, it is apparent that between the 4 different scaffolds there are numerous gaps and repeats present. The overall sequence ends at 1869 but the fourth scaffold (v2.1_scaffold56607) is truncated at 1471. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong. 	\N
GLEAN3_18802	SPU_018802		\N
GLEAN3_08764	SPU_008764		\N
GLEAN3_07142	SPU_007142		\N
GLEAN3_16743	SPU_016743		\N
GLEAN3_11773	SPU_011773		\N
GLEAN3_07413	SPU_007413		\N
GLEAN3_11634	SPU_011634		\N
GLEAN3_09191	SPU_009191		\N
GLEAN3_25733	SPU_025733		\N
GLEAN3_21200	SPU_021200		\N
GLEAN3_27308	SPU_027308	When a BLAST search was done, this was the only scaffold that was found. The sequence coverage isn't very good since only 383/849 base pairs match. 	\N
GLEAN3_24854	SPU_024854		\N
GLEAN3_21724	SPU_021724		\N
GLEAN3_10659	SPU_010659		\N
GLEAN3_23776	SPU_023776		\N
GLEAN3_01062	SPU_001062		\N
GLEAN3_16792	SPU_016792		\N
GLEAN3_12696	SPU_012696		\N
GLEAN3_25744	SPU_025744		\N
GLEAN3_21548	SPU_021548		\N
GLEAN3_05973	SPU_005973		\N
GLEAN3_25460	SPU_025460		\N
GLEAN3_16208	SPU_016208		\N
GLEAN3_00371	SPU_000371		\N
GLEAN3_08862	SPU_008862		\N
GLEAN3_14579	SPU_014579		\N
GLEAN3_26863	SPU_026863		\N
GLEAN3_06773	SPU_006773		\N
GLEAN3_12216	SPU_012216		\N
GLEAN3_13398	SPU_013398		\N
GLEAN3_10303	SPU_010303		\N
GLEAN3_26084	SPU_026084		\N
GLEAN3_09151	SPU_009151		\N
GLEAN3_04492	SPU_004492		\N
GLEAN3_13312	SPU_013312		\N
GLEAN3_24299	SPU_024299	For this particular GLEAN model there was no CDS information or gene features available on either the SpBase search engine or the Baylor annotations page. Only the mRNA sequence was available on the Baylor page. When the Excel data was analyzed, subject gb|DS006122| appeared to have an orderly arrangement and no internal repeats.  	\N
GLEAN3_11724	SPU_011724		\N
GLEAN3_19995	SPU_019995		\N
GLEAN3_01702	SPU_001702		\N
GLEAN3_17515	SPU_017515	For this particular GLEAN model, there was no Cds information available from either SpBase or Baylor annotations. There was however mRNA information provided by SpBase. When examining the excel data, it is believed that this sequence may be distributed onto two different scaffolds. The first portion of the sequence appears to be distributed onto AAGJ02126498 and the remaining portion of the sequence onto. gb|DS009869|. Both scaffolds contain an orderly arrangement without any internal repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	\N
GLEAN3_08215	SPU_008215		\N
GLEAN3_20816	SPU_020816		\N
GLEAN3_00793	SPU_000793		\N
GLEAN3_12489	SPU_012489		\N
GLEAN3_28036	SPU_028036		\N
GLEAN3_28038	SPU_028038		\N
GLEAN3_25292	SPU_025292		\N
GLEAN3_03387	SPU_003387		\N
GLEAN3_10287	SPU_010287		\N
GLEAN3_28430	SPU_028430		\N
GLEAN3_01315	SPU_001315		\N
GLEAN3_27111	SPU_027111		\N
GLEAN3_17502	SPU_017502		\N
GLEAN3_18369	SPU_018369		\N
GLEAN3_12228	SPU_012228		\N
GLEAN3_04143	SPU_004143		\N
GLEAN3_07216	SPU_007216		\N
GLEAN3_14869	SPU_014869		\N
GLEAN3_25524	SPU_025524		\N
GLEAN3_04195	SPU_004195		\N
GLEAN3_14365	SPU_014365		\N
GLEAN3_08120	SPU_008120		\N
GLEAN3_12910	SPU_012910		\N
GLEAN3_26096	SPU_026096		\N
GLEAN3_08562	SPU_008562		\N
GLEAN3_25772	SPU_025772		\N
GLEAN3_20452	SPU_020452		\N
GLEAN3_22636	SPU_022636		\N
GLEAN3_17644	SPU_017644		\N
GLEAN3_12683	SPU_012683		\N
GLEAN3_09736	SPU_009736		\N
GLEAN3_13842	SPU_013842		\N
GLEAN3_10664	SPU_010664		\N
GLEAN3_01550	SPU_001550		\N
GLEAN3_10722	SPU_010722		\N
GLEAN3_24366	SPU_024366		\N
GLEAN3_16214	SPU_016214		\N
GLEAN3_04247	SPU_004247		\N
GLEAN3_11145	SPU_011145		\N
GLEAN3_03137	SPU_003137		\N
GLEAN3_04425	SPU_004425		\N
GLEAN3_19856	SPU_019856		\N
GLEAN3_26961	SPU_026961	After reviewing the data and performing a BLAST search, it appears that the data is distributed onto 2 different scaffolds. The BLAST results did not display these 2 scaffolds as the best fit, probably due to a slightly higher bit score and lowered e-value when compared to the other scaffold results. When examining the excel results, it is evident that if the 2 scaffolds were combined, the sequence would have an orderly continuous arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall	\N
GLEAN3_22431	SPU_022431		\N
GLEAN3_15565	SPU_015565		\N
GLEAN3_00308	SPU_000308		\N
GLEAN3_15215	SPU_015215		\N
GLEAN3_15706	SPU_015706		\N
GLEAN3_22928	SPU_022928		\N
GLEAN3_02118	SPU_002118		\N
GLEAN3_01939	SPU_001939		\N
GLEAN3_00259	SPU_000259		\N
GLEAN3_27628	SPU_027628		\N
GLEAN3_09850	SPU_009850		\N
GLEAN3_06241	SPU_006241		\N
GLEAN3_05298	SPU_005298		\N
GLEAN3_02673	SPU_002673		\N
GLEAN3_10336	SPU_010336		\N
GLEAN3_05355	SPU_005355		\N
GLEAN3_14527	SPU_014527		\N
GLEAN3_00202	SPU_000202		\N
GLEAN3_14071	SPU_014071		\N
GLEAN3_18209	SPU_018209		\N
GLEAN3_06801	SPU_006801		\N
GLEAN3_10165	SPU_010165		\N
GLEAN3_00206	SPU_000206		\N
GLEAN3_16606	SPU_016606		\N
GLEAN3_22723	SPU_022723		\N
GLEAN3_26219	SPU_026219		\N
GLEAN3_11538	SPU_011538		\N
GLEAN3_12563	SPU_012563		\N
GLEAN3_23336	SPU_023336		\N
GLEAN3_21433	SPU_021433		\N
GLEAN3_25771	SPU_025771		\N
GLEAN3_06720	SPU_006720		\N
GLEAN3_14821	SPU_014821		\N
GLEAN3_21704	SPU_021704		\N
GLEAN3_27616	SPU_027616	After reviewing the data from the excel file and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The sequence doesn't begin until about 25, but from there if the two scaffolds were combined the sequence would have an orderly continuous arrangement, without any repeats or gaps present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values beign greater than 10. 	\N
GLEAN3_05032	SPU_005032		\N
GLEAN3_06533	SPU_006533		\N
GLEAN3_03032	SPU_003032		\N
GLEAN3_00784	SPU_000784		\N
GLEAN3_04814	SPU_004814		\N
GLEAN3_08717	SPU_008717		\N
GLEAN3_05063	SPU_005063		\N
GLEAN3_01401	SPU_001401		\N
GLEAN3_00865	SPU_000865		\N
GLEAN3_08024	SPU_008024		\N
GLEAN3_20168	SPU_020168		\N
GLEAN3_14746	SPU_014746		\N
GLEAN3_28344	SPU_028344		\N
GLEAN3_08000	SPU_008000		\N
GLEAN3_17814	SPU_017814		\N
GLEAN3_08699	SPU_008699		\N
GLEAN3_13509	SPU_013509		\N
GLEAN3_04762	SPU_004762		\N
GLEAN3_12944	SPU_012944		\N
GLEAN3_03865	SPU_003865		\N
GLEAN3_08994	SPU_008994		\N
GLEAN3_15445	SPU_015445		\N
GLEAN3_26049	SPU_026049		\N
GLEAN3_04444	SPU_004444		\N
GLEAN3_01327	SPU_001327		\N
GLEAN3_28416	SPU_028416		\N
GLEAN3_10377	SPU_010377		\N
GLEAN3_22493	SPU_022493		\N
GLEAN3_23132	SPU_023132		\N
GLEAN3_04431	SPU_004431		\N
GLEAN3_20491	SPU_020491		\N
GLEAN3_26470	SPU_026470		\N
GLEAN3_25537	SPU_025537		\N
GLEAN3_21869	SPU_021869		\N
GLEAN3_09289	SPU_009289		\N
GLEAN3_24508	SPU_024508		\N
GLEAN3_01903	SPU_001903		\N
GLEAN3_12975	SPU_012975		\N
GLEAN3_13163	SPU_013163		\N
GLEAN3_07763	SPU_007763		\N
GLEAN3_15403	SPU_015403		\N
GLEAN3_13716	SPU_013716		\N
GLEAN3_24449	SPU_024449		\N
GLEAN3_15613	SPU_015613		\N
GLEAN3_15614	SPU_015614		\N
GLEAN3_10347	SPU_010347		\N
GLEAN3_17052	SPU_017052		\N
GLEAN3_22187	SPU_022187		\N
GLEAN3_10531	SPU_010531		\N
GLEAN3_11204	SPU_011204		\N
GLEAN3_27633	SPU_027633		\N
GLEAN3_06211	SPU_006211		\N
GLEAN3_04687	SPU_004687		\N
GLEAN3_07056	SPU_007056		\N
GLEAN3_10456	SPU_010456		\N
GLEAN3_10517	SPU_010517		\N
GLEAN3_11168	SPU_011168		\N
GLEAN3_11952	SPU_011952		\N
GLEAN3_14245	SPU_014245		\N
GLEAN3_20864	SPU_020864		\N
GLEAN3_23236	SPU_023236		\N
GLEAN3_09852	SPU_009852		\N
GLEAN3_28120	SPU_028120	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it was apparent that both scaffolds contained an orderly arrangement without any gaps or repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with the majority of values being <10	\N
GLEAN3_20476	SPU_020476		\N
GLEAN3_24515	SPU_024515		\N
GLEAN3_27116	SPU_027116		\N
GLEAN3_10606	SPU_010606		\N
GLEAN3_08003	SPU_008003		\N
GLEAN3_11360	SPU_011360		\N
GLEAN3_21160	SPU_021160		\N
GLEAN3_10167	SPU_010167		\N
GLEAN3_16898	SPU_016898		\N
GLEAN3_21219	SPU_021219		\N
GLEAN3_07422	SPU_007422		\N
GLEAN3_05162	SPU_005162		\N
GLEAN3_17773	SPU_017773		\N
GLEAN3_25833	SPU_025833		\N
GLEAN3_15246	SPU_015246		\N
GLEAN3_10784	SPU_010784		\N
GLEAN3_08850	SPU_008850		\N
GLEAN3_19273	SPU_019273		\N
GLEAN3_01160	SPU_001160		\N
GLEAN3_26648	SPU_026648		\N
GLEAN3_21621	SPU_021621		\N
GLEAN3_26002	SPU_026002		\N
GLEAN3_13384	SPU_013384		\N
GLEAN3_03881	SPU_003881		\N
GLEAN3_25226	SPU_025226		\N
GLEAN3_22864	SPU_022864		\N
GLEAN3_01890	SPU_001890		\N
GLEAN3_06886	SPU_006886		\N
GLEAN3_11259	SPU_011259		\N
GLEAN3_24752	SPU_024752		\N
GLEAN3_01780	SPU_001780		\N
GLEAN3_01929	SPU_001929		\N
GLEAN3_02810	SPU_002810		\N
GLEAN3_02895	SPU_002895		\N
GLEAN3_04172	SPU_004172		\N
GLEAN3_05666	SPU_005666		\N
GLEAN3_05483	SPU_005483		\N
GLEAN3_09841	SPU_009841		\N
GLEAN3_10831	SPU_010831		\N
GLEAN3_17354	SPU_017354		\N
GLEAN3_19708	SPU_019708		\N
GLEAN3_20324	SPU_020324		\N
GLEAN3_21037	SPU_021037		\N
GLEAN3_21638	SPU_021638		\N
GLEAN3_21817	SPU_021817		\N
GLEAN3_25192	SPU_025192		\N
GLEAN3_27832	SPU_027832		\N
GLEAN3_27926	SPU_027926		\N
GLEAN3_28395	SPU_028395		\N
GLEAN3_20294	SPU_020294		\N
GLEAN3_16562	SPU_016562		\N
GLEAN3_22236	SPU_022236		\N
GLEAN3_26705	SPU_026705		\N
GLEAN3_08656	SPU_008656		\N
GLEAN3_28919	SPU_028919		\N
GLEAN3_21154	SPU_021154		\N
GLEAN3_25424	SPU_025424		\N
GLEAN3_06012	SPU_006012		\N
GLEAN3_10188	SPU_010188		\N
GLEAN3_04443	SPU_004443		\N
GLEAN3_01156	SPU_001156		\N
GLEAN3_01931	SPU_001931		\N
GLEAN3_08121	SPU_008121	After reviewing the data for model_GLEAN3_08121 it appears that this is the best fit for this particular GLEAN model based on bit score and e-value. The sequence is distributed onto 2 different scaffolds but there is a wide gap from 540-670. The arrangement of the sequence is orderly; however, there are several internal repeats within the sequence. There is Est information available on GBrowse assembly V0.5 and the transcriptome intensity information appears to be widely distributed and somewhat strong.	\N
GLEAN3_28714	SPU_028714		\N
GLEAN3_07633	SPU_007633		\N
GLEAN3_13200	SPU_013200		\N
GLEAN3_15625	SPU_015625		\N
GLEAN3_09478	SPU_009478		\N
GLEAN3_09480	SPU_009480		\N
GLEAN3_21458	SPU_021458		\N
GLEAN3_24082	SPU_024082		\N
GLEAN3_26860	SPU_026860		\N
GLEAN3_11819	SPU_011819		\N
GLEAN3_26643	SPU_026643		\N
GLEAN3_12618	SPU_012618		\N
GLEAN3_01991	SPU_001991		\N
GLEAN3_20821	SPU_020821		\N
GLEAN3_28903	SPU_028903		\N
GLEAN3_16007	SPU_016007		\N
GLEAN3_07424	SPU_007424		\N
GLEAN3_12207	SPU_012207		\N
GLEAN3_19667	SPU_019667		\N
GLEAN3_20350	SPU_020350		\N
GLEAN3_02297	SPU_002297		\N
GLEAN3_08728	SPU_008728	After reviewing the data and performing a BLAST search it appear that the data is distributed onto 2 different scaffolds. If the two scaffolds were combined, it appears that the sequence would have a continuous, orderly arrangement without any repeats or gaps present. There was some Est information available and the transcriptome intensity scores appeared to be some what strong with most of the values being greater than 5.	\N
GLEAN3_00114	SPU_000114		\N
GLEAN3_01045	SPU_001045		\N
GLEAN3_12858	SPU_012858		\N
GLEAN3_06333	SPU_006333		\N
GLEAN3_24555	SPU_024555		\N
GLEAN3_08953	SPU_008953		\N
GLEAN3_15276	SPU_015276		\N
GLEAN3_22810	SPU_022810		\N
GLEAN3_13835	SPU_013835		\N
GLEAN3_17837	SPU_017837		\N
GLEAN3_22957	SPU_022957		\N
GLEAN3_24985	SPU_024985		\N
GLEAN3_15819	SPU_015819		\N
GLEAN3_08211	SPU_008211		\N
GLEAN3_00904	SPU_000904		\N
GLEAN3_02070	SPU_002070		\N
GLEAN3_06623	SPU_006623		\N
GLEAN3_12838	SPU_012838		\N
GLEAN3_08703	SPU_008703		\N
GLEAN3_12396	SPU_012396		\N
GLEAN3_21491	SPU_021491		\N
GLEAN3_20565	SPU_020565		\N
GLEAN3_17635	SPU_017635		\N
GLEAN3_08112	SPU_008112		\N
GLEAN3_24093	SPU_024093		\N
GLEAN3_00638	SPU_000638		\N
GLEAN3_07942	SPU_007942		\N
GLEAN3_11931	SPU_011931		\N
GLEAN3_15012	SPU_015012		\N
GLEAN3_20540	SPU_020540		\N
GLEAN3_18829	SPU_018829		\N
GLEAN3_28555	SPU_028555		\N
GLEAN3_27786	SPU_027786		\N
GLEAN3_09126	SPU_009126		\N
GLEAN3_09558	SPU_009558	After reviewing the data and performing a BLAST search, it appears that there is no good GLEAN model fit for model_GLEAN3_09558. There are several internal repeats within the scaffolds, several gaps within the sequence and poor sequence coverage overall. >v2.1_scaffold57986 was the best results indicated by the BLAST search results based on a low e-value and high bit score. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	\N
GLEAN3_19287	SPU_019287		\N
GLEAN3_10737	SPU_010737		\N
GLEAN3_23198	SPU_023198		\N
GLEAN3_13856	SPU_013856		\N
GLEAN3_15126	SPU_015126	After reviewing the data it appears that there is no sufficient GLEAN model that fits GLEAN3_15126 due to the poor sequence coverage, gaps present and internal repeats within the sequence. The sequence appears to be distributed across 4 different scaffolds.  There was Est. information available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be clumped together and strong. This is an un-annotated gene so no additional comments were available from Baylor annotations (comments).	\N
GLEAN3_16699	SPU_016699		\N
GLEAN3_27001	SPU_027001		\N
GLEAN3_23450	SPU_023450		\N
GLEAN3_04323	SPU_004323		\N
GLEAN3_08845	SPU_008845		\N
GLEAN3_09231	SPU_009231		\N
GLEAN3_05787	SPU_005787		\N
GLEAN3_06232	SPU_006232		\N
GLEAN3_15712	SPU_015712		\N
GLEAN3_15983	SPU_015983		\N
GLEAN3_22163	SPU_022163		\N
GLEAN3_26205	SPU_026205		\N
GLEAN3_00990	SPU_000990		\N
GLEAN3_03681	SPU_003681		\N
GLEAN3_06583	SPU_006583		\N
GLEAN3_22554	SPU_022554		\N
GLEAN3_23339	SPU_023339	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If the two scaffolds were combined, the sequence would have an orderly, continuous arrangement without any gaps or repeats present. This is an un-annotated gene so no additional gene information (comments) was available from the Baylor webpage. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall with most of the values being <8	\N
GLEAN3_17484	SPU_017484		\N
GLEAN3_08673	SPU_008673		\N
GLEAN3_18452	SPU_018452		\N
GLEAN3_02727	SPU_002727		\N
GLEAN3_07962	SPU_007962		\N
GLEAN3_18804	SPU_018804		\N
GLEAN3_09523	SPU_009523	For this particular GLEAN model it appears that this is the best fit based on the BLAST results in comparison with the excel data. From the excel data it was evident that there were no large gaps present and the sequence had overall good coverage. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10	\N
GLEAN3_25640	SPU_025640		\N
GLEAN3_09892	SPU_009892		\N
GLEAN3_00707	SPU_000707		\N
GLEAN3_23517	SPU_023517		\N
GLEAN3_28837	SPU_028837		\N
GLEAN3_06569	SPU_006569		\N
GLEAN3_08313	SPU_008313		\N
GLEAN3_15551	SPU_015551		\N
GLEAN3_18933	SPU_018933		\N
GLEAN3_13432	SPU_013432		\N
GLEAN3_02464	SPU_002464		\N
GLEAN3_19788	SPU_019788		\N
GLEAN3_28155	SPU_028155		\N
GLEAN3_10528	SPU_010528		\N
GLEAN3_08558	SPU_008558		\N
GLEAN3_18884	SPU_018884		\N
GLEAN3_03485	SPU_003485		\N
GLEAN3_06213	SPU_006213		\N
GLEAN3_22322	SPU_022322		\N
GLEAN3_02193	SPU_002193		\N
GLEAN3_21195	SPU_021195		\N
GLEAN3_11921	SPU_011921		\N
GLEAN3_09240	SPU_009240		\N
GLEAN3_21695	SPU_021695		\N
GLEAN3_24546	SPU_024546		\N
GLEAN3_12376	SPU_012376		\N
GLEAN3_06072	SPU_006072		\N
GLEAN3_17142	SPU_017142		\N
GLEAN3_25569	SPU_025569		\N
GLEAN3_25318	SPU_025318		\N
GLEAN3_20627	SPU_020627		\N
GLEAN3_14005	SPU_014005		\N
GLEAN3_18674	SPU_018674		\N
GLEAN3_02880	SPU_002880		\N
GLEAN3_14056	SPU_014056		\N
GLEAN3_14057	SPU_014057		\N
GLEAN3_07057	SPU_007057		\N
GLEAN3_15818	SPU_015818		\N
GLEAN3_00379	SPU_000379		\N
GLEAN3_18801	SPU_018801		\N
GLEAN3_01772	SPU_001772		\N
GLEAN3_08778	SPU_008778		\N
GLEAN3_05669	SPU_005669		\N
GLEAN3_17238	SPU_017238		\N
GLEAN3_24993	SPU_024993		\N
GLEAN3_04609	SPU_004609		\N
GLEAN3_03849	SPU_003849		\N
GLEAN3_21924	SPU_021924		\N
GLEAN3_27892	SPU_027892		\N
GLEAN3_28247	SPU_028247		\N
GLEAN3_15666	SPU_015666		\N
GLEAN3_27149	SPU_027149		\N
GLEAN3_08296	SPU_008296		\N
GLEAN3_15294	SPU_015294		\N
GLEAN3_03905	SPU_003905		\N
GLEAN3_00395	SPU_000395		\N
GLEAN3_24214	SPU_024214		\N
GLEAN3_11572	SPU_011572		\N
GLEAN3_25574	SPU_025574		\N
GLEAN3_26541	SPU_026541		\N
GLEAN3_06459	SPU_006459		\N
GLEAN3_22161	SPU_022161		\N
GLEAN3_16200	SPU_016200		\N
GLEAN3_02438	SPU_002438		\N
GLEAN3_06034	SPU_006034		\N
GLEAN3_04996	SPU_004996		\N
GLEAN3_24553	SPU_024553		\N
SpMOSb	SPU_030242		\N
SpMELK2	SPU_030243		\N
GLEAN3_02724	SPU_002724		\N
GLEAN3_03718	SPU_003718		\N
GLEAN3_07327	SPU_007327		\N
GLEAN3_26671	SPU_026671		\N
GLEAN3_05699	SPU_005699		\N
GLEAN3_01585	SPU_001585	The BLAST results along with the excel data indicate that this sequence is distributed onto 2 different scaffolds.  Both scaffolds contain an orderly arrangement without any gaps or repeats present. If the two scaffolds were to be combined, the entire sequence would have a continuous arrangement with good overall coverage. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	\N
GLEAN3_10941	SPU_010941		\N
GLEAN3_26362	SPU_026362		\N
GLEAN3_22450	SPU_022450		\N
GLEAN3_03702	SPU_003702		\N
GLEAN3_22575	SPU_022575		\N
GLEAN3_06136	SPU_006136		\N
GLEAN3_27481	SPU_027481		\N
GLEAN3_20693	SPU_020693	For this particular GLEAN model there was no CDS information available or gene features in either the SpBase search engine or the Baylor annotations page. There was also no mRNA information available and the gene was not annotated.  After examining the excel data, it appears that the sequence did not have an orderly arrangement due to several gaps present. Several sequences between different subjects were the same as well and if the data could have been mapped onto V2.1 it would have been difficult to distinguish between the different scaffolds. 	\N
GLEAN3_20966	SPU_020966		\N
GLEAN3_25790	SPU_025790		\N
GLEAN3_21942	SPU_021942		\N
GLEAN3_26907	SPU_026907		\N
GLEAN3_11365	SPU_011365		\N
GLEAN3_05743	SPU_005743	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. Within the first scaffold, there is one internal repeat present that is evident from both the BLAST results and the excel data. Other than this repeat, if the two scaffolds were combined the sequence would have an orderly and continuous arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with many of the values being greater than 10.	\N
GLEAN3_07129	SPU_007129		\N
GLEAN3_00256	SPU_000256		\N
GLEAN3_23886	SPU_023886		\N
GLEAN3_25487	SPU_025487		\N
GLEAN3_20929	SPU_020929		\N
GLEAN3_24532	SPU_024532		\N
GLEAN3_14066	SPU_014066		\N
GLEAN3_15833	SPU_015833		\N
GLEAN3_24235	SPU_024235		\N
GLEAN3_27404	SPU_027404		\N
GLEAN3_14553	SPU_014553		\N
GLEAN3_17333	SPU_017333		\N
GLEAN3_23318	SPU_023318		\N
GLEAN3_12299	SPU_012299		\N
GLEAN3_24260	SPU_024260		\N
GLEAN3_03709	SPU_003709		\N
GLEAN3_12395	SPU_012395		\N
GLEAN3_04887	SPU_004887		\N
GLEAN3_05999	SPU_005999		\N
GLEAN3_04522	SPU_004522		\N
GLEAN3_04863	SPU_004863		\N
GLEAN3_03596	SPU_003596		\N
GLEAN3_12087	SPU_012087		\N
GLEAN3_26864	SPU_026864		\N
GLEAN3_24241	SPU_024241		\N
GLEAN3_26740	SPU_026740		\N
GLEAN3_16514	SPU_016514		\N
GLEAN3_01255	SPU_001255		\N
GLEAN3_04255	SPU_004255		\N
GLEAN3_21051	SPU_021051		\N
GLEAN3_02060	SPU_002060		\N
GLEAN3_11986	SPU_011986		\N
GLEAN3_16542	SPU_016542		\N
GLEAN3_25817	SPU_025817		\N
GLEAN3_21798	SPU_021798		\N
GLEAN3_06042	SPU_006042		\N
GLEAN3_09012	SPU_009012		\N
GLEAN3_27394	SPU_027394		\N
GLEAN3_23148	SPU_023148		\N
GLEAN3_07455	SPU_007455		\N
GLEAN3_23647	SPU_023647		\N
GLEAN3_26199	SPU_026199		\N
GLEAN3_26297	SPU_026297		\N
GLEAN3_19012	SPU_019012		\N
GLEAN3_27603	SPU_027603		\N
GLEAN3_13719	SPU_013719		\N
GLEAN3_10030	SPU_010030		\N
GLEAN3_05314	SPU_005314		\N
GLEAN3_13953	SPU_013953		\N
GLEAN3_26382	SPU_026382		\N
GLEAN3_27752	SPU_027752		\N
GLEAN3_09241	SPU_009241		\N
GLEAN3_08315	SPU_008315		\N
GLEAN3_18716	SPU_018716		\N
GLEAN3_15238	SPU_015238		\N
GLEAN3_17414	SPU_017414		\N
GLEAN3_21118	SPU_021118		\N
GLEAN3_22947	SPU_022947		\N
GLEAN3_28545	SPU_028545		\N
GLEAN3_21259	SPU_021259		\N
GLEAN3_28773	SPU_028773		\N
GLEAN3_20214	SPU_020214		\N
GLEAN3_12918	SPU_012918		\N
GLEAN3_00456	SPU_000456		\N
GLEAN3_27730	SPU_027730	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. There is an overlap between the 2 scaffolds from 536-745. Besides this sequence overlap, it appears that if the 2 scaffolds were combined the sequence would have an orderly and continuous arrangement. The was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10	\N
GLEAN3_07750	SPU_007750		\N
GLEAN3_11535	SPU_011535		\N
GLEAN3_09643	SPU_009643		\N
GLEAN3_27766	SPU_027766	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. Based on the BLAST results and the excel data, it appears that if these 2 scaffolds were combined the sequence would have a continuous and orderly arrangement. There was Est. support from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with most of the scores being below 8. This is an un-annotated gene so no additional gene information (comments) was available from Baylor annotations. 	\N
GLEAN3_16390	SPU_016390		\N
GLEAN3_06163	SPU_006163		\N
GLEAN3_04622	SPU_004622		\N
GLEAN3_07736	SPU_007736		\N
GLEAN3_21774	SPU_021774		\N
GLEAN3_11725	SPU_011725		\N
GLEAN3_04379	SPU_004379		\N
GLEAN3_15701	SPU_015701	After reviewing the data and performing a BLAST search, it appears that this is the best match for this GLEAN model. There is poor sequence coverage as well as several repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely dispersed with most of the values being less than 10 excluding 2 outliers at 34 and 26. This is an un-annotated gene so no additional gene information was available.	\N
GLEAN3_27970	SPU_027970		\N
GLEAN3_18434	SPU_018434		\N
GLEAN3_27888	SPU_027888		\N
GLEAN3_03110	SPU_003110		\N
GLEAN3_12206	SPU_012206		\N
GLEAN3_01006	SPU_001006		\N
GLEAN3_11175	SPU_011175		\N
GLEAN3_12240	SPU_012240		\N
GLEAN3_20441	SPU_020441		\N
GLEAN3_10254	SPU_010254		\N
GLEAN3_03906	SPU_003906		\N
GLEAN3_14806	SPU_014806		\N
GLEAN3_27022	SPU_027022		\N
GLEAN3_27548	SPU_027548		\N
GLEAN3_21688	SPU_021688		\N
GLEAN3_25979	SPU_025979		\N
GLEAN3_12386	SPU_012386		\N
GLEAN3_02476	SPU_002476		\N
GLEAN3_24344	SPU_024344		\N
GLEAN3_05944	SPU_005944		\N
GLEAN3_07405	SPU_007405		\N
GLEAN3_12818	SPU_012818		\N
GLEAN3_14020	SPU_014020		\N
GLEAN3_15375	SPU_015375		\N
GLEAN3_13346	SPU_013346		\N
GLEAN3_06379	SPU_006379		\N
GLEAN3_28375	SPU_028375		\N
GLEAN3_23525	SPU_023525		\N
GLEAN3_17954	SPU_017954		\N
GLEAN3_10628	SPU_010628		\N
GLEAN3_14580	SPU_014580		\N
GLEAN3_26781	SPU_026781		\N
GLEAN3_24055	SPU_024055		\N
GLEAN3_03900	SPU_003900		\N
GLEAN3_01758	SPU_001758		\N
GLEAN3_10866	SPU_010866		\N
GLEAN3_00286	SPU_000286		\N
GLEAN3_04662	SPU_004662		\N
GLEAN3_07243	SPU_007243		\N
GLEAN3_10418	SPU_010418		\N
GLEAN3_18330	SPU_018330		\N
GLEAN3_18439	SPU_018439		\N
GLEAN3_20804	SPU_020804		\N
GLEAN3_20061	SPU_020061		\N
GLEAN3_03518	SPU_003518		\N
GLEAN3_08042	SPU_008042		\N
GLEAN3_13282	SPU_013282		\N
GLEAN3_28601	SPU_028601		\N
GLEAN3_22879	SPU_022879		\N
GLEAN3_14489	SPU_014489		\N
GLEAN3_08294	SPU_008294		\N
GLEAN3_23423	SPU_023423		\N
GLEAN3_21716	SPU_021716		\N
GLEAN3_07219	SPU_007219		\N
GLEAN3_11547	SPU_011547		\N
GLEAN3_16588	SPU_016588		\N
GLEAN3_01416	SPU_001416		\N
GLEAN3_03773	SPU_003773		\N
GLEAN3_05229	SPU_005229		\N
GLEAN3_07190	SPU_007190		\N
GLEAN3_09198	SPU_009198		\N
GLEAN3_10458	SPU_010458		\N
GLEAN3_10634	SPU_010634		\N
GLEAN3_10914	SPU_010914		\N
GLEAN3_11760	SPU_011760		\N
GLEAN3_12443	SPU_012443		\N
GLEAN3_12955	SPU_012955		\N
GLEAN3_14696	SPU_014696		\N
GLEAN3_15199	SPU_015199		\N
GLEAN3_15495	SPU_015495		\N
GLEAN3_15800	SPU_015800		\N
GLEAN3_18960	SPU_018960		\N
GLEAN3_19508	SPU_019508		\N
GLEAN3_19857	SPU_019857		\N
GLEAN3_20110	SPU_020110		\N
GLEAN3_20846	SPU_020846		\N
GLEAN3_21164	SPU_021164		\N
GLEAN3_22428	SPU_022428		\N
GLEAN3_24143	SPU_024143		\N
GLEAN3_26110	SPU_026110		\N
GLEAN3_22912	SPU_022912		\N
GLEAN3_25000	SPU_025000		\N
GLEAN3_15932	SPU_015932		\N
GLEAN3_19875	SPU_019875		\N
GLEAN3_05386	SPU_005386		\N
GLEAN3_07265	SPU_007265		\N
GLEAN3_23685	SPU_023685		\N
GLEAN3_17509	SPU_017509		\N
GLEAN3_02511	SPU_002511		\N
GLEAN3_05443	SPU_005443		\N
GLEAN3_28234	SPU_028234		\N
GLEAN3_25484	SPU_025484		\N
GLEAN3_08914	SPU_008914		\N
GLEAN3_23012	SPU_023012		\N
GLEAN3_22332	SPU_022332		\N
GLEAN3_18987	SPU_018987		\N
GLEAN3_09075	SPU_009075		\N
GLEAN3_13799	SPU_013799		\N
GLEAN3_00966	SPU_000966		\N
GLEAN3_05788	SPU_005788		\N
GLEAN3_12988	SPU_012988		\N
GLEAN3_27242	SPU_027242		\N
GLEAN3_04398	SPU_004398		\N
GLEAN3_16961	SPU_016961		\N
GLEAN3_26982	SPU_026982		\N
GLEAN3_18321	SPU_018321		\N
GLEAN3_24606	SPU_024606		\N
GLEAN3_07185	SPU_007185		\N
GLEAN3_09109	SPU_009109		\N
GLEAN3_20924	SPU_020924		\N
GLEAN3_16139	SPU_016139		\N
GLEAN3_27461	SPU_027461		\N
GLEAN3_08407	SPU_008407		\N
GLEAN3_16665	SPU_016665		\N
GLEAN3_09592	SPU_009592		\N
GLEAN3_27827	SPU_027827		\N
GLEAN3_00354	SPU_000354		\N
GLEAN3_20624	SPU_020624		\N
GLEAN3_11523	SPU_011523		\N
GLEAN3_15351	SPU_015351		\N
GLEAN3_04704	SPU_004704		\N
GLEAN3_21175	SPU_021175		\N
GLEAN3_11984	SPU_011984		\N
GLEAN3_13092	SPU_013092		\N
GLEAN3_00850	SPU_000850		\N
GLEAN3_22410	SPU_022410		\N
GLEAN3_26946	SPU_026946		\N
GLEAN3_00277	SPU_000277		\N
GLEAN3_12163	SPU_012163		\N
GLEAN3_28833	SPU_028833		\N
GLEAN3_09141	SPU_009141		\N
GLEAN3_14381	SPU_014381		\N
GLEAN3_23507	SPU_023507		\N
GLEAN3_12712	SPU_012712		\N
GLEAN3_27378	SPU_027378		\N
GLEAN3_02613	SPU_002613		\N
GLEAN3_14654	SPU_014654		\N
GLEAN3_04381	SPU_004381		\N
GLEAN3_03369	SPU_003369	share strong homologies with mammalian counterparts	\N
GLEAN3_25042	SPU_025042	numerous paralogs and orthologs	\N
GLEAN3_25041	SPU_025041	numerous paralogs and orthologs	\N
GLEAN3_13932	SPU_013932	Very similar to Strongylocentrotus franciscanus epidermal growth factor II precursor	\N
GLEAN3_12045	SPU_012045	none	\N
GLEAN3_02971	SPU_002971	none	\N
GLEAN3_02973	SPU_002973	none	\N
GLEAN3_02972	SPU_002972	none	\N
GLEAN3_22099	SPU_022099	ArgRS core-containing	\N
GLEAN3_00005	SPU_000005	none	\N
GLEAN3_00008	SPU_000008	none	\N
GLEAN3_00010	SPU_000010	none	\N
GLEAN3_00014	SPU_000014	RAD6 homolog in Pan troglodytes	\N
GLEAN3_00018	SPU_000018	none	\N
GLEAN3_00019	SPU_000019	none	\N
GLEAN3_00021	SPU_000021	contains both DnaJ heat shock N-terminus motif and BSCB_C superfamily domain	\N
GLEAN3_00023	SPU_000023	none	\N
GLEAN3_00024	SPU_000024	none	\N
GLEAN3_00027	SPU_000027	none	\N
GLEAN3_00028	SPU_000028	none	\N
GLEAN3_00029	SPU_000029	contains RT_nLTR like motif	\N
GLEAN3_00030	SPU_000030	none	\N
GLEAN3_00032	SPU_000032	contains Rieske ISP(Iron-Sulfur, Fe-S) protein domain	\N
GLEAN3_00033	SPU_000033	none	\N
GLEAN3_00034	SPU_000034	contains COG3264 motif	\N
GLEAN3_00037	SPU_000037	PH-like domain near the C-terminus	\N
GLEAN3_00038	SPU_000038	Major Facilitator superfamily	\N
GLEAN3_00042	SPU_000042	also contains COG3055 conserved motif.	\N
GLEAN3_00043	SPU_000043	none	\N
GLEAN3_00044	SPU_000044	none	\N
GLEAN3_00050	SPU_000050	ankyrin repeat protein	\N
GLEAN3_00056	SPU_000056	none	\N
GLEAN3_00058	SPU_000058	none	\N
GLEAN3_00059	SPU_000059	DDX55 DEAD box polypeptide 55	\N
GLEAN3_00060	SPU_000060	none	\N
GLEAN3_00066	SPU_000066	none	\N
GLEAN3_00067	SPU_000067	none	\N
GLEAN3_00069	SPU_000069	none	\N
GLEAN3_00071	SPU_000071	none	\N
GLEAN3_00074	SPU_000074	none	\N
GLEAN3_00080	SPU_000080	none	\N
GLEAN3_00085	SPU_000085	none	\N
GLEAN3_00088	SPU_000088	homozygously deleted in human neuroblastoma	\N
GLEAN3_00089	SPU_000089	none	\N
GLEAN3_00090	SPU_000090	none	\N
GLEAN3_00095	SPU_000095	none	\N
GLEAN3_00099	SPU_000099	none	\N
GLEAN3_00103	SPU_000103	none	\N
GLEAN3_00104	SPU_000104	none	\N
GLEAN3_00107	SPU_000107	none	\N
GLEAN3_00109	SPU_000109	none	\N
GLEAN3_00110	SPU_000110	contains PRK08262 domain, hypothetical protein domain	\N
GLEAN3_00112	SPU_000112	none	\N
GLEAN3_00115	SPU_000115	none	\N
GLEAN3_00119	SPU_000119	none	\N
GLEAN3_00122	SPU_000122	none	\N
GLEAN3_00123	SPU_000123	contains SH3BP5 domain	\N
GLEAN3_00126	SPU_000126	contains IG CAM domain	\N
GLEAN3_00132	SPU_000132	none	\N
GLEAN3_00134	SPU_000134	none	\N
GLEAN3_00135	SPU_000135	3 ANK domains	\N
GLEAN3_00136	SPU_000136	contains Hf1C domain, also similar to stomatin isoforms 2,3 and band 7.2b stomatin and other forms of stomatin	\N
GLEAN3_00137	SPU_000137	also similar to a wide variety of lysozymes	\N
GLEAN3_00138	SPU_000138	RING superfamily motif towards C-terminus	\N
GLEAN3_00144	SPU_000144	contains NOP5 and MOSIC domains and NOP5NT towards N-terminus	\N
GLEAN3_00145	SPU_000145	contains Pyr_r and Ndh domains towards N-terminus	\N
GLEAN3_00147	SPU_000147	none	\N
GLEAN3_00155	SPU_000155	none	\N
GLEAN3_00156	SPU_000156	none	\N
GLEAN3_00157	SPU_000157	homologous to Drosophila dumpy gene CG33196-PB	\N
GLEAN3_00162	SPU_000162	none	\N
GLEAN3_00163	SPU_000163	none	\N
GLEAN3_00166	SPU_000166	none	\N
GLEAN3_00168	SPU_000168	contains LRR_RI superfamily motif in the N-terminal half, and Ran_GAP1_C superfamily motif in the C-terminal half	\N
GLEAN3_00169	SPU_000169	contains TLD superfamily motif near the C-terminus	\N
GLEAN3_00179	SPU_000179	none	\N
GLEAN3_00193	SPU_000193	none	\N
GLEAN3_00194	SPU_000194	contains RVP superfamily motif and RT_LTR domain motif	\N
GLEAN3_00195	SPU_000195	contains 2 C2 superfamily motifs, one in N-terminal half, the other towards C-terminus	\N
GLEAN3_00196	SPU_000196	none	\N
GLEAN3_00197	SPU_000197	also contains SpoVK domain	\N
GLEAN3_00198	SPU_000198	none	\N
GLEAN3_00201	SPU_000201	none	\N
GLEAN3_00204	SPU_000204	none	\N
GLEAN3_00207	SPU_000207	also similar to peroxisome membrane protein 3 (PXMP3), and contains 2 Pex2_Pex2 motifs	\N
GLEAN3_00209	SPU_000209	none	\N
GLEAN3_00211	SPU_000211	none	\N
GLEAN3_00212	SPU_000212	contains ATP-binding site and helicase motif near the N-terminus	\N
GLEAN3_00213	SPU_000213	none	\N
GLEAN3_00217	SPU_000217	none	\N
GLEAN3_00220	SPU_000220	homologous to various subtypes of annexin	\N
GLEAN3_00221	SPU_000221	homologous to various isoforms	\N
GLEAN3_00223	SPU_000223	none	\N
GLEAN3_00224	SPU_000224	none	\N
GLEAN3_00225	SPU_000225	none	\N
GLEAN3_00227	SPU_000227	none	\N
GLEAN3_00233	SPU_000233	also homologous to endonuclease/reverse transcriptase and many hypothetical proteins	\N
GLEAN3_00240	SPU_000240	none	\N
GLEAN3_00243	SPU_000243	contains a WD40 motif at the C-terminus, and a COG2319 domain towards C-terminus	\N
GLEAN3_00247	SPU_000247	none	\N
GLEAN3_00249	SPU_000249	none	\N
GLEAN3_00251	SPU_000251	contains DedA domain	\N
GLEAN3_00252	SPU_000252	similar to peroxisome proliferator-activated receptor binding protein (PBP) (PPAR binding protein) (thyroid hormone receptor associated protein complex 220 kDa component) (TRAP220) (thyroid receptor interacting protein 2) (TRIP2) (p52 regulatory protein ...)	\N
GLEAN3_00254	SPU_000254	contains a RING superfamly motif at the N-terminus, and a PRK02224 domain	\N
GLEAN3_00257	SPU_000257	none	\N
GLEAN3_00260	SPU_000260	homologous to numerous hypothetical proteins in a variety of species	\N
GLEAN3_00262	SPU_000262	contains Mab-21 superfamily motif at the N-terminus. also homologous to several hypothetical proteins.	\N
GLEAN3_00263	SPU_000263	also homologous to several hypothetical proteins in many species	\N
GLEAN3_00264	SPU_000264	none	\N
GLEAN3_00268	SPU_000268	none	\N
GLEAN3_00273	SPU_000273	also homologous to many different subtypes of annexin	\N
GLEAN3_00280	SPU_000280	none	\N
GLEAN3_00284	SPU_000284	contains Ins145_P3_rec domain. also homologous to various subtypes of ryanodine receptor (1, 2, 3) and isoforms.	\N
GLEAN3_00291	SPU_000291	also called PDGFA-associated protein 2, Fus-1 protein, or Fusion 1 protein.	\N
GLEAN3_00292	SPU_000292	none	\N
GLEAN3_00293	SPU_000293	contains TPR superfamily motif in N-terminal half, and Asp_Arg_Hydroxylase superfmily motif in C-terminal half.	\N
GLEAN3_00295	SPU_000295	contains two SH3 motifs and one WD40 motif in between. also orthologous to numerous hypothetical proteins.	\N
GLEAN3_00297	SPU_000297	none	\N
GLEAN3_00299	SPU_000299	none	\N
GLEAN3_00304	SPU_000304	orthologous to numerous hypothetical proteins	\N
GLEAN3_00305	SPU_000305	none	\N
GLEAN3_00309	SPU_000309	none	\N
GLEAN3_00311	SPU_000311	contains 3 WD40 motifs in the N-terminal half	\N
GLEAN3_00312	SPU_000312	contains F-box motif near the N-terminus	\N
GLEAN3_00316	SPU_000316	contains a PH-like superfamily motif at the N-terminus. also orthologous to many hypothetical proteins.	\N
GLEAN3_00317	SPU_000317	none	\N
GLEAN3_00319	SPU_000319	contains p23_NUDCD2-like motif	\N
GLEAN3_00321	SPU_000321	contains 2 UMP1 superfamily motifs	\N
GLEAN3_00313	SPU_000313	Mod_r superfamily motif at C-terminus	\N
GLEAN3_03362	SPU_003362	a large number of homologous gene models from Strongylocentrotus purpuratus indicating species-specific gene family	\N
GLEAN3_25044	SPU_025044	typical member of rhodopsin superfamily	\N
GLEAN3_00051	SPU_000051	none	\N
GLEAN3_00234	SPU_000234	also homologous to ankyrin-2 and ankyrin-3. contains 5 ANK motifs and 2 Arp domains.	\N
GLEAN3_13937	SPU_013937	large group of homologs in Strongylocentrotus purpuratus	\N
GLEAN3_00140	SPU_000140	none	\N
GLEAN3_00161	SPU_000161	none	\N
GLEAN3_00175	SPU_000175	contains two FA58C superfamily motifs: one near the N-terminus and the other in the C-terminal half	\N
GLEAN3_00177	SPU_000177	contains a Subtilisin_N superfamily motif near the N-terminus, and two PA superfamily motifs	\N
GLEAN3_00269	SPU_000269	also homologous to plexin A1, B1, A4, and other subtypes and isoforms	\N
GLEAN3_17137	SPU_017137	homologs are found in fungi, fish, and mouse, but not in plants	\N
GLEAN3_17033	SPU_017033	none	\N
GLEAN3_03367	SPU_003367	reversion-inducing-cysteine-rich protein in Gallus gallus	\N
GLEAN3_25047	SPU_025047	telomere-associated protein	\N
GLEAN3_26532	SPU_026532	TBC domain containing putative GTPase activating protein	\N
GLEAN3_26531	SPU_026531	none	\N
GLEAN3_26530	SPU_026530	none	\N
GLEAN3_12048	SPU_012048	superfamily of hypothetical proteins with unknown functions	\N
GLEAN3_12044	SPU_012044	most orthologous proteins are classified as hypothetical gene products	\N
GLEAN3_12047	SPU_012047	superfamily of proteins with known function	\N
GLEAN3_00180	SPU_000180	HEC1 domain containing; most homologs/orthologs are classified as hypothetical	\N
GLEAN3_00181	SPU_000181	none	\N
GLEAN3_00184	SPU_000184	none	\N
GLEAN3_02974	SPU_002974	none	\N
GLEAN3_22090	SPU_022090	none	\N
GLEAN3_22094	SPU_022094	none	\N
GLEAN3_00246	SPU_000246	none	\N
GLEAN3_00082	SPU_000082	contains Herpes_, OTUs, P-loop, and Exo_c domains	\N
GLEAN3_00324	SPU_000324	contains Asn_Synthase_B_C motif	\N
GLEAN3_00326	SPU_000326	contains 3 regions with WD40 motifs	\N
GLEAN3_00342	SPU_000342	none	\N
GLEAN3_00402	SPU_000402	none	\N
GLEAN3_00417	SPU_000417	contains PRK13388 domain. also orthologous to numerous hypothetical proteins.	\N
GLEAN3_00430	SPU_000430	also orthologous to numerous hypothetical proteins	\N
GLEAN3_00446	SPU_000446	also orthologous to numerous hypothetical proteins	\N
GLEAN3_00450	SPU_000450	also orthologous to numerous hypothetical proteins	\N
GLEAN3_00478	SPU_000478	also orthologous to numerous putative proteins	\N
GLEAN3_00497	SPU_000497	none	\N
GLEAN3_00510	SPU_000510	contains Nop53 domain in the N-terminal half, and small zf-C4 motif in the middle	\N
GLEAN3_00531	SPU_000531	none	\N
GLEAN3_00545	SPU_000545	contains BTB and BACK superfamily motifs at N-terminus. also orthologous to putative and miscellaneous proteins.	\N
GLEAN3_00548	SPU_000548	also orthologous to numerous putative and miscellaneous proteins	\N
GLEAN3_00560	SPU_000560	hypothetical protein. similar to SPU_000625.	\N
GLEAN3_00602	SPU_000602	also orthologous to numerous putative and miscellaneous proteins	\N
GLEAN3_00631	SPU_000631	none	\N
GLEAN3_00648	SPU_000648	nucleolar preribosomal-associated protein 1	\N
GLEAN3_00650	SPU_000650	contains FHA superfamily motif near C-terminus	\N
GLEAN3_00735	SPU_000735	none	\N
GLEAN3_00759	SPU_000759	none	\N
GLEAN3_00760	SPU_000760	also orthologous to exonuclease/endonuclease/phosphatase family proteins and other miscellaneous putative proteins	\N
GLEAN3_00779	SPU_000779	none	\N
GLEAN3_00790	SPU_000790	contains RPA2_OBF_family motif at the N-terminus, tRNA-synthetase 2 motif, and AsxRS_core domain	\N
GLEAN3_00821	SPU_000821	contains 2 AdoHCyase superfamily motifs, and a ProC domain	\N
GLEAN3_00847	SPU_000847	contains MFS_1 domain in addition to Sugar_tr superfamily motif	\N
GLEAN3_00886	SPU_000886	also orthologous to mitochondrial carrier homolog 1	\N
GLEAN3_00890	SPU_000890	none	\N
GLEAN3_00900	SPU_000900	contains Occludin_ELL motif at C-terminal region	\N
GLEAN3_00912	SPU_000912	contains prefoldin superfamily motif at N-terminus	\N
GLEAN3_00924	SPU_000924	none	\N
GLEAN3_00948	SPU_000948	none	\N
GLEAN3_00952	SPU_000952	contains CCL_1 domain at the C-terminus	\N
GLEAN3_01012	SPU_001012	none	\N
GLEAN3_01060	SPU_001060	none	\N
GLEAN3_01081	SPU_001081	none	\N
GLEAN3_01088	SPU_001088	none	\N
GLEAN3_01091	SPU_001091	none	\N
GLEAN3_01132	SPU_001132	none	\N
GLEAN3_01137	SPU_001137	none	\N
GLEAN3_01146	SPU_001146	none	\N
GLEAN3_01161	SPU_001161	none	\N
GLEAN3_01165	SPU_001165	contains 2 Arrestin_N superfamily motifs. also orthologous to numerous hypothetical proteins and other gene products.	\N
GLEAN3_01185	SPU_001185	also orthologous to numerous hypothetical proteins and other gene products.	\N
GLEAN3_01192	SPU_001192	contains S_TKc domain	\N
GLEAN3_01204	SPU_001204	also orthologous to numerous hypothetical proteins	\N
GLEAN3_01235	SPU_001235	contains a DUF1162 motif at N-terminus, and 3 MRS6 domains	\N
GLEAN3_01240	SPU_001240	none	\N
GLEAN3_01277	SPU_001277	none	\N
GLEAN3_01281	SPU_001281	contains ZnF_TTF motif towards N-terminus, and hATC motif towards C-terminus. also orthologous to putative hAT family dimerization domain proteins, and transposases.	\N
GLEAN3_01291	SPU_001291	none	\N
GLEAN3_01303	SPU_001303	also orthologous to hexuronate transport proteins in MFS superfamily	\N
GLEAN3_01308	SPU_001308	none	\N
GLEAN3_01317	SPU_001317	none	\N
GLEAN3_01320	SPU_001320	none	\N
GLEAN3_01341	SPU_001341	none	\N
GLEAN3_01352	SPU_001352	none	\N
GLEAN3_01377	SPU_001377	contains 2 Autophagy_C motifs	\N
GLEAN3_01404	SPU_001404	contains 2 AdoMet_MTase superfamily motifs	\N
GLEAN3_01441	SPU_001441	also orthologous to pyridoxin phosphatase, phosphoglycolate phosphatase, and several other unnamed hypothetical proteins	\N
GLEAN3_01450	SPU_001450	also orthologous to goliath E3 ubiquitin ligase, ring finger 130, and numerous unnamed hypothetical proteins	\N
GLEAN3_01457	SPU_001457	none	\N
GLEAN3_01493	SPU_001493	none	\N
GLEAN3_01496	SPU_001496	contains DnaJ superfamily motif near N-terminus, and SANT superfamily motif near C-terminus	\N
GLEAN3_01513	SPU_001513	none	\N
GLEAN3_01542	SPU_001542	also orthologous to numerous hypothetical proteins	\N
GLEAN3_01551	SPU_001551	none	\N
GLEAN3_01570	SPU_001570	none	\N
GLEAN3_01577	SPU_001577	none	\N
GLEAN3_01595	SPU_001595	contains SUL1 domain. also orthologous to member 6 (SLC26A6) or member 4 (SLC26A4) depending on species.	\N
GLEAN3_01600	SPU_001600	none	\N
GLEAN3_01612	SPU_001612	none	\N
GLEAN3_01623	SPU_001623	none	\N
GLEAN3_01657	SPU_001657	contains bZIP_1 superfamily motif at C-terminus	\N
GLEAN3_01681	SPU_001681	contains 2 DUF926 motifs in the C-terminal half	\N
GLEAN3_01696	SPU_001696	contains FYVE motif at C-terminus	\N
GLEAN3_01718	SPU_001718	contains BCNT motif towards the C-terminus	\N
GLEAN3_01722	SPU_001722	contains Prefoldin superfamily motif towards N-terminus	\N
GLEAN3_01736	SPU_001736	none	\N
GLEAN3_01771	SPU_001771	contains Sun domain	\N
GLEAN3_01776	SPU_001776	none	\N
GLEAN3_01786	SPU_001786	contains 2 Nup133_N superfmaily motifs	\N
GLEAN3_01799	SPU_001799	contains 3 C2 superfamily motifs and a FerI superfamily motif	\N
GLEAN3_01804	SPU_001804	none	\N
GLEAN3_01829	SPU_001829	none	\N
GLEAN3_01909	SPU_001909	none	\N
GLEAN3_01935	SPU_001935	contains vinculin domain	\N
GLEAN3_01954	SPU_001954	none	\N
GLEAN3_02002	SPU_002002	also similar to C18orf55 protein	\N
GLEAN3_02013	SPU_002013	similar to NADPH-dependent FMN reductase	\N
GLEAN3_02054	SPU_002054	contains 2 PRK0415 (replication factor C large subunit) domains	\N
GLEAN3_02069	SPU_002069	also similar to sarcoglycan delta in some species	\N
GLEAN3_02074	SPU_002074	also similar to mandelate racemase	\N
GLEAN3_02122	SPU_002122	contains SH3 superfamily motif at C-terminus	\N
GLEAN3_02132	SPU_002132	contains RRM superfamily motif at N-terminus	\N
GLEAN3_02150	SPU_002150	contains PRP38 superfamily motif at N-terminus	\N
GLEAN3_02153	SPU_002153	none	\N
GLEAN3_02189	SPU_002189	also similar to IP4/PIP3 binding-protein-like protein	\N
GLEAN3_02191	SPU_002191	none	\N
GLEAN3_02225	SPU_002225	none	\N
GLEAN3_02241	SPU_002241	none	\N
GLEAN3_02256	SPU_002256	none	\N
GLEAN3_02260	SPU_002260	also similar to membrane associated protein gex-3	\N
GLEAN3_02284	SPU_002284	also orthologous to 3-hydroxybutyrate dehydrogenase, 2-hydroxy-3-oxopropionate reductase, and several hypothetical proteins	\N
GLEAN3_02285	SPU_002285	none	\N
GLEAN3_02294	SPU_002294	contains PRK09198 domain. also orthologous to pre-B-cell colony enhancing factor 1	\N
GLEAN3_02295	SPU_002295	none	\N
GLEAN3_02299	SPU_002299	none	\N
GLEAN3_02302	SPU_002302	contains tRNA_Me_trans domain	\N
GLEAN3_02333	SPU_002333	none	\N
GLEAN3_02339	SPU_002339	similar to bacterial acetyltransferases	\N
GLEAN3_02374	SPU_002374	contains 2 FERM_M motifs and B41 domain	\N
GLEAN3_02389	SPU_002389	none	\N
GLEAN3_02425	SPU_002425	none	\N
GLEAN3_02455	SPU_002455	contains 4 repeats of SRCR superfamily domain	\N
GLEAN3_02480	SPU_002480	none	\N
GLEAN3_02514	SPU_002514	similar to bacterial proteins. also homologous to endonuclease/exonuclease/phosphatase, and aspartyl-tRNA synthetase, and several other bacterial proteins.	\N
GLEAN3_02535	SPU_002535	none	\N
GLEAN3_02536	SPU_002536	none	\N
GLEAN3_02572	SPU_002572	contains PLA2_bee_venom_like domain	\N
GLEAN3_02585	SPU_002585	none	\N
GLEAN3_02648	SPU_002648	contains dozens of PTB (protein binding) sites	\N
GLEAN3_02691	SPU_002691	none	\N
GLEAN3_02715	SPU_002715	none	\N
GLEAN3_02717	SPU_002717	none	\N
GLEAN3_02738	SPU_002738	none	\N
GLEAN3_02774	SPU_002774	none	\N
GLEAN3_02780	SPU_002780	none	\N
GLEAN3_02785	SPU_002785	none	\N
GLEAN3_02817	SPU_002817	none	\N
GLEAN3_02825	SPU_002825	none	\N
GLEAN3_02830	SPU_002830	none	\N
GLEAN3_02843	SPU_002843	none	\N
GLEAN3_02889	SPU_002889	none	\N
GLEAN3_02936	SPU_002936	none	\N
GLEAN3_02937	SPU_002937	none	\N
GLEAN3_02938	SPU_002938	also similar to uterferrin, and other protein phosphatases	\N
GLEAN3_02942	SPU_002942	none	\N
GLEAN3_02968	SPU_002968	contains 2 EFh superfamil motifs	\N
GLEAN3_02981	SPU_002981	also orthologous to alpha tectorin	\N
GLEAN3_02984	SPU_002984	contains 2 WD40 superfamily motifs, each at N- and C-terminus	\N
GLEAN3_03008	SPU_003008	none	\N
GLEAN3_03009	SPU_003009	contains Pyr_redox_2 domain	\N
GLEAN3_03048	SPU_003048	none	\N
GLEAN3_03065	SPU_003065	none	\N
GLEAN3_03078	SPU_003078	contains 2 Glyco_hydro_39 superfamily motifs	\N
GLEAN3_03087	SPU_003087	contains YSH1 domain	\N
GLEAN3_03099	SPU_003099	contains G-patch superfamily motif at C-terminus	\N
GLEAN3_03115	SPU_003115	none	\N
GLEAN3_03123	SPU_003123	contains 2 WD40 superfamily motifs and COG2319 domain	\N
GLEAN3_03153	SPU_003153	contains MFS_1 domain	\N
GLEAN3_03172	SPU_003172	none	\N
GLEAN3_03181	SPU_003181	contains 2 PH_like superfamily motifs	\N
GLEAN3_03190	SPU_003190	contains UAA domain	\N
GLEAN3_03214	SPU_003214	none	\N
GLEAN3_03219	SPU_003219	also orthologous to many endonuclease reverse transcriptases in Bos taurus	\N
GLEAN3_03223	SPU_003223	none	\N
GLEAN3_03234	SPU_003234	contains 2 ZnF_CSH1 superfmaily motifs	\N
GLEAN3_03235	SPU_003235	none	\N
GLEAN3_01914	SPU_001914	contains 2 7tm_1 superfamily motifs	\N
GLEAN3_02033	SPU_002033	also similar to oncoprotein-induced transcript 3 (OIT3)	\N
GLEAN3_02154	SPU_002154	none	\N
GLEAN3_02114	SPU_002114	contains two clusters of tandemn repeats (5x and 11x, respectively) of EGF_CA, calcium-binding EGF-like domains. also similar to fibropellin I isoforms.	\N
GLEAN3_01830	SPU_001830	contains COG1041 domain	\N
GLEAN3_17510	SPU_017510	contains MmsB domain	none
GLEAN3_15296	SPU_015296	contains 2 NIPSNAP superfamily motifs	none
GLEAN3_14269	SPU_014269	contains 2 7tm_1 superfamily motifs	none
GLEAN3_17738	SPU_017738	contains COG0429 domain	none
GLEAN3_07127	SPU_007127	contains COG0429 domain	none
GLEAN3_06443	SPU_006443	contains LMWPc domain	none
GLEAN3_19259	SPU_019259	contains ACTIN domain	none
GLEAN3_10800	SPU_010800	contains Actin domain	none
GLEAN3_04923	SPU_004923	contains CaiA domain	none
GLEAN3_16362	SPU_016362	contains PRK12583 domain	none
GLEAN3_16078	SPU_016078	contains 4 TSP_1 superfamily motifs	none
GLEAN3_04963	SPU_004963	contains 2 Alpha_adaptinC2 superfamily motifs	none
GLEAN3_23262	SPU_023262	contains 2 ArfGap superfamily motifs	none
GLEAN3_20019	SPU_020019	contains COG5580 domain	none
GLEAN3_05567	SPU_005567	contains HutI domain	none
GLEAN3_11369	SPU_011369	orthologous to leucyl aminopeptidases in various species of invertebrates	none
GLEAN3_21203	SPU_021203	contains COG0436 domain	none
GLEAN3_21721	SPU_021721	contains PRK06872 domain	none
GLEAN3_10443	SPU_010443	contains 6 ARM superfamily motifs	none
GLEAN3_09698	SPU_009698	contains 2 BTB superfamily motifs	none
GLEAN3_20434	SPU_020434	contains 2 ANK superfamily motifs	none
GLEAN3_23368	SPU_023368	contains 2 ANK superfamily motifs	none
GLEAN3_09189	SPU_009189	contains 2 ANK superfamily motifs	none
GLEAN3_19054	SPU_019054	contains 6 ANK superfamily motifs	none
GLEAN3_07842	SPU_007842	contains 10 ANK superfamily motifs	none
GLEAN3_04324	SPU_004324	contains 2 ANK superfamily motifs	none
GLEAN3_04095	SPU_004095	contains ANK superfamily motif near N-terminus	none
GLEAN3_22268	SPU_022268	contains 3 ANK superfamily motifs	none
GLEAN3_04356	SPU_004356	contains 5 ANK superfamily motifs	none
GLEAN3_19535	SPU_019535	contains 2 ANK superfamily motifs	none
GLEAN3_19139	SPU_019139	contains 4 Annexin superfamily motifs	none
GLEAN3_22630	SPU_022630	contains 2 IBR superfamily motifs	none
GLEAN3_04121	SPU_004121	contains 5 ARM superfamily motifs	none
GLEAN3_10281	SPU_010281	contains DUF634 domain	none
GLEAN3_21810	SPU_021810	contains 2 ACTIN superfamily motifs	none
GLEAN3_17462	SPU_017462	contains TyrB domain	none
GLEAN3_07950	SPU_007950	contains 2 Asp_Arg_Hydrox superfamily motifs	none
GLEAN3_05845	SPU_005845	orthologous to plant, bacterial, and archeal proteins	none
GLEAN3_04689	SPU_004689	contains XPG superfamily motif at N-terminus	none
GLEAN3_03416	SPU_003416	contains 2 EGF_Lam superfamily motifs	none
GLEAN3_13739	SPU_013739	contains PRK07883 domain	none
GLEAN3_05458	SPU_005458	contains 2 BPI superfamily motifs	none
GLEAN3_14242	SPU_014242	contains Seipin superfamily notif at N-terminus	none
GLEAN3_21602	SPU_021602	also homologous to cellulases in bacteria and other mollusc spp.	none
GLEAN3_09320	SPU_009320	also orthologous to bacterial D-aminoacylase proteins	none
GLEAN3_06840	SPU_006840	contains 3 PHD superfamily motifs	none
GLEAN3_15092	SPU_015092	contains MDN1 domain	none
GLEAN3_19793	SPU_019793	orthologs are found only in Xenopus, Ciona, and fish.	none
GLEAN3_12637	SPU_012637	contains Arp domain and COG4886 domain	none
GLEAN3_04434	SPU_004434	contains Smc domain	none
GLEAN3_07302	SPU_007302	contains 3 Vinculin domains	none
GLEAN3_07613	SPU_007613	contains NOT2_3_5 superfamily motif at C-terminus and Not3 domain at N-terminus	none
GLEAN3_22867	SPU_022867	contains MiaB domain	none
GLEAN3_20615	SPU_020615	contains RAD18 domain	none
GLEAN3_07275	SPU_007275	contains Smc domain	none
GLEAN3_03772	SPU_003772	contains Cupin_2 superfamily motid at C-terminus	none
GLEAN3_23274	SPU_023274	contains SMC domain	none
GLEAN3_05543	SPU_005543	contains Smc domain	none
GLEAN3_19304	SPU_019304	contains Smc domain	none
GLEAN3_07071	SPU_007071	contains SMC domain	none
GLEAN3_08071	SPU_008071	contains DUF572 superfamily motif at N-terminus	none
GLEAN3_05730	SPU_005730	contains RPN6 domain	none
GLEAN3_18438	SPU_018438	contains 2 WD40 superfamily motifs	none
GLEAN3_09619	SPU_009619	also orthologous to numerous bacterial cryptochrome proteins	none
GLEAN3_12898	SPU_012898	contains CBS domain	none
GLEAN3_21060	SPU_021060	contains COG0436 domain	none
GLEAN3_19401	SPU_019401	contains 3 VWC superfamily motifs	none
GLEAN3_06853	SPU_006853	contains 2 nt_trans superfamily motifs, and argS domain, and CysS domain	none
GLEAN3_09707	SPU_009707	contains CysS domain	none
GLEAN3_14819	SPU_014819	contains COG1520 domain	none
GLEAN3_17219	SPU_017219	contains D1tE domain	none
GLEAN3_08465	SPU_008465	contains COG1413 domain	none
GLEAN3_07044	SPU_007044	contains 2 FMO-like domains	none
GLEAN3_22175	SPU_022175	contains PRK0115 domain	none
GLEAN3_04188	SPU_004188	contains 2 DEP superfamily motifs	none
GLEAN3_16403	SPU_016403	contains WD40 superfamily motif at C-terminus	none
GLEAN3_06134	SPU_006134	contains 2 CDC27 superfamily motifs	none
GLEAN3_05958	SPU_005958	contains DnaJ domain	none
GLEAN3_07592	SPU_007592	contains DnaJ superfamily motif and CbpA domain at N-terminus	none
GLEAN3_17259	SPU_017259	contains 4 Thioredoxin-like superfamily motifs	none
GLEAN3_05701	SPU_005701	contains ZUO1 domain	none
GLEAN3_16958	SPU_016958	contains CbpA domain	none
GLEAN3_07324	SPU_007324	contains COG5273 domain	none
GLEAN3_10151	SPU_010151	contains 2 C2 superfamily motifs	none
GLEAN3_13604	SPU_013604	contains 2 UBQ superfamily motifs	none
GLEAN3_18012	SPU_018012	contains Dtyr_deacylase superfamily motif at N-terminus	none
GLEAN3_10886	SPU_010886	contains DYN1 domain and Dynein_heavy domain	none
GLEAN3_21217	SPU_021217	contains 3 C2 superfamily motifs. homologous to C. elegans myoferlin.	none
GLEAN3_16190	SPU_016190	contains PRK05431 domain	none
GLEAN3_07213	SPU_007213	contains FRQ1 domain	none
GLEAN3_16291	SPU_016291	contains PRK00055 domain	none
GLEAN3_18807	SPU_018807	contains Cast domain and Smc domain	none
GLEAN3_11358	SPU_011358	contains Elp3 domain	none
GLEAN3_10189	SPU_010189	contains AP_endonuc_2 domain	none
GLEAN3_05544	SPU_005544	orthologous to mammalian proteins bearing homology to E. coli Eral11 protein	none
GLEAN3_10543	SPU_010543	contains 2 ANK superfamily motifs	none
GLEAN3_16439	SPU_016439	contains Gst domain	none
GLEAN3_07869	SPU_007869	contains IF-2B domain	none
GLEAN3_21449	SPU_021449	contains Paf67 domain	none
GLEAN3_22386	SPU_022386	contains 2 PCI superfamily motifs	none
GLEAN3_08561	SPU_008561	contains RRP4 domain	none
GLEAN3_18481	SPU_018481	contains PRK04282 domain	none
GLEAN3_13995	SPU_013995	contains COG2123 domain	none
GLEAN3_14762	SPU_014762	contains RRP4 domain	none
GLEAN3_12202	SPU_012202	contains CRM1 domain	none
GLEAN3_10172	SPU_010172	contains DadA domain	none
GLEAN3_11159	SPU_011159	contains DadA domain	none
GLEAN3_20945	SPU_020945	contains 2 TrkA domains and PRK06116 domain	none
GLEAN3_03971	SPU_003971	protein function unknown	none
GLEAN3_18968	SPU_018968	contains 2 FEZ superfamily motifs	none
GLEAN3_03480	SPU_003480	contains NosD domain	none
GLEAN3_07523	SPU_007523	contains MFS_1 domain	none
GLEAN3_05896	SPU_005896	also orthologous to formin2 (Drosophila)	none
GLEAN3_15811	SPU_015811	contains FKBP_C superfamily motif at C-terminus	none
GLEAN3_07008	SPU_007008	contains 3 Gelsolin superfamily motifs and COG4886 domain	none
GLEAN3_05489	SPU_005489	contains FH2 superfamily motif at C-terminus	none
GLEAN3_20870	SPU_020870	contains RPN7 domain	none
GLEAN3_05418	SPU_005418	contains 2 GST_N superfamily motifs	none
GLEAN3_07979	SPU_007979	contains MDN1 domain	none
GLEAN3_17876	SPU_017876	contains MoCF_BD superfamily motif at N-terminus	none
GLEAN3_17961	SPU_017961	contains ERG12 domain	none
GLEAN3_08907	SPU_008907	contains Prc domain	none
GLEAN3_05241	SPU_005241	contains NagC domain	none
GLEAN3_20762	SPU_020762	contains BetA domain and PhaC domain	none
GLEAN3_05338	SPU_005338	contains 2 nt_trans superfamily motifs	none
GLEAN3_09518	SPU_009518	contains ECM4 domain	none
GLEAN3_12540	SPU_012540	contains Amidinotransf domain	none
GLEAN3_07935	SPU_007935	contains GDPD domain	none
GLEAN3_11336	SPU_011336	contains COG1215 domain	none
GLEAN3_19011	SPU_019011	contains ATP_bind_1 domain	none
GLEAN3_21732	SPU_021732	contains ATP_bind_1 domain	none
GLEAN3_08412	SPU_008412	contains ATP_bind_1 domain	none
GLEAN3_23160	SPU_023160	contains COG1413 domain	none
GLEAN3_16418	SPU_016418	contains 2 RCC1 superfamily motifs and 2 AST1 domains	none
GLEAN3_06582	SPU_006582	contains HECTc domain	none
GLEAN3_22710	SPU_022710	contains COG1112 domain	none
GLEAN3_07096	SPU_007096	contains 2 HMG-box superfamily motifs	none
GLEAN3_22283	SPU_022283	contains MFS_1 domain	none
GLEAN3_03428	SPU_003428	homologous to bacterial proteins	none
GLEAN3_14437	SPU_014437	contains DUF1740 domain	none
GLEAN3_14008	SPU_014008	also orthologous to bacterial flagella/basal body proteins	none
GLEAN3_16446	SPU_016446	contains MFS_1 domain	none
GLEAN3_21737	SPU_021737	contains Ion_trans domain	none
GLEAN3_19493	SPU_019493	contains fabG domain	none
GLEAN3_05529	SPU_005529	contains HECTc superfamily motif at C-terminus	none
GLEAN3_18019	SPU_018019	contains COG1112 domain	none
GLEAN3_05117	SPU_005117	contains SXM1 domain	none
GLEAN3_08930	SPU_008930	orthologous to numerous Drosophila, C. elegans, plant, and bacterial proteins with unknown function	none
GLEAN3_06666	SPU_006666	contains TNG2 domain	none
GLEAN3_04902	SPU_004902	contains Ptr domain	none
GLEAN3_16108	SPU_016108	contains 4 ANK superfamily motifs	none
GLEAN3_13295	SPU_013295	contains SpoVK domain	none
GLEAN3_21302	SPU_021302	contains ileS domain	none
GLEAN3_22372	SPU_022372	contains JmjC superfamily motif at C-terminus	none
GLEAN3_04203	SPU_004203	contains 4 Kelch_1 superfamily motifs	none
GLEAN3_07977	SPU_007977	contains AAA domain	none
GLEAN3_10222	SPU_010222	contains C2 superfamily motif at N-terminus	none
GLEAN3_21937	SPU_021937	contains Smc domain	none
GLEAN3_07532	SPU_007532	contains 4 IG superfamily motifs	none
GLEAN3_19649	SPU_019649	contains ANK superfamily motif at N-terminus	none
GLEAN3_14368	SPU_014368	contains CDC37_N superfamily motif and Smc domain at N-terminus	none
GLEAN3_17024	SPU_017024	contains WSC domain	none
GLEAN3_22784	SPU_022784	contains 2 Beta-lactamase superfamily motifs	none
GLEAN3_15975	SPU_015975	contains COG2433 domain	none
GLEAN3_06748	SPU_006748	contains COG4886 domain	none
GLEAN3_17780	SPU_017780	contains COG4886 domain	none
GLEAN3_16726	SPU_016726	contains leuS domain	none
GLEAN3_10496	SPU_010496	contains LigT domain	none
GLEAN3_09124	SPU_009124	contains LIM superfamily motif at N-terminus and SH3 superfamily motif at C-terminus. also orthologous to nubelette, non-muscle isoform.	none
GLEAN3_22771	SPU_022771	contains 2 LIM superfamily motifs	none
GLEAN3_22482	SPU_022482	contains 12 PLAT superfamily motifs	none
GLEAN3_22354	SPU_022354	contains PH-like superfamily motif at N-terminus	none
GLEAN3_12372	SPU_012372	contains 6 LY superfamily motifs and 4 Ld1_re superfamily motifs	none
GLEAN3_03918	SPU_003918	contains 4 copies of LY superfamily motif. also orthologous to vitellogenin receptors and lipophorin receptors in various species.	none
GLEAN3_11347	SPU_011347	contains DadA domain	none
GLEAN3_20157	SPU_020157	contains SMC_N domain	none
GLEAN3_20317	SPU_020317	contains MFS_1 domain	none
GLEAN3_09003	SPU_009003	contains YTH1 domain	none
GLEAN3_18902	SPU_018902	contains Man-6P_recep domain	none
GLEAN3_21484	SPU_021484	contains RING superfamily motif near N-terminus, and SSM4 domain	none
GLEAN3_12725	SPU_012725	contains 2 RHOD superfamily motifs and SseA domain	none
GLEAN3_10842	SPU_010842	also orthologous to bacterial cystathione gamma-synthases	none
GLEAN3_19030	SPU_019030	contains MBD superfamily motif at N-terminus	none
GLEAN3_20642	SPU_020642	contains PRK09426 domain	none
GLEAN3_03840	SPU_003840	homologous to bacterial proteins	none
GLEAN3_18039	SPU_018039	contains COG1041 domain	none
GLEAN3_03476	SPU_003476	homologous to bacterial proteins	none
GLEAN3_09743	SPU_009743	contains 2 Tubulin_FtsZ superfamily motifs	none
GLEAN3_11691	SPU_011691	contains SpoU domain	none
GLEAN3_18187	SPU_018187	contains COG3217 domain	none
GLEAN3_03748	SPU_003748	contains COG1112 domain	none
GLEAN3_20695	SPU_020695	contains COG4642 domain	none
GLEAN3_07308	SPU_007308	contains 3 vWF superfamily motifs	none
GLEAN3_18218	SPU_018218	contains 3 RRM superfamily motifs	none
GLEAN3_17256	SPU_017256	contains B41 domain	none
GLEAN3_11495	SPU_011495	contains HemK domain	none
GLEAN3_05469	SPU_005469	contains 2 DAHP_Synth_1 superfamily motifs	none
GLEAN3_15311	SPU_015311	contains cyclophilin superfamily motif at N-terminus	none
GLEAN3_09268	SPU_009268	contains 2 E1_enzyme_family superfamily motifs	none
GLEAN3_10789	SPU_010789	contains Beach domain	none
GLEAN3_10402	SPU_010402	contains csdA domain	none
GLEAN3_04365	SPU_004365	contains 2 NIF3 superfamily motifs	none
GLEAN3_16077	SPU_016077	contains COG4886 domain	none
GLEAN3_13093	SPU_013093	contains NMD3 domain	none
GLEAN3_08638	SPU_008638	contains 3 TPR superfamily motifs	none
GLEAN3_18518	SPU_018518	contains NmrA domain	none
GLEAN3_18234	SPU_018234	contains Sun domain	none
GLEAN3_06180	SPU_006180	contains Sun domain	none
GLEAN3_08013	SPU_008013	contains Sun domain	none
GLEAN3_09226	SPU_009226	contains UMPH-1 domain	none
GLEAN3_18617	SPU_018617	contains NPL4 domain	none
GLEAN3_10857	SPU_010857	contains BBC superfamily motif near N-terminus	none
GLEAN3_10282	SPU_010282	contains Nol1_Nop2_Fmu domain. also orthologous to NOP2 nucleolar protein.	none
GLEAN3_10119	SPU_010119	contains COG1341 domain	none
GLEAN3_09115	SPU_009115	contains 2 Nup84 superfamily motifs	none
GLEAN3_03282	SPU_003282	contains Nsp1_C superfamily motif at N-terminus	none
GLEAN3_21736	SPU_021736	contains 2 Thioredoxin-like superfamily motifs	none
GLEAN3_14324	SPU_014324	contains f1hG domain	none
GLEAN3_23092	SPU_023092	contains NPY domain	none
GLEAN3_22052	SPU_022052	contains RhsA domain. also Sp-Ten3.	none
GLEAN3_18021	SPU_018021	contains UBQ superfamily motif at N-terminus	none
GLEAN3_21209	SPU_021209	contains 2 CH superfamily motifs. also homologous to parvin, alpha.	none
GLEAN3_11154	SPU_011154	contains 5 CAP_ED superfamily motifs	none
GLEAN3_05687	SPU_005687	contains COG5600 domain	none
GLEAN3_17133	SPU_017133	contains 3 EFhand superfamily motifs	none
GLEAN3_21078	SPU_021078	contains PqqL domain	none
GLEAN3_20031	SPU_020031	contains 2 NHL superfamily motifs and COG3391 domain	none
GLEAN3_10976	SPU_010976	contains Mpv17_PMP22 superfamily motif at C-terminus	none
GLEAN3_11242	SPU_011242	contains 2 TPR superfamily motifs	none
GLEAN3_07443	SPU_007443	contains 2 LNS2 superfamily motifs	none
GLEAN3_05415	SPU_005415	contains 3 GAF superfamily motifs	none
GLEAN3_14444	SPU_014444	contains PhnD domain	none
GLEAN3_12693	SPU_012693	contains 2 C2 superfamily motifs	none
GLEAN3_19706	SPU_019706	contains COG1741 domain	none
GLEAN3_03501	SPU_003501	contains 2 PH-like superfamily motifs, each near N-terminus and C-terminus, respectively	none
GLEAN3_04859	SPU_004859	contains PH-like superfamily motif at N-terminus	none
GLEAN3_14211	SPU_014211	contains 2 Esterase_lipase superfamily motifs	none
GLEAN3_21703	SPU_021703	contains 3 RNA_pol_B_RPB2 superfamily motifs and PRK08565 domain	none
GLEAN3_06372	SPU_006372	contains RPB7 domain	none
GLEAN3_11964	SPU_011964	contains 2 PQ_loop superfamily motifs	none
GLEAN3_06239	SPU_006239	contains RPR domain	none
GLEAN3_16596	SPU_016596	contains Suf domain. also homologous to rRNA processing proteins in fungi.	none
GLEAN3_05402	SPU_005402	contains Pro_racemase domain	none
GLEAN3_18082	SPU_018082	contains PtrB domain	none
GLEAN3_04206	SPU_004206	contains 3 Peptidase_S28 superfamily motifs	none
GLEAN3_03540	SPU_003540	contains Prominin domain	none
GLEAN3_12590	SPU_012590	contains 2 VSP domains	none
GLEAN3_23096	SPU_023096	contains COG2130 domain	none
GLEAN3_04318	SPU_004318	orthologous to numerous proteins	none
GLEAN3_15013	SPU_015013	contains RPN1 domain	none
GLEAN3_18597	SPU_018597	contains RPN7 domain	none
GLEAN3_21307	SPU_021307	contains 4 CBS_pair superfamily motifs and 2 CBS domains	none
GLEAN3_18419	SPU_018419	contains 2 CAP_ED superfamily motifs	none
GLEAN3_14790	SPU_014790	contains COG4886 domain	none
GLEAN3_07665	SPU_007665	contains PPX1 domain at N-terminus	none
GLEAN3_19552	SPU_019552	contains truA domain	none
GLEAN3_18860	SPU_018860	contains 2 WD40 superfamily motifs	none
GLEAN3_15873	SPU_015873	contains PRK09057 domain	none
GLEAN3_10961	SPU_010961	contains 2 PH-like superfamily motifs, 2 zf-RING superfamily motifs and PRK07003 domain	none
GLEAN3_12920	SPU_012920	contains TB2_DP1_HVA superfamily motif at N-terminus	none
GLEAN3_11638	SPU_011638	contains 2 MtN3_slv superfamily motifs. also orthologous to numerous hypothetical proteins in Drosophila and C. elegans.	none
GLEAN3_20264	SPU_020264	contains rfc domain	none
GLEAN3_15041	SPU_015041	contains COG5109 domain	none
GLEAN3_04118	SPU_004118	contains 2 Rgp1 domains	none
GLEAN3_22469	SPU_022469	also homologous to ring finger protein 115	none
GLEAN3_06149	SPU_006149	also orthologous to ring finger protein 111 in some species	none
GLEAN3_09739	SPU_009739	contains RING superfamily motif at C-terminus	none
GLEAN3_22878	SPU_022878	contains RRM superfamily motif at N-terminus	none
GLEAN3_12932	SPU_012932	contains SpoU domain	none
GLEAN3_20498	SPU_020498	contains RTC domain	none
GLEAN3_09772	SPU_009772	contains PRK05431 domain	none
GLEAN3_17156	SPU_017156	contains 2 ANK superfamily motifs	none
GLEAN3_12288	SPU_012288	contains fabG domain	none
GLEAN3_18638	SPU_018638	contains AtoC domain	none
GLEAN3_15271	SPU_015271	contains PRK10929 domain	none
GLEAN3_12704	SPU_012704	contains MFS_1 domain	none
GLEAN3_10939	SPU_010939	contains 3 Mito_carr superfamily motifs	none
GLEAN3_07195	SPU_007195	contains 3 Mito_carr superfamily motifs	none
GLEAN3_03240	SPU_003240	contains 2 sodium phosphate cotrasporter superfamily motifs	none
GLEAN3_17567	SPU_017567	contains 2 DUF6 superfamily motifs	none
GLEAN3_16333	SPU_016333	contains MFS_1 domain	none
GLEAN3_16213	SPU_016213	contains MFS_1 domain	none
GLEAN3_18720	SPU_018720	contains PRK08581 domain	none
GLEAN3_03450	SPU_003450	contains 3 SH3 superfamily motifs	none
GLEAN3_05740	SPU_005740	contains VPS10 domain. also orthologous to lipoprotein receptor relative with 11 ligand-binding repeats (Mus musculus).	none
GLEAN3_16849	SPU_016849	contains Vps5 domain	none
GLEAN3_15644	SPU_015644	contains COG5391 domain	none
GLEAN3_20802	SPU_020802	contains B41 domain	none
GLEAN3_04584	SPU_004584	contains 3 MIR superfamily motifs	none
GLEAN3_09761	SPU_009761	contains 2 E1_enzyme_family superfamily motifs	none
GLEAN3_12457	SPU_012457	contains Bromo_TP superfamily motif at N-terminus, and PHD superfamily motif toward C-terminus	none
GLEAN3_10113	SPU_010113	contains Metallo-dependent_hydrolases superfamily motif at C-terminus	none
GLEAN3_15235	SPU_015235	contains CALCOCO1 domain	none
GLEAN3_22609	SPU_022609	contains Taxilin domain	none
GLEAN3_19748	SPU_019748	contains COG5210 domain	none
GLEAN3_18192	SPU_018192	contains 4 TPR superfamily motifs	none
GLEAN3_04064	SPU_004064	contains 3 TRP superfamily motifs	none
GLEAN3_20261	SPU_020261	contains Deme6 domain	none
GLEAN3_04719	SPU_004719	contains 2 TPR superfamily motifs near C-terminus	none
GLEAN3_13871	SPU_013871	contains THAP superfamily motif at N-terminus	none
GLEAN3_18272	SPU_018272	contains 2 Thioredoxin-like superfamily motifs	none
GLEAN3_09534	SPU_009534	contains PRK06116 domain	none
GLEAN3_20617	SPU_020617	contains PRK06078 domain	none
GLEAN3_03806	SPU_003806	contains IF-2B domain	none
GLEAN3_08625	SPU_008625	contains COG2319 domain	none
GLEAN3_09640	SPU_009640	SbcC domain	none
GLEAN3_08054	SPU_008054	contains 2 Sec63 superfamily motifs	none
GLEAN3_17676	SPU_017676	contains 4 TPR superfamily motifs	none
GLEAN3_09409	SPU_009409	also orthologous to numerous Drosphila proteins	none
GLEAN3_05636	SPU_005636	contains 2 DUF6 superfamily motifs and RhaT domain	none
GLEAN3_19214	SPU_019214	contains DUF1183 domain	none
GLEAN3_14995	SPU_014995	contains KAP95 domain	none
GLEAN3_20463	SPU_020463	contains PcnB domain	none
GLEAN3_12116	SPU_012116	contains tsf domain	none
GLEAN3_12328	SPU_012328	contains Ion_trans domain	none
GLEAN3_18207	SPU_018207	contains HECTc domain	none
GLEAN3_21701	SPU_021701	contains 3 E1_enzyme_family superfamily motifs	none
GLEAN3_04630	SPU_004630	contains 2 E1_enzyme_family superfamily motifs	none
GLEAN3_03887	SPU_003887	contains 2 copies of glycos_transf_2 superfamily motifs and RICIN superfamily motifs	none
GLEAN3_16095	SPU_016095	contains 2 Glycos_transf_2 superfamily motifs	none
GLEAN3_04079	SPU_004079	contains TRP superfamily motif at N-terminus, and ARM superfamily motif at C-terminus	none
GLEAN3_15226	SPU_015226	contains PRK08125 domain	none
GLEAN3_19032	SPU_019032	contains DUF1162 superfamily motif at N-terminus. contains MRS6 domain and COG2872 domain	none
GLEAN3_19426	SPU_019426	contains AAA domain	none
GLEAN3_11317	SPU_011317	contains COG2319 domain	none
GLEAN3_17013	SPU_017013	contains MAD domain	none
GLEAN3_10131	SPU_010131	contains COG2319 domain and PRK12704 domain	none
GLEAN3_03791	SPU_003791	contains ATS1 domain	none
GLEAN3_03297	SPU_003297	contains WSC superfamily motif near N-terminus	none
GLEAN3_05771	SPU_005771	also orthologous to numerous miscellaneous proteins	none
GLEAN3_11396	SPU_011396	contains COG5219 domain	none
GLEAN3_20126	SPU_020126	contains AIR1 domain	none
GLEAN3_17358	SPU_017358	contains 2 AIR1 domains	none
GLEAN3_14362	SPU_014362	contains COG5273 domain	none
GLEAN3_09595	SPU_009595	contains COG5273 domain	none
GLEAN3_16930	SPU_016930	contains HRD1 domain	none
GLEAN3_05501	SPU_005501	also orthologous to other proteins and numerous hypothetical proteins	none
GLEAN3_05313	SPU_005313		none
GLEAN3_10020	SPU_010020	contains hATC superfamily motif near C-terminus	none
GLEAN3_17578	SPU_017578	also homologous to GH3.3; indole-3-acetic acid amido synthetase (A. thaliana)	none
GLEAN3_05580	SPU_005580	contains 2 WD40 superfamily motifs near C-terminus and COG2319 domain	none
GLEAN3_15223	SPU_015223	contains MRS6 domain at N-terminus	none
GLEAN3_18003	SPU_018003	contains Ndh domain	none
GLEAN3_23085	SPU_023085	contains PRK02106 odmain	none
GLEAN3_12514	SPU_012514	contains Atrophin-1 domain	none
GLEAN3_19967	SPU_019967	contains 4 FA58C superfamily motifs	none
GLEAN3_20558	SPU_020558	homologous to numerous sea urchin and worm proteins	none
GLEAN3_12851	SPU_012851	contains PRK03983 domain	none
GLEAN3_12655	SPU_012655	contains 14 EGF_CA superfamily motifs in tandemn. also similar to Notch protein (Drosophila).	none
GLEAN3_20607	SPU_020607	contains 4 Gelsolin superfamily motifs	none
GLEAN3_12273	SPU_012273	contains 2 7tm_1 superfamily motifs	none
GLEAN3_15727	SPU_015727	contains 7 MAM superfamily motifs	none
GLEAN3_10871	SPU_010871	orthologous to G protein-coupled receptors	none
GLEAN3_11830	SPU_011830	contains 3 Cu-oxidase superfamily motifs. also orthologous to laccase proteins in many insects.	none
GLEAN3_17701	SPU_017701	contains 4 WD40 superfamily motifs and COG2319 domain. also homologous to transmembrane protein 38a in some mammalian spp.	none
GLEAN3_17293	SPU_017293	contains f1hF domain, PR domain, and COG1112 domain	none
GLEAN3_22421	SPU_022421	contains 2 COG1112 domains and PRK12678 domain	none
GLEAN3_17466	SPU_017466	contains COG2016 domain	none
GLEAN3_09716	SPU_009716	contains B41 domain	none
GLEAN3_26309	SPU_026309	contains Pyr_redox_2 domain	none
GLEAN3_28425	SPU_028425	contains Smc domain	none
GLEAN3_28888	SPU_028888	contains 2 WD40 superfamily motifs	none
GLEAN3_26154	SPU_026154	contains Creatinase domain	none
GLEAN3_28183	SPU_028183	contains CbpA domain	none
GLEAN3_26432	SPU_026432	contains 2 7tm_1 superfamily motifs	none
GLEAN3_27289	SPU_027289	contains FRQ1 domain	none
GLEAN3_27916	SPU_027916	contains Uvr domain	none
GLEAN3_27982	SPU_027982	contains COG2123 domain	none
GLEAN3_26635	SPU_026635	contains 2 CH superfamily motifs	none
GLEAN3_27805	SPU_027805	contains RasGAP domain	none
GLEAN3_28921	SPU_028921	contains SNF2_N domain	none
GLEAN3_26993	SPU_026993	contains 3 EFh superfamily motifs	none
GLEAN3_26146	SPU_026146	also orthologous to mammalian phospholipase A2	none
GLEAN3_28147	SPU_028147	orthologous to bacterial proteins	none
GLEAN3_28848	SPU_028848	contains PHD superfamily motif at C-terminus and RecN domain	none
GLEAN3_28373	SPU_028373	contains 7 PDZ superfamily motifs. also orthologous to numerous PDZ domain containing hypothetical proteins.	none
GLEAN3_27893	SPU_027893	contains COG4942 domain	none
GLEAN3_26210	SPU_026210	contains 2 Kelch_1 superfamily motifs and COG3055 domain	none
GLEAN3_28150	SPU_028150	contains PRK12678 domain	none
GLEAN3_27314	SPU_027314	contains Lipin_N superfamily motif and LNS superfamily motif near N-terminus and C-terminus, respectively	none
GLEAN3_26716	SPU_026716	contains Lon domain	none
GLEAN3_27844	SPU_027844	contains MAD domain	none
GLEAN3_28458	SPU_028458	contains Hyaluronidase_2 superfamily motif at N-terminus	none
GLEAN3_26683	SPU_026683	contains 2 MIT superfamily motifs	none
GLEAN3_28214	SPU_028214	contains Smc domain. also orthologous to KIAA0774-like proteins and other hypothetical proteins.	none
GLEAN3_28926	SPU_028926	contains B41 domain	none
GLEAN3_26774	SPU_026774	contains 2 COG5098 domains and PRK09894 domain	none
GLEAN3_28807	SPU_028807	contains 2 ArsB_NhaD_permease superfamily motifs	none
GLEAN3_27903	SPU_027903	contains Smc domain	none
GLEAN3_26124	SPU_026124	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_26933	SPU_026933	contains TruA domain	none
GLEAN3_28354	SPU_028354	contains PRK09212 domain, PRK11892 domain, AcoB domain, and Dxs domain	none
GLEAN3_26695	SPU_026695	contains VHS_ENTH_ANTH superfamily motif at N-terminus	none
GLEAN3_28579	SPU_028579	contains PRK04195 domain	none
GLEAN3_28894	SPU_028894	contains 2 UBQ superfamily motifs	none
GLEAN3_26170	SPU_026170	contains 2 SH3 superfamily motifs	none
GLEAN3_26814	SPU_026814	contains RhaT domain	none
GLEAN3_28069	SPU_028069	contains SAM superfamily motif near N-terminus	none
GLEAN3_28110	SPU_028110	contains SANT superfamily motif near C-terminus	none
GLEAN3_26042	SPU_026042	contains TSP_1 superfamily and PLAC superfamily motifs near C-terminus	none
GLEAN3_28461	SPU_028461	contains Smc domain	none
GLEAN3_28217	SPU_028217	contains Icc domain	none
GLEAN3_26467	SPU_026467	contains 2 WD40 superfamily motifs	none
GLEAN3_26244	SPU_026244	contains 2 HYR superfamily motifs	none
GLEAN3_28567	SPU_028567	contains 6 vWA/vWFA superfamily motifs. also orthologous to numerous hypothetical proteins.	none
GLEAN3_28754	SPU_028754	contains 2 AIG2 superfamily motifs	none
GLEAN3_26317	SPU_026317	contains THAP superfamily motif at N-terminus	none
GLEAN3_27433	SPU_027433	also orthologous to various subtypes of nicotinic cholinergic receptors	none
GLEAN3_26649	SPU_026649	S. purpuratus protein family containing EGF domain	none
GLEAN3_14248	SPU_014248	contains 2 SPEC superfamily motifs	none
GLEAN3_05938	SPU_005938	contains 2 TSP_1 superfamily motifs	none
GLEAN3_15342	SPU_015342	contains COG1112 domain	none
GLEAN3_04005	SPU_004005	contains 2 PH-like superfamily motifs	none
GLEAN3_12504	SPU_012504	contains 7 COG5022 domain motifs and AF-4 domain	none
GLEAN3_10024	SPU_010024	contains 8 COG5022 domain motifs	none
GLEAN3_04375	SPU_004375	huge protein with 44,492 amino acids. only 550 amino acids from N-terminus have been determined.	none
GLEAN3_06457	SPU_006457	contains Smc domain	none
GLEAN3_04989	SPU_004989	contains 2 CUB superfamily motifs	none
GLEAN3_14369	SPU_014369	contains MDN1 domain	none
GLEAN3_02057	SPU_002057	contains MDN1 domain	none
GLEAN3_08503	SPU_008503	contains 3 SMC_N domain motifs	none
GLEAN3_16359	SPU_016359	contains Smc domain and SMC_N domain	none
GLEAN3_07254	SPU_007254	contains SMC_N domain and Smc domain	none
GLEAN3_16258	SPU_016258	contains Smc domain and SMC_N domain	none
GLEAN3_10736	SPU_010736	contains TolA domain motif near C-terminus	none
GLEAN3_15350	SPU_015350	contains 3 ATP_gua_Ptrans superfamily C-terminal motifs and 3 ATP_gua_Ptrans superfamily N-terminal motifs and PRK01059 domain and COG3869 domain	none
GLEAN3_10441	SPU_010441	contains 5 CCP superfamily motifs and 3 HYR superfamily motifs	none
GLEAN3_09854	SPU_009854	contains 12 CUB superfamily motifs in tandem over the entire length of the protein	none
GLEAN3_02016	SPU_002016	contains 6 CUB superfamily motifs	none
GLEAN3_10596	SPU_010596	contains 2 MIF4G superfamily motifs	none
GLEAN3_05335	SPU_005335	contains MPH1 domain	none
GLEAN3_09877	SPU_009877	contains 2 DEXDc superfamily motifs	none
GLEAN3_02015	SPU_002015	contains DENN superfamily motif at N-terminus and WD40 superfamily motif at C-terminus	none
GLEAN3_15671	SPU_015671	contains 2 PRK05850 domain motifs	none
GLEAN3_13590	SPU_013590	homologous only to 4 Drosophila proteins	none
GLEAN3_07997	SPU_007997	contains MDN1 domain and DYN1 domain	none
GLEAN3_12416	SPU_012416	contains DHC_N1 domain	none
GLEAN3_02733	SPU_002733	contains DHC_N1 domain	none
GLEAN3_04934	SPU_004934	contains 2 P_loop_NTPase superfamily motifs and DYN1 domain	none
GLEAN3_02110	SPU_002110	contains AST1 domain and DHC_N1 domain and DYN1 domain	none
GLEAN3_03404	SPU_003404	contains DHC_N1 domain	none
GLEAN3_03216	SPU_003216	contains 3 P_loop_NTPase superfamily motifs and DYN1 domain	none
GLEAN3_14702	SPU_014702	contains 2 P_loop_NTPase superfamily motifs and 2 DYN1 domain motifs and Dynein_heavy domain	none
GLEAN3_10423	SPU_010423	contains PRK00409 domain and COG3264 domain	none
GLEAN3_15263	SPU_015263	contains SMC_N domain	none
GLEAN3_01069	SPU_001069	contains SMC_N domain	none
GLEAN3_13828	SPU_013828	contains 2 SH3 superfamily motifs	none
GLEAN3_01796	SPU_001796	contains 2 Calx-beta superfamily repeats at N-terminus	none
GLEAN3_11173	SPU_011173	contains Smc domain and SMC_N domain	none
GLEAN3_15240	SPU_015240	contains COG1413 domain	none
GLEAN3_09932	SPU_009932	contains RecD domain	none
GLEAN3_09776	SPU_009776	contains 7 TSP_1 superfamily motifs	none
GLEAN3_11935	SPU_011935	contains PRK09039 domain	none
GLEAN3_04147	SPU_004147	contains MDN1 domain	none
GLEAN3_04993	SPU_004993	contains COG1413 domain	none
GLEAN3_14663	SPU_014663	contains SMC_N domain and Smc domain	none
GLEAN3_05893	SPU_005893	contains EGF_CA superfamily motif at N-terminus and MdoB somain	none
GLEAN3_14426	SPU_014426	contains SMC_N domain and PRK11281 domain. Sp-specific protein.	none
GLEAN3_14543	SPU_014543	homologous to several mosquito proteins	none
GLEAN3_14981	SPU_014981	Sp-specific	none
GLEAN3_15018	SPU_015018	homologous to mosquito proteins	none
GLEAN3_15209	SPU_015209	contains TraB_pillus domain. Sp-specific protein.	none
GLEAN3_05894	SPU_005894	contains 6 EGF_CA superfamily motifs	none
GLEAN3_15216	SPU_015216	Sp-specific protein or bogus.	none
GLEAN3_15286	SPU_015286	contains 2 IG superfamily motifs. Sp-specific protein with N-terminal region weakly homologous to proteins in other species.	none
GLEAN3_15308	SPU_014308	contains MDN1 domain. Sp-specific protein with N-terminal region weakly homologous to 2 Branchiostoma floridae hypothetical proteins.	none
GLEAN3_05924	SPU_005924	contains ANK superfamily motif at N-terminus. sea urchin-specific protein.	none
GLEAN3_06284	SPU_006284	contains COG1305 domain	none
GLEAN3_07346	SPU_007346	contains RING superfamily motif at C-terminus	none
GLEAN3_08291	SPU_008291	contains CUB superfamily motifs at N-terminus and 7tm_2 superfamily motif near C-terminus	none
GLEAN3_08687	SPU_008687	contains Sec7 superfamily motif at C-terminus	none
GLEAN3_09081	SPU_009081	contains RecD domain	none
GLEAN3_09154	SPU_009154	appears to be S. purpuratus-specific	none
GLEAN3_09259	SPU_009259	appears to be S. purpuratus-specific	none
GLEAN3_09463	SPU_009463	contains Smc (chromosome segregation ATPases) domain	none
GLEAN3_09464	SPU_009464	contains Smc (chromosome segregation ATPases) domain and SMC_N domain and Herpes_BLLF1 domain	none
GLEAN3_09667	SPU_009667	contains Smc domain	none
GLEAN3_09777	SPU_009777	contains TSP_1 (thrombospondin) superfamily motifs near C-terminus	none
GLEAN3_09921	SPU_009921	appears to be S. purpuratus-specific, unique protein	none
GLEAN3_09928	SPU_009928	appears to be S. purpuratus-specific, unique protein	none
GLEAN3_10074	SPU_010074	contains SMC_N domain	none
GLEAN3_10200	SPU_010200	no homolog found	none
GLEAN3_10202	SPU_010202	contains PH-like superfamily motif at N-terminus. S. purpuratus-specific.	none
GLEAN3_10225	SPU_010225	contains Smc domain and 2 SMC_N domain motifs	none
GLEAN3_11508	SPU_011508	contains 6 EGF_CA superfamily motifs. very large protein (ca 6,000 amino acids). appears to be S. purpuratus-specific.	none
GLEAN3_11581	SPU_011581	homologous to conserved putative proteins	none
GLEAN3_12466	SPU_012466	contains RING superfamily motif at C-terminus and Myosin_tail domain and Smc domain	none
GLEAN3_12612	SPU_012612	contains 2 PAT1 domain motifs	none
GLEAN3_12692	SPU_012692	contains PDZ superfamily motif near C-terminus	none
GLEAN3_12879	SPU_012879	S. purpuratus-specific protein	none
GLEAN3_12981	SPU_012981	S. purpuratus-specific protein	none
GLEAN3_13286	SPU_013286	appears to be S. purpuratus-specific	none
GLEAN3_13333	SPU_013333	S. purpuratus-specific protein	none
GLEAN3_13439	SPU_013439	S. purpuratus-specific	none
GLEAN3_13603	SPU_013603	large (> 4,000 amino acids) S. purpuratus-specific protein	none
GLEAN3_13643	SPU_013643	S. purpuratus-specific protein	none
GLEAN3_13982	SPU_013982	contains 3 NDPk superfamily motifs. S. purpuratus-specific or echinoderm-specific.	none
GLEAN3_14072	SPU_014072	Sp-specific	none
GLEAN3_14335	SPU_014335	contains SMC_N domain. appears to be Branchiostoma floridae-specific.	none
GLEAN3_14414	SPU_014414	contains COG4886 domain. appears to be Branchiostoma floridae-specific.	none
GLEAN3_15797	SPU_015797	Sp-specific protein	none
GLEAN3_15820	SPU_015820	contains 2 EGF_CA superfamily motifs	none
GLEAN3_15906	SPU_015906	Sp-specific protein	none
GLEAN3_15914	SPU_015914	Sp-specific protein	none
GLEAN3_15951	SPU_015951	contains PRK03918 domain and PAT1 domain. Sp-specific protein.	none
GLEAN3_16132	SPU_016132	Sp-specific protein	none
GLEAN3_16223	SPU_016223	contains PAT1 domain. Sp-specific protein.	none
GLEAN3_16285	SPU_016285	contains LIM superfamily motif at N-terminus. Sp-specific.	none
GLEAN3_16485	SPU_016485	Sp-specific protein	none
GLEAN3_08282	SPU_008282	contains 2 BTB superfamily motifs and ATS1 domain	none
GLEAN3_15416	SPU_015416	contains INCEN superfamily motif near C-terminus	none
GLEAN3_05776	SPU_005776	contains 2 WD40 superfamily motifs	none
GLEAN3_11304	SPU_011304	contains 4 Kelch-like superfamily motifs	none
GLEAN3_08893	SPU_008893	contains 3 E_set superfamily motifs	none
GLEAN3_16478	SPU_016478	contains 3 Hyd_WA superfamily motifs	none
GLEAN3_09103	SPU_009103	contains 4 P_loop_NTPase superfamily motifs	none
GLEAN3_01139	SPU_001139	contains 2 Glyco_hydro_31 superfamily motifs	none
GLEAN3_08933	SPU_008933	contains CYK3 domain	none
GLEAN3_03926	SPU_003926	contains 5 LamG superfamily motifs and PRK02224 domain	none
GLEAN3_00376	SPU_000376	contains LRR_RI superfamily motif near N-terminus and PDZ superfamily motif at C-terminus	none
GLEAN3_05309	SPU_005309	contains COG4886 domain	none
GLEAN3_02862	SPU_002862	contains COG4886 domain	none
GLEAN3_02072	SPU_002072	contains 3 MFS superfamily motifs and MFS_1 domain	none
GLEAN3_11525	SPU_011525	contains 8 MAM superfamily motifs	none
GLEAN3_01261	SPU_001261	contains MDN1 domain and Smc domain	none
GLEAN3_08413	SPU_008413	contains 2 DNA_pol_phi superfamily motifs	none
GLEAN3_04269	SPU_004269	contains 2 PHR domain motifs	none
GLEAN3_08030	SPU_008030	contains PHR domain	none
GLEAN3_15420	SPU_015420	contains 3 PHD superfamily motifs	none
GLEAN3_13478	SPU_013478	contains 3 WD40 superfamily motifs and COG2319 domain	none
GLEAN3_14279	SPU_014279	contains CH superfamily motif near N-terminus	none
GLEAN3_01232	SPU_001232	contains SMC_N domain	none
GLEAN3_15653	SPU_015653	contains SMC_N domain and mukB domain	none
GLEAN3_02313	SPU_002313	contains Chaperonin-like superfamily motif near N-terminus	none
GLEAN3_16227	SPU_016227	contains 2 Smc domain motifs	none
GLEAN3_04772	SPU_004772	contains MIP-T3 domain and RecD domain	none
GLEAN3_01338	SPU_001338	homologous to numerous Danio rerio genes	none
GLEAN3_00495	SPU_000495	contains PAT1 domain	none
GLEAN3_13890	SPU_013890	contains Smc domain	none
GLEAN3_12581	SPU_012581	contains COG1112 domain	none
GLEAN3_04588	SPU_004588	contains COG2940 domain	none
GLEAN3_02571	SPU_002571	contains HepA domain. also homologous to Drosophila helicase domino.	none
GLEAN3_10617	SPU_010617	contains OATP domain and MFS_1 domain motifs	none
GLEAN3_14441	SPU_014441	contains 5 SPEC superfamily motifs	none
GLEAN3_13237	SPU_013237	contains 2 CH superfamily motifs and 8 SPEC superfamily motifs and 2 SbcC domain motifs and 3 Smc domain motifs. very large protein (11475 amino acids).	none
GLEAN3_05151	SPU_005151	contains 14 SPEC superfamily motifs	none
GLEAN3_08368	SPU_008368	contains 4 SPEC superfamily motifs	none
GLEAN3_02664	SPU_002664	contains 3 TPR superfamily motifs	none
GLEAN3_08214	SPU_008214	contains 2 P_loop_NTPase superfamily motifs and Smc domain	none
GLEAN3_07052	SPU_007052	contains 4 LDLa superfamily motifs	none
GLEAN3_14258	SPU_014258	contains 4 CCP superfamily motifs	none
GLEAN3_11098	SPU_011098	contains 5 CCP superfamily motifs	none
GLEAN3_11309	SPU_011309	contains 4 CCP superfamily motifs and 4 HYR superfamily motifs and 3 GCC2_GCC3 superfamily motifs	none
GLEAN3_12778	SPU_012778	contains Metallo-dependent_hydrolases superfamily motif at C-terminus	none
GLEAN3_13785	SPU_013785	contains MDN1 domain	none
GLEAN3_04998	SPU_004998	contains 3 TPR superfamily motifs	none
GLEAN3_02498	SPU_002498	contains 10 TPR superfamily motifs and NrfG domain	none
GLEAN3_08179	SPU_008179	contains 3 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_08241	SPU_008241	contains 2 Smc domain motifs and PRK02224 domain	none
GLEAN3_10006	SPU_010006	contains 6 IG superfamily motifs	none
GLEAN3_06656	SPU_006656	contains 2 TPR superfamily motifs	none
GLEAN3_06258	SPU_006258	contains HECTc domain	none
GLEAN3_11281	SPU_011281	contains Peptidase_C19 superfamily motif at N-terminus	none
GLEAN3_08938	SPU_008938	contains UBP14 domain	none
GLEAN3_09889	SPU_009889	contains MRS6 domain	none
GLEAN3_10637	SPU_010637	contains WD40 superfamily motif near N-terminus	none
GLEAN3_09683	SPU_009683	contains WD40 superfamily motif at C-terminus	none
GLEAN3_11144	SPU_011144	contains 2 WD40 superfamily motifs and COG2319 domain	none
GLEAN3_04246	SPU_004246	contains 2 WD40 superfamily motifs	none
GLEAN3_10350	SPU_010350	contains 2 WD40 superfamily motifs	none
GLEAN3_15882	SPU_015882	contains SFP domain	none
GLEAN3_04455	SPU_004455	contains 2 ANK superfamily motifs and 2 Smc domain motifs and Arp domain and PRK09039 domain	none
GLEAN3_00582	SPU_000582	contains HECTc domain	none
GLEAN3_04314	SPU_004314	contains 2 Herpes_LMP superfamily motifs	none
GLEAN3_15197	SPU_015197	contains 3 HYR superfamily motifs. Sp-specific protein except for N-terminal 100 amino acids that share some homologies with other metazoans.	none
GLEAN3_09431	SPU_009431	contains DNA_pol_B_2 domain and POLBc domain	none
GLEAN3_09485	SPU_009485	appears to be S. purpuratus-specific	none
GLEAN3_09808	SPU_009808	contains SET superfamily motif at N-terminus and MDN1 domain	none
GLEAN3_13317	SPU_013317	contains Smc domain and SMC_N domain	none
GLEAN3_15950	SPU_015950	specific to S. purpuratus and Branchiostoma floridae	none
GLEAN3_12900	SPU_012900	contains Nucleoporin2 superfamily motif near N-terminus	none
GLEAN3_15412	SPU_015412	contains 3 PDZ superfamily motifs	none
GLEAN3_01249	SPU_001249	contains 15 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_14679	SPU_014679	contains 6 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14779	SPU_014779	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14866	SPU_014866	contains 7 ANK superfamily motifs and 2 ZU5 superfamily motifs and Arp domain motifs	none
GLEAN3_14883	SPU_014883	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14953	SPU_014953	contains 6 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_15024	SPU_015024	contains 6 ANK superfamily motifs and Arp domain motifs and Herpes_BLLF1 domain	none
GLEAN3_15058	SPU_015058	contains 13 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_15270	SPU_015270	contains 15 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_15431	SPU_015431	contains 20 ANK superfamily motifs and 2 ZU5 superfamily motifs and Arp domain motifs	none
GLEAN3_15436	SPU_015436	contains 22 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01383	SPU_001383	contains 26 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_15498	SPU_015498	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_15875	SPU_015875	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_16034	SPU_016034	contains 14 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_16479	SPU_016479	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_16616	SPU_016616	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01385	SPU_001385	contains 20 ANK superfamily motifs and Arp domain	none
GLEAN3_01430	SPU_001430	contains 7 ANK superfamily motifs and Arp domain	none
GLEAN3_01432	SPU_001432	contains 6 ANK superfamily motifs and Arp domain	none
GLEAN3_01433	SPU_001433	contains 14 ANK superfamily motifs	none
GLEAN3_01611	SPU_001611	contains 27 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_01685	SPU_001685	contains 3 ANK superfamily motifs	none
GLEAN3_01820	SPU_001820	contains 10 ANK superfamily motifs	none
GLEAN3_01871	SPU_001871	contains 9 ANK superfamily motifs and Arp domain	none
GLEAN3_00327	SPU_000327	contains 3 ANK superfamily motifs	none
GLEAN3_01988	SPU_001988	contains 10 ANK superfamily motifs and Arp domain	none
GLEAN3_02006	SPU_002006	contains 14 ANK superfamily motifs and Arp domain	none
GLEAN3_02247	SPU_002247	contains 8 ANK superfamily motifs and Arp domain	none
GLEAN3_02415	SPU_002415	contains 20 ANK superfamily motifs	none
GLEAN3_02468	SPU_002468	contains 13 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_02778	SPU_002778	contains 4 ANK superfamily motifs and 2 ZU5 superfamily motifs and 2 Arp domain motifs	none
GLEAN3_02848	SPU_002848	contains 12 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_02900	SPU_002900	contains 15 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_02943	SPU_002943	contains 15 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_00476	SPU_000476	contains 8 ANK superfamily motifs and Arp domain	none
GLEAN3_03370	SPU_003370	contains 11 ANK superfamily motifs and 3 Arp domain motifs	none
GLEAN3_03600	SPU_003600	contains 4 ANK superfamily motifs	none
GLEAN3_03809	SPU_003809	contains 19 ANK superfamily motifs and 4 Arp domain motifs	none
GLEAN3_03834	SPU_003834	contains 15 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_03935	SPU_003935	contains 9 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_04426	SPU_004426	contains 12 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_04512	SPU_004512	contains 8 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_05023	SPU_005023	contains 9 ANK superfamily motifs and Arp domain	none
GLEAN3_05024	SPU_005024	contains 17 ANK superfamily motifs and many Arp domain motifs	none
GLEAN3_05027	SPU_005027	contains 9 ANK superfamily motifs and many Arp domain motifs	none
GLEAN3_00499	SPU_000499	contains 12 ANK superfamily motifs and 2 ZU5 superfamily motifs	none
GLEAN3_05066	SPU_005066	contains 16 ANK superfamily motifs and many Arp domain motifs	none
GLEAN3_05094	SPU_005094	contains 14 ANK superfamily motifs and many Arp domain motifs	none
GLEAN3_05235	SPU_005235	contains 3 ANK superfamily motifs	none
GLEAN3_05465	SPU_005465	contains 12 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_05619	SPU_005619	contains 6 ANK superfamily motifs and many Arp domain motifs	none
GLEAN3_05858	SPU_005858	contains 3 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_05928	SPU_005928	contains 6 ANK superfamily motifs	none
GLEAN3_05970	SPU_005970	contains 4 ANK superfamily motifs	none
GLEAN3_05975	SPU_005975	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_05997	SPU_005997	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_00697	SPU_000697	contains 9 ANK superfamily motifs and 2 ZU5 superfamily motifs	none
GLEAN3_06051	SPU_006051	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_06131	SPU_006131	contains 12 ANK superfamily motifs	none
GLEAN3_06327	SPU_006327	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_06518	SPU_006518	contains 18 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_06615	SPU_006615	contains 18 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_06832	SPU_006832	contains 13 ANK superfamily motifs and 2 ZU5 superfamily motifs and Arp domain motifs	none
GLEAN3_07123	SPU_007123	contains 4 ANK superfamily motifs and 2 ZU5 superfamily motifs and 2 Death superfamily motifs	none
GLEAN3_07298	SPU_007298	contains 5 ANK superfamily motifs	none
GLEAN3_07332	SPU_007332	contains 11 ANK superfamily motifs	none
GLEAN3_08014	SPU_008014	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01017	SPU_001017	contains 5 ANK superfamily motifs and 2 Arp domain motifs	none
GLEAN3_08437	SPU_008437	contains 10 ANK superfamily motifs and Arp domains	none
GLEAN3_08887	SPU_008887	contains 12 ANK superfamily motifs and Arp domains	none
GLEAN3_09034	SPU_009034	contains 12 ANK superfamily motifs and Arp domains	none
GLEAN3_09792	SPU_009792	contains 9 ANK superfamily motifs and Arp domains	none
GLEAN3_09823	SPU_009823	contains 12 ANK superfamily motifs and Arp domains	none
GLEAN3_09997	SPU_009997	contains 12 ANK superfamily motifs and Arp domains	none
GLEAN3_10676	SPU_010676	contains 14 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11004	SPU_011004	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11006	SPU_011006	contains 3 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01018	SPU_001018	contains 10 ANK superfamily motifs and Arp domain	none
GLEAN3_11021	SPU_011021	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11341	SPU_011341	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11359	SPU_011359	contains 17 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11500	SPU_011500	contains 17 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11598	SPU_011598	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11599	SPU_011599	contains 22 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11748	SPU_011748	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11898	SPU_011898	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_11950	SPU_011950	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_12058	SPU_012058	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01163	SPU_001163	contains 11 ANK superfamily motifs and Arp domain	none
GLEAN3_12059	SPU_012059	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_12060	SPU_012060	contains 25 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_12153	SPU_012153	contains 31 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_12267	SPU_012267	contains 13 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_12392	SPU_012392	contains 13 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_12587	SPU_012587	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13050	SPU_013050	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13210	SPU_013210	contains 19 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13213	SPU_013213	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13221	SPU_013221	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01205	SPU_001205	contains 10 ANK superfamily motifs and Arp domain	none
GLEAN3_13239	SPU_013239	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13418	SPU_013418	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13421	SPU_013421	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_13776	SPU_013776	contains 6 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14018	SPU_014018	contains 15 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14096	SPU_014096	contains 17 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14159	SPU_014159	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14484	SPU_014484	contains 23 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14601	SPU_014601	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_14661	SPU_014661	contains 18 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_10011	SPU_010011	contains 5 ANK superfamily motifs	none
GLEAN3_01237	SPU_001237	contains MDN1 domain	none
GLEAN3_01449	SPU_001449	contains 22 EGF-like superfamily motifs in tandem and 10 SRCR superfamily motifs in tandem. also homologous to Drosophila notch.	none
GLEAN3_12322	SPU_012322	contains 4 FA58C superfamily motifs	none
GLEAN3_06881	SPU_006881	contains 4 EGF_CA superfamily motifs and Trypan_PARP domain	none
GLEAN3_11308	SPU_011308	contains 3 GCC2_GCC3 superfamily motifs and 4 EGF-like superfamily motifs and 3 CCP superfamily motifs	none
GLEAN3_00918	SPU_000918	contains 10 EGF-like superfamily motifs and 3 HYR superfamily motifs and 2 LY (low density lipoprotein receptor) superfamily motifs	none
GLEAN3_00635	SPU_000635	the protein consists of 3 domains each of which contains RT-like superfamily motif, RNase superfamily motif, and rve superfamily motif.	none
GLEAN3_01918	SPU_001918	contains 2 7tm_2 superfamily motifs	none
GLEAN3_01921	SPU_001921	contains SET superfamily motif toward N-terminus	none
GLEAN3_08222	SPU_008222	contains 12 MAM superfamily motifs and 5 LDLa superfamily motifs	none
GLEAN3_14415	SPU_014415	contains 14 EGF_CA superfamily motifs. member of very large Sp-specific protein families.	none
GLEAN3_14530	SPU_014530	Sp-specific protein with some homologies to Branchiostoma floridae proteins	none
GLEAN3_15146	SPU_015146	contains MDN1 domain. appears to be a member of Sp-specific protein family.	none
GLEAN3_06619	SPU_006619	contains COG5635 domain and PAT1 domain. belongs to a large group of proteins specific to S. purpuratus.	none
GLEAN3_06858	SPU_006858	contains RT-like superfamily motif at C-terminus. appears to belong to a large S. purpuratus-specific group of proteins.	none
GLEAN3_07677	SPU_007677	contains 5 CCP superfamily motifs and 4 HYR superfamily motifs	none
GLEAN3_05200	SPU_005200	contains 2 MAM superfamily motifs at N-terminus	none
GLEAN3_08888	SPU_008888	contains 9 HYR superfamily motifs. member of very large S. purpuratus-specific protein family.	none
GLEAN3_08967	SPU_008967	contains SMC_N domain. member of a large S. purpuratus-specific protein family.	none
GLEAN3_09536	SPU_009536	member of very large S. purpuratus-specific nuclease/reverse transcriptase-like proteins	none
GLEAN3_09591	SPU_009591	member of very large S. purpuratus-specific nuclease/reverse transcriptase-like proteins	none
GLEAN3_09608	SPU_009608	contains 4 IG superfamily motifs and V-set domain	none
GLEAN3_09794	SPU_009794	member of large glycoprotein fucotransferase-like, S. purpuratus-specific protein family	none
GLEAN3_05388	SPU_005388	contains 2 RING superfamily motifs and PRK03918 domain	none
GLEAN3_10134	SPU_010134	member of very large S. purpuratus-specific nuclease/reverse transcriptase-like proteins	none
GLEAN3_10153	SPU_010153	member of very large S. purpuratus-specific protein family	none
GLEAN3_10684	SPU_010684	contains Herpes_BLLF1 domain	none
GLEAN3_11771	SPU_011771	contains 4 CUB superfamily motifs and 7 EGF_CA superfamily motifs	none
GLEAN3_11782	SPU_011782	contains 5 CCP superfamily motifs and 10 HYR superfamily motifs	none
GLEAN3_13077	SPU_013077	contains 8 EGF_CA superfamily motifs and 3 CCP superfamily motifs	none
GLEAN3_15813	SPU_015813	contains 11 HYR superfamily motifs	none
GLEAN3_15879	SPU_015879	contains 2 MAM superfamily motifs	none
GLEAN3_16322	SPU_016322	contains SET superfamily motif near N-terminus	none
GLEAN3_16424	SPU_016424	contains Smc domain and SMC_N domain. member of Sp-specific protein family.	none
GLEAN3_16448	SPU_016448	contains SET superfamily motif at N-terminus	none
GLEAN3_16473	SPU_016473	specific to Nematostella vectensis, Branchiostoma floridae, and Strongylocentrotus purpuratus	none
GLEAN3_16179	SPU_016179	contains 8 LDLb superfamily motifs and 2 IG superfamily motifs and COG3391 domain	none
GLEAN3_16181	SPU_016181	contains 8 LDLb superfamily motifs and COG3386 domain	none
GLEAN3_03671	SPU_003671	contains 7 LDLa superfamily motifs and 3 CUB superfamily motifs	none
GLEAN3_00737	SPU_000737	contains Macoilin domain	none
GLEAN3_01336	SPU_001336	contains 6 MAM superfamily motifs	none
GLEAN3_14016	SPU_014016	contains 5 MAM superfamily motifs and PRK08404 domain	none
GLEAN3_00725	SPU_000725	contains 6 MAM superfamily motifs in tandem	none
GLEAN3_01753	SPU_001753	contains 8 MAM superfamily motifs	none
GLEAN3_02133	SPU_002133	contains 29 MAM superfamily motifs and 18 LDLa superfamily motifs	none
GLEAN3_14087	SPU_014087	contains 6 MAM superfamily motifs	none
GLEAN3_03589	SPU_003589	contains 5 ANK superfamily motifs. also similar to ankyrin2,3/unc44.	none
GLEAN3_03258	SPU_003258	contains 12 SPEC superfamily motifs	none
GLEAN3_03261	SPU_003261	contains 22 SPEC superfamily motifs	none
GLEAN3_02173	SPU_002173	contains COG1112 domain and Keratin_B2 domain	none
GLEAN3_14734	SPU_014734	contains Gag_spuma domain and COG1112 domain	none
GLEAN3_15792	SPU_015792	contains 20 EGF_CA superfamily motifs	none
GLEAN3_07264	SPU_007264	contains 2 EGF_CA superfamily motifs	none
GLEAN3_08945	SPU_008945	contains 2 EGF_CA superfamily motifs and PRK03427 cell division protein motif	none
GLEAN3_13491	SPU_013491	contains 4 EGF_CA superfamily motifs	none
GLEAN3_15009	SPU_015009	contains 2 CUB superfamily motifs and 10 EGF_CA superfamily motifs	none
GLEAN3_00675	SPU_000675	contains 15 EFG superfamily motifs in tandem and 2 CUB superfamily motifs in tandem	none
GLEAN3_06225	SPU_006225	contains 3 GCC2_GCC3 superfamily motifs	none
GLEAN3_14412	SPU_014412	contains 7 EGF_CA superfamily motifs	none
GLEAN3_02783	SPU_002783	contains 2 MAM superfamily motifs and 7 EGF_C superfamily motifs	none
GLEAN3_13808	SPU_013808	contains rne domain	none
GLEAN3_02127	SPU_002127	contains PRK12678 domain	none
GLEAN3_03319	SPU_003319	contains 2 motifs of RT-like superfamily, Peptidase_A17 superfamily, and rve superfamily. appears to represent a duplication of the region containing the 3 superfamily motifs.	none
GLEAN3_04278	SPU_004278	contains 2 RVP superfamily motifs and 2 RT-like superfamily motifs that appears to be a duplication of RVP-RT-like structure	none
GLEAN3_05695	SPU_005695	contains AIR1 domain	none
GLEAN3_07074	SPU_007074	contains 2 SRCR superfamily motifs at N-terminus	none
GLEAN3_10235	SPU_010235	contains PRK12678 domain	none
GLEAN3_11612	SPU_011612	contains 2 motifs of RT-like superfamily and Peptidase_A17 superfamily, respectively	none
GLEAN3_12595	SPU_012595	contains AIR1 domain	none
GLEAN3_13702	SPU_013702	contains 3 HYR superfamily motifs and 2 GCC2_GCC3 superfamily motifs	none
GLEAN3_02021	SPU_002021	contains 2 CCP superfamily motifs	none
GLEAN3_04333	SPU_004333	contains f1hF domain and COG1112 domain	none
GLEAN3_13438	SPU_013438	contains COG1112 domain	none
GLEAN3_02564	SPU_002564	contains COG1112 domain	none
GLEAN3_02278	SPU_002278	contains 2 rne domain motifs and PRK07003 domain	none
GLEAN3_09015	SPU_009015	contains 10 LDLa superfamily motifs and 4 CUB superfamily motifs	none
GLEAN3_14758	SPU_014758	contains 3 zf-RING superfamily motifs and 2 PH-like superfamily motifs	none
GLEAN3_00834	SPU_000834	contains 4 CUB superfamily motifs and 2 EGF-like superfamily motifs	none
GLEAN3_06000	SPU_006000	contains 4 EGF_CA superfamily motifs	none
GLEAN3_06001	SPU_006001	contains 6 EGF_CA superfamily motifs	none
GLEAN3_03055	SPU_003055	also homologous to pol polyprotein and various proteins	none
GLEAN3_03056	SPU_003056	also homologous to pol polyprotein and various proteins	none
GLEAN3_03057	SPU_003057	also homologous to pol polyprotein and various proteins	none
GLEAN3_03059	SPU_003059	also homologous to pol polyprotein and various proteins	none
GLEAN3_03060	SPU_003060	also homologous to pol polyprotein and various proteins	none
GLEAN3_03061	SPU_003061	contains 2 rve superfamily motifs and 2 RT-like motifs. appears to be a duplication of rve and RT-like motif protein. also homologous to pol polyprotein and various proteins.	none
GLEAN3_03062	SPU_003062	also homologous to pol polyprotein and various proteins	none
GLEAN3_03063	SPU_003063	also homologous to pol polyprotein and various proteins	none
GLEAN3_16187	SPU_016187	contains COG1112 domain	none
GLEAN3_02163	SPU_002163	contains COG1112 domain and Furlin-like_cysteine-rich domain	none
GLEAN3_02705	SPU_002705	contains PRK12678 domain and SSL2 domain and COG1112 domain	none
GLEAN3_02706	SPU_002706	contains COG1112 domain	none
GLEAN3_06468	SPU_006468	contains COG1112 domain	none
GLEAN3_11697	SPU_011697	contains COG1112 domain	none
GLEAN3_05472	SPU_005472		\N
GLEAN3_00757	SPU_000757		\N
GLEAN3_17031	SPU_017031	strong DNA polymerase motifs shared by diverse groups of insects and intestinal parasites	\N
GLEAN3_27935	SPU_027935	After reviewing the data and performing the BLAST search, it appears that there is no good GLEAN model that fits GLEAN3_27935 sufficiently. According to the BLAST results and the excel data, there are large sequence gaps resulting in poor sequence coverage an un-orderly arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values slightly greater than 5. 	\N
GLEAN3_26731	SPU_026731		\N
GLEAN3_18170	SPU_018170		\N
GLEAN3_09413	SPU_009413		\N
GLEAN3_13935	SPU_013935		\N
GLEAN3_11034	SPU_011034		\N
GLEAN3_00128	SPU_000128		\N
GLEAN3_11451	SPU_011451		\N
GLEAN3_04405	SPU_004405		\N
GLEAN3_05842	SPU_005842		\N
GLEAN3_18826	SPU_018826		\N
GLEAN3_25315	SPU_025315		\N
GLEAN3_23252	SPU_023252		\N
GLEAN3_27899	SPU_027899		\N
GLEAN3_08245	SPU_008245		\N
GLEAN3_10786	SPU_010786		\N
GLEAN3_25436	SPU_025436		\N
GLEAN3_21643	SPU_021643		\N
GLEAN3_24416	SPU_024416		\N
GLEAN3_22382	SPU_022382		\N
GLEAN3_14028	SPU_014028		\N
GLEAN3_06161	SPU_006161		\N
GLEAN3_22857	SPU_022857		\N
GLEAN3_07537	SPU_007537		\N
GLEAN3_02638	SPU_002638		\N
GLEAN3_00153	SPU_000153		\N
GLEAN3_19228	SPU_019228		\N
GLEAN3_09218	SPU_009218		\N
GLEAN3_06481	SPU_006481		\N
GLEAN3_03136	SPU_003136		\N
GLEAN3_13373	SPU_013373		\N
GLEAN3_13765	SPU_013765		\N
GLEAN3_02787	SPU_002787		\N
GLEAN3_06572	SPU_006572		\N
GLEAN3_06123	SPU_006123		\N
GLEAN3_05097	SPU_005097		\N
GLEAN3_00927	SPU_000927		\N
GLEAN3_05779	SPU_005779		\N
GLEAN3_04238	SPU_004238		\N
GLEAN3_09134	SPU_009134		\N
GLEAN3_11320	SPU_011320		\N
GLEAN3_00242	SPU_000242		\N
GLEAN3_11220	SPU_011220		\N
GLEAN3_15921	SPU_015921		\N
GLEAN3_02857	SPU_002857		\N
GLEAN3_12977	SPU_012977		\N
GLEAN3_16866	SPU_016866		\N
GLEAN3_17918	SPU_017918		\N
GLEAN3_02521	SPU_002521		\N
GLEAN3_17211	SPU_017211		\N
GLEAN3_27573	SPU_027573		\N
GLEAN3_21159	SPU_021159		\N
GLEAN3_10776	SPU_010776		\N
GLEAN3_05272	SPU_005272		\N
GLEAN3_09027	SPU_009027		\N
GLEAN3_09192	SPU_009192		\N
GLEAN3_02115	SPU_002115		\N
GLEAN3_08472	SPU_008472		\N
GLEAN3_00836	SPU_000836		\N
GLEAN3_02875	SPU_002875		\N
GLEAN3_04442	SPU_004442		\N
GLEAN3_00340	SPU_000340		\N
GLEAN3_25699	SPU_025699		\N
GLEAN3_16177	SPU_016177		\N
GLEAN3_05670	SPU_005670		\N
GLEAN3_14876	SPU_014876		\N
GLEAN3_03832	SPU_003832		\N
GLEAN3_02252	SPU_002252		\N
GLEAN3_25212	SPU_025212		\N
GLEAN3_10221	SPU_010221		\N
GLEAN3_06561	SPU_006561		\N
GLEAN3_02316	SPU_002316		\N
GLEAN3_12186	SPU_012186		\N
GLEAN3_21319	SPU_021319		\N
GLEAN3_16283	SPU_016283		\N
GLEAN3_24249	SPU_024249		\N
GLEAN3_08527	SPU_008527		\N
GLEAN3_01774	SPU_001774		\N
GLEAN3_06124	SPU_006124		\N
GLEAN3_28455	SPU_028455		\N
GLEAN3_19710	SPU_019710		\N
GLEAN3_14316	SPU_014316		\N
GLEAN3_21158	SPU_021158		\N
GLEAN3_22837	SPU_022837		\N
GLEAN3_06814	SPU_006814		\N
GLEAN3_10203	SPU_010203		Also found in scaffolds 113729, 14987, and 2038\n
GLEAN3_20330	SPU_020330		Only the last exon of GLEAN3_20330 belongs to SpGrm, and a big part of the predicted ORF is DNA mismatch repair protein Mlh3, which has a duplicated seq in genome\n
GLEAN3_02737	SPU_002737		See GLEAN3_01774, _09709. \n
GLEAN3_27023	SPU_027023		CDS sequences are part of rendezvin.  See GLEAN3_19369 for which exons they encode.\n
GLEAN3_27987	SPU_027987		In addition to duplicated copy, there is a third copy not in the Glean3 list on scaffold 134677\n
GLEAN3_15027	SPU_015027		This prediction covers partial CDS that is missing C-terminal sequences, which are probably on another scaffold.  On the gene duplications page, I have listed another copy also lacking C-terminal sequences.  This second copy is on a scaffold, 52540, which contains multiple ACE-like genes, some complete and some partial.  Finally, there is another copy in the genome that is not on the GLEAN3 list on  Scaffold 123895.\n
GLEAN3_23329	SPU_023329		Adjacent to this model is a very closely related gene, GLEAN3_23330.\n
GLEAN3_23330	SPU_023330		This gene model is adjacent to GLEAN3_23329, which encodes a very similar protein.\n
GLEAN3_18409	SPU_018409		240 bp intron was accepted as coding region by comparison to the corresponding FgenesAB prediction.\n
GLEAN3_25719	SPU_025719		507 bp and 549bp introns and 426bp of the 5'UTR were accepted as coding region by comparison to the corresponding FgenesAB and ++ prediction. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_28940	SPU_028940		Partial CDS.  The model may lack N-terminal half sequences. \n \nThis model of one of a cluster of 4 closely related NAALAD2 genes on scaffold 496.\n
GLEAN3_15185	SPU_015185		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 93% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n
GLEAN3_11837	SPU_011837		RNA editing function, see also Glean3_11875, for identical gene (duplication? or redundant scaffold?) \n
GLEAN3_22057	SPU_022057		This prediction lacks 5'-terminus.  5'-terminus sequence is in GLEAN3_10703 (scaffold 1972). \n
Sp-ADAM12-like	SPU_030025		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-STE24p-like	SPU_030050		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_10805	SPU_010805		duplicate GLEAN3_17694\n
GLEAN3_00154	SPU_000154		Lacking N-terminus.  See also GLEAN3_06123. \n
GLEAN3_03744	SPU_003744		Likely assembly error. Identical over frpm AA 180 onwards to GLEAN3_03743. Recommend deletion?\n
GLEAN3_16163	SPU_016163		This model is possibly incomplete.\n
GLEAN3_17197	SPU_017197		GLEAN3_15064 has the first part of the gene. GLEAN3_17197 should have the latter half. GLEAN3_15899 is likely a haplotype of GLEAN3_17197.\n
GLEAN3_19208	SPU_019208		expressed histone gene\n
GLEAN3_00009	SPU_000009		 partial, missing C-terminus\n
GLEAN3_00447	SPU_000447		 missing C-terminus\n
GLEAN3_07150	SPU_007150		 partial, missing C-terminus\n
GLEAN3_09099	SPU_009099		 missing stretch in middle, missing C-terminus\n
GLEAN3_14007	SPU_014007		 partial, missing N-terminus\n
GLEAN3_20808	SPU_020808		 missing C-terminus\n
GLEAN3_20881	SPU_020881		 missing N-terminus\n
GLEAN3_01956	SPU_001956		 extra C-terminus\n
GLEAN3_14214	SPU_014214		 partial, missing N- and C-terminus\n
GLEAN3_10918	SPU_010918		 missing central regions\n
GLEAN3_14874	SPU_014874		 partial, missing N-terminus\n
GLEAN3_24639	SPU_024639		 partial, missing C-terminus\n
GLEAN3_20002	SPU_020002		 missing C-terminus\n
GLEAN3_20176	SPU_020176		 missing N-terminus\n
GLEAN3_21295	SPU_021295		 partial, missing C-terminus\n
GLEAN3_17950	SPU_017950		short protein 0nly 99aa. \nOriginally called H2bk, changed to pseudo-gene on 28 July 2006.\n
GLEAN3_01225	SPU_001225		Homologue to GGDEF and Neurotrophin. Am working to find out what it is exactly. \nSimilar to genescan predicted peptide: Supertig12270|GENSCAN_predicted_peptide_1|302_aa \nCHIMERIC PROTEIN:  \nSupertig12270|GENSCAN_predicted_peptide_1|302_aa CONTAINS A PIECE OF SEQUENCE WHICH IS IN Supertig153688|GENSCAN_predicted_peptide_1|85_aa and \n
GLEAN3_08358	SPU_008358		Inspection of the tiling array suggests that glean may have missed the following exons: GFGYQREEVLFQIWDKRGGEKVAFLLDFVREAPKVEVQSLQKEGTWGNYKDSSQPPPTSPLPETPSLAKDTQTPPTSPLPETPSLAKDNQTPPTSPLPETQSLAKDTQTPPTSPLPETPFLAKDTQTPPTSPLPETPSLAKDTQTPPTSPLPETQSLAKDTQTPPTSPLPETQSLAKDTQTPPTSPLPETPSLAKDTQTPPTSPLPETLSPAKDTQTPPTSPLLEIPPHHDDDGISINTNPVDGWYEYTNDTGTQTSDPEDHIGKLMDRCRISIKQNPDSKRTEACSETQSYFHTLKSRES,QNRGKMVQRHGNQTEDGERKCLEDRVWNGWKEQVVYSPFTSVASSDCENFELTELKRQLQELFERQPTKLVMTPERKIFDGKAAEIEDLVISVKRSFSRYGIREEGKRSPFSWILSEKLQR,RFNLSRRKEPGGTTKTLPNLHQLHLSQKLRPWLKTPKHHQLPLSQKLRPWLKTTKHHQLPLSQKPSPLLKTPKHHQLPLSQKPRSWLKTLKHHQLPLSQKPRPWLKTPKHHQLPLSQKLSPWLKTPKHHQLPLSQKLSPWLKTPKHHQLPLSQKPHPWLKTPKHHQLPLSQKPCPRLKTPKHHQLPLS,FMNIVYHHRTMANKRKAWSLQGIIAVMQEYLSCHDVYKRMKRNGTTKGFHQTVADKLSYQENATENLLKSLGREYRDILQ,WGSDVLRTRSITGKGSNMEKGKFDTIMSQQKNSSFVKYLAVAIWGSDVLRTRSITGKGSNVDKGKFDTIMSQQKNSSFVKYLALAIWGSDVLRTRSIT,XXXXXXXGYLGFGCSENALDHRKKGKSICEPLELGQKRRHCVVMNLPKFPSSSEPALHLSDHEAVSIVVGLAIFSGTAGSKVHLCFLRKGV\n
GLEAN3_12772	SPU_012772		Inspection of the tiling array suggests that glean may have missed the following exons: YWVHARAQWLEFDVTSHHLATTGLVYIMKLVCVFVSAKTSLLNGKYTSEDKVHHKTQPVLMPIRNAALLDMLQVTKRLQLGRITKKVSIISSFVRLK\n
GLEAN3_14793	SPU_014793		Inspection of the tiling array suggests that glean may have missed the following exons: RRVDNCGPVCGAALSILIIMRIHDFISSLCGSLSLKLSSPFLMRNYLSALTLQSSFDDMAEAFTIFLFCHIYWYKFPYIVDCKNVPT\n
GLEAN3_17348	SPU_017348		Inspection of the tiling array suggests that glean may have missed the following exons: CNRNHSLDGSFPLALSLALSPSSPSPAPPPLLPLPSTSLSLFISLIPPPYLSFPSPCLFLSSSLSPVLPPSPLSLSLSLSPTYLSRTS\n
GLEAN3_19651	SPU_019651		Inspection of the tiling array suggests that glean may have missed the following exons: QTFVNTEDIGRYPSLRGPPAHDTSSGCTDGTSCRNGDTCKVADDQSKRSAFQAYQREMPKLLLLAYLKLSPFRLHPLPYQRL,RSFVRESFVAEGAEVAQLSSLMHPAFMSTYPFADTESLPTDLTLERFEACMFPLVIGEGAIRRKGSWAERAAVETVLLVTLVMLKEVAFFHEDTRAQVTLE,EHQPMVFACIVNAAILVLLKNCYKSQLIMAQLIITPAHSSFILHDEPLFISDISCHQLYIFTIPCLNVHLSLYCSPWHVSHNCDCLISVDSISL\n
GLEAN3_19653	SPU_019653		Inspection of the tiling array suggests that glean may have missed the following exons: YFNLNKSHTSISSSLCLFTGFTLLLLLWAAVQIFCPYTVRSGAIASGLMFCPFDVLSLTIASVLVFCVPMVLFVTVAIPLFLTLSLFDP,EAGGNFVRPGSLDSVSGLLLDSMILPLALPSSFPRCFVISSMIPEDKMRTVSPLLLSGLVCPKPFDAPELRILEESCGRPSLVLAVL,PKMTVTSTPHPLPPVHLPAVHTVYQQGSPPQRSLQRSSGLHHDEMRKTCTRIPPVMVEHPSLLLLHPPSPSRHHPTPDSYHPHQRPHRSPLALPPPHNHYYNLHLSLTHLYLTLDCHHHHHHRRHCHPPPPLPSPPCLFFPSPLPPPHHTTLHPPPIHLHPPAVMRHNGTSPQHLSGNNNNKI\n
GLEAN3_22473	SPU_022473		Inspection of the tiling array suggests that glean may have missed the following exons: PVLEGISPVSQDSSQVLPSMENSSQSERELDESINDPTNSPPTHDEDSTGKQGFDCTKCKKRFSVESDLGSHVMMCCGNLSTQCPVCKKIFASKSYIGKHMRLHTGEKPFQCGECGMRFTRKHHLVHHQRTHTGEKPFKCTE\n
GLEAN3_26418	SPU_026418		Inspection of the tiling array suggests that glean may have missed the following exons: TRNELKNPRHLPRPRPRTFMYKSLVDIDRYRDDVTPNRTRRASLYTIEPSMVFKEIKDSQQIGYTRVNRVSENVLQRGSIVNEQVSEWLMWITTRNAFDIIMPHL\n
GLEAN3_10248	SPU_010248		Incomplete Protein kinase domain\n
GLEAN3_15668	SPU_015668		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 11" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like6" one of its synonyms. \n \nNB: The structure of this model is supported by the fact that other gene prediction protocols generated almost identical models and by the genome-wide tiling array hybridization data.\n
GLEAN3_08227	SPU_008227		This gene model may represent a pseudogene or contain a sequence error. 1200bp of 3'UTR was accepted to a coding region that is encoding TIR domain.  \n
GLEAN3_07337	SPU_007337		#\nIt's missing 5' and 3' end of the gene.  It overlaps with Glean3_18583.\n
GLEAN3_05376	SPU_005376		Position on contig and alignments with best hit suggest the gene is missing 3' exons.\n
GLEAN3_26916	SPU_026916		thanks to Charlie W.\n
GLEAN3_22078	SPU_022078		EGF-LAMG-LAMG-EGF-LAMG \n \nthese two domains (or other EGF variants) occur in intermingled fashion in quite a few known proteins - neurexins, perlecans, agrin, crumbs, CASPR, some cadherins \n \nurchins appear also to have novel EGF/LAMG proteins \n
GLEAN3_02947	SPU_002947		Only domains it contains is a portion of a metalloprotease domain.\n
GLEAN3_00580	SPU_000580		2 SRCR domains. Probably partial.\n
GLEAN3_09471	SPU_009471		 fragment\n
GLEAN3_14306	SPU_014306		 small fragment\n
GLEAN3_18083	SPU_018083		 partial, missing C-terminus half\n
GLEAN3_17872	SPU_017872		possible relative of col1a1\n
GLEAN3_17657	SPU_017657		SEA and EGFs\n
GLEAN3_21262	SPU_021262		FN3-10- one LH\n
GLEAN3_00876	SPU_000876		SRCR(2).  Probably partial. \n
GLEAN3_02041	SPU_002041		SigPep-SRCR(3)-TM. (DMBT1)\n
GLEAN3_11039	SPU_011039		Strongylocentrotus purpuratus calcium-binding protein (Endo16), mRNA. \n
GLEAN3_06045	SPU_006045		SRCR(2). Probably incomplete.\n
GLEAN3_08642	SPU_008642		SigPep-SRCR(8). Possibly incomplete.\n
GLEAN3_15011	SPU_015011		 fragment\n
GLEAN3_00745	SPU_000745		contains only part of the reprolysin domain\n
GLEAN3_00216	SPU_000216		5 EGF repeats, 13 HYR\n
GLEAN3_09224	SPU_009224		Could be haplotype pair of GLEAN3_06396\n
GLEAN3_14509	SPU_014509		Related to At1g79380/T8K14_20\n
GLEAN3_10140	SPU_010140		very weak signal.  no est.  may be pseudogene\n
GLEAN3_20754	SPU_020754		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 40.14% over 436 BLAST alignment positions. 183 of 666 Muscle alignment positions masked (27.400 %; 483 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_24836	SPU_024836	When reviewing the data it was apparent that the data provided by the excel spreadsheet did not come up as 	possible non-coding exon included in glean annotation\n
GLEAN3_13447	SPU_013447		PSI/IPT x3/TM\n
GLEAN3_20597	SPU_020597		SRCR(8)-Sushi(3)-HYR-Sushi_HYR(2)-Sushi(2). Possibly incomplete.\n
GLEAN3_12085	SPU_012085		IDENTICAL TO 11970\n
GLEAN3_02220	SPU_002220		PREDICTED: Strongylocentrotus purpuratus similar to survival of motor neuron 1, telomeric isoform b (LOC575759)\n
GLEAN3_16039	SPU_016039		Groups with the caspase 8 subfamily by neighbor joining of multiple sequence alignment.  Sister of caspase8-like/caspase8-like-2a/b.\n
Sp-IL17r-like	SPU_030141		This gene was annotated based on a manual inspection of multiple protein alignments, reciprocal BLASTing and domain structure analyses. \n \nGiven the position of this gene at the end of a scaffold, and based on alignments to other IL17 receptor genes, it is highly likely that there is N-ter sequence missing in this model.\n
GLEAN3_01101	SPU_001101		GLEAN3_01101 matches the first half of RBM19. GLEAN3_06990 matches the latter half.\n
GLEAN3_25671	SPU_025671		First exons more difficult to detect on expression microarray.  Similarity to known TAF1A proteins (mouse, rat, and human) show little similarity towards the carboxyl end, suggesting possible misprediciton of early exons (2 and 3).\n
GLEAN3_14788	SPU_014788		Partial duplication of GLEAN3_00838.\n
GLEAN3_20923	SPU_020923		Added 5'UTRs to the gene model based on the est data\n
GLEAN3_04533	SPU_004533		Partial sequence similar to DUSP23.\n
GLEAN3_11785	SPU_011785		The terminal part of this GLEAN is actually the first exon of a GST protein. Modified seqs are below: \nDNA: \nATGGCTTTAGCTCAGGAGGAGCTGACCATGATGAAGGGGAAGATCAACAGCCAGAAAGAGATGGGTAAACAC AAACTGGAAGGAGAACTTACCCAGCTGAAAGAGGTTCCAAGTTACCATTCAGATCTGTGCACGATTCCCAGT GTTGTTGCTGATGTCTGCAGTGACACAAGAGACCTCTCAGGGTTGGACATTGCTAAAAAGCGTGTGGCCCTG ATGGAACTCCTGCAGAAGCAGTCACAGCGTAAGGCACTAGCTGCCCTGGAAAGGCGACAGGCAACCCTTGAG GTGCAAGAAAGAACTGCTCAATTGATTCTGAAACATGAGGAGCAAAAACTTCAGGATTTTGAAACTGATCCA CGTTTGCTTGAGGGAGAGTTCGGTTCAGAGCAGTCACCTGAAAAAGAGCTTGGAGATCGTGACATTGAGAAC ACAGTGGATAGTTCTACAGAAAATGTGAGCACAGAATTGCCAGCATCAGGTCGAGTCAGTCCTGTAGGAAAC AGTGACCCTGAAACAATGCATCACAACAATGTTCCTTCTCCGAAGAAAGAGACTAAGCGCATCCAAAAGCAG GAAAACAAAGTTAAGACTATGGATCAAAAGAAGGACCAAACAAACAGTTCAAAGAAAAATTCCAGAAAAACT GATGCTAGACCTACTAGAAAGGTTGAATTAGGTCCTTCAAAAACTAAACCCAGGGAAGAACAAATAAAGAGG GAAAGTGATGGTCTATCTGCATCACCTTCAAGGTCTCCATCTCCTCCCAAAAGTTCAAGGTCAAGCTCGCCA AGTAAATCCTCGAGGTCCAGCTCACCCACCAAGTCACTAAGATCAGATTCACCAACAAAATCCTCAAGGTCC AACTCACCCACCAAGTCACTAAGATCAGATTCACCAACAAAATCCTCAAGGTCCAACTCACCCACCAAGTCA CTAAGATCAGATTCACCAACAAAATCCTCAAGGTCCAACTCACCCACCAAGTCATTAAGATCAGATTCACCA ACAAAATCTTCAAGGTCTGCATCTCCAACCAAATCTGTAAGGTCTGAATCACCGACCAAGTCATCAAGATCA TCTTCTCCTGCAAGTACCTCCTCGAGGCAGTCCAGGCAAACTTCAGGAAATGTTTTTTCTCGCCTGTATCCC CAGCAAGAAACAAAATTTAATTTTCTCAGGAAGAAGAGTCCTCCAAGATATGGTGAATATGATCCTCACAGA GAGAGAGACAGAGACTCTCCACACTTGATCAGGAGAGAACTGTCTCCTGTAGAGCATGACCCTATCAAGTTA AAGGACAAAAAACGTGAACATGTTCAGGACCATAAGCATGTGAGAACCAGCAGTGTGCCTTCAAGCAATCTT CCAGACAGAACAAAGCAAAATCTTCATTCTAGAAGTAGGAGTGCTAGTCCATGTGTAAATAATCCTGTAAAA CCAGCAACAAGGCCAAAGAAATCTCCAGTATCACAGTCCTCACAAGAAACTGAAAAGGCGGTTAGGAGAACT AAAGTGAAATCAAAAGACACAGCAAGTGCTCCACTTGATAGAACTCAAACTGATCCTGGTAATGAAAAACAA AATAATAAGTGCCAAACCAAAACCAAGCAAGCCCAACGCACATCTCCAAGTGAATCAACCCACTCCAGTCCA ACTAAAAGGAAATTAAATAAACCAGAAAACACTTCAAATGCAAAGACCCCTAAAGGCTCATCTGCCTCTCCT CGGTCAAGACCAATCATTGGGAAAGCAGTTGATAGAAGTCCATCTCCAAGGAAGAAATCAGAACAAACTCCA TCTCACCGGAAGACTGTAAAGAGGCCCCTTTCTAGGAGTCGCAACGATAAGGATGGTGAAAACACAAATTCT CCTAGTCCTAGGAGGAAATTTGCAAGAACTCCCGTTGTCAAACCAAAGAACTGGCCGTCCATAGAGAACTTA CCAAAGAGTACTCCCAAATCTGAAGCTTTCTATGTACCTCTCACTAGCGAACAGTTGAGGCTTGCACTTCAG AAGCACGCCAGTGAGCAGGACAGTGGACCACATCTGGAAGGTGCAGATTTGGGGGCTTCAGGATGTGAACAG CACTGGAGTCCCAGTCGTAAGAGGTCTATTGGAAAGCAGCCCCAGCAAACACAAACAGATCCAATTCAGGCC AGCATAGGAGGAGAGATATTTGGTTTGGAAATGGATGGAGTAAACAATTTAGAAAATGAGGAAGATCACTAC AGCTCCTCAGAA \nProtein: \nMALAQEELTMMKGKINSQKEMGKHKLEGELTQLKEVPSYHSDLCTIPSVVADVCSDTRDLSGLDIAKKRVAL MELLQKQSQRKALAALERRQATLEVQERTAQLILKHEEQKLQDFETDPRLLEGEFGSEQSPEKELGDRDIEN TVDSSTENVSTELPASGRVSPVGNSDPETMHHNNVPSPKKETKRIQKQENKVKTMDQKKDQTNSSKKNSRKT DARPTRKVELGPSKTKPREEQIKRESDGLSASPSRSPSPPKSSRSSSPSKSSRSSSPTKSLRSDSPTKSSRS NSPTKSLRSDSPTKSSRSNSPTKSLRSDSPTKSSRSNSPTKSLRSDSPTKSSRSASPTKSVRSESPTKSSRS SSPASTSSRQSRQTSGNVFSRLYPQQETKFNFLRKKSPPRYGEYDPHRERDRDSPHLIRRELSPVEHDPIKL KDKKREHVQDHKHVRTSSVPSSNLPDRTKQNLHSRSRSASPCVNNPVKPATRPKKSPVSQSSQETEKAVRRT KVKSKDTASAPLDRTQTDPGNEKQNNKCQTKTKQAQRTSPSESTHSSPTKRKLNKPENTSNAKTPKGSSASP RSRPIIGKAVDRSPSPRKKSEQTPSHRKTVKRPLSRSRNDKDGENTNSPSPRRKFARTPVVKPKNWPSIENL PKSTPKSEAFYVPLTSEQLRLALQKHASEQDSGPHLEGADLGASGCEQHWSPSRKRSIGKQPQQTQTDPIQA SIGGEIFGLEMDGVNNLENEEDHYSSSE\n
GLEAN3_27503	SPU_027503		SigPep-SRCR(3). Probably incomplete.\n
GLEAN3_23126	SPU_023126		e val for NP_001804 = 2e-101, and for Q02224=e-139; CENPE_HUMAN [Homo sapiens].   \nKinesin-7 family member.   \nSee also Glean3_17809 which also hits Q02224. \nCENPE_HUMAN data obtained from UniProtKB/Swiss-Prot entry Q02224.    \nAnnotation by RA Obar, RL Morris, SA Tower, KM Judkins\n
GLEAN3_12137	SPU_012137		GLEAN3_12137 coverage is limited to residues 441 (M) to 606 of 614 residue sequence used as a Query.\n
GLEAN3_01781	SPU_001781		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_08382	SPU_008382		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_09488	SPU_009488		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_27639	SPU_027639		This model was fused to a modified version of GLEAN3_27640. Please see GLEAN3_27640 for details.\n
GLEAN3_05799	SPU_005799		 fragment\n
GLEAN3_14112	SPU_014112		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_05343	SPU_005343		An incomplete ORC6 sequence, missing exon 3 is also present in GLEAN_09374 (see comments on this Glean) \n
GLEAN3_11491	SPU_011491		Partial MCM2 sequence likely due to inappropriate contig fusion. View GLEAN_06096 for full length sequence.\n
GLEAN3_25077	SPU_025077		Genscan model may be more accurate. \nDomains: DEATH, NACHT, LRRs \n
GLEAN3_02921	SPU_002921		Very similar to GLEAN3_02923; looks like local duplication.\n
GLEAN3_13952	SPU_013952		#\nDomains: DEATH, NACHT, LRRs.\n
GLEAN3_02272	SPU_002272		Domains: NACHT, LRRs.  \nThis gene model could be incomplete, missing the DEATH domain(s).\n
GLEAN3_11088	SPU_011088		Domains: NACHT, LRRs. \nThe Genscan model has additional exons that code for a DEATH domain. This gene model is on a short scaffold and could be incomplete, missing LRRs.\n
GLEAN3_26320	SPU_026320		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_06815	SPU_006815		NOTE: Based on sequence, alignment and domain analysis--This look like the 3' end of a single OPA gene that is composed of GLEANS 09807 (5'end)  and 06815 (3'end), with no gaps between.\n
GLEAN3_09673	SPU_009673		NOTE: No signal in tiling array and no EST.  May be pseudogene or adult expressed.\n
GLEAN3_12239	SPU_012239		homolog of Cernunnos (protein) and Xlf (gene)\n
GLEAN3_20692	SPU_020692		This is the 5'end end of the gene.  the rest is located in GLEAN3_07086\n
GLEAN3_16113	SPU_016113		 missing some N-terminus residues\n
GLEAN3_25888	SPU_025888		The gene model C-term is likely incorrect.\n
GLEAN3_24010	SPU_024010		this may be a gene fragment\n
GLEAN3_24199	SPU_024199		#\nMay have an extra exon at the end.\n
GLEAN3_21755	SPU_021755		Duplicate prediction for GLEAN3_00408\n
GLEAN3_12849	SPU_012849		GLEAN3_12849 is a partial duplicate prediction for GLEAN3_09023.\n
GLEAN3_03696	SPU_003696		Missing ~150 AA at beginning.\n
GLEAN3_26065	SPU_026065		GLEAN3_26065 is a partial duplicate prediction for GLEAN3_10328.\n
GLEAN3_12417	SPU_012417		This Glean is part of the annotated full length Sp_DNAH2 gene (SPU_030224)\n
GLEAN3_14179	SPU_014179		Likely missing an exon.\n
GLEAN3_14882	SPU_014882		Incorrect gene model. Extra exon in middle?\n
GLEAN3_17682	SPU_017682		Inspection of the tiling array suggests that glean may have missed the following exons: LSCLVFFKLKDDPPDMPDMGFPLSSHDSADQSPLDTALSVSAMLVETGSDHNSDSDFMTGDGVIPGDMVSGFRESTINLQDLE,PPPLPPPPPKTLRRMPSHRLQLHPSPAHPSSRLSPCSLPAPGDYPYPLPWPRPSSPSSHRIHPPTPPHPPSSSPHLHWYHRHRR,DGCHPTGYSCTRLQRIPPLVFLRAHCQLRGTIHTPFLGHARPLHPPTEFILQPLRTLHHPPLIFTGIIVIVGEGVATSICSPYHHHHGHALDHAQGVPGEQEWLGKDRLSGGNCHTSAASRCSGALEEDR,INLTTTTTTTTTKDSETDAIPPVTAAPVSSASLLSSFSVLTASSGGLSIPPSLATPVLSILPPNSSSNPSAPSIILPSSSLVSSSSSVKESLPVSAPPTTTTTVTPSTTPKVYLENRSGSARTDYRAVTATHPLPLDALERLKKTGNY\n
GLEAN3_01634	SPU_001634		motor domain\n
GLEAN3_10009	SPU_010009		Incorrect gene model. May be a mix of two separate genes.\n
GLEAN3_18683	SPU_018683		gal_lec, gal_lec\n
GLEAN3_14701	SPU_014701		Dynein heavy chain, N-terminal region 1. Dynein heavy chains interact with other heavy chains to form dimers, and with intermediate chain-light chain complexes to form a basal cargo binding unit. The region featured in this family includes the sequences implicated in mediating these interactions. It is thought to be flexible and not to adopt a rigid conformation.\n
GLEAN3_10593	SPU_010593	none	This genes spans three GLEAN predictions: GLEAN3_10593 + GLEAN3_13086 + GLEAN3_03874 \nThe GLEAN3_13086 and GLEAN3_03874 predictions are overlapping (exons 3 and 4); gene duplication probably due to assembly \n3'UTR of this gene is missing \n \nExon \tStart \tStop \tScaffold \n1\t24676\t24229\t23709 \n1?\t2159\t1712\t12273 \n2\t13236\t13078\t23709 \n3\t95271\t95968\t1679 \n3?\t5804\t5107\t38156 \n4\t96341\t96594\t1679 \n4?\t4742\t4489\t38156 \n5\t106938\t107067\t1679 \n6\t109648\t109828\t1679 \n7\t110254\t110486? 1679\t3'UTR missing \nDatabase version 2005/07/18\n
GLEAN3_05979	SPU_005979	none	This gene spans two GLEAN predictions: GLEAN3_05979 + GLEAN3_03084 \n \nExon \tStart \tStop \tScaffold \n1\t58051\t57408\t1179 \n1?\t54625\t55052?  21642\tincomplete \n2\t19833\t19675\t1179 \n3\t16229\t16386\t1201 \n4\t17471\t17710\t1201 \n5\t18192\t18357\t1201 \n6\t31949\t32079\t1201 \n7\t40928\t41191\t1201 \n8\t41532\t41731\t1201 \n9\t42426\t42601\t1201 \n10\t43112\t44008\t1201 \nDatabase version 2005/07/18\n
GLEAN3_03084	SPU_003084		This gene spans two GLEAN predictions: GLEAN3_05979 + GLEAN3_03084 \n \nExon \tStart \tStop \tScaffold \n1\t58051\t57408\t1179 \n1?\t54625\t55052?  21642\tincomplete \n2\t19833\t19675\t1179 \n3\t16229\t16386\t1201 \n4\t17471\t17710\t1201 \n5\t18192\t18357\t1201 \n6\t31949\t32079\t1201 \n7\t40928\t41191\t1201 \n8\t41532\t41731\t1201 \n9\t42426\t42601\t1201 \n10\t43112\t44008\t1201 \nDatabase version 2005/07/18\n
GLEAN3_13015	SPU_013015		this is one haplotype the other is GLEAN3_20451\n
GLEAN3_23976	SPU_023976		Looks to be an allele of Glean3_04037.\n
GLEAN3_28275	SPU_028275		This gene spans two GLEAN predictions: GLEAN3_02473 + GLEAN3_28275 \nThe GLEAN3_02473 prediction appears to include two different genes, a helicase/ zinc finger protein and the first two Sp-osteonectin exons \n \nExon \tStart \tStop \tScaffold \n1\t27772\t27363\t56445 \n2       17533   17392   56445 \n3       19999   20241   2764 \n4       21405   21615   2764 \n5       23776   23924   2764 \n6       26489   26684   2764 \nDatabase version 2005/07/18\n
GLEAN3_01893	SPU_001893		This model was identified in part on the basis of PFAM Lectin_C domains.  It has alternating Lectin_c and Fn3 domains.\n
GLEAN3_09885	SPU_009885		THis gene was identified partially on the basis of the PFAM domains included in the model\n
GLEAN3_03825	SPU_003825		added 3'UTR based on EST evidence\n
GLEAN3_14594	SPU_014594		5' and 3' UTRs added based on EST evidence \n \nThis and paralog GLEAN3_22537 are urchin-specific homologs of metazoan 14-3-3 proteins. It is likely that the ancestral metazoan had 2 14-3-3 proteins. One epsilon ortholog (GLEAN3_03825) and one other. This other has undergone differential expansion in vertebrates, nematodes, insects and echinoderms.\n
GLEAN3_22537	SPU_022537		5' and 3' UTRs added based on EST evidence \n \nThis and paralog GLEAN3_22537 are urchin-specific homologs of metazoan 14-3-3 proteins. It is likely that the ancestral metazoan had 2 14-3-3 proteins. One epsilon ortholog (GLEAN3_03825) and one other. This other has undergone differential expansion in vertebrates, nematodes, insects and echinoderms.\n
GLEAN3_06917	SPU_006917		Note that the 3' end of this gene (encoding the C-terminus of the protein) is contained on Scaffold118064, and that there is some overlap between the sequence at the 5' end of that scaffold and the 3' end of Scaffold874, which contains the 5' end of the gene (encoding the N-terminus).\n
GLEAN3_07852	SPU_007852		The existence of this second Runx gene was predicted based on  Southern genomic blot analysis (Coffman et al., Dev. Biol. 174 (1), 43-54 (1996)).  No evidence for expression in the embryo.\n
GLEAN3_23469	SPU_023469		Partial gene model. Single exon in the scaffold matches part of GLEAN3_01638. Possible assembly problem\n
GLEAN3_10876	SPU_010876		Based on comparison with the cloned orthologue from Lytechinus pictus (LpPKC1, acc. no. U02967), this Glean3 prediction encodes the C-terminal half of the protein (beginning with exon 8), corresponding to nucleotides 1108-2177 of the LpPKC1 cDNA (note that the full length CDS entered here is that of the Sp homologue, which was cloned by RT-PCR in our lab using primers based on manual assembly of the gene using EST sequences, genomic trace sequences, and LpPKC1 as a scaffold; this sequence has not yet been deposited at NCBI).  There appears to be missing sequence corresponding to at least one exon, between exon 9 and the C-terminal two exons; some of this falls on a very short scaffold (Scaffold131377). The sequence encoding the N-terminal half (exons 1-7) is contained on Scaffold161 (in gene models Glean3_01048 and Glean3_01049), corresponding to nucleotides 1-1107 of LpPKC1.\n
GLEAN3_13522	SPU_013522		There is a missing chunk of amino acids 1-125 that is probably most likely on another Scaffod and 312-382 That is probably lost in sequence of  \nNNNNNNN\n
GLEAN3_07485	SPU_007485		3' utr extended information added from the spline est data\n
GLEAN3_13414	SPU_013414		the C-terminus of this known gene has its own Glean3 number and prediction (Glean3_14515); combined predictions were annotated in this entry\n
GLEAN3_14515	SPU_014515		this glean model is a c-terminus of another glean-defined model (gleas3_13414); all information will be entered with that glean annotation\n
GLEAN3_25428	SPU_025428		EST data from cleavage-blastula\n
GLEAN3_14131	SPU_014131		this is clearly an ortholog of LvNotch along its entire length.  Therefore the expression patterns which are known for that species are shown in the embryo expression series\n
GLEAN3_03898	SPU_003898	For this particular GLEAN model, there was no Cds information available from both Baylor annotations and SpBase. However, after reviewing the data from the excel file, it appears that the sequence is distributed onto 3 different scaffolds. If the three different scaffolds were combined, the sequence would have an orderly continuous arrangement without any repeats or gaps present.\nAdditional gene information from Baylor annotations (comments):\nPROBLEM: in the scaffold200, where this prediction resides, regions containing exons 4 and 5 are duplicated: potential assembly problem!!! this messed up the original prediction quite badly.	PROBLEM: in the scaffold200, where this prediction resides, regions containing exons 4 and 5 are duplicated: potential assembly problem!!! this messed up the original prediction quite badly.\n
GLEAN3_26277	SPU_026277		Reference: \nFerkowicz,M.J., Stander,M.C. and Raff,R.A. \nPhylogenetic relationships and developmental expression of three sea urchin Wnt genes \nMol. Biol. Evol. 15 (7), 809-819 (1998)\n
GLEAN3_11756	SPU_011756		Reference: \nFerkowicz,M.J., Stander,M.C. and Raff,R.A. \nPhylogenetic relationships and developmental expression of three sea urchin Wnt genes \nMol. Biol. Evol. 15 (7), 809-819 (1998)\n
GLEAN3_26371	SPU_026371		Exon 9-13 of this gene are located on another scaffold (scaffold 73533 with GLEAN_09526 gene model). \n
GLEAN3_24792	SPU_024792		potential duplication of the scaffold region, Glean3_24793 prediction has exactly 100% identical exons\n
GLEAN3_24793	SPU_024793		the prediction is incomplete; another prediction on the same scaffold is longer and contains the regions within this glean as if duplicated\n
GLEAN3_10040	SPU_010040		the Glean only described the C-terminal part of the gene; \nhuman ortholog Accession Number is NP_892023\n
GLEAN3_20371	SPU_020371		Reference: \nWikramanayake,A.H., Peterson,R., Chen,J., Huang,L., Bince,J.M., McClay,D.R. and Klein,W.H. \nNuclear beta-catenin-dependent Wnt8 signaling in vegetal cells of the early sea urchin embryo regulates gastrulation and differentiation of endoderm and mesodermal cell lineages \nGenesis 39 (3), 194-205 (2004)\n
GLEAN3_23952	SPU_023952		The 3' end of the Guisti et al. mRNA sequence is not part of any known Abl protein.  It is probably UTR, but it has specific hits on two Scaffolds, # 41 and 1211. \n
GLEAN3_10021	SPU_010021		Histidine Decarboxylase\n
GLEAN3_03345	SPU_003345		See putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137536970-15530-214731484586.BLASTQ1\n
GLEAN3_19261	SPU_019261		This is a partial sequence.  It appears to be the N-terminal region of Fmi2.  Another glean sequence, Glean_09215, contains this region plus the C-terminal sequence either of this gene or a duplicate form of this gene.  \n
GLEAN3_09526	SPU_009526		Exon 1-8 of this gene are located on another scaffold (scaffold 72753 with GLEAN3_26371 gene model). \nPlease refer to GLEAN3_26371 for complete modified gene model.\n
GLEAN3_12374	SPU_012374		Missing 5' end, no start site \n
GLEAN3_18391	SPU_018391		5' partial  \nGLEAN_18392 belongs to 3' end of SpTbx20\n
GLEAN3_20345	SPU_020345		5' partial \nGLEAN_20346 belongs to 3' end of SpTbx6/16 \n
GLEAN3_24946	SPU_024946		5' partial  \nGLEAN3_24947 belongs to 3' end of SpWntA\n
GLEAN3_15341	SPU_015341		POSSIBLE DUPLICATES glean3_12295, 03996.\n
GLEAN3_12294	SPU_012294		contains part of a peptidiase M16 inactive domain.  looks like part of an insulin degrading enzyme\n
GLEAN3_23463	SPU_023463		5' partial \nGLEAN3_23065& 24669 (identical duplicated seq) belongs to 3' end of SpWnt4  \n \nReference: \nFerkowicz,M.J., Stander,M.C. and Raff,R.A. \nPhylogenetic relationships and developmental expression of threesea urchin Wnt genes \nMol. Biol. Evol. 15 (7), 809-819 (1998)\n
GLEAN3_26099	SPU_026099		Identical to cDNA cloned in our lab\n
GLEAN3_17534	SPU_017534		This gene is on two scaffolds (441 and 84105). On scaffold441, two GLEAN models are predicted for this gene (GLEAN_17533 for exon 1-8 and GLEAN_17534 for exon 9-16). On scafold 84105, there is one GLEAN model (GLEAN3_17719 for exon 17-22) prediceted for this gene. \nPlease refer to GLEAN_17533 for refined gene features.\n
GLEAN3_17533	SPU_017533		This gene is on two scaffolds (441 and 84105). On scaffold441, two GLEAN models are predicted for this gene (GLEAN_17533 for exon 1-8 and GLEAN_17534 for exon 9-16). On scafold 84105, there is one GLEAN model (GLEAN3_17719 for exon 17-22) prediceted for this gene.\n
GLEAN3_19715	SPU_019715		Amino Acids 46-369 of Sp-Mapk match with part of glean3_19715 prediction. Amino acides 1-45 of Sp-Mapk match with supertig69362_2 on scaffold 69362\n
GLEAN3_17719	SPU_017719		This gene is on two scaffolds (441 and 84105). On scaffold441, two GLEAN models are predicted for this gene (GLEAN_17533 for exon 1-8 and GLEAN_17534 for exon 9-16). On scafold 84105, there is one GLEAN model (GLEAN3_17719 for exon 17-22) prediceted for this gene. \nPlease refer to GLEAN_17533 for refined gene features.\n
GLEAN3_17790	SPU_017790		Highly homologous to a cDNA cloned in Sphaerechinus granularis (AJ841701)\n
GLEAN3_19369	SPU_019369		Near-complete annotation for Rendezvin.  Also includes sequences from Scaffolds 102292 and 92747 (GLEAN3_27023).\n
GLEAN3_00668	SPU_000668		the predicted protein sequence bears 2 AA substitution (107H->N - 203L->V) compared to S.Purp_Univin previously published (AAA57553). \nThis might reflect polymorphism. \n
GLEAN3_24859	SPU_024859		#\npartial prediction corresponding to the N-terminal end of eIF4G. In this scaffold, exon 21 stops at position 39027 (instead of  39057), and  exon 22 (not predicted) spans nt 39579 to the end of the scaffold. GLEAN3_19064 corresponds to the C-terminus counterpart,there is a stretch of Ns between exon 1 and 2 of this prediction, these exons correspond respectively to exons 23 and 25, the missing exon (#24) is in scaffold155969 (=glean3_12286). Exon 17 is duplicated (scaffold137842, glean3_06085).  \nThe modified gene model is entered in the gene features and is highly homologous to a Sphaerechinus granularis cDNA. \n
GLEAN3_20678	SPU_020678		I do not have any experimental evidence, but based on alignments with human and mouse Nanos1 orthologs I feel this GLEAN prediction may not be correct. I predict that the gene is only one exon and has the following sequence: \n \nATGGAGACATCTTCTTGGGATCTTTTCATGGGGAAAGGGTTGAACCTCAGTGAGATCATTTCTTCGACAAGCTGGAAAACTCCTCCAACCATGGCCATGCCACAACATTCACCAGCGATGTGGCCATCATCTCCGTGCCCATCGCCGCCTATGTCTCCATGGCCAGCTTTATCTCCCCCTATGTCTCCATGGCCAGCTCTATCTCCCTCAAGCACCGTACCACCATCAGCTTCACCACCACCATCAGCATCATCATCGCCGCATGAAGATGAGTTGATATTTCGATCCAGCTTTACCGACACCCTATCTGTCTCTTATGAGAAGAAGCGATACCTCAACACTTACTGCGTGTTCTGTAAGAACAACAAAGAAACTCTTTGCTTCTACAGCTCTCATGTCCTGAAGGATGATTTGGGGAACGTTCAATGTCCTGTTCTTAGGGCTTACAAGTGTCCTATTTGTGGGGCGAAGGGTGATAATGCGCACACCGTCAAGTATTGTCCTCAAAATTCCAGTTCATCAAAAGCCGAGAAGCTGACCAAATCATCAGGTTGCTGGTCGGATTACCCATCACCCCCGGGATTTTTTTAA\n
GLEAN3_26559	SPU_026559		I don't have any experimental evidence, but this gene may not be annotated correctly. There is a large insertion in the middle of the protein that is not found in the human, mouse, fly, or worm orthologs.\n
GLEAN3_27826	SPU_027826		Tiling microarray data predicts additional expressed tag between exons 3 and 4, but it's not present in the est for this same gene. This gene is homologous to the isoform 5 of mammalian Gbeta, but since urchin has only 1 other isoform (homologous to Gbeta1-4, and named "a"), this one is "b".\n
GLEAN3_16157	SPU_016157		This gene spans two GLEAN predictions: GLEAN3_18353 + GLEAN3_16157 \nThe GLEAN3_16157 prediction is contained within GLEAN3_03874 (exons 3 and 4); gene duplication probably due to assembly and/or haplotype \n \nExon \tStart \tStop \tScaffold \n1\t48765\t48989\t773 \n2       51585   51683   773 \n3       61327   61444   773 \n3?      41672   41789   85877 \n4       62157   62392   773 \n4?      42315   42550   85877 \n5       63630   63768   773 \n6       64653   64771   773 \n7       65224   65337   773 \n8       65903   66014   773 \n9       66514   66719   773 \n10      67327   67588   773 \n11      68104   68246   773 \n12      68855   68980   773 \n13\t71135   71325   773\t3'UTR missing \nDatabase version 2005/07/18\n
GLEAN3_15335	SPU_015335		Deleted 5' exon (16,057-16,235), not present in known cDNAs.\n
GLEAN3_03874	SPU_003874		This genes spans three GLEAN predictions: GLEAN3_10593 + GLEAN3_13086 + GLEAN3_03874 \nThe GLEAN3_13086 and GLEAN3_03874 predictions are overlapping (exons 3 and 4); gene duplication probably due to assembly \n3'UTR of this gene is missing \n \nExon \tStart \tStop \tScaffold \n1\t24676\t24229\t23709 \n1?\t2159\t1712\t12273 \n2\t13236\t13078\t23709 \n3\t95271\t95968\t1679 \n3?\t5804\t5107\t38156 \n4\t96341\t96594\t1679 \n4?\t4742\t4489\t38156 \n5\t106938\t107067\t1679 \n6\t109648\t109828\t1679 \n7\t110254\t110486? 1679\t3'UTR missing \nDatabase version 2005/07/18\n
GLEAN3_13086	SPU_013086		This genes spans three GLEAN predictions: GLEAN3_10593 + GLEAN3_13086 + GLEAN3_03874 \nThe GLEAN3_13086 and GLEAN3_03874 predictions are overlapping (exons 3 and 4); gene duplication probably due to assembly \n3'UTR of this gene is missing \n \nExon \tStart \tStop \tScaffold \n1\t24676\t24229\t23709 \n1?\t2159\t1712\t12273 \n2\t13236\t13078\t23709 \n3\t95271\t95968\t1679 \n3?\t5804\t5107\t38156 \n4\t96341\t96594\t1679 \n4?\t4742\t4489\t38156 \n5\t106938\t107067\t1679 \n6\t109648\t109828\t1679 \n7\t110254\t110486? 1679\t3'UTR missing \nDatabase version 2005/07/18\n
GLEAN3_18505	SPU_018505		This is just a C-terminus of expected Gbeta cDNA. Since it looks like mammalian betas 1-4, it is designated A. Number 3 refers to the fact that it is a C-terminal piece of 3 pieces found. 3'UTR might extend until nt 8,000 by chip data.\n
GLEAN3_00199	SPU_000199		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_21816	SPU_021816		Added 3' exon (7107-7167) and modified next exon (8251-8350)to agree with known cDNA.\n
GLEAN3_08526	SPU_008526		Middle part of the urchin GbetaA (hence A2). Further sequence located on scaffold59878, and "predicted" as Glean3_18505. Two N-terminal exons exactly match sequences from another scaffold27704, which contains more N-terminal regions of what I think is the same gene. Prediction is modified to match est data.\n
GLEAN3_17106	SPU_017106		Part of the cDNA sequence (AY130972 ) could not be found anywhere in the genome. Therefore, accept gene model prediction for exon 2. \n \nExon 1-3 of this gene are on Scaffold 87957 (GLEAN3_17106). Exon 3-7 are on Scaffold 107218 (GLEAN3_25601). The two scaffolds have exon 3 and flanking sequences overlap and can be aligned quite well (e=0). Therefore, the two scaffold might be assembled as one scaffold. The modified gene model is composed of Exon 1-3 from Scaffold 87957 and exon 4-7 from scaffold 107218. 3' UTR is extended based on the EST data. \n \nExon 5-7 are also on Scaffold 55268 (GLEAN3_05718). This might be another allele of this gene.\n
GLEAN3_08194	SPU_008194		daz homolog\n
GLEAN3_18353	SPU_018353		This gene spans two GLEAN predictions: GLEAN3_18353 + GLEAN3_16157 \nThe GLEAN3_16157 prediction is contained within GLEAN3_03874 (exons 3 and 4); gene duplication probably due to assembly and/or haplotype \n \nExon \tStart \tStop \tScaffold \n1\t48765\t48989\t773 \n2       51585   51683   773 \n3       61327   61444   773 \n3?      41672   41789   85877 \n4       62157   62392   773 \n4?      42315   42550   85877 \n5       63630   63768   773 \n6       64653   64771   773 \n7       65224   65337   773 \n8       65903   66014   773 \n9       66514   66719   773 \n10      67327   67588   773 \n11      68104   68246   773 \n12      68855   68980   773 \n13\t71135   71325   773\t3'UTR missing \nDatabase version 2005/07/18\n
GLEAN3_14573	SPU_014573		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 43.8% (aa level).\n
GLEAN3_12253	SPU_012253		5' and 3' UTR are extended based on EST and expression data.\n
GLEAN3_00436	SPU_000436		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 40.8% (aa level).\n
GLEAN3_21555	SPU_021555		This gene encodes a precursor for seven putative SALMFamide neuropeptides (see Elphick & Thorndyke, 2005, J. Exp. Biol. 208, 4273-4282.) \nGLEAN predicts 3 exons. However, the second exon encodes a signal peptide (see above paper), which is always located at the N-terminus of neuropeptide precursors. Moreover, the tiling data do not show up a signal for exon 1 of the GLEAN prediction. Therefore, I think it is likely that the GLEAN prediction is wrong and should be changed so that the predicted CDS is derived only from the 2nd and 3rd exons of the GLEAN prediction. The tiling data also show signals for sequences located 5' and 3' to the two CDS exons; these may correspond to UTRs but this needs to be confirmed by EST/cDNA sequencing.\n
GLEAN3_15450	SPU_015450		orb-like, similar to cytoplasmic polyadenylation binding protein 1\n
GLEAN3_07822	SPU_007822		most of the N-terminal part of the glean prediction seems inaccurate.\n
GLEAN3_00615	SPU_000615		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(15 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_00911	SPU_000911		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IA. \n
GLEAN3_02442	SPU_002442		Intronless Toll-like receptor with predicted secretory signal peptide, LRR(22), LRR-CT, TM and TIR. This is a member of sea urchin-specific Tlr Group I(orphan). \n
GLEAN3_02538	SPU_002538		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_04139	SPU_004139		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_04150	SPU_004150		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IA. \n
GLEAN3_04311	SPU_004311		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IE. \n
GLEAN3_04360	SPU_004360		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(18 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_05088	SPU_005088		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(18 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_27278	SPU_027278		Amino Acid number 41 predicted from cloned cDNA is an alanine. Glean3 model predicts a T. \n \nGlean3_12529 predicts the same ORF\n
GLEAN3_05950	SPU_005950		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IA.  \n
GLEAN3_06164	SPU_006164		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(23), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IB.  \n
GLEAN3_06458	SPU_006458		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(18 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_07790	SPU_007790		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(10 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_08278	SPU_008278		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(13 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_08396	SPU_008396		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(10 to 21), LRR-CT, TM and TIR.  \n
GLEAN3_08456	SPU_008456		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(16 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_04599	SPU_004599		In the absence of ESTs or cDNA sequences from purpuratus, the 5 and 3' UTR regions have not been described here. \n
GLEAN3_08962	SPU_008962		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IB.  \n
GLEAN3_08963	SPU_008963		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_09037	SPU_009037		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_20677	SPU_020677		the first exons are wrong predicted. starting with the third exon the prediction is correct. \nuse the accession number NP_999702.1 for the full lenght cDNA\n
GLEAN3_09173	SPU_009173		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group I(orphan). \n
GLEAN3_09435	SPU_009435		#\nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(13 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_10575	SPU_010575		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_10695	SPU_010695		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IA. \n
GLEAN3_11537	SPU_011537		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_06988	SPU_006988		This gene was annotated based on a manual revision of multiple protein sequence alignments. \nThe predicted N-terminal SH2 domains align best with vertebrate Syk genes, whereas the predicted C-terminal tyrosine kinase domain aligns best with various vertebrate ZAP-70 tyrosine kinases. \n \nThere is some extra exons present in the NCBI prediction (XM_793943.1), however Glean3_06988 shows a better pairwise alignment to murine Syk. There is still a few of gaps in such alignment, which suggest there might be excess sequence in this glean model. \n
GLEAN3_12257	SPU_012257		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(14 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_13470	SPU_013470		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(17 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_13824	SPU_013824		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(16 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_14041	SPU_014041		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IB.   \n
GLEAN3_14073	SPU_014073		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(15 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_14266	SPU_014266		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IE. \n
GLEAN3_15066	SPU_015066		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_05171	SPU_005171		See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_02634	SPU_002634		This GLEAN prediction only corresponds to the N-terminus of SpHox7. The homeodomain and C-terminus of this protein are predicted as GLEAN3_05170. \nIn fact Glean3_05170 contains the 2nd exon of the gene plus a misspreddicted miniexon.\n
GLEAN3_15303	SPU_015303		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(8 to 21), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_16457	SPU_016457		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(10 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_27685	SPU_027685	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. When comparing the BLAST results with the excel data it appears that there are different base pair results in the BLAST sequence than was provided in the excel data. The excel data begins at 39 and ends at 443, while the BLAST sequence begins at 1 and ends at 1329. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with scores less than 5.	This gene shows significant similarity to vertebrate SH2-B family members. However, pairwise alignments also suggest there is additional sequence missing from this model. A better alignment to vertebrate SH2-B proteins can be reconstructed if two Fgenesh++ models located in two separate scaffolds are joined. The sequence continuity of these scaffolds is supported by genomic sequence alignments (scaffold83718-to-scaffold53583-to-scaffold1921). A putative N-ter region of this gene is still missing.\n
GLEAN3_19309	SPU_019309		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IA. \n
GLEAN3_21420	SPU_021420		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(15 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_22911	SPU_022911		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(18 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_00388	SPU_000388		See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_23035	SPU_023035		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(14 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_23321	SPU_023321		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group I(orphan). \n \n
GLEAN3_27600	SPU_027600		cDNA sequence from another individual has been submitted to genebank (DQ082723).  The sequence differs from FgenesH prediction in simple sequence repeat region.    \n
GLEAN3_24062	SPU_024062		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_24204	SPU_024204		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_24205	SPU_024205		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IC.  \n
GLEAN3_00669	SPU_000669		extra exon on 5'end... \nunclear duplication GLEAN_21497\n
GLEAN3_24208	SPU_024208		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_24385	SPU_024385		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IE. \n
GLEAN3_24429	SPU_024429		#\nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IE. \n
GLEAN3_24731	SPU_024731		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(18 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_24733	SPU_024733		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(13 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_24868	SPU_024868		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(24), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IB.  \n
GLEAN3_26200	SPU_026200		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(16 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_27162	SPU_027162		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(15 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_27164	SPU_027164		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(12 to 21), LRR-CT, TM and TIR.  \n
GLEAN3_28639	SPU_028639		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 21), LRR-CT, TM and TIR.  \n
GLEAN3_28893	SPU_028893		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group ID.  \n
GLEAN3_25646	SPU_025646		Partial sequence.  DIX domain missing and some deletions (in comparison to the Lytechinus variegatus form).  There appears to be a mistake in sequencing.  Individuals who have sequenced this gene in S. purpuratus did not note these deletions.\n
GLEAN3_13536	SPU_013536		Initial Glean3 model is missing exons 681-782 and 2024-2126 and has an incorrect, low complexity exon 2154-2787 as exon 1.  The 5' end of this cDNA is in GLEAN3_09788(Scaffold 1841)\n
GLEAN3_13810	SPU_013810		This gene was annotated based on a manual revision of multiple protein sequence alignments. \n \nNote: there are slight differences between the Glean and other predictions. None of them shows a significantly improved alignment to Pptn11.\n
GLEAN3_25612	SPU_025612		Encodes the C-terminus of Glean3_06917 (see annotation to that gene).\n
GLEAN3_01048	SPU_001048		This is a gene that falls on two different scaffolds.  See Glean3_10876 for full annotation.\n
GLEAN3_01049	SPU_001049	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. There was however mRNA information available from SpBase. There was Est support from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	This is part of a gene that is contained on multiple scaffolds.  See Glean3_10876 for full annotation.\n
GLEAN3_09788	SPU_009788		This is the 5' end of the Sp-Alpha P subunit the completed annotation is on GLEAN3_13536 \n
GLEAN3_21303	SPU_021303		#\nActual Exon number 6 was missing in GLEAN3 prediction.\n
GLEAN3_03911	SPU_003911		This gene was fused to an adjacent glean model (GLEAN3_03912) to obtain a full sequence that best aligns with vertebrate IL1AP (the last exon in the original version of this model was removed from the modified model). The DNA and protein sequences corresponding to the modified model are provided.\n
GLEAN3_00428	SPU_000428		56 nucleotides encoding a predicted signal peptide were added at the 5'end of the GLEAN3 model by comparison to the corresponding FgeneshAB prediction.  \n \nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(19 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_01877	SPU_001877		1381 nucleotides encoding a predicted signal peptide, LRRNT, LRRs were added at the 5'end of the GLEAN3 model by comparison to the corresonding FgeneshAB prediction.  \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_03912	SPU_003912		A modified version of this model was fused to its adjacent glean model (GLEAN3_03911) based on manually revised sequence alignments to vertebrate IL1AP sequences. \n \nSee GLEAN3_03911 for more details, features and modified sequence.\n
GLEAN3_21516	SPU_021516		Deleted 5' exon (6960-6973) not represented in known cDNAs. GLEAN3_21516 is missing N-terminal half of amino acid coding sequence reported in AF061750, AF036902.\n
GLEAN3_10245	SPU_010245		This annotation is based on a manual revision of multiple protein sequence alignments. \n \nNote that the protein encoded by this gene is shorter than that of its vertebrate homologs, which indicates there might be some N-terminal sequence missing from this model. \nSimilar predictions do not add a significant amount of sequence, nor do they improve the protein alignment to vertebrate SOCS7.\n
GLEAN3_11298	SPU_011298		In a neighbor-joining tree based on multiple sequence alignments with vertebrate and fruit fly SOCS-related sequences, this gene does not co-group with any distinct vertebrate homolog. Because its sister group includes hSOCS6 and hSOCS7, and because the most significant Blast hit is to hSOCS6 is that we named this gene Sp-SOCS6-like; but it should be noted that this name only reflects its closest similarity to vertebrate SOCS6, and should not be taken as to reflect true orthology. \n \nThere are slightly different predictions for this gene, but none of them significantly improved its protein  alignment to vertebrate SOCS genes.\n
GLEAN3_23261	SPU_023261		There is a missing Amino Acid 61-159 \nThat is probably lost in sequence of  \nNNNNNNN\n
GLEAN3_02792	SPU_002792		Even though the best Blast hit for this gene is to hSOCS2, we named GLEAN3_02792 Sp-Socs2/3 because it seems similarly related to both as indicated by neighbor joining trees made from various SOCS-related sequences (it typically co-distributes with a sister group that includes both SOCS2 and SOCS3 and no other vertebrate SOCS genes). \n \nThe embryonic expression of this gene is supported by tiling array data.\n
GLEAN3_20879	SPU_020879		Glean_20879 sequence incomplete in 5', completed with Glean_12710\n
GLEAN3_02224	SPU_002224		75 nucleotides encoding a predicted signal peptide were added at the 5'end of the GLEAN3 model by comparison to the corresponding FgeneshAB prediction.  \n \nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(14 to 24), LRR-CT, TM and TIR.  \n
GLEAN3_26496	SPU_026496		Even though the best Blast hit for this gene is to vertebrate SOCS5, we named GLEAN3_02792 Sp-Socs4/5 because it seems similarly related to both as indicated by neighbor joining trees made from various SOCS-related sequences (it typically co-distributes with a sister group that includes both SOCS2 and SOCS3 and no other vertebrate SOCS genes). \n \nBetter Blast hits to this gene were found, but they all correspond to predicted sequences in various genomes. The accession number provided is to murine SOCS5. \n \nNote that the protein encoded for this gene has a longer N-ter region than that encoded by its vertebrate counterparts. However, the embryonic tiling array data correlate well with the entire predicted coding exon, which argues against possible annotation mistakes.\n
GLEAN3_19743	SPU_019743		located in an intron of predicted GLEAN3_19742 (a cysteine protease)\n
GLEAN3_12710	SPU_012710		5'region of Glean_20879\n
GLEAN3_03578	SPU_003578		54 nucleotides encoding a predicted signal peptide were added at the 5'end of the GLEAN3 model by comparison to the corresponding FgeneshAB prediction.  \n \nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(12 to 23), LRR-CT, TM and TIR.  \n
GLEAN3_03579	SPU_003579		#\n54 nucleotides encoding a predicted signal peptide and 138 nucleotides encoding a part of TIR domain were added at the 5'end of the GLEAN3 model by comparison to the corresponding FgeneshAB prediction and BLASTN.  \nThis gene model may be a pseudogene. \n
GLEAN3_14243	SPU_014243		Predicted amino acid sequence from exon 1 (N-terminus) is not present in mammalian homologs.\n
GLEAN3_11539	SPU_011539		96 nucleotides encoding a predicted signal peptide were added at the 5'end of the GLEAN3 model by comparison to the corresonding FgeneshAB prediction.  \n \nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_22112	SPU_022112		Sp-CSK spans two scaffolds 40333 and begins on the very small 127652.  Somehow the N-terminus half of the predicted sequence has a 7-transmembrane domain that DOES NOT BELONG!!!!  The origional scaffolds, above, contain the correct sequence predicted.\n
GLEAN3_15533	SPU_015533		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. There are 50 unknown nucleotides (NNN) in the TIR domain. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_07343	SPU_007343		The alignment between this protein and murine MyD88 is very strong, except for the N-terminus of GLEAN_07343, which is notably longer that that of vertebrate MyD88 genes. There are several internal Methionines in this glean model, which could account for this seemingly "extra" N-terminus sequence. \n \nA partial duplication of this model was found in GLEAN3_07342.\n
GLEAN3_18394	SPU_018394		1) alignment with best blast hits suggest glean model contains complete coding sequence. \n \n2) There is an excellent, but short, match on scaffold76149_1, 1230 to 1344, which is not on the glean3 list.\n
GLEAN3_07342	SPU_007342		The protein coded by this sequence is identical to part of the sequence coded by an adjacent Glean model (GLEAN3_07343), and might represent either a haplotype or assembly-originated duplication.\n
GLEAN3_07004	SPU_007004		Actually two GLEAN3 predictions (GLEAN3_07004 & GLEAN3_17646) give the same "best genebank hit" (BMP11/GDF11_DanioRerio)  \nbut the 2 sequence divergence seems too high to be due to duplication. \nGLEAN3_17646 was then annotated as "Sp-BMP11b"\n
GLEAN3_17647	SPU_017647		Same best genbank hit than GLEAN3_07004 (GDF11/BMP11_DanioRerio). \nbut most likely not a duplication. \nArbitrarily called Sp-BMP11b \nto differenciate from Sp-BMP11 (GLEAN3_07004) \n \n
GLEAN3_17352	SPU_017352		#\nSee also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_17469	SPU_017469		A cap in this contig contains at least one exon for this gene. \n
GLEAN3_02874	SPU_002874		Amino Acids 1-97 of Sp-Ets1-2 match with part of glean3_04409 prediction on Scaffold1036. Amino acides  \n464-555 of Sp-Ets1-2 match with part of glean3_27053 prediction on Scaffold28371.\n
GLEAN3_05170	SPU_005170		-This GLEAN prediction covers only the C-terminal part (second exon) of the known protein. The N-terminal part (first exon) is in GLEAN3_02634 \n-One GLEAN? predicted miniexon has been deleted. It doesn't appear in the known cDNA sequence. \n \n-See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_02680	SPU_002680		ESTs cover nearly the whole gene. 5' and 3' UTRs not annotated.\n
GLEAN3_15378	SPU_015378		See GLEAN3_15377\n
GLEAN3_15379	SPU_015379		See GLEAN3_15377\n
GLEAN3_00547	SPU_000547		See GLEAN3_15377\n
GLEAN3_15377	SPU_015377		GLEAN3_15377, 15378, 15379, and 106060 combine to form one gene.    15377, part of 15378, and 15379 are contiguous and all blast as alpha integrins and GLEAN3_00547 is an overlapping sequence on a small scaffold that includes the last 4 exons of the gene.  The start of translation and first approximately 50 aa not included.\n
GLEAN3_09335	SPU_009335		Supported by 3 ESTs\n
GLEAN3_03514	SPU_003514	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. The first scaffold covers the first 400 bases or so, and then the second scaffold continues the rest of the sequence. If the two scaffolds were combined the sequence would have an orderly continuous arrangement (after discarding the other repeated scaffolds.) There was Est information available from GBrowse assembly V0.5 and the transcriptome score intensity appeared to be weak with values ranging from1.5-5.5. 	The CDS is much longer than RhoA orthologs.  It is possible that the first exon(s) are not real CDS and that the real start site begins at bp 439 (based on sequence homology)\n
GLEAN3_13815	SPU_013815		1st 3 exons are in GLEAN3_13814.  Junction between them is not correct.\n
GLEAN3_13814	SPU_013814		This is the first 3 exons of the gene.  The rest is on GLEAN3_13815.  That sequence was annotated to have the correct sequence, except that an exon joining the two GLEANs seems to be missing.\n
GLEAN3_11307	SPU_011307		Removed two incorrectly added exons.  One EST covers about 1 kb of the gene.\n
GLEAN3_12694	SPU_012694		Also could be considered an ortholog of ABCD2.  The 3' end may be wrong.  The NCBI prediction gives a different C-terminus, but there are no ESTs to verify the correct one.\n
GLEAN3_07530	SPU_007530		We have cloned Chk1 from S. pupruratus eggs and are in the process of modifying the model with the updated sequence. There appears to be another exon not identified in the model.\n
GLEAN3_12805	SPU_012805		This SPU is part of the same gene as SPU_019224 (SpFrk), by fusing the two SPUs and aligning with HsFrk.  The two SPU sequences have been cloned out of egg cDNA (see mRNA sequence).\n
GLEAN3_14051	SPU_014051		Only exons 4 and 5 are present on this scaffold1351. There is a huge gap of Ns (>100kb) where exons 1-3 could be located. However, exons 1-3 are present on scaffold21 GLEAN3_27234 but exons 4-5 are missing. \n
GLEAN3_24525	SPU_024525		cloned from egg cDNA \nGlean sequence accepted, however further validation needed for sequence not matching the cloned mRNA sequence.\n
GLEAN3_05957	SPU_005957		GLEAN3_05957 prediction contains 2 out of the 3 exons for Sp-4EBP. Exon 1 is located on Scaffold4246 (no GLEAN prediction). Coordinates 52018 to 54824 in GLEAN3_05957 prediction do not match 4EBP.  \nNew gene model as follows: \nscaffold 4246 strand +  start 54530 stop 54673 \nscaffold 111348 strand -  start 51046 stop 51210 \nscaffold 111348 strand -  start 49781 stop 49815 \nThe new gene model matches also a Sphaerechinus granularis partial cDNA (accession # AM161045).  \n
GLEAN3_00826	SPU_000826		#\nGlean3_00826 and glean3_00827 are very similar to previously cloned sea urchin SM30 genes.  A previously isolated genomic clone (Akasaka et al 1994, JBC 269: 20592-20598) indicates that glean3_00826 is most similar to the SM30-alpha gene. This is because of glean3_00826's intron size and its distance to the next downstream SM30 gene. Glean_00825, glean__00826, glean_00827, and glean_00828 encode SM30 like proteins and they are tandemly arranged on Scaffold25604. \n \nMatches c-type lectin domain (cd00037).\n
GLEAN3_00508	SPU_000508		Full gene of the GbetaA is annotated here. Further C-terminal sequences found in Glean3_08526 and Glean3_18505 are/will be added in. 2 (last) exons contained in this Glean match exactly with 2 (first) exons contained in Glean3_08526, so there might be a problem with contigs assembly. \nExon 5 might be alternatively spliced: it's present in some but not all ests.\n
GLEAN3_00827	SPU_000827		Glean3_00826 and glean3_00827 are very similar to cloned sea urchin SM30 genes.  A previously isolated genomic clone (Akasaka et al 1994, JBC 269: 20592-20598) indicates that glean3_00827 is most similar to the partially cloned SM30-beta gene. Glean3_00827's intron size and its distance to glean3_00826 supports this conclusion. Glean_00825, glean__00826, glean_00827, and glean_00828 encode SM30-like proteins and they are tandemly arranged on Scaffold25604. \n \nMatches c-type lectin domain (cd00037).\n
GLEAN3_00828	SPU_000828		Glean3_00828 is similar to previously cloned S. purpuratus SM30-alpha but to a lesser extent than glean3-00825,glean3-00826, and glean3_00827.  Transcriptome data indicates that this gene is expressed. Glean3_00825, glean3__00826, glean3_00827, and glean3_00828 encode SM30-like proteins that are tandemly arranged on Scaffold25604. \n \nMatches c-type lectin domain (cd00037). \n \nHad to alter gene sequence.  Now has two exons instead of four.\n
GLEAN3_18204	SPU_018204		See GLEAN3_23216. \n
GLEAN3_18500	SPU_018500		Pulling up same GLEAN3 for dynactin isoform 2 (p50).\n
GLEAN3_26748	SPU_026748		partial sequence \nThis incomplete model has been incorporated in Glean_3 24044 by Mariano Loza Coll (Toronto).\n
GLEAN3_19711	SPU_019711		See GLEAN3_19710. \n
GLEAN3_04021	SPU_004021		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nNote that the best Blast hit to this model corresponds to a vertebrate Lim-Hox2 gene. However, there is no detectable homeodomain in GLEAN3_04021. An overlapping, larger Fgenesh++ model was inspected for a homeodomain prediction, albeit unsuccessfully. Note, however, that this model is located on a scaffold with several gaps between contigs; therefore, a full LIM-Hox gene may exist in this region and it simply was not picked up by the predictions. \n \nUntil better evidence becomes available, we have decided to name this gene Sp-Lim-containing1.\n
GLEAN3_13569	SPU_013569		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThe best Blast hit for this model is vertebrate LMO2, which contains two LIM domains. This model, however, contains a single LIM domain, and is located in a scaffold that contains various gaps between contigs, for which it could as well represent an incomplete model. There are Fgenesh and Genscan models that, though slightly longer, are otherwise identical to GLEAN3_13569. The Genscan model codes for a protein that is almost the same length as LMO2, but that does not include additional LIM domains. For this reason, and until additional evidence becomes available, we have decided to name this gene Sp-Lmo2t (for "truncated").\n
GLEAN3_02633	SPU_002633		See also the paper, for the latest tree on Hox affinities:  \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_02631	SPU_002631		See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_02632	SPU_002632		See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_05169	SPU_005169		-THE GLEAN3 PREDICTION COVERS TWO FUSED GENES:SpHox5 and an Acetylcholinesterase. It is a known fact that these genes are very close (or fused?) in the sea urchin genome. \nI have seen the exon 2 of Hox genes (indicated in the gene model). The other exons are, most probably (given BLAST values), fragments of a Cholinesterase gene. \n \n-See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_21309	SPU_021309		See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_27568	SPU_027568		See also the latest tree of Hox affinities: \n \nCameron, R.A., Rowen, L., Nesbitt, R., Bloom, S., Rast, J.P., Berney, K., Arenas-Mena, C., Martinez, P., Lucas, S., Richardson, P.M., Davidson, E.H., Peterson, K. J. and Hood, L. (2005) Unusual gene order and organization of the sea urchin HOX cluster. J. Exp. Zoology. In press.\n
GLEAN3_19586	SPU_019586		This model was annotated based on a manual inspection of multiple protein sequence alignments.\n
GLEAN3_24044	SPU_024044		This model was annotated and modified based on full length cDNA sequences from Courtney Smith and Peggy Stevens.  The gene was first isolated from a coelomocyte library (Smith et al. 1996. J. Immunol. 156:593).  QPCR analyses indicates embryonic expression in gastrula.  In situ indicates expression in SMC. \n \nThe original version of GLEAN3_24044 was incomplete (C-terminus missing). The rest of its sequence was found on a partly duplicated model (GLEAN3_26748) and annotated by Christian Gache (Villefranche-sur-Mer, France). Both models have now been fused in a modified version of GLEAN3_24044 that is supported by cDNA sequence.\n
GLEAN3_09178	SPU_009178		The same gene is found in three different glean3 models, 2 are haplotypes, and 1 contains another part of the gene. \n
GLEAN3_28698	SPU_028698		The same gene is found in three different glean3 models, 2 are haplotypes, and 1 contains another part of the gene. \n
GLEAN3_18813	SPU_018813		Aligns with S. purpuatus SM37. Lee et al (1999, Develop. Growth Differ 41: 303-312 PUB MED:10400392). Is on the same scaffold as SM50 which is consistant with Lee et al.'s findings that SM37 and SM50 are linked. \n \nMatches c-type lectin domain (cd00037). \n \n
GLEAN3_12979	SPU_012979		See GLEAN3_02875. \n
GLEAN3_26899	SPU_026899		This Glean has 74% identies on the nt. level with Mus Musculus DYRK2 mRNA (from 614 to 1687 out of 2165).\n
GLEAN3_00870	SPU_000870		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains frame shift.\n
GLEAN3_24900	SPU_024900		See GLEAN3_23252. \n
GLEAN3_22703	SPU_022703		See GLEAN3_08370.  N- and C-termini are shorter than those of other organism.  \n
GLEAN3_27462	SPU_027462		This is the full sequence of PLCg.  It is missing sequences for approx. 40 amino acids in the reigon of 176-217. \n \nThe other scaffolds are located on 464 (Glean 10275) and 801 (glean 06056), However these have been added to this annotation and contain only inclomplete modles.\n
GLEAN3_08370	SPU_008370		See GLEAN3_22703.  N- and C-termini of the prediction is shorter than those of other organism.  \n
GLEAN3_20904	SPU_020904		We pulled this partical cDNA from an Sp egg library.  It aligns with the first half of this Glean prediction.  Glean3_05654 is also a good BLAST hit.  \n
GLEAN3_07964	SPU_007964		This gene was annotated based on a curated analysis of alignments to known vertebrate genes. \n \nThe protein inhibitor of activated STAT (PIAS) family of proteins has been proposed to regulate the activity of many transcription factors, including STATs, and recent genetic studies support an in vivo function for PIAS proteins in the regulation of innate immune responses. \n \nThis gene model was modified at the time of annotation. One of its exons was removed from its original version, since its expression was not supported by the embryonic tiling array data, which otherwise strongly support the expression of every other exon in this model. \n \nOnce the exon was removed, the alignment of this model to its vertebrate counterpart was significantly improved. \n \nAlso note that unaccounted exons for this model might exist, also based on the tiling array data.\n
GLEAN3_05339	SPU_005339		The intron of this GLEAN3 model was modified to a coding region by comparison to the corresonding FgeneshAB prediction.  \n \nIntronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(16 to 23), LRR-CT, TM and TIR.    \n
GLEAN3_27144	SPU_027144		one of two, duplication. The other is _02836. GLEAN3_22717 is overlapping and mostly non-identical. This and _02836 are the internal sequences, while _06197 is the N-terminal sequence \n
GLEAN3_02836	SPU_002836		#\none of two, duplicate. Other is _27144. GLEAN3_22717 is a mostly non-identical overlapping duplicate. This and _27144 are internal sequence, while the N terminal part of this gene is in _06197\n
GLEAN3_11536	SPU_011536		Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 22), LRR-CT and TIR. Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete.\n
GLEAN3_09762	SPU_009762	After reviewing the data and performing a BLAST search, it appears that there is no good fit for this particular GLEAN model. There is poor sequence coverage and numerous gaps within the sequence. There was some Est support available from GBrowse V0.5. This is an un-annotated gene so no additional information (comments) was available from Baylor annotations. 	Proteins structure suggests that Strabismus proteins may be members of the Ltap protein family (Kibar, 2001).\n
GLEAN3_12078	SPU_012078		This annotation was done with alignments using A. pectinifera mRNA and peptide sequence.  There are a total of 10 scaffolds and 43 exons (predicted).  Of note, in Glean3_24561 (Scaffold101715) there are two predicted exons which are out of sequencial order on the scaffold, but do not overlap in their nt. sequence of the protein.  In Glean3_27674 (Scaffold102204) there are two predicted exons which are in sequential order, but have the same nt. sequence.  One was omitted.  Changes have been made to other Glean3 predictions in this annotation, but not in the origional glean3 predictions.  Other glean3 predictions of this protein were accepted, usually with little or no change.  They also all refer to this Glean3 prediction in the comments.  Glean3_12078 aligns with of ApIP3R from AAs 316-903.\n
GLEAN3_10424	SPU_010424		Alternate transcripts, beta-1 and beta-3: \nbeta-1 NCBI accession #: NM_001032368 \nbeta-3 NCBI accession #: NM_001032369 \n \nThere are 4 basepairs from mRNA missed between exon 1 and exon 2.\n
GLEAN3_22820	SPU_022820		A small fragment (168921 - 169128, 211bp) is highly identical to Sp-Soxb2 (320520 - 320730) on Scaffold467, GLEAN3_25113.\n
GLEAN3_28103	SPU_028103		Could be a continuation of the Sp-proteoliaisin (GLEAN3_PLN), based on high sequence identity to the CDS across the repetitive low-density lipoprotein repeat.  NOT CERTAIN, however, given the absence of the complete proteoliaisin cDNA.\n
GLEAN3_00985	SPU_000985		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \n
GLEAN3_24336	SPU_024336	For this particular GLEAN model there is no CDS information available in either the SpBase search engine or on the Baylor annotations page. mRNA information was unavailable as well.\nAdditional information: Joins with model_GLEAN3_24337.	Joins with GLEAN3_24337.\n
GLEAN3_24337	SPU_024337		Joins with other scaffolds.  Also note:  \n- exon 7 start is not defined because of unknown sequence (NNN...) \n- exon 14 is missing 1 bp after 29361 \n- exon 15 is missing 12 bp after 29180\n
GLEAN3_21062	SPU_021062		Encodes the final exons of the gene on GLEAN3_24337.\n
GLEAN3_25113	SPU_025113		A small fragment (320520 - 320730, 208bp) is highly identical to Sp-Soxb1 (168921 - 169128) on Scaffold732, GLEAN3_22820.\n
GLEAN3_01970	SPU_001970		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(6 to 10), LRR-CT, TM and TIR.\n
GLEAN3_01971	SPU_001971		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(7 to 10), LRR-CT, TM and TIR.\n
GLEAN3_03684	SPU_003684		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group I(orphan).\n
GLEAN3_04791	SPU_004791		605 bp intron was modified to a coding region by comparison to the corresponding FgenesAB prediction.\n
GLEAN3_26687	SPU_026687		>Spec2a protein sequence: \nMAVQLLFTEEEKALFKSSFKSEDTDGDGKITSEELRAAFKSIEIDLTQEKIDEMMGMVDK \nDGSKDMDFSEFLMRKAEQWRGREVQLTKAFVDLDKDHNGSLSPQELRTAMSACTDPPMTE \nKEIDAIIEKADCNGDGKICLEEFMKLIHSS \n \n>CDS of Spec2a gene \natggctgtcc aattattatt taccgaagag gaaaaagctt tattcaaaag ctccttcaaa        60 \ntcagaagaca cggatggcga tggcaaaatc acttctgaag agttgagagc agcgtttaaa       120 \ntcaattgaaa tagacttgac tcaggaaaag attgacgaaa tgatgggaat ggttgataaa       180 \ngatggtagca aagatatgga cttttctgag tttttgatga ggaaggcaga acagtggcgc       240 \nggaagagaag tacaattaac taaagctttc gtcgacttgg acaaggatca caacggatcc       300 \nctcagtcctc aagagctgcg tacagcgatg tcagcatgca ccgatccacc gatgacggag       360 \naaggaaatcg atgcaatcat cgagaaagcc gactgcaatg gggacggtaa aatctgcctt       420 \ngaagaattca tgaaattgat tcactcgtct taa       453 \n
GLEAN3_21354	SPU_021354		GLEAN3_21354 should be linked to GLEAN3_21353\n
GLEAN3_21351	SPU_021351		This prediction is one of many ACE genes on scaffold 52540. \n \nAlignment with best blast sequence and transcriptome signals suggest that there may be a missing exon 3' of GLEAN3_21351|Scaffold52540|76503|76746| and 2 exons may be incorrect (see below). \nGLEAN3_21351|Scaffold52540|77756|77824|  \n>GLEAN3_21351|Scaffold52540|77756|77824| DNA_SRC: Scaffold52540 START: 77756 STOP: 77824 STRAND: +  \nGATCTTCCTCGCTTCGTCTTTCATCTCTAAGCCATAGTTTCCCGACAGCATGGACACATTAGACAAAAA \nGLEAN3_21351|Scaffold52540|83924|84128 \n>GLEAN3_21351|Scaffold52540|83924|84128| DNA_SRC: Scaffold52540 START: 83924 STOP: 84128 STRAND: +  \nCGTCCAATGCATATTGGACACAGTTCGTGCACAACGGACAGAGTTCCGCATTGCTATCCAAGCATGCCAT \nATGTCCTAGAACTCCCTTCGTCTTTTCGACAAACTGATCAAGGTTGTCGAGCTTGAGCTGAAGATAACAC \nGTCACGTTGGTGATCAGGAGTACTGTAGCGAGAACCCCGTATCCGGTCCATCGGAGGGACGCCAT \n
GLEAN3_21357	SPU_021357		#\nThis prediction is one of many ACE genes or parts of genes on scaffold52540.\n
GLEAN3_26751	SPU_026751		Two comments: \n \n1)Exon 3 of this prediction encodes reverse transcriptase domain.  Transcriptome data suggests that this exon is represented in embryo RNA, but cross-reaction cannot be excluded. \n>GLEAN3_26751|Scaffold1575|15835|16444| DNA_SRC: Scaffold1575 START: 15835 STOP: 16444 STRAND: +  \nAATGAATTCCGTCTAGGAAGATCTACAGTAGCGCGAATTCTAACTTTGCGGAGACTGGTGGAAGGTACTA \nAAGCAAAGCATCTGACAGCAGTACTTACGTTCGTGGGTTTTAAGAAGGCCTTCGATTCAATCAATAGGAA \nGAAGATGTTAGAGATCTTAAGAGCCTACGGAATACCATACACAATAGTCACAGCAGTAGGGTTGCTGGAC \nAAAGTTACTACAGCTCAAGTGCGTTCACCAAATGGAGAGACTGACTACTTTACCATCTTAGCAGGAGTGC \nTCCAAGGCAATACTTTAGCACCATACCTATTCATCGTAGCATTGAATTATGCTCTAAGAATGGCTACTGA \nATGGTTCGAGGATCTGGGCTTTACCCTAGAGGAAAGAGAAAGTAGCAGATATTCTGCTGTAATGATCACA \nGATACTGACTTTGCTGATGATATTGCACTAATTTCAGACAATGTGGAAAAGGCACAGAAGCTCCTAAAAC \nAACTAAAGTCTGCAGCAAGTCAAATCGGTCTACAAATAAACAGTACTAAGACAGAATTCAAGATGTACAA \nCCTTCAGCCTATATTTCACACATATCGTCATTTGCCTGACGTAGGAATGC \n \nIn addition to duplicated copies on the glean3 list, there are many not on the glean3 list with e values>100.  Scaffolds 55516, 93134, 115888, 20, 302, 807, 102465, 95798, 1755, 11431, 52209, 431, 87717, 1081, 53542, 138714, 51779, 86354, 119129, 1241, 1111, 28699, 58694, 18304.  The ACE family cannot be described from the current assembly many fragments are scattered both on the glean3 list and outside of it. \n
GLEAN3_28021	SPU_028021		Alignment with best blast sequence suggests that the model may lack N-terminal sequence.\n
GLEAN3_12611	SPU_012611		Prediction covers partial CDS as inferred from alignments with best blast hit.\n
GLEAN3_27362	SPU_027362		TWO COMMENTS: \n \n1)Last  three exons are unlikely to be part of this gene and should be deleted from the model. \n \nExon 8 is COG5048, COG5048, FOG: Zn-finger;  not blasting to thiamet oligopeptidease \n \n>GLEAN3_27362|Scaffold50623|88407|89245| DNA_SRC: Scaffold50623 START: 88407 STOP: 89245 STRAND: +  \nCTGTATGGATGTGTTTATGTCTTGTGTGATGAGTTTCTTGATTAAATGTCTTACCACATTGATCACATAC \nATAGGGCTTCTCACCTGTATGGGTGTGTTTATGTCTTGTGTGATGAGTTTCTTGATTAAATGTCTTACCG \nCATTGATCACATACATAGGGCTTCTCACCTGTATGGGTGTGTTTATGTCTTGTGTGATGAGTTTCTTGAT \nTAAATGTCTTACCACATTGATCACATACATAGGGCTTCTCACCTGTATGGATGTGTTTATGTCTTGTGTG \nATGAGTTTCTTGATGAATGTCTTACCACATTGATCACATACATAGGGCTTCTCACCTGTATGGATGTGTT \nTATGTCTTGTGTGATGAGTTTCTTGATTAAATGTCTTACCACATTGATCACATACATAGGGCTTCTCACC \nTGTATGGATGTGTTTATGTCTTGTGTGATGAGTTTCTTGATTAAATGTCTTACCACATTGATCACATACA \nTAGGGCTTCTCACCTGTATGGATGTGTTTATGTCTTGTGTGATGAGTTTCTTGATTAAATGTCTTACCAC \nATTGATCACATACATAAGGCTTCTCACCTGTATGGATGCGCTTATGTCTTGTGTGATGAGTTTCTTGATT \nAAATGTCTTTCCACATTGATCACATACATAAGGCTTCTCACCTGTATGGATGCGCTTATGTCTTGTGTGA \nTGAGTTTCTTGATTAAATGTCTTACCACATTGATCACATACATAGGGCTTCTCACCTGTATGGATGTGTT \nTATGTCTTGTGTGATGAGTTTCTTGATTAAATGTCTTACCACATTGATCACATACATAGGGCTTCTCAC \n \nExons 9 and 10 do not blast to thiamet oligopeptidease.  This regions contans a SIR1 domain:  COG0846, SIR2, NAD-dependent protein deacetylases, SIR2 family \n \n>GLEAN3_27362|Scaffold50623|92553|92742| DNA_SRC: Scaffold50623 START: 92553 STOP: 92742 STRAND: +  \nCACTCTCCACATCTGGCAATAGGCATGCATGGATATAGGAATGGATAGCGTGTTCACATCTCCACCGACC \nCCCATGGTCGATGAGAGGGCCTACTCTCCTGTATCGACGGCGGGTGATGAGGTCATCCACAAATTGCCGA \nATGATGAAATAAAGACGGCTAGCCATGGGTTCGTCTTCACCCACACGCCG \n>GLEAN3_27362|Scaffold50623|94169|94282| DNA_SRC: Scaffold50623 START: 94169 STOP: 94282 STRAND: +  \nCTGGTAGATGTCCAGTGACCCCTGGATGACCTTCTCGAGGGGGAAGTACTCCTTGAGCTTGTTGTGGTCC \nACAGAGTAGCGCTTCTCTTCGGACATGTTCATGAAGTAGCGCAT \n\t\t \n2) In additions to multiple copies on the glean3 list, there are excellent matches on scaffolds 71773 and 89507. \n \n
GLEAN3_26549	SPU_026549		#\nAlignment with best blast sequence suggests that the model is correct.\n
GLEAN3_02117	SPU_002117		Exon sequences below contain chromo and ChSh domains characteristic of some nuclear proteins, and are unlikely to be part of this gene and should be removed from the gene model. \n>GLEAN3_02117|Scaffold310|75820|75949| DNA_SRC: Scaffold310 START: 75820 STOP: 75949 STRAND: +  \nTTATCTTTTGTCCTCCTCGTCGTCTTCACTGTGCCACGTCAAACGCTCTTCGTAGAACTGGATGACAATC \nTGAGGGCACCTGTGGTTGGCTTCCTTGGCTCGCACAAGGTCTGCTTCGTTGTTGTTTTTC \n>GLEAN3_02117|Scaffold310|77625|77755| DNA_SRC: Scaffold310 START: 77625 STOP: 77755 STRAND: +  \nCACTTCATGAGGAATAGGAGTTCATTGTTGGATTCTGTGGCACCAATGATTCTCTCTGGATCCAGCCCTC \nTGTCAAATCCTCTGTACTTTTTCTCATCTTGTTTCGTCTGCTTGCCATCTTTGGAAGGCCC \n>GLEAN3_02117|Scaffold310|78216|78405| DNA_SRC: Scaffold310 START: 78216 STOP: 78405 STRAND: +  \nAGATGAGTCTTTCCTCTTCTCGGTTGCAGCAGCTTCTTCTTTCCTTCTTTTCTTTGCTGCTACATCTCCA \nGCAGCAACATTTTGAGCTGATTTTCTCTTTAAGGCCTCCTTTTCTCGGATCTTCTTTTCATACGCCTCAA \nTTAAATCAGGGCACTCTAGATTGTCCTGGGGTTCCCATGTCGATTCATCA \n>GLEAN3_02117|Scaffold310|80007|80191| DNA_SRC: Scaffold310 START: 80007 STOP: 80191 STRAND: +  \nTCTCCATAGCCCTTCCACTTGAGGAGGTATTCTACTCTTCCTTTGTGTATCCTCTTATCGACAACCTTCT \nCCACTTGGTAGACCTCTTCTTCTTCCTCCTCACTTTCTCCCTCGGTTTTCTTTTCTTCATTTTCTTCACC \nTTCTGGTTCAGGCCCATCTTCAGGTTTCCTCTGCTTTTTGCCCAT \n
GLEAN3_10808	SPU_010808		Some exons may be not belong in this gene:  The first three at one end of the gene blast to nothing in the nr database; the second set of two are in the middle of the gene and also do not blast to anything. \n \n>GLEAN3_10808|Scaffold1433|165949|166138| DNA_SRC: Scaffold1433 START: 165949 STOP: 166138 STRAND: +  \nATGACAATGGATACAAATAAGAGGAACACCATGATAAATCTCAAATTAATTCTGACTGTGATGATCATCA \nTTTTTCTACAATGTTGGGAAGCTACTTCTCTGTCGTCTGCTCCAGCTCCTAGCCGTTGCATATTTGATGA \nAGTTCAAAAGCATCAAAACGTAGAAAGAACACTTATAAAATACCATCCAG \n>GLEAN3_10808|Scaffold1433|166548|166721| DNA_SRC: Scaffold1433 START: 166548 STOP: 166721 STRAND: +  \nGTGATGTAAGCGCAAAATCAAAGAGGTCAGTAGAAGAAGAAGCAAATGCCTACCAGCCAATCAGAGTGAA \nGACGTTTGTCCAGAATGAGGAGCATCTGATGGACTCCGTGCAGGTTGAAAAACTAGAGACCATCATGGCT \nGGTGCAACATCTGTTGTTCAAAAACTTCTGTCAG \n>GLEAN3_10808|Scaffold1433|167584|167668| DNA_SRC: Scaffold1433 START: 167584 STOP: 167668 STRAND: + \n \n>GLEAN3_10808|Scaffold1433|170097|170175| DNA_SRC: Scaffold1433 START: 170097 STOP: 170175 STRAND: +  \nCTTGCCCTGCATGAAGCGTTTCATGTTCTTGGATTTTCTACAAGTCTTTTTGACCAGTTTCAAGATTGTA \nGTGTATGTG \n>GLEAN3_10808|Scaffold1433|170631|170782| DNA_SRC: Scaffold1433 START: 170631 STOP: 170782 STRAND: +  \nAAGATGGACTCGAGTGCGAGACAAGAGAGGATGTTGTGAGAGTGGATGCCGGTGGGCAGTCTAGACTCCA \nCACCCCAGCAGTCGTGGCTGCATCTCAGATTCATTTTGGCTGCACTGAAGAAGAAGAAATGGGTGTTCCT \nCTGGAAAATCTG \n
GLEAN3_11364	SPU_011364		#\nIn addition to many copies of these gene on the glean3 list, there are several scaffolds that contain excellent matches:  Scaffolds 25161 and 85005.\n
GLEAN3_26072	SPU_026072		Exons 1 and 2 cannot be confirmed because they do not blast to endothelin converting enzyme. \n \n>GLEAN3_26072|Scaffold692|19600|19738| DNA_SRC: Scaffold692 START: 19600 STOP: 19738 STRAND: +  \nATGACGAGTAGTCAGGCTAAACTCGCCGTCGATGAGGGTGTCGTTGTCAGACGAAAAGCCCCCAAGGTCA \nTTACCAGGAATCTGGTCGTCATCGTTGTCGTCTTGGCACTCCTCACCGTGTCACTTATAGTAGCTACCG \n>GLEAN3_26072|Scaffold692|21211|21327| DNA_SRC: Scaffold692 START: 21211 STOP: 21327 STRAND: +  \nTTGTAATCGCGTCAGACCGGGATAACCTTTCTTCAAGATTACGATCATATACCGGCCACCAAACCTCACC \nATGCCCTGAACCGAAGCAATGTCTCACGCCCTCTTGTGTTAAAGCAG \n
GLEAN3_04765	SPU_004765		Partial cds inferred from alignment with best blast hit.\n
GLEAN3_11071	SPU_011071		partial cds inferred from alignment with best blast hit.\n
GLEAN3_02141	SPU_002141		Partial cds inferred from alignments with best blast hit.\n
GLEAN3_08959	SPU_008959		Alignment with best blast sequence suggests that the model may lack N- and C-terminal sequences.\n
GLEAN3_15178	SPU_015178		There appear to be several CPA2 genes tandemly repeated; the other GLEAN3_15179 is partial CDS while this appears to be complete, as inferred with alignments to best blast hits.\n
GLEAN3_15179	SPU_015179		There are 2 CPA2-like genes on this scaffold.  This is partial CDS while the other, GLEAN3_15178 appears to be complete, as inferred by alignments to best blast hits.\n
GLEAN3_01397	SPU_001397		TWO COMMENTS: \n \n1) This prediction covers partial cds, as inferred from best blast hit alignments;  2N-TERMINAL exons may not belong to this protein, since they do not blast to CPA genes \n>GLEAN3_01397|Scaffold22766|21673|21839| DNA_SRC: Scaffold22766 START: 21673 STOP: 21839 STRAND: +  \nTTATTTTGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTCCTTCTCCT \nTCTTCTTCTTCTTCTTCTTCTTCGTCTTCTTCGTCTTCATTGTCTTCTTTGTCTTCTTCTTCTTCTTCTT \nCTTCTGTTTCTTCTTCTTCTTCTTCAT \n>GLEAN3_01397|Scaffold22766|21978|22053| DNA_SRC: Scaffold22766 START: 21978 STOP: 22053 STRAND: +  \nAGATAACGTTGGCGATAGAGCCAGTCTTGTACGATCTATTACCCATGGCGGATACGGCACTGTCTGATAC \nAGATTT \n \nSimilarly, 2 C-terminal exons may belong. \n>GLEAN3_01397|Scaffold22766|28996|29077| DNA_SRC: Scaffold22766 START: 28996 STOP: 29077 STRAND: +  \nATAGTTTCCCAGTACGCTCTTGAGAACGACTAGCTTCTCCAGCTGCAGGCGATCTTTAGGGGTGACACGG \nAATACCTTGTAC \n>GLEAN3_01397|Scaffold22766|29479|29540| DNA_SRC: Scaffold22766 START: 29479 STOP: 29540 STRAND: +  \nCCATCGTAATTGACGGCTAGCGCAGTAGCTAAGAGAGCAGTGAATACCAGAAAACGCATCA \n \n2) There are 4 CPA-like genes or parts of genes on this scaffold. \n
GLEAN3_01399	SPU_001399		#\nOne of a set of 4 whole or parts of CPA-like genes\n
GLEAN3_01400	SPU_001400		TWO COMMENTS: \n \n1) exon below should be added upstream of present exon1 since it blasts to the N-terminal part of its best blast hit. \n \n>Supertig22766_5|Scaffold22766|32108|32236| DNA_SRC: Scaffold22766 START: 32108 STOP: 32236 STRAND: +  \nATGCTGATAATAATTATTCAATCTCTTTCAGCTCGACTTCTGGAGGGAGGCGACGCCGTCTTCGATCGGT \nCGTCCCGTCGACATCATGGTACCATCGAGCCTTCGGAACAACGTCCACGACATACTGAC \n \n2) This is one of 4 whole or parts of CPA-like genes on scaffold 22766.  This model covers partial CDS, as inferred from alignments with best blast hit.\n
GLEAN3_14935	SPU_014935		This is one of two CPA-like genes on scaffold 27970.\n
GLEAN3_12020	SPU_012020		This is one of two CPA-like genes on scaffold 48388.  The other is GLEAN3_12021.  There are several clusters of CPA genes that may link to form a large cluster.\n
GLEAN3_12021	SPU_012021		This is one of two CPA-like genes on scaffold 48388.  The other is GLEAN3_12020.\n
GLEAN3_04157	SPU_004157		BLAST data suggest that the following exons should be deleted from this model. \n \n>GLEAN3_04157|Scaffold48533|10538|10720| DNA_SRC: Scaffold48533 START: 10538 STOP: 10720 STRAND: +  \nCTCGTCAGCCATGCTGGTCTCTAGCTCCCTCAGCCATTGCCTGTGCTCATCCGTACCAGGGGTCACACGA \nAGCACCTGGTATCTGTAAATTAATAAGAGAGACATATAGAGGGGTATGGAGTGATGGGGTATAGAGGGAA \nGAGATGGGAGATTGANGGCACGTCTTCGTGCAATCAAGATTGC \n>GLEAN3_04157|Scaffold48533|15593|15676| DNA_SRC: Scaffold48533 START: 15593 STOP: 15676 STRAND: +  \nCTTGAGTTTGCGTCTAGCGTTGAACTTCTTCAAGCAATCAACGGTCTCCTGTCTGTGCATTGCTGAAGCA \nTAGCGATCTCGGTT \n>GLEAN3_04157|Scaffold48533|21092|21201| DNA_SRC: Scaffold48533 START: 21092 STOP: 21201 STRAND: +  \nCTGGATCCATGGATGTTTTAGAGCCTGGCAGGCAGAGATACGCTTTCCTGGGTTGACTGTCAGCATGCTA \nTCTATCAAGTTCTTTGCTTCAGGTGTCACTGTGTCCCATT \n>GLEAN3_04157|Scaffold48533|22651|22855| DNA_SRC: Scaffold48533 START: 22651 STOP: 22855 STRAND: +  \nCTGAATTAGACTCCAATCCACCATCTTGAACAATCTCCTCAAAATCCATTGAATGGCTGGCATTGAATAA \nTCCAGTCTTCGCTTTGACGATGGCATACTGCCCTTTGAAGGCAGACATATTGCCACAGGTAGCTTGACTT \nTGGATGTATCGTAGGGTCTTCCCACTGCGAGTCATGCACTTCTTTCTCCCGACGCTTGCTGCCAT \n
GLEAN3_22102	SPU_022102		This is one of two CPA-like genes on scaffold114652.  There are clusters of CPA-like genes on several scaffolds, raising the possibility that large clusters of these genes exist.\n
GLEAN3_09007	SPU_009007		partial CDS, C-terminal only, because this is a short scaffold.\n
GLEAN3_27451	SPU_027451		TWO COMMENTS: \n \n1) partial cds; note that there are repeated elements in the cds of the best blast hit. \n \n2) In addition to duplicated copy on glean3 list, there is also an excellent match on scaffold 73285.\n
GLEAN3_04957	SPU_004957		This gene model may represent a pseudogene or contain a sequence error. 5' upstream sequence matches coding sequence of other Sp-Tlr genes and contains stop codons . Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_05830	SPU_005830		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_06218	SPU_006218		This gene model may represent a pseudogene or contain a sequence error. 5' upstream sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_25601	SPU_025601		Part of the cDNA sequence (AY130972 ) could not be found anywhere in the genome. Therefore, accept gene model prediction for exon 2. \n \nExon 1-3 of this gene are on Scaffold 87957 (GLEAN3_17106). Exon 3-7 are on Scaffold 107218 (GLEAN3_25601). The two scaffolds have exon 3 and flanking sequences overlap and can be aligned quite well (e=0). Therefore, the two scaffold might be assembled as one scaffold. The modified gene model is composed of Exon 1-3 from Scaffold 87957 and exon 4-7 from scaffold 107218. \n \nExon 5-7 are also on Scaffold 55268 (GLEAN3_05718). This might be another allele of this gene. \n \nPlease refer GLEAN3_17106 for the modified gene model.\n
GLEAN3_05718	SPU_005718		Part of the cDNA sequence (AY130972 ) could not be found anywhere in the genome. Therefore, accept gene model prediction for exon 2. \n \nExon 1-3 of this gene are on Scaffold 87957 (GLEAN3_17106). Exon 3-7 are on Scaffold 107218 (GLEAN3_25601). The two scaffolds have exon 3 and flanking sequences overlap and can be aligned quite well (e=0). Therefore, the two scaffold might be assembled as one scaffold. The modified gene model is composed of Exon 1-3 from Scaffold 87957 and exon 4-7 from scaffold 107218. \n \nExon 5-7 are also on Scaffold 55268 (GLEAN3_05718). This might be another allele of this gene. \n \nPlease refer GLEAN3_17106 for the modified gene model.\n
GLEAN3_27819	SPU_027819		Possible gene duplication  \nGLEAN3_16081\n
GLEAN3_16081	SPU_016081		Possible gene duplication \nGLEAN3_27819\n
GLEAN3_23660	SPU_023660		Partial CDS.  Note there are repetitive elements in the best blast hit.\n
GLEAN3_17575	SPU_017575		partial cds; Note that there are repetitive elements in the best blast hit.\n
GLEAN3_07682	SPU_007682		There is an excellent match to part of this predicted gene on scaffold 34914_1, but it is not on the glean3 list.\n
GLEAN3_21609	SPU_021609		partial CDS, inferred from alignments with best blast hits.  N-terminal sequence is not included on this scaffold, 52285.\n
GLEAN3_03363	SPU_003363		Partial CDS, as inferred from best blast hit alignment.  One predicted and fairly long exon, listed below, is probably not included in this protein because it does not blast to carboxypeptidase E. \n>GLEAN3_03363|Scaffold112125|14529|14828| DNA_SRC: Scaffold112125 START: 14529 STOP: 14828 STRAND: +  \nTCAACCTTTACTTATTGTGTTGTTAAATAAATTCTGTCGGGTTACAATTCTCGGAGGGGGTTGGGCGTCG \nGTGAGTGTGATGAGAATAATGGTTGTGAAGATGATTTTGACAACTCTCAGATTTGATTTTATGATGACGA \nAGACACCGATGTTAACGATGGTGGTGTTGATGAAGCTGATGCTGCTGCTGCTGATGATGATGATAGTGAT \nGATGATTAACGCTGCACTTGCTATGTTGTGGCGTTTGTCGAGGTCAAGGATAATGATTATGATCATAAAA \nTCAATCATTATAGCAATGAA\n
GLEAN3_26013	SPU_026013		partial CDS, as inferred from alignment with best blast hit.  Appears to missing N-terminal sequences.\n
GLEAN3_16331	SPU_016331		partial CDS, as inferred by alignments with best blast hit. Model  appears to missing N-terminal sequences.\n
GLEAN3_20494	SPU_020494		1)This model, GLEAN3_20494 and the neigboring model, GLEAN_320393, appear to be parts of the same gene, as inferred from alignment with best blast hit. \n
GLEAN3_01638	SPU_001638		#\nSubgroup A thrombospondin with TSP type 1 repeats\n
GLEAN3_07105	SPU_007105		This gene model may represent a pseudogene or contain a sequence error. A part of intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nNew assembly doesn't show any frame shifts and stop codons. \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_09755	SPU_009755		Amino terminal half of the protein is missing due to end of contig. \n \nThis region is slightly more closely related to mammalian SubgroupA TSPs than subgroupB based on blast.\n
GLEAN3_07430	SPU_007430		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_09230	SPU_009230		partial CDS;  \nTWO COMMENTS: \n1) Three exons in the model may not belong because they blast to a cub domain not normally found in this class of peptidase.  >GLEAN3_09230|Scaffold376|224783|225029| DNA_SRC: Scaffold376 START: 224783 STOP: 225029 STRAND: +  \nGAGTGGTCCATCACAGCACCCCTTCAGAGGAGGATCCTCGTCACTTTTAATGACTTAAAGTTGGAATCTC \nCTTTTGATTTTATTGTCATTCGGGACATGTACCGTGATAAAAAGCCCTACACAGGCGAGAACATGCTACT \nCCATCCATTTCTGACGTTAGGACGTACTCTTGATATTGAGTTCTCCTCATCTAGAAGTGGCAGAAGGAGA \nGGATTCAATATTTCAGTCTCATGTAGCGAACTTTCAA \n>GLEAN3_09230|Scaffold376|226152|226229| DNA_SRC: Scaffold376 START: 226152 STOP: 226229 STRAND: +  \nATACATATCGTCTGATGGATTGCGCAGCAGAGCATTCAATGTGCTTGTCTGAGAAGGGCGTCAAAATAAA \nATGTCATG \n>GLEAN3_09230|Scaffold376|227528|227685| DNA_SRC: Scaffold376 START: 227528 STOP: 227685 STRAND: +  \nCTTCTGTGGGTTTCTGTGAGGAACTAAACGACACGATTGATGGATCTTGGGATCCTAACATCACATGGTT \nTGGTTCTATCGTTCATCGTACATGTATGGATGGATACAGTCTAAAAGGCAATGGAACCCTGCAATGTGTG \nCCGGGGTATCACCATTGA \n2)Six N-terminal exons are questionable because there is no conservation with this class of peptidase based on alignments to best blast hits. \n>GLEAN3_09230|Scaffold376|53883|53961| DNA_SRC: Scaffold376 START: 53883 STOP: 53961 STRAND: +  \nATGCATGTTGATCGTTGTACAACGGTGATAACCGGTGCAACGCACTGTCCTTGGTTCAGTGCCTTTCCCA \nTTGATACCT \n>GLEAN3_09230|Scaffold376|59076|59173| DNA_SRC: Scaffold376 START: 59076 STOP: 59173 STRAND: +  \nGTGACGGTAAAGACTCTGGAATTTTACTCATTGATGAAAGGACAAAAGCAGTAATGACTGACCAACCAAG \nACATGCACAAGAAGCTTTCAAGGAACAG \n>GLEAN3_09230|Scaffold376|60698|60806| DNA_SRC: Scaffold376 START: 60698 STOP: 60806 STRAND: +  \nATTGCCAAAGTTCGTGAGTTGGTTCCTACCCGGAGTAGAGATGATATTGCACTGGTTCTTCAATGCCATG \nAGGGAAATGTGGATAAAGCAGTACAGTCATTCATAGACG \n>GLEAN3_09230|Scaffold376|66479|66525| DNA_SRC: Scaffold376 START: 66479 STOP: 66525 STRAND: +  \nATGGAGCCAAAACTGTTTTGAATGAGTGGCAGTCGCATGGCAAGAAG \n>GLEAN3_09230|Scaffold376|69825|69978| DNA_SRC: Scaffold376 START: 69825 STOP: 69978 STRAND: +  \nTCTGCAAATAAGAGAAACAAGAAAAAGAAACGAGGCCCTGATGCACCAGATGAGAAATCAAATGGTGGTG \nATGCTGCTGTAGCTAGTAAAACAGGTAAATATAACGCACTAGAGCAATTCCATGGAAAGTTGCCTAAGAC \nGGGCAACATGCAAG \n>GLEAN3_09230|Scaffold376|103311|103387| DNA_SRC: Scaffold376 START: 103311 STOP: 103387 STRAND: +  \nGTGAAGTAAATGGGTACCTGGTAGGAATTTATTCCTTGAAACGCAGCGCGCGTAACAGCTGCACTGCTAA \nAGCCAGG \n \n
GLEAN3_19059	SPU_019059		partial CDS; 3 N'terminal exons are questionable as the sequences are not conserved with other metalloprotease1 genes. \n>GLEAN3_19059|Scaffold64363|806|935| DNA_SRC: Scaffold64363 START: 806 STOP: 935 STRAND: +  \nATGCTTGGAAAGAAAGTGGAAGGATCCGGACTTGAAGATATCCTTTTGGAAGCTGGTCTGATGTCTTCTG \nGGTCTATAAAAGATGTGTCAACAACAGTGCGACAGGAGTCTGCATTGTCACAAGACAATG \n>GLEAN3_19059|Scaffold64363|2797|2824| DNA_SRC: Scaffold64363 START: 2797 STOP: 2824 STRAND: +  \nTGCTGAGTGGAGGAGAAGCTATGCTGTT \n>GLEAN3_19059|Scaffold64363|7517|7616| DNA_SRC: Scaffold64363 START: 7517 STOP: 7616 STRAND: +  \nTGCTGAGTGGAGGAGAAGCTATGCTGTTGTGAGTAAAGCCCAGGAGCGAGCAAAACAATACCAACCAGGA \nGACAGGCTCCATGGCTTCTCGGTGGAGAAA \n
GLEAN3_07850	SPU_007850		#\nThis gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some flame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_07986	SPU_007986		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IE. \n
GLEAN3_08267	SPU_008267		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(18 to 24), LRR-CT, TM and TIR. \n
GLEAN3_09129	SPU_009129		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. Modified gene model includes the long unknown sequence.\n
GLEAN3_09829	SPU_009829		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some flame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group I(orphan).\n
GLEAN3_15920	SPU_015920		5' end of the gene is missing because scaffold data is incomplete.  This gene is on same scaffold adjacent to Sp-AlphaD\n
GLEAN3_10940	SPU_010940		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_11030	SPU_011030		This is longer than most homologs/orthologs.  May be duplication of GLEAN3_11029 with actual starting codon at bp 105. \n
GLEAN3_10619	SPU_010619		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_11540	SPU_011540		Intron of this gene model was modified to a coding region by comparison to the corresponding FgenesAB, ++ and Genscan prediction. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_11949	SPU_011949		The intron of this gene model included a long unknown sequence. So the model was modified as an intronless gene by comparison to the corresponding FgenesAB prediction. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_13144	SPU_013144		partial CDS; Contains C-terminal half sequences, based on best blast alignment.  Some sequences of predicted exons are not conserved and are therefore questionable.  They are: \n>GLEAN3_13144|Scaffold9356|6814|6907| DNA_SRC: Scaffold9356 START: 6814 STOP: 6907 STRAND: +  \nTCATTCCTTGAGTATATTCCATCCCTGACTGGTCATGGTGGTGGTTGGTCCTAGTAGAGTAGTGGCATGG \nGTTCTCTTACCATCCATCAAGTAC \n>GLEAN3_13144|Scaffold9356|12446|12517| DNA_SRC: Scaffold9356 START: 12446 STOP: 12517 STRAND: +  \nCTGTAAGAGTAAAACTTGAAAGCGCCCTCGGTGCCCATGGATGCTCCACCTCCATATGCTCCTCCCTTCT \nCC \n>GLEAN3_13144|Scaffold9356|13299|13352| DNA_SRC: Scaffold9356 START: 13299 STOP: 13352 STRAND: +  \nCTGATTTCCCGATGCAGGTATTTGGCAGACATCAAACGGGCAAGGACCCTCAAC \n>GLEAN3_13144|Scaffold9356|14762|14873| DNA_SRC: Scaffold9356 START: 14762 STOP: 14873 STRAND: +  \nCTGTGTTAAATGTAGACTGTCTCCTAAGGGTGAGCCTGGTAGATTATCTAGAAACCTTGTCAGCTGATTG \nGCTGCTTGATCCACGCCTTCTGGGCTGGAGTTGACTGCACAT \n>GLEAN3_13144|Scaffold9356|15689|15789| DNA_SRC: Scaffold9356 START: 15689 STOP: 15789 STRAND: +  \nCTCATGTTGGTCTTGTTTAGTACCAAGGAGGCAATAGTCTGAAGGTGGGCCAAGACTGGGTCCAAGTTCT \nCCTTCTCGGCAAGTCCTTTTAGGAATGACAC \n
GLEAN3_25459	SPU_025459		#\npartial CDS, as inferred from alignment with best blast hit.  The following exon sequence is not conserved and may not be part of this gene. \n>GLEAN3_25459|Scaffold2341|6491|6568| DNA_SRC: Scaffold2341 START: 6491 STOP: 6568 STRAND: +  \nACTGCCATGCCTGGTATGAAGCGGGACTGCGGTGGCGCAGCAGCGATTTTGGGTGCATTCTATGCAGCCG \nTTAAAGAA \n
GLEAN3_13676	SPU_013676		Intron was modified to a coding region by comparison to the corresponding FgenesAB, ++ and Genscan prediction.\n
GLEAN3_25520	SPU_025520		This model contains to predicted exons that may not belong to this gene, because they are not conserved, while flanking exons are strongly conserved.  The first of these was also included in a haplotype, GLEAN3_25459. \n>GLEAN3_25520|Scaffold32958|7581|7658| DNA_SRC: Scaffold32958 START: 7581 STOP: 7658 STRAND: +  \nACTGCCATGCCTGGTATGAAGCGGGACTGCGGTGGCGCAGCAGCGATTTTGGGTGCATTCTATGCAGCCG \nTTAAAGAA \n>GLEAN3_25520|Scaffold32958|12680|12777| DNA_SRC: Scaffold32958 START: 12680 STOP: 12777 STRAND: +  \nGCGATGGCAAACTATCACTTCCTCGTCGATCAGAATGTATACGCTATCTTTCCTCAGCTTCGCTTCGGAA \nAGATAGCTACATCCAGATCGCCTCGGAA \n
GLEAN3_13751	SPU_013751		#\nThis gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_03749	SPU_003749		This model may encode two similar adjacent genes, but it is difficult to annotate because there are some exact amino acid duplications in different parts, raising the possibility of an assembly error. \n
GLEAN3_09159	SPU_009159		Both ends of the gene run are at scaffold boundaries and the CDS is incomplete.  I have added 2 exons in the Fgenes H prediction that blast to the gaps in alignments of the GLEAN3 prediction.  One EST\n
GLEAN3_12617	SPU_012617		Might be missing exons (from best hit alignment, position on contig).\n
GLEAN3_20050	SPU_020050		Phylogenetic analysis using the PTPc domains shows that this is an orthologue of the human PTPRs D, F, and S.  The missing extracellular portion of this gene may correspond to GLEAN3_23737.  This sequence has not been added to the GLEAN3_20050 sequence, since cloning has not yet verified this relationship.\n
GLEAN3_27234	SPU_027234		Only exons 1-3 are present on this scaffold21. Exons 4-5 are missing. Exons 4-5 are present on scaffold1351 GLEAN3_14051 but exons 1-3 are missing.\n
GLEAN3_15371	SPU_015371		The predicted exons 2, 3, 4, 14 may not belong to this gene,  \nas inferred by alignment to best blast hit. \n \nExon2 \n>GLEAN3_15371|Scaffold2|816312|816482| DNA_SRC: Scaffold2 START: 816312 STOP: 816482 STRAND: +  \nGTTGAAGCAAAGATTCGAATCACCGAGTTTGATTCTGAATCACGTCGGACTGCTCAGACCTACTACCATA \nCCTACCCTTCTCATCAGATCTCATACGATGTCAGACGTGAACTCGAAGCTATAGCCTCGACATCGGGTTC \nGCCCACCTACGTGGACGAAGTCACGCAAGAG \n \nExon3 \n \n>GLEAN3_15371|Scaffold2|818170|818272| DNA_SRC: Scaffold2 START: 818170 STOP: 818272 STRAND: +  \nCTAGAGGATGTGAAGTCTCGAATGGAGCAGAGATATCACACGGCTAAGGTATGCAGGAAGAAAGGTAGAC \nGAGCACGGAAAGAATGTCTGCGTCTAGATCCAG \n \nExon4 \n \n>GLEAN3_15371|Scaffold2|821627|821929| DNA_SRC: Scaffold2 START: 821627 STOP: 821929 STRAND: +  \nGACTAGAAGAAGACTGTGCATTGGTATTCAAGAAACGAAGATGCGTAATCATGACAATGCTACGAGATGA \nGAGATTATATGTTTACTGCATGCAGGCTGTGATGAAAGCTAGGAAAGTGGATCAAGGGTCCCTTTATTTG \nTATGTTATGATCAATTATTTGAAGGAAGGTAGTGATGACAAGATTGGTGATGATCATAATGCCATGGACT \nTTGACCGGACATCATGGTCAAGGACCATGGTTAATCTCACGAAATGGAAAGTAGCAATGAAGCGAAACGA \nTGAGGGAGGATGTACTTACGAGG \n \nExon14 \n \n>GLEAN3_15371|Scaffold2|831817|831927| DNA_SRC: Scaffold2 START: 831817 STOP: 831927 STRAND: +  \nTGATATGCGTGAGGCAAACACCATTGGTGCCGATAAGTACTTCCATGCCCGGGGCAACTTCGACGCTGCT \nCAGCGAGGATCAGGAGGTAGATTCGCTGCCGAGGTTATCAG \n
GLEAN3_16055	SPU_016055		Added 3prime UTR based on EST evidence. \nN-terminus is missing due to end of contig.\n
GLEAN3_23767	SPU_023767		Exons 6-23 are accepted on the + strand from the GLEAN3_23767 predictions. Exons 2-5 are present on the - strand between exons 7 and 8. Exon 1 is from scaffold59902 but there was no GLEAN3 prediction for it, but it was predicted by FirstEF.\n
GLEAN3_28077	SPU_028077		partial CDS at end of scaffold\n
GLEAN3_23734	SPU_023734		See GLEAN3_17211. \n
GLEAN3_01047	SPU_001047		partial CDS on short scaffold, 21370\n
GLEAN3_07431	SPU_007431		TWO COMMENTS: \n \n1)partial CDS; missing N-terminal sequence as judged by alignment with best blast hit \n2)In addition to two glean3 copies, there is an excellent match on scaffold 123301, 194-227 that is not on the glean3 list.\n
GLEAN3_02912	SPU_002912		partial CDS; scaffold 52017 is short\n
GLEAN3_27157	SPU_027157		partial CDS; scaffold 66222 is short.\n
GLEAN3_15029	SPU_015029		Intronless Toll-like receptor with predicted LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_15132	SPU_015132		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_28898	SPU_028898		This gene was annotated based on a manual inspection of multiple sequence alignments. \n \nNote that slightly different models were created by other predictions for this gene. The Glean model provides the best alignment with vertebrate TRAF6, and it was therefore accepted in its original version. \n \nAlso note there is a gap in the alignment that is introduced by extra sequence in the glean model. There is no a priori computational evidence to suggest this extra sequence is not real.\n
GLEAN3_16468	SPU_016468		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. But The intron sequence except NNN matches coding sequence of other Sp-Tlr genes.\n
GLEAN3_16501	SPU_016501		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_26495	SPU_026495		This gene was annotated based on manual inspection of multiple sequence alignments. \n \nA slightly different model was created by the NCBI prediction; however, the pairwise alignment with vertebrate TRAF3 is significantly better for the glean model. \n
GLEAN3_17180	SPU_017180		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_17529	SPU_017529		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT and TIR. \nThis is a member of sea urchin-specific Tlr Group I(orphan). \n
GLEAN3_18055	SPU_018055		#\n54 nucleotides encoding a predicted signal peptide were added at the 5'end of the GLEAN3 model by comparison to the corresponding FgeneshAB and ++ prediction. \n
GLEAN3_08332	SPU_008332		This gene was annotated based on a manual inspection of multiple sequence alignments. \n \nEven though the available information is sufficient to confidently assign this gene as Sp-Traf4, there is also strong computational evidence to suggest that there is missing sequence towards the N-terminus of this gene, which is supported by the fact that this model is located at one end of its respective scaffold. \n \nA slightly different model was created based on a Fgenesh++ prediction; however, the glean/NCBI model shows a better alignment to vertebrate TRAF4.\n
GLEAN3_03462	SPU_003462		This gene was annotated based on a manual inspection of multiple sequence alignments. \n \nIn a multiple sequence alignment that included vertebrate and Drosophila TRAF family sequences, this gene failed to cluster with any specific family member. Thus, we have decided to name this gene with an arbitrary classifier ("A"). \n \nSlightly different models were created by other predictions; however the exonic structure of the glean model is very strongly supported by the tiling array data. The array data also suggest there might be additional sequence absent from the current glean model, which coincides with gaps in the alignment with vertebrate TRAFs. However, at present we have no evidence to determine whether the model should indeed be accordingly modified. \n
GLEAN3_25346	SPU_025346		Inspection of the tiling array suggests that glean may have missed the following exons: LVKSIGLYTYGLLLLLSSIQLLTAVRSMVKTIAHGDLQTVPFHMTKSRVIVQSRGHMEAIVLTKSWRKAVNICVINRICSVNPSKRDILITYT\n
GLEAN3_23069	SPU_023069		This gene was annotated based on a manual inspection of multiple sequence alignments to family members in other animal groups. \n \nThis sequence did not cluster with any specific family member in a multiple alignment tree, and was therefore named with an arbitrary identifier ("B"). \n \nMost exons in the present model are supported by the genome-wide tiling array data. Based on the same array data, there seem to be some exons that may have been erronously left out from the model. We have no evidence to independently support such possibility, and we have therefore accepted the glean model in its present form. \n \nAlso note that the first exon in the glen model codes for some very low complexity aminoacidic sequence. Exon 1 in the corresponding NCBI model does not include such sequence. However, the transcription of this sequence is supported by the tiling array data. Thus, it is highly likely that the most upstream sequence in exon 1 of the glean model is in fact 5' UTR, and that the true CDS corresponds to that of the NCBI model's exon 1. At the present time we cannot support any additional evidence to support this possibility.\n
GLEAN3_18100	SPU_018100		The first exon was eliminated and 117 nucleotides encoding a predicted signal peptide were added at the 5'end of the second exon of the GLEAN3 model by comparison to the corresponding FgeneshAB prediction. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_18211	SPU_018211		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. The nucleotides in the intron except NNN... have highly similar to other Sp-Tlr genes, so it was modified to a coding region.\n
GLEAN3_18212	SPU_018212		Unknown sequence (NNN...) in the intron of this gene model could make this gene model incomplete. The nucleotides in the first intron except NNN... have highly similar to other Sp-Tlr genes, so it was modified to a coding region and third exon was eliminated.\n
GLEAN3_11877	SPU_011877		The 5' end of the gene is incomplete, there are probably 1 or 2 exons (140 amino acids) missing.  The 6 ESTs are up to 3.5 kb upstream and only UTR sequence seems to be available.  There is embyronic expression data for 5' sequences.\n
GLEAN3_20620	SPU_020620		Exons 8-19 are from this scaffold57107 and GLEAN3_20620 prediction. Exons 3-7 are from scaffold533 and GLEAN3_23989 predictions except for exon 5 which was only predicted by the Fgenesh++ prediction. Exon 2 is from scaffold65249 with no tracks predicting it. Exon 1 is incomplete and present on scaffold137005 with no tracks predicting it.\n
GLEAN3_25471	SPU_025471		This gene was annotated with A.pectinifera mRNA and peptide.  The complete annotation of this gene is on Glean3_12078.  This glean aligns up with ApIP3R AAs 1247-1632.\n
GLEAN3_24561	SPU_024561		The mRNA and pepetide sequence of A. pectinifera were used to check the Glean prediction.  The complete annotation of this gene is on Glean3_12078.  Of note, two exons predicted are in sequential order, but code for the exact same nts. of the protein.  Also, not all the predicted exons were used in the full annotation under Glean3_12078.  This glean aligns up with ApIP3R AAs 116-403.\n
GLEAN3_07452	SPU_007452	From the excel data and the BLAST results, it is evident that this is the best match for this particular GLEAN model. The sequence appears to have an orderly and continuous arrangement without any gaps or internal repeats present. There was some Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	The first exon of Spec1 contains 5'UTR and one aminoacid: Met.GLEAN3_07452 does not show the first exon.\n
GLEAN3_07449	SPU_007449		This predicted gene GLEAN3_07449 matches to exon 4 to 6 of Spec2c gene. The other predicted gene GLEAN3_14607 matches to exon 1 to 3 of Spec2c.And, there are 10 Amino Acid that cannot match to any predicted genes. \n
GLEAN3_14607	SPU_014607		This predicted gene matches to exon 1 to 3 of Spec2c. Another predicted gene GLEAN3_07449 matchs to exon 4 to 6 of Spec2c. For more information of Spec2c, please search GLEAN3_07449 to see details.\n
GLEAN3_03175	SPU_003175		First exon and last exon of Spec2d gene is not predicted in \nGLEAN3_03175 \n
GLEAN3_06513	SPU_006513		Duplicated piece matching nucleotide numbers 11549-12671 of this scaffold (#56396) on scaffold #33010 (nucleotide numbers 1-1123, Glean3_23676)\n
GLEAN3_07655	SPU_007655		Additional evidences of the existence of the gene have been obtained in Sphaerechinus granularis \nGLEAN3_21839 encodes a partial 3'-terminal sequence of the mRNA \nPosition of the mRNA 5'end has been deduced from Sp ESTs (i.e. CX678933.1) \nTentative assignment of the 3'UTR end position by computational methods and comparison with S. granularis mRNA\n
GLEAN3_03332	SPU_003332		GLEAN3_03332 and the neighboring prediction GLEAN3_03333 are partial CDS and may result from assembly problems.  The first two exons of 03332 are exact repeats and this sequence appears again in 03333.\n
GLEAN3_21022	SPU_021022		Alignment data suggests this model contains complete CDS.\n
GLEAN3_00075	SPU_000075		Alignment with best blast hit suggests that the model may be missing N-terminal sequences\n
GLEAN3_15452	SPU_015452		Alignment with best blast hit suggests that model may be missing N-terminal sequence.  All exons except for the first one (see below) of the model blast to methionyl aminopeptidase.  This first exon may or may not be correct. \n>GLEAN3_15452|Scaffold1614|92736|92855| DNA_SRC: Scaffold1614 START: 92736 STOP: 92855 STRAND: +  \nATGTCTTTCAACAGCTACAGAAAACCCAGACCAGAACAGCTTTCAATATCTTCCAGAAATGGGGCAAAAA \nAGCAGAGCCAACAACACCCCAGTAACTTCTCAATTGTTCAGGCAGGAAAG\n
GLEAN3_14474	SPU_014474		Partial CDS, because it is on a short scaffold. Alignment with best blast sequences suggests that each of the 4 exons of this model is similar to sequences in the same protein, methionyl aminopeptidase.\n
GLEAN3_06794	SPU_006794		Alignment with best blast hit suggests that several internal exons may be missing from the model.\n
GLEAN3_19205	SPU_019205		Alignment to best blast hit suggests that there may be two missing internal exons in this model\n
GLEAN3_00619	SPU_000619		partial CDS, missing C-terminal sequences, based on alignment with best blast hit sequence\n
GLEAN3_14274	SPU_014274		Partial CDS based on alignment with best blast hit sequence\n
GLEAN3_08606	SPU_008606		Alignment with best blast hit sequence suggests that model is complete.\n
GLEAN3_11258	SPU_011258		Partial CDS\n
GLEAN3_19102	SPU_019102		TWO COMMENTS: \n \n1) Alignment with best blast hit sequence suggests that this model may lack N-terminal sequence. \n2) This model is nearly identical to the adjacent model GlEAN3_19101 on Scaffold 112071\n
GLEAN3_10630	SPU_010630		Alignment with best blast sequence suggests that this model is complete.\n
GLEAN3_02994	SPU_002994		Alignment with best blast hit sequence suggests that C-terminal exon(s) may be missing from the model.\n
GLEAN3_18410	SPU_018410		Intronless Toll-like receptor with LRR-NT, LRR(11 to 22), LRR-CT, TM and TIR. \n
GLEAN3_23048	SPU_023048		Partial CDS as suggested by alignment to best blast hit sequence\n
GLEAN3_18519	SPU_018519		This gene model was modified as intronless Toll-like receptor by comparison to the corresponding FgenesAB and Genscan prediction. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_02598	SPU_002598		Alignment with best blast hit sequence suggests that both N- and C-terminal exons are missing from the model\n
GLEAN3_18534	SPU_018534		Unknown sequence (NNN...) in the 5'region of the current model could make this gene model incomplete.\n
GLEAN3_28736	SPU_028736		There is an excellent, although short, match on Scaffold6547 that is not on the glean3 list. \n \nAlignment with best blast hit data suggest that this model may lack N-terminal sequence.\n
GLEAN3_18838	SPU_018838		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. The fist intron is similar to a coding region of other Sp-Tlr gene, but the second is not.\n
GLEAN3_18928	SPU_018928		471 bp intron was accepted as coding region by comparison to the corresponding FgenesAB and ++ prediction.\n
GLEAN3_19042	SPU_019042		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. The intron was accepted to a coding region.  \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_19834	SPU_019834		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_20996	SPU_020996		Intronless Toll-like receptor with predicted secretory signal peptide, LRR(5 to 10), LRR-CT, TM and TIR.\n
GLEAN3_20997	SPU_020997		Intronless Toll-like receptor with predicted secretory signal peptide, LRR(5 to 10), LRR-CT, TM and TIR.\n
GLEAN3_21162	SPU_021162		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. This gene is at the end of scaffold.\n
GLEAN3_21225	SPU_021225		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_22302	SPU_022302		5' end of the gene (200 aa) is missing from the scaffold end.  There is an allele or a paralogue predicted by GLEAN3_02273 Scaffold4501.  \n
GLEAN3_02273	SPU_002273		This appears to be 575-840 of SpAlpha-J.  Either an allele or a duplication\n
GLEAN3_26617	SPU_026617		Alignment with best blast hit sequence suggests the gene model may lack N-terminal sequence, but otherwise appears to be complete. \n \nThere are excellent matches to sequences in this model on scaffold64641_1, 766-1098 and on scaffold 32233, 2047-2418.  Neither of these is on the glean3 list.\n
GLEAN3_04913	SPU_004913		Alignment with best blast hit sequence suggests that N-terminal sequence may be missing, but otherwise model appears to be complete.\n
GLEAN3_21008	SPU_021008		This appears to be a good prediction for the half of an alphaV,5-like subunit\n
GLEAN3_21395	SPU_021395		Partial Toll-like receptor. This gene model is located at the end of the scaffold.\n
GLEAN3_21415	SPU_021415		546 bp intron and 63bp 5'UTR were accepted as coding region by comparison to the corresponding FgenesAB and ++ prediction.\n
GLEAN3_21787	SPU_021787		Unknown sequence (NNN...) in the 5' upstream of the current model could make this gene model incomplete. 331bp of 5'UTR next to NNN... was accepted to a coding region. The modified gene model has a stop codon, but reflects best gene structure.\n
GLEAN3_15735	SPU_015735		Likely only partial gene.\n
GLEAN3_21936	SPU_021936		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has a stop codon, but reflects best gene structure. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_23090	SPU_023090		The 5' end of this protein seems to be from some other gene, but the mid-3' end corresponds to Prickle.  \n
GLEAN3_25302	SPU_025302		1) GLEAN3_22817 shows a perfect match to the N-terminal region of Sp-Alx1. The scaffold ends within a large intron in the Sp-Alx1 gene. \n2) The best Genbank hit (XP_785238) is to GLEAN3_ 22816, a closely related gene. Note that GLEAN3_22816 and GLEAN3_22817 are on the same scaffold (Scaffold 260).\n
GLEAN3_22451	SPU_022451		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_12766	SPU_012766		This Glean was checked with alignment to A. pectinifera mRNA and peptide.  The full annotation of this gene is in Glean3_12078.  This Glean aligns up with ApIP3R on AAs 1-93.\n
GLEAN3_03264	SPU_003264		haplotype=GLEAN3_08197\n
GLEAN3_02129	SPU_002129		Scaffold 1876 hit the first 352 basepairs (1-352) of Sp-Not mRNA (NM_214562 from NCBI), which might be exon 1. So modified gene feature starts from exon 2. \nFrame number is decided by assuming exon 2 is frame 0. \n
GLEAN3_23033	SPU_023033		#\n429 bp intron was accepted as coding region by comparison to the corresponding FgenesAB, ++ and Genscan prediction.\n
GLEAN3_10698	SPU_010698		Structure similar to Protein Tyrosine Kinase 7 isoform c precursor.\n
GLEAN3_22716	SPU_022716		Partial CDS, lacking N-terminal half, as inferred from alignment with best blast hit sequence,\n
GLEAN3_22909	SPU_022909		468 bp intron was accepted as coding region by comparison to the corresponding FgenesAB, ++ and Genscan prediction.\n
GLEAN3_25078	SPU_025078		Alignment with best blast hit sequence suggests that model is lacking C-terminal 2/3 of gene\n
GLEAN3_12194	SPU_012194	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. When examining the excel data and comparing it to the BLAST results, it appears that the all three individual scaffolds have an orderly arrangement within their confines. If all three scaffolds were to be combined, the entire sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong containing a wide distribution of values. 	Alignment with best blast hit sequence suggests that model is lacking C-terminal half and some N-terminal sequence.\n
GLEAN3_24526	SPU_024526		See GLEAN3_26779.  Partial duplication.\n
GLEAN3_19976	SPU_019976		Alignment with best blast hit sequence suggests that model lacks N-terminal half of the gene.\n
GLEAN3_02093	SPU_002093		Alignment with best blast hit sequence suggests that the model lacks both N- and C-terminal sequences. \n \nThere is an excellent, although short, match on Scaffold 1524_1, 455-727, that is not on the glean3 list.\n
GLEAN3_23544	SPU_023544		This gene model may represent a pseudogene or contain a sequence error. Intron and 3'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_24032	SPU_024032		Alignments and tiling data suggest that two additional exons are possible at 3'end- position 10,010-10,144 and 13,805-13,986. \n
GLEAN3_06768	SPU_006768		Repair polymerase. Conducts "gap-filling" DNA synthesis in a stepwise distributive fashion rather than in a processive fashion as for other DNA polymerases. Has a 5'-deoxyribose-5-phosphate lyase (dRP lyase) activity (By similarity).\n
GLEAN3_24815	SPU_024815		429 bp intron was accepted as coding region by comparison to the corresponding FgenesAB, ++ and Genscan prediction.\n
GLEAN3_25076	SPU_025076		Partial Toll-like receptor. This gene model is located at the end of a small scaffold.\n
GLEAN3_25136	SPU_025136		Unknown sequence (NNN...) in the intron and the small scaffold could make this gene model incomplete. The third exon was accepted as the coding region of a partial Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group ID.\n
GLEAN3_25263	SPU_025263		Partial Toll-like receptor. This gene model is located at the end of a small scaffold. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_25312	SPU_025312		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. Intron sequence except NNN is highly similar to other Sp-Tlr genes, so it was accepted as a coding region.  \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_09002	SPU_009002		Exons 8-28 are present on this scaffold772 and GLEAN3_09002 prediction. Exons 1-7 are present on scaffold772 and GLEAN3_20718 prediction\n
GLEAN3_28938	SPU_028938		Alignment to best blast sequence suggests that this model may be complete, possibly lacking sequences at the N-terminal end. \n \nThis is one of 4 very similar NAALADase genes on scaffold 496.\n
GLEAN3_28939	SPU_028939		Alignment with best blast sequence suggests that this model lacks C-terminal sequences. \n \nThis is one of 4 closely related NAALADase genes on Scaffold 496.\n
GLEAN3_28937	SPU_028937	From the excel data and the BLAST results, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it is apparent that there are several internal repeats present resulting in several sequence overlaps. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall with most of the values being less than 10.	Alignment with best blast sequence suggests that this model may be complete. \n \nThis is one of 4 closely related NALAADase genes on Scaffold496.\n
GLEAN3_26274	SPU_026274		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_09215	SPU_009215		This sequence is a composite of the original Glean_09215 plus the Glean_19261 sequences.  Although these sequences are on separate scaffolds, they appear to be complementary.  Alternatively, it is possible that they are from two different forms of Fmi.\n
GLEAN3_15929	SPU_015929		Exons 8-24 are present on this scaffold60165 and modified GLEAN3_15929 prediction.  Modified GLEAN3_15930 and GLEAN3_15931 have been merged into the GLEAN3_15929 prediction. Exons 3-7 are from scaffold41327 and GLEAN3_23590 prediction.  Exons 1-2 are missing.\n
GLEAN3_26275	SPU_026275		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_27163	SPU_027163		This gene model may represent a pseudogene or contain a sequence error. Intron and 5'UTR sequences match coding sequences of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_16406	SPU_016406		CDS of positive strand of this gene align exactly with CDS of the minus strand of Glean3_16409, same scaffold. Sequence between CDS as well as the repeat structure in the two regions is very similar as well. Glean3_16409 could be a haplotype that was put on the wrong scaffold. The contig that it is on is attached to the very end of the scaffold right next to this gene. \n
GLEAN3_18375	SPU_018375		complete gene model : GLEAN3_18375 + GLEAN3_23408\n
GLEAN3_11225	SPU_011225		Alignment with best blast sequence suggests that the model lacks the N-terminal half of the gene.\n
GLEAN3_23984	SPU_023984		Alignment with best blast sequence suggests that the model lacks the N'terminal half of the CDS.\n
GLEAN3_23985	SPU_023985	After reviewing the excel data and the BLAST results, it is evident that this is best fit for this particular GLEAN model. When reviewing the excel data, it was apparent that the sequence had an orderly arrangement without any gaps or repeats present. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with about half the values being greater than 10.	FOUR COMMENTS:  \n \n!) Alignment with best blast sequence suggests that this model may be a complete NAALAD2 gene, possibly lacking short N- and C-terminal sequences. \n \n2) It is linked to a reverse transcriptase elements which may not be part of the gene.  This element is on the following predicted exon: \n>GLEAN3_23985|Scaffold113776|19389|20291| DNA_SRC: Scaffold113776 START: 19389 STOP: 20291 STRAND: +  \nTTTCCTTTCCGACCGTCATCAGGTGGTTCGACATCAAGGCGTGACATCAAGCCCAAAGAGCCTCGCATGC \nGGAGTACCTCAGGGTACAAAGTTGGGACCAATCCTATTCCTTGCCCTTGTTAACGATGCTGCCTTAACGT \nCAACATACCGATGGAAGTATGTTGACGACTTAAGTTTGGTGGAAGTCTTGCCTAAAACCCAGCAAAGTTC \nCTTACAGGAGTACGTTGATGAGCTCGGTGAATGGTGCGCCATTAATGACGTGACGCCAAAGCCCGAAAAA \nTGTAAGGCCATGCAAGTGTCTTTCTTGAAGAATCCTCTTCCTCATTTGGACATCACCATCGCAGATGTTC \nATCTTGAACGTGTTGATTCCTTGACTCTCCTTGGTGTCGCGATCCAATCAGACCTGAAATGGGATAATCA \nGGTCCAACAGATGATCTCACGGGCCGCTCGGAGACTGTACATTCTGAGTGTTCTGAAGAAATCTGGAGTC \nAACGCGAATGATCTAGTAACCATCTACAAAGCGTATATCCGTCCCCTGATGGAATTTGGTGTCCCTGTCT \nGGGGCTCCGGCATTACTAATACGCAGAGTGATAAAATCGAACGAATCCAAAGACGTGCGCTACGTTTCAT \nTGTGTATCCAGCTGACCTCTCCTACACACAACGGCTCACTCGTTTCAACTTGCCTATGTTGTGTGAACGC \nAGGAATGATCTCCTTCTACGCTTTGGACGTGGTCTCCTCAAGTCTGAACGGCATCGTGACATGCTACCTG \nCTACTCGTCAATGTGTCTCTCACCGCAGTTCAACACTGAGAAGTGCTCATCTACTAGACCTACAGCGTTG \nTAAAACCCAACGATATAGGAACTCTGCAATCCCGTTTTTAACACGAATGCTCAATTCTTCCAA \n \n3) This model is adjacent to a closely related gene model, GLEAN3_23984, which lacks N-terminal half sequence. \n \n4) There are many excellent matches to this model in scaffolds that are not on the glean 3 list.  In addition there are 3 copies in the glean3 list.\n
GLEAN3_26984	SPU_026984	From the BLAST results and the excel data, it is evident that the 2 sequences are distributed onto 2 different scaffolds. There is an overlap between the two scaffolds that spans from 807-907. Both scaffolds contain an orderly arrangement without any gaps or internal repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	Alignment with best blast sequence suggests that this model may be missing N-terminal CDS.\n
GLEAN3_14154	SPU_014154		Alignment with best blast sequence suggests that the models is missing N-terminal CDS.\n
GLEAN3_04980	SPU_004980		Alignment with best blast sequence suggets that this model may be missing N-terminal CDS.\n
GLEAN3_14945	SPU_014945		Alignment with best blast sequence suggests that this model may be complete.\n
GLEAN3_27698	SPU_027698		Partial TLR. The locus of the gene model is at the end of the scaffold.\n
GLEAN3_27721	SPU_027721		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. The intron sequence except NNN... is highly similar to other Sp-Tlr genes, so it was accepted to a coding region. \nThis is a member of sea urchin-specific Tlr Group I(orphan). \n
GLEAN3_27798	SPU_027798		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has a stop codon, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_27815	SPU_027815		This gene model may represent a pseudogene or contain a sequence error. 5'UTR sequence doesn't match coding sequence of other Sp-Tlr genes. \nThis is a member of sea urchin-specific Tlr Group IE.\n
GLEAN3_28404	SPU_028404		Intron sequence was accepted to a coding region that could make TIR domain complete. \nThis is a member of sea urchin-specific Tlr Group I(orphan). \n
GLEAN3_21404	SPU_021404		#\nAlignment with best blast hit suggests that this model is missing C-terminal CDS.  The best blast hit is huge, >2200 aa.\n
GLEAN3_06660	SPU_006660		Alignment with best blast sequence suggests model is missing large N- and C-terminal  CDSs.\n
GLEAN3_17421	SPU_017421		There is a family of carbonic anhydrase genes in the sea urchin. These have not  been carefully compared to potential vertebrate orthologs. \n
GLEAN3_00851	SPU_000851		Alignment with best blast sequence suggests that this model lacks the N-terminal half of a huge protein, >2200 amino acids\n
GLEAN3_04135	SPU_004135		A family of carbonic anhydrase-like proteins exists in sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_02570	SPU_002570		Alignment with best blast sequence indicates that this model includes only a small central portion of the CDS.\n
GLEAN3_00871	SPU_000871		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. There are duplicated 600bp senquence in this gene model. \nThis is a member of sea urchin-specific Tlr Group IIA.\n
GLEAN3_07418	SPU_007418		Partial Toll-like receptor. The locus of this gene model is at the end of a scaffold.\n
GLEAN3_07859	SPU_007859		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(14), LRR-CT(2), TM and TIR. \n
GLEAN3_04136	SPU_004136		Annotated gene shows coordinates and sequences of the "long form" of SpP19. There is a shorter, alternatively spliced form (see AF519413).\n
GLEAN3_11823	SPU_011823		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 18), LRR-CT(2) and TIR. \n
GLEAN3_14191	SPU_014191		This gene model was modified based on Fgenesh++ prediction. Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(24), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_20259	SPU_020259		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.This is a member of sea urchin-specific Tlr Group IB.  \n
GLEAN3_24404	SPU_024404		Unkown sequence (NNN) in the intron could make the gene model incomplete. Modified model was accepted as an intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(12 to 22), LRR-CT, TM and TIR.  \n
GLEAN3_24960	SPU_024960		Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT and TIR. \nThis gene model may represent a pseudogene or contain a sequence error, but reflect best gene structure. \nThis is a member of sea urchin-specific Tlr Group ID. \n
GLEAN3_27222	SPU_027222		#\nThe gene model was accepted as a intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(11 to 20), LRR-CT, TM and TIR.\n
GLEAN3_02174	SPU_002174		Alignment with best blast sequence suggests that this model is missing C-terminal CDS.  The best blast hit is >2200 amino cds and the model is 1492.\n
GLEAN3_26537	SPU_026537		Alignment with best blast seuence suggests model may be missing a small C-terminal segment. \n \nThis model is adjacent to a short segment of a closely related gene, GLEAN3_26538, on Scaffold107330.\n
GLEAN3_26538	SPU_026538		Alignment with best bast sequence suggests that the model contains only a small N-terminal segement of the CDS. \n \nThis model is adjacent to a very similar gene, GLEAN3_26537, on Scaffold107330.\n
GLEAN3_23590	SPU_023590		Refer to GLEAN3_15929 for complete REJ4 gene features\n
GLEAN3_19049	SPU_019049		Alignment with best blast sequence suggests that this model contains only  a short segment, which is repeated in human multifunctional protein CAD.\n
GLEAN3_13209	SPU_013209		Alignment with best blast sequence suggests this model could be complete if the predicted N-terminal exon which is not conserved is correct.\n
GLEAN3_21494	SPU_021494		Alignment with best blast sequence shows that this model contains only a short conserved segment. \n \nThere is a match to sequences on Scaffold72931_1, 262-413, that is not on the GLEAN3 list.\n
GLEAN3_09429	SPU_009429		Alignment with best blast sequence suggests that this model may be missing a short N-terminal segment and a longer C-terminal segement.\n
GLEAN3_25325	SPU_025325		Alignments with best blast hits suggest that the gene model may be correct.\n
GLEAN3_28671	SPU_028671		Alignment with best blast seuence suggests that this model contains only a short conserved sequence.  The following exons are considered unlikely to be part of this model. \n>GLEAN3_28671|Scaffold36273|17615|17788| DNA_SRC: Scaffold36273 START: 17615 STOP: 17788 STRAND: +  \nTTAGATACCTGGAACCTGGGCAAGTAAGTTGACGTAAGAGCAAGTAATCTTGATCTCACAAGTAATCTTG \nATCTCACAAGTAATCTTGATCTCACAAGTAACCTTGATCTCACAAGTAATCTTGATCTCACAAGTAATCT \nTGATCTCACAAGTAATCTTGATCTCACAAGTAAC \n>GLEAN3_28671|Scaffold36273|17825|18165| DNA_SRC: Scaffold36273 START: 17825 STOP: 18165 STRAND: +  \nCTTGATCTCACAAGTAACCTTGATCTCACTAGTAATCTTGATCTCACAAGTAATCTTGATCTCACAAGTA \nACCTTGATCTCACAAGTAACCTTGATCTCACAAGTAACCTTGATCTCACAAGTAATCTTGATCTCACAAG \nTAATCTTGATCTCACAAGTAATCTTGATCTCACAAGTAATCTTGATCTCACAAGTAATCTTGATCTTACA \nACTAACCTTGATCTCACAAGTAACCTTGATCTCACAAGTAATCTTGATCTCACAAGTAACCTTGATCTCA \nCAAGTAATCTTGATCTCACAAGTAATCTTGATCTCACAAGTAACCCTTACTGTTATCCTTC \n
GLEAN3_00375	SPU_000375		Partial Toll-like receptor. The previous contig (separated by 1056 bp of NNN) contains a part of a common TLR structure. \nThis is a member of sea urchin-specific Tlr Group ID.  \n
GLEAN3_01458	SPU_001458		This gene model may be a short Toll-like recepter.  5'UTR sequence doesn't match coding sequence of other Sp-Tlr genes.\n
GLEAN3_01650	SPU_001650		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence  has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IA. \n
GLEAN3_01862	SPU_001862		#\nPartial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n \n
GLEAN3_01993	SPU_001993		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 99% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n \n
GLEAN3_19349	SPU_019349		#\nThis gene was annotated based on a manual inspection of protein alignments. \n \nTwo other adjacent models (GLEAN3_19350 and GLEAN3_19351) code for very similar sequences. It is yet to be determined to which extent this reflects true gene duplications or assembly problems. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_02803	SPU_002803		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IC. \n
GLEAN3_03419	SPU_003419		Partial Toll-like receptor. This gene model is located at the end of a short contig. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n \n
GLEAN3_03846	SPU_003846		#\nPartial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n \n
GLEAN3_04655	SPU_004655		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. This is a member of sea urchin-specific Tlr Group I(orphan). \n \n
GLEAN3_05871	SPU_005871		#\nThis gene was annotated based on a manual inspection of protein alignments. \n \nThere might be some missing N-terminus sequence in this model, as suggested by an incomplete alignment to homologous vertebrate sequences, and by signal from the tiling array upstream of the first annotated exon that do not correspond with any other models. Such potential missing sequence was searched computationally, but with no success.\n
GLEAN3_13950	SPU_013950		This gene was annotated based on a manual inspection of protein alignments. \n \nWe named this gene "Sp-Il1R-rs1" because no other significant Blast hit could be found for it, and because of its overall domain composition, which closely resembles that of a typical IL-1 receptor. In fact, among the Blast hits obtained for this sequence, the one that spans most of the query corresponded to a Gallus gallus predicted IL1RAcP.\n
GLEAN3_05148	SPU_005148		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 97% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_07991	SPU_007991		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. A part of the intron sequence is highly similar to other Sp-Tlr genes, so it was accepted a coding region and the second exon was eliminated. \n
GLEAN3_08229	SPU_008229		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_09343	SPU_009343		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_09459	SPU_009459		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group ID.\n
GLEAN3_09933	SPU_009933		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_02467	SPU_002467		Same gene as GLEAN3_02025. All annotation information is documented there. \n
GLEAN3_09952	SPU_009952		 The nucleotides of the coding and 3'UTR sequence have 99% identity to those of GLEAN3_10695. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_10320	SPU_010320		This gene model may represent a pseudogene or contain a sequence error. 3'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure.\n
GLEAN3_10680	SPU_010680		This gene model may represent a pseudogene or contain a sequence error. 5'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene model.\n
GLEAN3_10693	SPU_010693		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group I(orphan).\n
GLEAN3_11277	SPU_011277		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 93% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_05983	SPU_005983		This gene was annotated based on a manual inspection of protein alignments. \n \nAnother glean model (GLEAN3_12845) codes for a very similar sequence (92% identity). It is yet to be determined to which extent this corresponds to a true gene duplication or a problem with the assembly. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_19350	SPU_019350		This gene was annotated based on a manual inspection of protein alignments. \n \nTwo other adjacent models (GLEAN3_19349 and GLEAN3_19351) code for very similar sequences. It is yet to be determined to which extent this reflects true gene duplications or assembly problems. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_19351	SPU_019351		This gene was annotated based on a manual inspection of protein alignments. \n \nTwo other adjacent models (GLEAN3_19349 and GLEAN3_19350) code for very similar sequences. It is yet to be determined to which extent this reflects true gene duplications or assembly problems. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_11328	SPU_011328		This gene model may represent a pseudogene or contain a sequence error. Intron and 3'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified model contains some frame shifts, but reflects best gene structure. \n
GLEAN3_12844	SPU_012844		This gene was annotated based on a manual inspection of protein alignments. \n \nThis prediction contains a sequence that seems a triplication of the sequence from an adjacent model (GLEAN3_12845). It is yet to be determined if this reflects a true exon multiplication/gene duplication or problems with the assembly. In addition, another model (GLEAN3_05983) codes for a very similar sequence (92% identical). Again, it is yet to be determined whether this reflects a true gene duplication or assembly problems. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_22838	SPU_022838		This gene was annotated based on a manual inspection of sequence alignments. \n \nThere seems to be an annotation problem with this gene: the first two exons, located in one contig, are highly similar in sequence to the last three exons, which are located in a separate contig of the same scaffold. Since no other models map between these contigs, and since they lie at the end of the scaffold, this may represent a case of exon amplification, or it may be a case of erroneous assembly (haplotypes?) that led to a duplicated sequence within this model. We have not modified the present model because we have no independent evidence to support either claim. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_12845	SPU_012845		This gene was annotated based on a manual inspection of protein alignments. \n \nAn adjacent prediction (GLEAN3_12844) contains a sequence that seems a triplication of this model. It is yet to be determined if this reflects a true exon amplification/gene duplication case or problems with the assembly. In addition, another model (GLEAN3_05983) codes for a very similar sequence (92% identical). Again, it is yet to be determined whether this reflects a true gene duplication or assembly problems. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster. For consistency purposes, therefore, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F).\n
GLEAN3_11481	SPU_011481		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n
GLEAN3_27904	SPU_027904		#\nThis gene was annotated based on a manual inspection of protein alignments. \n \nSeveral Sp-IL17 genes were identified computationally. Each of them shows a best Blast-p hit to specific vertebrate IL17 family members. However, multiple sequence alignments demonstrated that almost all the Sp-IL17 sequences segregate from their vertebrate counterparts as a separate cluster, except for this model that co-distributed with vertebrate IL17E (or IL25). For consistency purposes, however, we decided to arbitrarily assign numerical identifiers to all the Sp-Il17 genes, and thus avoid the implication that they all have specific vertebrate orthologs (which are classified by the letters A-F). At this point, a more careful analysis is needed to determine whether Sp-Il17-8 is indeed an ortholog of IL17E.\n
GLEAN3_11541	SPU_011541		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. 2nd exon was eliminated based on BLASTN search. \n
GLEAN3_13111	SPU_013111		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n
GLEAN3_13162	SPU_013162		#\nPartial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 97% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n
GLEAN3_14352	SPU_014352		This gene model may represent a pseudogene or contain a sequence error. 3'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified model contains some frame shifts, but reflects best gene structure.\n
GLEAN3_14548	SPU_014548		This gene model may represent a pseudogene or contain a sequence error. 3'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified model contains some frame shifts, but reflects best gene structure. \n
GLEAN3_14929	SPU_014929		#\nThis gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model contains some frame shifts, but reflects best gene structure. \n
GLEAN3_15553	SPU_015553		#\nPartial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IB.  \n
GLEAN3_16388	SPU_016388		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. Intron sequence except NNN... matches a coding region of other Sp-Tlr genes. Modified gene model contains some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group ID.\n
GLEAN3_22815	SPU_022815		similar to G-protein coupled receptor 64 precursor  \n            (Epididymis-specific protein 6) (He6 receptor)\n
GLEAN3_02587	SPU_002587		S.purpuratus elongation factor 1B gamma cDNA cloned (AJ973179) \n
GLEAN3_15867	SPU_015867		#\nS.purpuratus EF1B alpha cloned (AJ973180) \n
GLEAN3_15285	SPU_015285		Part of this sequence is also contained in GLEAN3_26576.  \n"Additional evidences of the existence of the gene" have been obtained in Sphaerechinus granularis\n
GLEAN3_16438	SPU_016438		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 98% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n
GLEAN3_16554	SPU_016554		#\nPartial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 97% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_17735	SPU_017735		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_17794	SPU_017794		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_18380	SPU_018380		This gene model may represent a pseudogene or contain a sequence error. 5'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified model contains some frame shift, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group ID. \n
GLEAN3_20428	SPU_020428		Partial Toll-like receptor. This gene model is located at the end of a contig. The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \n
GLEAN3_20644	SPU_020644		#\nUnknown sequence (NNN...) in the intron of the current model could make this gene model incomplete. But the second exon was eliminated based on BLASTN search. \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_11299	SPU_011299		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
GLEAN3_20652	SPU_020652		Unknown sequence (NNN...) in the 3'and 5' UTR of the current model could make modified gene model still incomplete.  \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_17901	SPU_017901		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nThis gene model was modified based on an overlapping FgeneshAB prediction which extends the protein sequence of this model and improves its alignment with related sequences from other phyla. Nonetheless, this model seems still incomplete after this modification (N-ter sequence missing). The model is situated in a small scaffold, and that likely accounts for the missing information. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
GLEAN3_20124	SPU_020124	After reviewing the data and performing a BLAST search, it appears that there is no sufficient match for this particular GLEAN model. There is a large gap present from 319-844 that is indicated by both the BLAST results and the excel data. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the values ranging from about 5-9	Sp-Elf has two splice variants differing in the 5' region: \nSp-Elf A       GLEAN3_20124 \nSp-Elf B       GLEAN3_20123 \n
GLEAN3_12071	SPU_012071		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nThe best Blast hit for this gene corresponds to a nematode MIF-like protein, and slightly worse hits correspond to vertebrate dopachrome tautomerase genes, which are closely related to MIFs. For this reason, we have arbitrarily named this gene Sp-Mif-like1.\n
GLEAN3_19323	SPU_019323		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nThe best Blast hit for this gene corresponds to human dopachrome tautomerase (a.k.a. phenylpyruvate tautomerase II), a gene very closely related to MIF (phenylpyruvate tautomerase). For this reason, we have arbitrarily named this gene Sp-Mif-like2. \n \nNote that the genome-wide tiling array data correlate with the exon structure of this model, but that they also indicate high expression levels of a genomic region that falls in the second intron of this model. No other models cover the region, and it is unclear at this point what might account for these observations.\n
GLEAN3_20035	SPU_020035		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nThe genome-wide tiling array data correlate with the exon structure of this model. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
GLEAN3_20036	SPU_020036		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nNote that the genome-wide tiling array data correlate with the exon structure of this model, but that they also indicate high expression levels of a genomic region that falls in only intron of this model. This region, however, is included in a Genescan model (Supertig1576_6) on the opposite strand, suggesting it may reflect the expression of an overlapping gene. This is yet to be determined experimentally. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
GLEAN3_16226	SPU_016226		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nNote that the genome-wide tiling array data correlate with the exon structure of this model, but that they also indicate expression of a genomic region that falls in the second intron of this model. No other models cover the region, and it is unclear at this point what might explain this observation. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
GLEAN3_01152	SPU_001152		This gene was annotated based on a manual inspection of multiple protein alignments. \n \nThe genome-wide tiling array data correlate with the exon structure of this model. Note, however, that there are overlapping Fgenesh++/AB predictions that incorporate more C-ter sequence than this glean model, which is also supported by the genome-wide tiling array data. Since we have no experimental evidence to favor either model, we have accepted the glean model in its present form. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
Mif-2	SPU_030001		This gene was created based on an NCBI prediction identified based on a manual inspection of multiple protein alignments. The NCBI prediction was accepted with no modifications. However, notice that this model is incomplete (CDS has no stop codon), which is likely due to the fact that it is located in a small scaffold. \n \nSeveral gene models related to this one were found. A multiple sequence alignment with various vertebrate and nematode related proteins failed to strongly support any homologies among them, and we have therefore arbitrarily numbered the members of this S.purpuratus family of genes.\n
GLEAN3_19408	SPU_019408		This model was annotated based on a manual inspection of multiple protein sequence alignments.\n
GLEAN3_10703	SPU_010703		This prediction includes Semaphorin 5'-terminus domain (predicted exon 1-5) and UDP-glucuronosyltransferase (predicted exon 6).  See GLEAN3_22057 (scaffold 181) for the other part of Semaphorin sequence. \n \nThis prediction maps to Scaffoldi5629 and a hand editted genewise prediction identifies a gene with the architecture  NH2, SEMA, PSI, TM, COOH.  This sequence is a class 6 Sema and the peptide sequence in Gene Sequences has been updated.(RDB 3 May06)\n
GLEAN3_21931	SPU_021931		GLEAN3_16610 (scaffold 50410) is also likely ortholog of AADC, but the prediction is missing 40 aa that is consistent with predicted exon 3 of GLEAN3_21931.  \n
GLEAN3_16610	SPU_016610		This prediction is missing 40 aa.  GLEAN3_21931 is also likely ortholog of AADC.  \n
GLEAN3_03528	SPU_003528		Part of this sequence is also contained in GLEAN3_11191. Exon 3 and 4 are missing in GLEAN3_11191. \n"Additional evidences of the existence of the gene" have been obtained in Sphaerechinus granularis\n
GLEAN3_06778	SPU_006778		This gene is on two scaffolds (59529 and 1777). On scaffold59529, one GLEAN model is predicted for this gene (GLEAN3_06778 for exon 1-5). On scafold 1777, there is one GLEAN model (GLEAN3_08535 for exon 3-24) prediceted for this gene. \nPlease refer to GLEAN_08535 for refined gene features.\n
GLEAN3_08535	SPU_008535		This gene is on two scaffolds (59529 and 1777). On scaffold59529, one GLEAN model is predicted for this gene (GLEAN3_06778 for exon 1-5). On scafold 1777, there is one GLEAN model (GLEAN3_08535 for exon 3-24) prediceted for this gene. \n
GLEAN3_15654	SPU_015654		#\nOne more exon was accepted as coding region by comparison to the corresponding FgenesAB and ++ prediction.\n
GLEAN3_07240	SPU_007240		Alignment with best blast sequence suggests that the model is good.\n
GLEAN3_19579	SPU_019579		Blast alignments suggest that the model is good.\n
GLEAN3_19610	SPU_019610		Blast alignments suggest that the model is good. \n \nThere are two excellent matches defined  by Genscan that are not on the glean3 list.  The first on scaffold 2001_1, 1128-31566, is likely to be a haplotype sequence.  The second is on scaffold 102441_1, 594-10744.\n
GLEAN3_22407	SPU_022407		partial CDS;  \n \nThere are two excellent matches defined by Genscan that are not on the glean3 list.  Scaffold2--1_1, 1128-31455 and Scaffold102411_1, 591-10744\n
GLEAN3_02939	SPU_002939		Long sequence presenting homologie with Neurofilament Heavy Polypeptide in N-term and acin1 in C-terminal \nFor SpAcin1 see duplication gene Glean_02578\n
GLEAN3_10966	SPU_010966		Blast alignment suggests that the model is good.\n
GLEAN3_07208	SPU_007208		THREE COMMENTS: \n \n1) Alignment with best blast sequence suggests that N-terminal sequences are missing in the model.   \n2) They also suggest that an internal exon is missing between the following two exons. \n>GLEAN3_07208|Scaffold38521|27220|27355| DNA_SRC: Scaffold38521 START: 27220 STOP: 27355 STRAND: +  \nTGAGCTGAGTAAAAGCATTCATCCCTCCTCCTCTCATTGCTGAGCGTATGATGTAGAAGAGAAAACCCAG \nAACAAGGACTGTTCCCAGGATACTCATCATTCCATCAGAGCTAGAATGCTGGTAACGGATCTGAAT \n>GLEAN3_07208|Scaffold38521|28790|28906| DNA_SRC: Scaffold38521 START: 28790 STOP: 28906 STRAND: +  \nCTCTCTACCAAAGACGATAGCACCATCATGCAGGAAGACATGGACTACATCAGACTCAGGATTCTTGCTC \nTCTCCACTGCCATCAGTATCATAATACGTTACTGACACATCCTTCAC \n3) There are matches to the model sequence on scaffolds 115226_1, 13 to 446 and 118973_1, 377 to 537. \n \n
GLEAN3_25264	SPU_025264		Alignment to best blast sequence suggests that the model is missing an internal exon, N-terminal sequence and C-terminal sequence.  See notes for GLEAN3-07208, which probably is the same gene, concerning the internal exon. \n \n
GLEAN3_07374	SPU_007374		#\nBlast alignments suggest that the model is good.  Sequence similarity with Hs sequence is 89% at amino acid level!\n
GLEAN3_02234	SPU_002234		Alignment with best blast sequence suggests that model is good.  88% amino acid identity over 389 residues!!\n
GLEAN3_16198	SPU_016198		Alignment with best blast sequence suggests that the N-terminal sequence in the model is not conserved although the remainder of the protein is 90% identical.  The N-terminal sequence cannot be confirmed by blast alignment.\n
GLEAN3_28593	SPU_028593		Alignment with best blast sequence suggests that the N-terminal sequence in the model is not conserved, altough the remainder of the protein is 90% identical.  Cannot confirm that the N-terminal sequence is correct.\n
GLEAN3_18934	SPU_018934		Alignment with best blast sequence suggests assembly problems with this model.  N-terminal half sequences in the model match C-terminal half sequences in the best blast hit and C-terminal half in the model match N-terminal sequences in the best blast hit.  The N-terminal sequence in the model is not conserved and cannot be verified.\n
GLEAN3_19415	SPU_019415		Alignment with best blast sequence suggests that an exon is missing between the following predicted exons: \n>GLEAN3_19415|Scaffold469|23524|23696| DNA_SRC: Scaffold469 START: 23524 STOP: 23696 STRAND: +  \nGCTAAGATGGATGAGCTTCAGCTCTTCCGTGGAGACACAGTCATGCTCAAAGGCAAGAAAAGGCGAGACA \nCCGTCTGCATTGTACTCTCAGATGACACCGTAACAGATGACAAGATTCGTGTCAACCGAGTTGTCAGGAG \nTAATCTTCGCGTTCGTCTAGGAGACATTGTCAG \n>GLEAN3_19415|Scaffold469|25849|26041| DNA_SRC: Scaffold469 START: 25849 STOP: 26041 STRAND: +  \nAAACCTCTTTGATGTATACCTGAGGCCGTACTTCCAGGAGGCGTACCGCCCCGTCAGGAAAGGTGACATC \nTTTCAAATCCGTGGAGGCATGAGGGCGGTAGAATTCAAAGTGGTGGAAACAGACCCCGGACCATACTGCA \nTCGTTTCACCTGATACAGTCATACACTTTGAGGGAGATGCAATCAAGCGAGAG \n
GLEAN3_22919	SPU_022919		Alignment with best blast sequence suggests that the only a small segment (~10%) of the model is conserved with the best blast hit. \n \nAlignment of the Genscan model shows a much longer alignment suggesting that GLEAN3_22918, 22919, 22920 and 22921 should be combined in one model.  However two exons in the Genscan model are not conserved and cannot be confirmed.  They are: \n>Supertig39397_1|Scaffold39397|21179|21262| DNA_SRC: Scaffold39397 START: 21179 STOP: 21262 STRAND: +  \nGAGTGCATCCAACAGCTGACGTCAGATGATGCGTGGTATCCGTGCGGAGAAGGGCGCGAAAATTCAACAG \nATTATTGGCTGAAG \n>Supertig39397_1|Scaffold39397|26430|26602| DNA_SRC: Scaffold39397 START: 26430 STOP: 26602 STRAND: +  \nGGGCTATCTGCAAGGCTAAATGTCGTTTGGGTCGGTCATACCAGATTGCCCTGGAGGGTGCTACCTACAG \nAGAGCGGGGGCTGCTGGGAAGGCATGAGGTCATCCTGTGCACGGCCGGTGGTGTCTGGTACCCCAACCTC \nGACCAGATAGTATGTCATGAAAAATGCTTGGAG\n
GLEAN3_18513	SPU_018513		Alignment with best blast sequence verifies that all but the N-terminal sequence is conserved.   This is another haplotype copy of the combination GLEAN3_22918, 22919, 22920 and 22921 (see Genscan 61252_1 or annotation notes for GLEAN3_22919)\n
GLEAN3_12016	SPU_012016		Alignment with best blast sequence shows that only part of the gene is present on this scaffold, 1962.\n
GLEAN3_06212	SPU_006212		Alignment with best blast sequence shows that model does not include N-terminal sequence.\n
GLEAN3_13070	SPU_013070		Alignment with best blast sequence suggest that the N-terminal exon may not be part of the protein; it is not conserved and the remainder of the model matches the entire length of the best blast hit.  The questionable exon is: \n>GLEAN3_13070|Scaffold31758|4248|4352| DNA_SRC: Scaffold31758 START: 4248 STOP: 4352 STRAND: +  \nTCAGGCTGCAGAAACAGCAGGAATAGACTTTGTGATGACAGCTCTTTGAGAAGGGATTCTGTGAGGTAGC \nTCAGGGCAGTTGCAGTACTGTCTAACCTGGATGGC\n
GLEAN3_01178	SPU_001178	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The sequence doesn't begin until the 12th base pair and continues until it reaches 989. There is an overlap between the 2 scaffolds at 914-989, but from there the rest of the sequence is completed. Excluding this overlap, if the 2 scaffolds were combined the sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with values being 5 or greater.	Alignment with best blast sequences shows that that all but a short N-terminal region the model is conserved; an internal exon consisting of a string of serines may be missing.\n
GLEAN3_16511	SPU_016511		Alignment with best blast sequence suggests that the model may lack N- and C-terminal sequence; conserved sequences are located in the central region of the best blast hits.\n
GLEAN3_10215	SPU_010215		Alignment with best blast sequence suggests that the model lacks conserved N-terminal sequence.\n
GLEAN3_16327	SPU_016327		Alignment with best blast sequence suggests that the model is nearly complete, but lacks an internal exon between the following predicted exons: \n>GLEAN3_16327|Scaffold102457|14501|14625| DNA_SRC: Scaffold102457 START: 14501 STOP: 14625 STRAND: +  \nTTGAGAGCCTCTCTTGAGAGAGACAGGTAGATTCCTACACACTCTGCTCTACATTCTTCATAAGGTGAGG \nCAATCACAGGAAACTTGGAATCCCAAACCTCTCCTGGCATGTACCATGATGAGAT \n>GLEAN3_16327|Scaffold102457|18405|18511| DNA_SRC: Scaffold102457 START: 18405 STOP: 18511 STRAND: +  \nCTTGTCTGTATCTGTCAAGAAGGTGATCGTCTGATCTTTACCGTAAGCAGACAGCACATTGCCTAGAGAT \nACATTCTTAAATCCTTCATCCTGACGAATGTCATCAT\n
GLEAN3_28721	SPU_028721		TWO COMMENTS: \n1) partial CDS; lacking C-terminal half and possibly short N-terminal region. \n2) There is a match to a short segment of this model on Scaffold40239_1, 687-798, that is not on the glean3 list.\n
GLEAN3_04886	SPU_004886		partial CDS; lacking both N- and C-terminal sequences, probably because this model is on a short scaffold.\n
GLEAN3_26005	SPU_026005		Alignment with best blast sequence suggest that the model is good. \n  \n
GLEAN3_15187	SPU_015187		TWO COMMENTS: \n \n1) Alignment of best blast sequence suggests that the model is good. \n2) There is an excellent match on Scaffold131610, 127-321 that is not on the glean 3 list.\n
GLEAN3_00976	SPU_000976		partial CDS, lacking C-terminal half\n
GLEAN3_23965	SPU_023965		TWO COMMENTS: \n \n1) Alignment with top blast sequences suggusts the model is good. \n2) There is an excellent match to a portion of this model on Scaffold17468_1, 217-384.\n
GLEAN3_23686	SPU_023686		Alignment with best blast data and blasts with individual exons strongly suggest that the only exons in the model that should be included are: \n>GLEAN3_23686|Scaffold1004|11066|11134| DNA_SRC: Scaffold1004 START: 11066 STOP: 11134 STRAND: +  \nATGGCGGATGAGCGCTACGTCATTGACGTTCTGGTTTGTTGTTGTCAAGAGATAAGGGTAGGGTTGCAT \n>GLEAN3_23686|Scaffold1004|143554|143633| DNA_SRC: Scaffold1004 START: 143554 STOP: 143633 STRAND: +  \nGTCGCAGTGGACATGGAGTTTGCCAAGAACATGTTTGAGTTACATAAAAAGGTGAACTCATGGGAGAACA \nTTGTAGGATG \n>GLEAN3_23686|Scaffold1004|143857|143994| DNA_SRC: Scaffold1004 START: 143857 STOP: 143994 STRAND: +  \nGTATGCAACAGGACGTGACATCACAGGTCATTCAGTGCTGATACACGACTACTACTCAAGAGAGTGTCAA \nAACCCGATTCACGTCACGGTCGATACAACGATGGTAGACCTCAATATGTCAGTCAAGACATGGGTTAG \n>GLEAN3_23686|Scaffold1004|144623|144714| DNA_SRC: Scaffold1004 START: 144623 STOP: 144714 STRAND: +  \nGCAAAATATGGGCGTACCAGACAAGTCACAAGGCACCGTGTTCATTCCAATTCCCATGAAAATCTCCTTC \nCATCAACCAGAGAAAGTAGCAA \n>GLEAN3_23686|Scaffold1004|145417|145553| DNA_SRC: Scaffold1004 START: 145417 STOP: 145553 STRAND: +  \nTGGATGCGCTTATAAGGGAGACAGAACCAAACAGAAAAACCATTGAGTTGACGACCGATCTTCAGTACGT \nGTCTAAAGCCTCTGGTAAACTTCAAGAGATGTTGACAAGAGTGCTCCAGTATGTTGATGATATCCTG \n>GLEAN3_23686|Scaffold1004|146767|146880| DNA_SRC: Scaffold1004 START: 146767 STOP: 146880 STRAND: +  \nAGTGGAAAGATTCAAGCCGATAACCAGATTGGCCGGTTTCTGATGAATCTAGTTTCCAATGTTCCTAAGT \nTGCAGCCTGATGAGTTTGATGAGATGCTCAATAACAGTATGAAG \n>GLEAN3_23686|Scaffold1004|147401|147496| DNA_SRC: Scaffold1004 START: 147401 STOP: 147496 STRAND: +  \nGATCTTCTGATGGTAGTCTACCTGTCCGGTCTGATTGAGACCCAGCTCACTCTCAACGAAAAGCTGACGT \nTATCCAAAGCAGCTAATGCAGTTGCA \nOthers blast to either different proteins or nothing.\n
GLEAN3_02210	SPU_002210		"Additional evidences of the existence of the gene" have been obtained in Sphaerechinus granularis\n
GLEAN3_12936	SPU_012936		SubgroupB thrombospondin. Large gap in the middle of the gene.\n
Sp-ACE I-like	SPU_030002		Genes or parts of genes predicted by Genscan that are not in the Glean3 list.\n
Sp-TAF1-like	SPU_030003		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
Sp-leishmanolysin-like	SPU_030004		#\nGenes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-TAF2-like	SPU_030005		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
GLEAN3_13106	SPU_013106		This may be 2 concatenated genes because of the (TSP3)n-TSP_C-LamGL domain architecture. It's also likely to be a partial gene model missing the N-terminus due to end of contig.\n
Sp-MT4-mmp-like	SPU_030006		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-mmp2-like	SPU_030007		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-mmp27-like	SPU_030008		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-matrix metallopeptidase2-like	SPU_030009		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-matrix metallopeptidase3-like	SPU_030010		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-matrix metalloproteinase2-like	SPU_030011		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-matrix metalloproteinase3/14-like	SPU_030012		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_17370	SPU_017370		This gene encodes a protein with an unusual domain architecture. The N-terminus contains: \nPfam:F5_F8_type_C-Pfam:GCC2_GCC3 - which occurs in metazoa \n \nthe C-terminus looks like a subgroupB thrombospondin. Therefore, it might be a concatenation of 2 genes or a novel urchin architecture.  \nThere is no est evidence or anything else to justify spliting it into 2. The NCBI gene model may be more accurate than the GLEAN3 model.\n
Sp-meprinA-like	SPU_030013		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-TLL1-like	SPU_030014		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_22667	SPU_022667		N-terminus may be truncated due to end of contig\n
Sp-tolloid2-like	SPU_030015		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-tolloid1-like	SPU_030016		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-meprinA, beta-like	SPU_030017		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ADAMTS6-like	SPU_030018		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ADAMTS6 metalloprotease-like	SPU_030019		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
Sp-metagidin-like	SPU_030020		#\nGenes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ADAM10-like	SPU_030021		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ADAMTS1-like	SPU_030022		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ADAM10 metallopeptidase-like	SPU_030023		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ADAMTS1 metallopeptidase-like	SPU_030024		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
Sp-ADAMTS1 metalloprotease-like	SPU_030026		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-endothelin converting enzyme1-like	SPU_030027		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ECE1-like	SPU_030028		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-carboxypeptiaseD-like	SPU_030030		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-aminopeptidase1-like	SPU_030031		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-dipeptidease-like	SPU_030032		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ACY1L2-like	SPU_030033		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-aminocyclase-like	SPU_030034		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_00959	SPU_000959		Glean3_0059 corresponds to N-ter domain of Sp-EF1B delta. \nGlean3_0059 contains three exons :  \nexon1 : scaffold1943|194310|194423|strand(-);   \nexon2 : scaffold1943|191630|191707|strand(-);  \nexon3 : scaffold1943|189643|189735|strand(-); \n \nTwo isoforms are expressed in sea urchin (Y14235 and AJ973181). Y14235 does not contain exon2. \n \nThe C-ter domain of Sp-EF1B delta is encoded by Glean3_00960 \nThe entire sequence is given in annotation for GLEAN3_00960 \n \n
GLEAN3_12281	SPU_012281		#\nSame sequence as GLEAN3_11711 except the most N-terminal part. \nDuplication most likely due to assembly process.\n
Sp-aminocyclase1-like	SPU_030035		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_11711	SPU_011711		Same sequence as GLEAN3_12281 except the most N-terminal part. \nDuplication most likely due to assembly process.\n
Sp-O-sialoglycoprotein endopeptidase-like	SPU_030036		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_09593	SPU_009593		Binds the stem-loop structure of replication-dependent histone pre-mRNAs and contributes to efficient 3' end processing by stabilizing the complex between histone pre-mRNA and U7 small nuclear ribonucleoprotein (snRNP). Could play an important role in targeting mature histone mRNA from the nucleus to the cytoplasm and to the translation machinery. Stabilizes mature histone mRNA and could be involved in cell-cycle regulation of histone gene expression (By similarity). \n
GLEAN3_24780	SPU_024780		see anotation to GLEAN3_09593\n
Sp-O-peptidaseD-like	SPU_030037		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-O-NAALAD2-like	SPU_030038		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
Sp-N-acetylated alpha-linked acidic dipeptidase 2-like	SPU_030039		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_20741	SPU_020741		This gene model may represent a pseudogene or contain a sequence error. 3'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified model contains some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin specific-Tlr Group ID.\n
Sp-N-acetylated alpha-linked acidic dipeptidase 1-like	SPU_030040		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-NAALAD2 metalloprotease-like	SPU_030041		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-NAALAD2-like	SPU_030042		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-NAALAD2-like protease	SPU_030043		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_21075	SPU_021075		This gene model may represent a pseudogene or contain a sequence error.5'UTR sequence matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
Sp-aspartate transcarbamylase-like	SPU_030044		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_24092	SPU_024092		Same sequence as GLEAN3_00814 except the most N-terminal part. \nDuplication most likely due to assembly process.\n
Sp-carbamoylphosphate synthetase-like	SPU_030045		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_00814	SPU_000814		Same sequence as GLEAN3_24092 except the most N-terminal part. \nDuplication most likely due to assembly process.\n
Sp-carbamoylphosphate	SPU_030046		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_21283	SPU_021283		The nucleotide sequence of this gene model has 100% identity to GLEAN3_02442. It may be caused by an assembly error. \n
GLEAN3_20322	SPU_020322		#\nmany similarities to plant HSP-90 also; see also (Glean3_20322) looks to be the same as (Glean3_01586) and above Glean\n
GLEAN3_21362	SPU_021362		Partial Toll-like receptor. This gene model is located at the end of a short scaffold.\n
Sp-allantoinase-like	SPU_030047		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_21502	SPU_021502		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IA.\n
Sp-YME1-like	SPU_030048		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-paraplegin-like	SPU_030049		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_23179	SPU_023179		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_23491	SPU_023491		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 99% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group ID.\n
GLEAN3_24501	SPU_024501		Partial Toll-like receptor. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group ID.\n
GLEAN3_28424	SPU_028424		this glean prediction contains only the c-term (tyrosine kinase domain) of the protein. it seems that glean3_07624 contains the N-term (extracellular domain).\n
GLEAN3_07624	SPU_007624		this glean prediction contains only the N-term (extracellular domain) of the protein. it seems that glean3_28424 contains the C-term (tyrosine kinase domain).\n
GLEAN3_24590	SPU_024590		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 97% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_24847	SPU_024847		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 96% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_23989	SPU_023989	After reviewing the excel data and performing a BLAST search, it appears that the sequence is distributed onto 4 different scaffolds. There are numerous gaps within the sequence and poor overall coverage. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5	Exons 3-7 are from this scaffold533 and GLEAN3_23989 prediction except for exon 5 which was only predicted by the Fgenesh++ prediction. Exon 2 is from scaffold65249 with no tracks predicting it. Exon 1 is incomplete and present on scaffold137005 with no tracks predicting it. Exons 8-19 are from  scaffold57107 and GLEAN3_20620 prediction. Refer to GLEAN3_20620 for the complete gene features of REJ2.\n
GLEAN3_27735	SPU_027735		792bp nucleotides of 5'UTR that was highly similar to other Sp-Tlr genes was accepted to a coding region by BLASTN search. Modified gene model is located at the end of a contig, which could make the model still incomplete. \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_20718	SPU_020718		Exons 1-7 are present on this scaffold772 and GLEAN3_20718 prediction. Exons 8-28 are present on scaffold772 and GLEAN3_09002 prediction. Refer to GLEAN3_09002 for the complete gene features of REJ3.\n
Sp-COP9signalosome/s6-like	SPU_030051		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
GLEAN3_10579	SPU_010579	From the BLAST results as well as the excel data, it was evident that this is the best fit for this particular GLEAN model. When reviewing the excel data there were no gaps or repeats present; however the end of the scaffold was truncated at 2430 when the entire sequence ended at 2695. Some of this missing sequence information is distributed onto:                                                          	Exons 1-12 and the begining of 13 are present on this scaffold601 and GLEAN3_10579 prediction. In the middle of exon 13 on scaffold601 there is a huge gap of N's, where the reamining exons should be present. The last 2 exons (named 14 and 15) are present on Scaffold17211. GLEAN3_18502 predicts exon 14 but not 15. Fgenesh++ predicts exon 15.\n
Sp-EIF3S3-like	SPU_030052		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-angiotensin I converting enzyme-like	SPU_030053		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_10180	SPU_010180		A part of the 3rd intron was accepted as coding region by comparison to the corresponding FgenesAB and ++ prediction. Unknown sequence (NNN...) in the 1st intron of the current model could make this gene model incomplete.\n
Sp-mmp20-like	SPU_030054		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list. \n \n
GLEAN3_12211	SPU_012211		#\nThis gene model is located at the end of a short scaffold. Three exons were added at the 3'end of the GLEAN3 model by comparison to the corresponding Genscan prediction.\n
Sp-NAALAD2 protease-like	SPU_030055		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
GLEAN3_18915	SPU_018915		Partial TNF receptor. This gene model is located at the end of a short scaffold\n
Sp-leishmanolysin-like metalloprotease	SPU_030056		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-ACE-T-like	SPU_030057		Genes or parts of genes predicted by one or more of the prediction programs (Genscan, Gnomen, fgenesh) that are not in the Glean3 list.\n
Sp-angiotensin I converting enzyme isoform 1 precursor-like	SPU_030058		Genes or parts of genes predicted by Genscan that are not in the Glean3 list.\n
GLEAN3_10926	SPU_010926		Difficulty to confirm expression. \nHighly similar to Glean3_13313.  \n
GLEAN3_20740	SPU_020740		Unknown sequence (NNN...) in the intron of the current model could make this gene model incomplete.\n
GLEAN3_24584	SPU_024584		Partial TNF receptor. This gene model is located at the end of a short scaffold. The first exon was changed by comparison to the corresponding FgenesAB and ++ prediction. \n
GLEAN3_10230	SPU_010230		This gene model was modified based on FgeneshAB ++ prediction. The model is located at the end of a scaffold, which could make it still incomplete.\n
GLEAN3_20955	SPU_020955		Partial TNF receptor. This gene model is located at the end of a short scaffold. \n
GLEAN3_02328	SPU_002328	After reviewing the data, it appears that there is a somewhat orderly arrangement within the sequence however, there is very poor coverage. The coverage for this particular sequence is only 5K/25K. There also appears to be numerous gaps within the sequence denoted by N. There was also no Est information available and the transcriptome intensity scores appeared to be widely distributed and strong with values ranging from about 5-180.\nAdditional information from Baylor annotations (gene comments): last exon has large stretch of NNNNs, will examine later for correction.	last exon has large stretch of NNNNs, will examine later for correction\n
GLEAN3_06859	SPU_006859		Exons 1-4 are on this scaffold59004 and GLEAN3_036859. Exons 5-7 are on scaffold83735 and GLEAN3_21516.\n
Sp-Twist	SPU_030059		Determined by the sequence info from Lv-Twist\n
GLEAN3_12238	SPU_012238		glean3_02950 is a partial sequence identical to this one, but on a different scaffold\n
GLEAN3_19440	SPU_019440		Nectin sequence from L.variegatus hits 5 GLEAN predictions with a score of 0.0.  This prediction has 5 exons that are not in cDNA sequence, embryonic expression data agrees with cDNA and predicted protein mass is larger than that found in eggs.\n
GLEAN3_11009	SPU_011009		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 42.3% (aa level). Gene model modified to include 2 more exons as predicted by FGENESH+ homology to GLEAN3_14573\n
GLEAN3_13642	SPU_013642		Exons 5-12 are present on this scaffold85921 with GLEAN3_13642 and FgeneshAB predictions. Exons 1-4 are on scaffold6090 with GLEAN3_20643 and FgeneshAB predictions.\n
GLEAN3_20643	SPU_020643		Exons 1-4 are on this scaffold6090 with GLEAN3_20643 and FgeneshAB predictions. Exons 5-12 are present on scaffold85921 with GLEAN3_13642 and FgeneshAB predictions. Refer to GLEAN3_13642 for the complete Sp-HCN2 gene features.\n
GLEAN3_06462	SPU_006462		Scaffold1146 contains stretches of Ns in exon 1 of GLEAN3_06462 (GCM), and the length of this region is shorter than that of the corresponding region in experimentally verified mRNA sequence. Scaffold sequences and coordinate of exon 1 should be revised.  \n
GLEAN3_17511	SPU_017511		Although Type2 TGFbeta receptors belongs to the "Ser/Thr Protein Kinase", and that this sequence gives TGFbetaR2 as best genebank hits, NCBI Blast predict a "Tyr Kinase" domain for this gene.\n
GLEAN3_27254	SPU_027254		see GLEAN3_08876\n
GLEAN3_08197	SPU_008197		Highly similar to GLEAN3_03264.\n
GLEAN3_00960	SPU_000960	From the BLAST results and the excel data, it is evident that this is the best match for this particular GLEAN model. There is a large gap present from 423-514 resulting in poor overall sequence coverage. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	Glean3_00960 corresponds to C-ter domain of Sp-EF1B delta \nGlean3_00960 contains 7 predicted exons. However only six correspond to the known sequences for S. granularis EF1B delta (Y14235 and AJ973181). \nAmong the 6 remaining exons, one exon (Scaffold1943|187113|187206|) is strictly identical to exon3 of Glean3_00959 (scaffold1943|189643|189735|)  \n \nWe therefore propose to construct the CDS for Sp-EF1B delta from 2 exons of Glean3_00959 plus the common exon between Glean3_00959 and Glean3_00960 plus 5 exons of Glean3_00960 as follows : \nexon1 Glean3_00959 scaffold1943|194310|194423|Strand(-);   \nexon2 Glean3_00959 scaffold1943|191630|191707|Strand(-);  \nexon3 Glean3_00959 scaffold1943|189643|189735|Strand(-)  \n idem to Glean3_00960 scaffold1943|187113|187206|Strand(-); \nexon4 Glean3_00960: Scaffold1943|186520|186614|Strand(-); \nexon5 Glean3_00960: Scaffold1943|185409|185492|Strand(-); \nexon6 Glean3_00960: Scaffold1943|183100|183319|Strand(-); \nexon7 Glean3_00960: Scaffold1943|181425|181540|Strand(-); \nexon8 Glean3_00960: Scaffold1943|180516|180554|Strand(-); \n \n
GLEAN3_00229	SPU_000229		Closely related to GLEAN3_09517 \nProbably missing the N-terminus due to end of contig\n
GLEAN3_04340	SPU_004340		This sequence was modified by adding the 5' sequence from GLEAN3_06789\n
GLEAN3_26288	SPU_026288		Exons 2-9 are on this scaffold97931 and GLEAN3_26288 prediction. Exon 1 is on scaffold96 with GLEAN3_02214 prediction.\n
GLEAN3_06789	SPU_006789		This sequence has been added to the 5' end of GLEAN3_04340\n
GLEAN3_13709	SPU_013709		There was a long unknown sequence (NNN) between the 7th and 8th exons. The 1st to 7th exons were eliminated.\n
GLEAN3_23670	SPU_023670		also GLEAN3_06070 \n
GLEAN3_06070	SPU_006070		also GLEAN3_23670\n
GLEAN3_03292	SPU_003292		also GLEAN3_16634\n
GLEAN3_16634	SPU_016634		also GLEAN3_03292\n
GLEAN3_17117	SPU_017117		probable atp-dependent helicase ddx41 (dead-box protein 41)\n
GLEAN3_00526	SPU_000526		Exons 1-28 are on this scaffold80510 and GLEAN3_00526. Exons 29-58 are from scaffold1165 and GLEAN3_18112. Exons 49-58 are repeats and may be alternatively spliced. Exons 59-60 are from scaffold100796 and GLEAN3_08369.\n
GLEAN3_05854	SPU_005854		The 3' end GLEAN3_05854 is located near the edge of the scaffold. By comparison to the published cDNA (AAB67801), the model is missing 3' exons coding for 106 C-terminal aa. Exon 6 was modified to agree with known cDNAs. \n
GLEAN3_27915	SPU_027915		GLEAN prediction originally for 386 to 806 (end).  The front end 1-386 is on GLEAN3_27915  Use Annotations on GLEAN 20342\n
GLEAN3_20342	SPU_020342		GLEAN prediction originally for 386 to 806 (end).  The front end 1-386 is on GLEAN3_27915.  Exon 1 does not agree with cDNA sequence, two small intron exon boundary problems when aligned with BetaC cDNA\n
GLEAN3_14418	SPU_014418		glean3_14418, glean3_27648 are the same gene, two different alleles. \n
GLEAN3_27648	SPU_027648		glean3_14418, glean3_27648 are the same gene, two different alleles. \n
GLEAN3_02292	SPU_002292		GLEAN3_02292 has a tandem duplication of the N-terminal half of the C2A domain.\n
GLEAN3_15621	SPU_015621		partial CDS of N-terminal region of GLEAN3_24838\n
GLEAN3_12856	SPU_012856		GLEAN3_12856 has a tandem duplication of the N-terminal half of the C2A domain.\n
GLEAN3_11065	SPU_011065		CDS contains 26-631 or betaG cDNA sequence. First and last exons are missing from the scaffold\n
GLEAN3_15213	SPU_015213		dna mismatch repair protein mlh1 (mutl protein homolog 1)\n
GLEAN3_16009	SPU_016009		Prediction has small errors at exon boundaries by comparison with cDNA.  The last exon is incorrect.\n
GLEAN3_28261	SPU_028261		GLEAN3_28261 is the C-terminal match to MSH6 mouse. \nGLEAN3_12960 is the N-terminal match to MSH6 mouse.\n
GLEAN3_26225	SPU_026225		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 45.0% (aa level).\n
GLEAN3_21255	SPU_021255		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 37.9% (aa level). GLEAN3 gene model altered to follow FGENESH+ predictions and cDNA sequences (added exon 3 and 4).\n
GLEAN3_21588	SPU_021588		Predicted C-terminus sequence is longer than those of the other organism. \n
GLEAN3_12985	SPU_012985		Part of a previously unknown beta subunit.  The 5' end is missing because scaffold is incomplete.  Good evidence for embryonic expression.\n
GLEAN3_13102	SPU_013102		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 35.8% (aa level). By homology to other CYP3-like genes appears to be missing exons 1-4\n
GLEAN3_09766	SPU_009766		See GLEAN3_13107 (scaffold 457). \n
GLEAN3_13107	SPU_013107		See GLEAN3_09766 (scaffold 98369). \n
GLEAN3_07644	SPU_007644		Different parts of the cDNA are found in different strands in a non-linear organization. \n
GLEAN3_11635	SPU_011635		Different parts of the cDNA are found in different strands in a non-linear organization. \n
GLEAN3_17379	SPU_017379		GLEAN3_17379 matches SIX1_mouse and SIX2_mouse in Homobox with high identity. \n
GLEAN3_17380	SPU_017380		Six5 also hits the same Glean3 gene.\n
GLEAN3_12076	SPU_012076		This Glean model is a concatenation of 2 adjacent genes \nThe Cterminus is a Calcium channel alpha2 delta subunit. \nthe Nterminus is a homolog of human Q69YN2\n
GLEAN3_14621	SPU_014621		GLEAN3_04346 could be an alternate model of this gene.\n
GLEAN3_04346	SPU_004346		see also GLEAN3_14621\n
GLEAN3_20140	SPU_020140		GLEAN3_22433 has high sequence homolgy and could be a variant.\n
GLEAN3_18112	SPU_018112		Exons 29-58 are from this scaffold1165 and GLEAN3_18112. Exons 49-58 are repeats and may be alternatively spliced. Exons 1-28 are on scaffold80510 and GLEAN3_00526. Exons 59-60 are from scaffold100796 and GLEAN3_08369. Refer to GLEAN3_00526 for the complete Sp-EBR1 gene features.\n
GLEAN3_22433	SPU_022433		see also GLEAN3_20140\n
GLEAN3_09210	SPU_009210	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 3 different scaffolds. There is a small gap that occurs between the second and third scaffold that ranges from 553-581. Other than this small gap between the second and third scaffold, the overall sequence has an orderly arrangement without any repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be unusual in that the majority of the values were very large <80. However, there were several small values as well. 	This Model contains a partial sequence relative to GLEAN3_20140\n
GLEAN3_27905	SPU_027905		Likely haplotype of GLEAN3_28897, but 27905 gene model incomplete. See GLEAN3_28897 (Sp-apn6) for annotation.\n
GLEAN3_08369	SPU_008369		Exons 59-60 are from this scaffold100796 and GLEAN3_08369. Exons 1-28 are on scaffold80510 and GLEAN3_00526. Exons 29-58 are from scaffold1165 and GLEAN3_18112. Exons 49-58 are repeats and may be alternatively spliced. Refer to GLEAN3_00526 for the complete Sp-EBR1 gene features.\n
GLEAN3_23808	SPU_023808		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 45.0% (aa level). Edited to add exons 2 and 3 based on FGENESH+ prediction and cDNA evidence, and exon 5 was shortened. Possible missing or mispredicted exon 1. Removed from gene model.\n
GLEAN3_22916	SPU_022916		More than 90 % identity with Fz5/8 from P. lividus. AC number AM084899   \n
GLEAN3_23898	SPU_023898		Scaffold75943 covers exons 3-25 by merging GLEAN3_23898 and GLEAN3_23899 modified predictions. Exons 3, 6,13, and 19 have sequence gaps. Scaffold6755 covers exons 1-2 which have no GLEAN3 predictions. \n
GLEAN3_16056	SPU_016056		Highest homology to CYP3 family genes. Phylogenetically falls at base of CYP3 family, within CYP clan 3 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP3A4 (human) of 42.3% (aa level). Missing exons 1 and 2 due to incomplete assembly.\n
GLEAN3_07406	SPU_007406		Member of CYP1 family. Tentatively designated CYP1F1 pending CYP nomenclature committee approval. Phylogenetically falls at base of CYP1 family, within CYP clan 2 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP1C1  (fugu) of 37.4% (aa level).\n
GLEAN3_06989	SPU_006989		Member of CYP1 family. Tentatively designated CYP1F2 pending CYP nomenclature committee approval. Phylogenetically falls at base of CYP1 family, within CYP clan 2 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP1A (Danio rerio) of 41.7% (aa level).\n
GLEAN3_01262	SPU_001262		also GLEAN3_05908 is the cterminal portion only \n \nGLEAN3_01262 annotated as Sp-Birc6 but aa5240-6588 similar to hypoxia-inducible factor 1 alpha (HIF-1a).  The assembly of nt sequence seems consistent with a modeled intronic region being, rather an intergenic region.  The HIF-1a was added annotation as such by M. Hahn. \n
GLEAN3_05908	SPU_005908		this glean model is a subset of GLEAN3_01262\n
GLEAN3_20646	SPU_020646		aa 1-215 do not align to closest homologs\n
GLEAN3_23899	SPU_023899		Scaffold75943 covers exons 3-25 by merging GLEAN3_23898 and GLEAN3_23899 modified predictions. Exons 3, 6,13, and 19 have sequence gaps. Scaffold6755 covers exons 1-2 which have no GLEAN3 predictions. Refer to GLEAN3_23898 for the complete gene features.\n
GLEAN3_24610	SPU_024610		Ig2-Fz-TM-kinase \nBest Blast hit is MUSK, which has one more Ig domain - gene may be missing N-terminal piece.\n
GLEAN3_00310	SPU_000310		10 Ig-TM-kinase. Closest human protein is VEGFR-1, but only distantly related\n
GLEAN3_21021	SPU_021021		7 Ig - TM - kinase \nBest Blast hit is VEGFR1\n
GLEAN3_01905	SPU_001905		BLASTP search shows this gene model has very low similarity to MyD88 in other animals, but domain structure shows it could be a member of MyD88 gene family. \n
GLEAN3_22684	SPU_022684		See GLEAN3_11320, _22683, _28351. \n
GLEAN3_16914	SPU_016914		Missing first 756bp of coding gene sequence.  Scaffold 42354,65902, and 40781 contain the first 756bp and scaffold 23542 contains 1420 to 2189bp of the coding region.\n
GLEAN3_22683	SPU_022683		See GLEAN3_11320, _22684, _28351. \n
GLEAN3_28351	SPU_028351		See GLEAN3_11320, _22683, _22684. \n
GLEAN3_19779	SPU_019779		Overlaps with GLEAN3_23615\n
GLEAN3_23615	SPU_023615		Overlaps with GLEAN3_19779\n
GLEAN3_02050	SPU_002050		GLEAN3_02050|Scaffold111471|764|1126|corresponds to the last exon for encoding Sp-EF1A,  \nother exons are contained in Glean3-00595 (see for complete gene)\n
GLEAN3_00595	SPU_000595		EF1A is encoded by 5 exons in Glean3_00595(scaffold1277) plus one exon in Glean3_02050(scaffold111471) \nthe sequence was constructed on the basis of a fusion between the two scaffolds\n
Arnone1	SPU_030060		exon 1-7 are on Scaffold22273; exon 2-8 are on Scaffold59839\n
GLEAN3_12295	SPU_012295		See GLEAN3_15341 for annotation.\n
GLEAN3_03996	SPU_003996		See GLEAN3_15341 for annotation\n
GLEAN3_08008	SPU_008008		The protein aligns with dentin and other proteins with long stretches of serine/Asp/Gln. This is not real.\n
GLEAN3_22707	SPU_022707		This gene model has a Death domain and two TIR domains, which indicates a member of MyD88 family.\n
GLEAN3_04955	SPU_004955		 partial\n
GLEAN3_26723	SPU_026723	After reviewing the data from the excel file and performing a BLAST search, it appears that there is no sufficient GLEAN model that maps well onto V2.1. When reviewing the excel file there appears to be only a small amount of order within the sequence (some gaps are present) and sequence coverage appears to be low as well. There was Est. information available on GBrowse assembly V0.5 and the transcriptome information appeared to be widely dispersed and somewhat strong. This is an un-annotated gene as well so no additional comments were available. 	See GLEAN#_08008 for annotation\n
GLEAN3_11843	SPU_011843		Aligns to vertebrate dentin due to serine repeat-not true homology.\n
GLEAN3_00567	SPU_000567		Partial CDS.  This model contains the N-terminal 2/3 of the CDS and all of the predicted exons are correct based on alignment data.  A full length copy of hatching enzyme gene is adjacent on scaffold581, GLEAN3_00566.  Where these sequences overlap, they are >90% identical at the amino acid level.\n
GLEAN3_00343	SPU_000343		Allele of GLEAN3_07948\n
Sp-astacin protease	SPU_030061		Alignment with best blast sequence, a C. elegans hatching enzyme, suggests that the model may be incomplete at both N- and C- termini.\n
GLEAN3_28132	SPU_028132		GLEAN3_28132 is located near the edge of the contig; it appears to be missing 5' exons. GLEAN3_16698 appears to be an allele representing the 5' end of this gene, the overlapping region is nearly identical sequence. Sp-Syt15-1 has three (predicted) C2 domains.\n
GLEAN3_16698	SPU_016698		GLEAN3_16698 is located near the edge of the contig; it appears to be missing 3' exons. GLEAN3_28132 appears to be an allele representing the 3' end of this gene, the overlapping region is nearly identical sequence. Sp-Syt15-1 has three (predicted) C2 domains.\n
GLEAN3_22210	SPU_022210		GLEAN3_22210 appears to be a tandem duplication of the 3' end of GLEAN3_22209, located just 5' to GLEAN3_22210. The 3' exons of these two gene models are identical sequence. \nSp-Syt15b only has one C2 domain.\n
GLEAN3_21274	SPU_021274		Merge with GLEAN3_21274 and GLEAN3_21273 predictions.\n
GLEAN3_21272	SPU_021272		Merge with GLEAN3_21273 and GLEAN3_21274.\n
GLEAN3_07948	SPU_007948	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. There is a large gap on v2.1_scaffold27007 that spans from 819-1088. This portion of the sequence is distributed onto v2.1_scaffold79512. There was no Est support available from GBrowse assembly V0.5 and most of the transcriptome intensity scores appeared to be weak, except for two outlier values at approximately 45. 	Allele of GLEAN3_00343\n
GLEAN3_19127	SPU_019127		Also related to electric ray synaptotagmin C, P24507.\n
GLEAN3_04113	SPU_004113		This gene is in a cluster with 4 other genes closely related to SpAN which are called SpAN-like.  All of these genes encode proteins closely related to tolloid.\n
GLEAN3_18035	SPU_018035		Contains a 60 kb intron. Appears to have an allele, GLEAN3_20349, without this long intron.\n
GLEAN3_04114	SPU_004114		This gene is one of 4 clustered SpAN-like genes; this cluster also contains SpAN.  All of these genes encode proteins closely related to tolloid.\n
GLEAN3_20349	SPU_020349		GLEAN3_20349 appears to have an allele, GLEAN3_18035, containing a 60 kb intron.\n
GLEAN3_05832	SPU_005832		This gene model could not be a typical Toll-like receptor. There is only four LRR in the coding region and the nuleotides of the intron have no similarities. \n
GLEAN3_05850	SPU_005850		#\nUnknown sequence (NNN) in the first intron of the current model makes this gene model incomplete. The first exon only shows typical Toll-like receptor structures (signal peptide, LRRNT, LRR(22), LRRCT, TIR(partial)). \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_09970	SPU_009970		This gene model was fused to an adjacent glean model (GLEAN3_09969) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The sequence between these two models matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_09969	SPU_009969		This gene model was fused to an adjacent glean model (GLEAN3_09970) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The sequence between these two models matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_24207	SPU_024207		This gene model was fused to an adjacent glean model (GLEAN3_024206) to obtain a full sequence. The nucleotide sequence between 024206 and 024207 has 94% identity to another Sp-Tlr gene. This fused gene model may represent a pseudogene or contain a sequence error, but reflect best gene structure. \nThis is a member of sea urchin-specific Tlr Group IC.\n
GLEAN3_24206	SPU_024206		This gene model was fused to an adjacent glean model (GLEAN3_024207) to obtain a full sequence. The nucleotide sequence between 024206 and 024207 has 94% identity to another Sp-Tlr gene. This fused gene model may represent a pseudogene or contain a sequence error, but reflect best gene structure.\n
GLEAN3_07599	SPU_007599		There is another predicted gene GLEAN3_15362  has some domain as this one, maybe there are different isoforms of Glass protein.\n
GLEAN3_19810	SPU_019810		There are many homologs of this protein.\n
GLEAN3_04115	SPU_004115		This is one of 4 SpAN-like genes in a cluster that also contains SpAN.  All of these genes encode proteins closely related to tolloid.\n
GLEAN3_04116	SPU_004116		This is one of four clustered SpAN-like genes; the cluster also contains SpAN.  All of these genes encode proteins closely related to tolloid.\n
GLEAN3_04117	SPU_004117		This is one of four SpAN-like genes in a cluster; the cluster also contains SpAN.  All five genes encode proteins that are closely related to tolloid. \n \nBased on alignment with best blast sequence, it is likely that the model lacks the N-terminal exon.  Transcriptome expression data suggests that the missing exon lies between coordinates 13000 and 13500 on scaffold 61174.\n
GLEAN3_26629	SPU_026629		Glean3_26630 immediately downstream also has homology to fibulin, but differs from 26629.\n
GLEAN3_06628	SPU_006628		Partial CDS based on alignment with best blast data suggests that this model is missing both N- and C- exons.  The sequence inferred for the protease active site is significantly altered from other SpAN-like proteins or the closely related tolloid proteins.\n
GLEAN3_23280	SPU_023280		Partial cds based on alignment with best blast sequence suggests that this model contains only a portion of the gene encoding the N-terminal half of the protein.  The model is located at one end of a short scaffold (73008) making it likely that the remainder of the gene is on another scaffold.\n
GLEAN3_00881	SPU_000881		4 Glean3 models match the mvp sequence \nGLEAN3_07085 \nGLEAN3_00881 \nGLEAN3_18647 \nand GLEAN3_18164 partial\n
GLEAN3_18647	SPU_018647		4 Glean3 models match the mvp sequence \nGLEAN3_07085 \nGLEAN3_00881 \nGLEAN3_18647 \nand GLEAN3_18164 partial\n
GLEAN3_18164	SPU_018164		4 Glean3 models match the mvp sequence \nGLEAN3_07085 \nGLEAN3_00881 \nGLEAN3_18647 \nand GLEAN3_18164 partial\n
Sp-AN-like7	SPU_030062		Partial cds sequence based on alignment with best blast sequence suggests that this model encodes two CUB domains only.  The sequences of these domains are most closely related to those in SpAN, but it is not clear that they are linked to a metalloprotease domain.  Based on the position of this model on Scaffold16164, the remaining part of the gene is on another scaffold.\n
Sp-AN-like8	SPU_030063		Partial cds sequence based on alignment with best blast sequence suggests that this model encodes two CUB domains only.  The sequences of these domains are most closely related to those in SpAN, but it is not clear that they are linked to a metalloprotease domain.  Based on the position of this model on Scaffold37352, the remaining part of the gene is on another scaffold.\n
GLEAN3_15333	SPU_015333		Partial Toll-like receptor. This gene is located at the end of a short scafford.  Nucleotide seq has 94% similarity to another Sp-Tlr gene.\n
GLEAN3_21495	SPU_021495		One of 4 genes containing multiple EGF and TB domains.  Appears that gene is truncated at 5 end due to scaffold being incomplete\n
GLEAN3_01532	SPU_001532		One of two tandem genes containing EGF and TB repeats.\n
GLEAN3_24479	SPU_024479		Partial Toll-like receptor. This gene is located at the end of a short scafford.  Nucleotide seq has 95% similarity to another Tlr gene.\n
GLEAN3_01533	SPU_001533	After reviewing the data and performing A BLAST search, it appears that the data is distributed onto 2 different scaffolds. The second scaffold has a lower bit score and higher e-value compared to the other scaffold results, however, this scaffold contains more overall sequence coverage. There was some Est support available from GBrowse assembly V0.5 and the trancriptome intensity scores appeared to be strong with most of the values ranging from 5-45.	One of two fibrillin genes.  Other parts of this same gene are in GLEAN3_01533, 20166, 21495\n
GLEAN3_12550	SPU_012550		One of 4 genes containing multiple EGF and TB domains.\n
GLEAN3_11455	SPU_011455	The BLAST results and the excel data indicate that the sequence is distributed onto 2 different scaffolds. Both scaffolds are unique in that they both cover the entire scaffolds continuously from their beginning to end. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5. 	The nucleotide sequence of the first exon and the following intron have 100% identity to GLEAN3_11454. This gene model may be duplicated recently or produced by wrong prediction. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_13876	SPU_013876	According to the BLAST results in comparison with the excel data, it evident that this is the best fit for this particular GLEAN model. This scaffold is unique in that it completely covers that entire sequence (from 1-2382) without and breaks within the base pairing. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the majority of values being <5	Unknown sequence (NNN...) in the first intron of the current model could make this gene model incomplete. The first exon shows typical Toll-like receptor structures (signal peptide, LRRNT, LRR, LRRCT, TIR(partial)).\n
GLEAN3_17530	SPU_017530		The first exon of this gene model was eliminated and a part of the intron was accepted as coding region by comparison to the corresponding FgenesAB and ++ prediction. \nThis is a member of sea urchin-specific Tlr Group I(orphan). \n
GLEAN3_19661	SPU_019661		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor. \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_26630	SPU_026630		This may be a splice variant of Glean3_26629.\n
GLEAN3_19882	SPU_019882		This gene model was fused to an adjacent glean model (GLEAN3_19881) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The nucleotide sequence between 19881 and 82 matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_19881	SPU_019881		This gene model was fused to an adjacent glean model (GLEAN3_19882) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The nucleotide sequence between 19881 and 82 matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_20258	SPU_020258		This gene model was fused to an adjacent glean model (GLEAN3_20257) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The nucleotide sequence of these two models matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift). \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_20257	SPU_020257		This gene model was fused to an adjacent glean model (GLEAN3_20258) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The nucleotide sequence of these two models matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift).\n
GLEAN3_20654	SPU_020654		This gene model was fused to an adjacent glean model (GLEAN3_20653) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The nucleotide sequence of these two models matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift). \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_20653	SPU_020653		This gene model was fused to an adjacent glean model (GLEAN3_20654) to obtain a full sequence. This fused gene model may represent a pseudogene or contain a sequence error. The nucleotide sequence of these two models matches coding sequence of other Sp-Tlr genes and contains stop codons (frame shift). \nThis is a member of sea urchin specific Tlr Group IA.\n
GLEAN3_26338	SPU_026338		missing 5' of CDS \n5' of CDS probably encoded by GLEAN3_00937 + GLEAN3_28620 or GLEAN3_12324 \n
GLEAN3_15080	SPU_015080		#\nThere are 10 ESTs. \nGene required correction from EST data\n
GLEAN3_20849	SPU_020849		1 EST, CD342027 Has 1st exon and 5'UTR only \n \nGene model is missing the C-terminus \n
GLEAN3_26767	SPU_026767		No ESTs \nModel has only the nucleotide bing domain, is missing the C-terminal half od the protein.\n
GLEAN3_12874	SPU_012874		No ESTs \nMissing N-terminus.\n
GLEAN3_24785	SPU_024785		1 EST DN561873, contins central half of the gene. \nDuplicated exon in the middle of the gene removed \n \nMissing N-terminus, has the wrong C-terminus\n
GLEAN3_21184	SPU_021184		Amino Acid Sequence corrected. \n2 ESTs. CD311111 overlaps GLEAN_14007 \n
GLEAN3_14013	SPU_014013		#\nNo ESTs \nOn the same contig as GLEAN_14007 another ABCG2-like gene. \nMissing the N-terminus and Walker A domain\n
GLEAN3_15930	SPU_015930		Merged with GLEAN3_15929 and GLEAN3_15931. Refer to GLEAN3_15929 for the complete gene features of REJ4.\n
GLEAN3_15931	SPU_015931		Merged with GLEAN3_15929 and GLEAN3_15930. Refer to GLEAN3_15929 for the complete gene features of REJ4.\n
GLEAN3_28620	SPU_028620		missing 5' of CDS (domain I; HS attachment sites) \n5' of CDS probably encoded by GLEAN3_00937 \nmissing 3' of CDS \n3' of CDS probably encoded by GLEAN3_26338 \nhaplotype duplication of GLEAN3_12324 \nGLEAN3_28620 is shorter than GLEAN3_12324 and is missing a 29 residue piece in the middle of the sequence that is present in GLEAN3_12324\n
GLEAN3_09876	SPU_009876		Remove exons belonging to another gene.\n
GLEAN3_09718	SPU_009718		CDSs of this gene align exactly with CDSs of GLEAN3_28124 on scaffold 75874. Sequence between CDS very similar also.\n
GLEAN3_02758	SPU_002758		Similarity to vertebrate dentin due to serine repeats.\n
GLEAN3_26442	SPU_026442		Aligns with vertebrate dentin primarily due to serine repeats. Alignment included.\n
GLEAN3_19064	SPU_019064		Partial prediction for eIF4G. N-terminus part of the protein is predicted by GLEAN3_24859 (gene model modified)\n
GLEAN3_12174	SPU_012174		Aligns with vertebrate dentin due to serine rich repeat region.\n
GLEAN3_03725	SPU_003725		Missing exon 4 and 6. \n
GLEAN3_28124	SPU_028124		CDSs of this gene align exactly with CDSs of GLEAN3_09718 on scaffold 31633. Sequence between CDS very similar also.\n
GLEAN3_16409	SPU_016409		CDS of minus strand of this gene align exactly with CDS of the positive strand of Glean3_16406, on same scaffold.  Sequence between CDS as well as the repeat structure in the two regions is very similar as well. This gene could be a haplotype that was put on the wrong scaffold. The contig that it is on is attached to the very end of the scaffold right next to Glean3_16406. \n
GLEAN3_18861	SPU_018861		The ligand binding domain is in glean3_25239 on scaffold46634.\n
GLEAN3_16657	SPU_016657		see also Glean3_18404 for very similar glean\n
GLEAN3_16032	SPU_016032		Aligns with vertebrate dentin due to serine rich repeat.\n
GLEAN3_02821	SPU_002821		Aligns with vertebrate protein phosphatase 4 regulatory subunit and dentin due to repetitive regions.\n
GLEAN3_04861	SPU_004861		Alignment to vertebrate dentin due to serine rich repeat.\n
GLEAN3_15771	SPU_015771	From the BLAST results and the excel data, it is evident that the sequence is distributed onto two different scaffolds for this particular GLEAN model. When reviewing the excel data, it was apparent that there is a small gap within the second scaffold that ranges from 2684-2737. Excluding this gap within the second scaffold, if both scaffolds were to be combined, the overall sequence would have good coverage without any internal repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 10.	Contains Domain of Unkown Function (DOF)- highly conserved in vertebrates. \nAlignment to vertebrate dentin is due to serine rich repeat.\n
GLEAN3_13358	SPU_013358		Alignment with vertebrate dentin is due to serine rich repeat in dentin.\n
GLEAN3_19545	SPU_019545		Alignment to vertebrate dentin due to serine repeat in dentin. \nPossible additional exons 5' to Glean3 prediction.\n
GLEAN3_07317	SPU_007317		Blast data suggest that this model could be tolloid1 rather than suBMP1, as it is currently named based on cDNA sequence.  However, this conclusion is tentative because the model lacks the last three domains (EGF, CUB, CUB) domains characteristic of tolloid proteins.  This is undoubtedly because the scaffold is too short and these remaining exons are on another scaffold. \n
GLEAN3_26920	SPU_026920		Tiling data suggests exons in regions of repeated units.  These areas require further examination to determine if they are truly exons.\n
GLEAN3_19152	SPU_019152		because the sequence is so short it is possible this is actually a fragment\n
GLEAN3_13651	SPU_013651		some similarities to a part of a peptidase_M1 domain, but sequence is very short\n
GLEAN3_06947	SPU_006947		GLEAN3_02418 duplicate\n
GLEAN3_11869	SPU_011869		Tiling data suggests multiple exons not found by the glean3 and NCBI modles, but the tiling data is too messy for me to alter the GLEAN3 modle  \n
GLEAN3_05884	SPU_005884		great prediction of the N-terminus; c-terminus is in glean3_11914 (entered here)\n
GLEAN3_10829	SPU_010829		77.8% identity with Aedes aegypti elongation factor 2 (AAK01430) \n \n \npredicted exon1, i.e. GLEAN3_10829|Scaffold51549|194|325| has been deleted since it does not seem to exist in other eukaryotic EF2 sequences \nThe first exon in mRNA becomes GLEAN3_10829|Scaffold51549|2642|2856|which lacks a methionine \n \npredicted exon6 GLEAN3_10829|Scaffold51549|6024|6059| has been deleted since it was a repeated sequence of exon7\n
GLEAN3_11914	SPU_011914		this is the C-terminus of Sp-RACK, which will be completely annotated with its N-terminal prediction (glean3_05884)\n
GLEAN3_00923	SPU_000923		a duplicated exon of Sp-RACK, fully annotated as glean3_05884 \n
GLEAN3_06221	SPU_006221		#\nN-terminus probably truncated due to end of contig\n
GLEAN3_06547	SPU_006547		Single exon gene encoding a partial copine. This is a possible pseudogene.\n
GLEAN3_00906	SPU_000906		best match of 4 good matches\n
GLEAN3_01821	SPU_001821		one match of 4\n
GLEAN3_07865	SPU_007865		one of 4 matches no obvious haplotypes\n
GLEAN3_07866	SPU_007866		one of 4 matches no obvious haplotypes\n
GLEAN3_14221	SPU_014221		one of 4 matches no obvious haplotypes\n
GLEAN3_28566	SPU_028566		one of 4 matches no obvious haplotypes\n
GLEAN3_26438	SPU_026438		The ABCH subfamily is found only in insect, Dictyostelium and zebrafish, to date. \nThere are 2 ESTs.  One has an inserted sequence corresponding to an exon I can't find in the genomic sequence. \n \nGLEAN model is missing the 3' part of the gene.\n
GLEAN3_22633	SPU_022633		GLEAN3_22633 and GLEAN3_22634 appear to be a single gene (agrin) split into two models.  I have added the Gene features of 22634 to 22633.    \nGLEAN3_02025 contains a single NtA domain like N-terminus of agrin. Other GLEAN  predictions contain FOLN and KAZAL repeats and may comprise the next segment (especially GLEAN3_02467 and possibly GLEAN3_24994).  A fourth gene looks like the next piece (GLEAN3_22633)and the adjacent gene (GLEAN3_22634) contains LamG repeats that look like the C-terminus. These five gene predictions may be adjacent and comprise a full agrin gene.\n
GLEAN3_22634	SPU_022634		This appears to be the last 2 LamininG domains of Agrin.  The tandem gene, GLEAN3_22633 has been amended to include the exons originally predicted for this gene\n
GLEAN3_23247	SPU_023247		Small scaffold\n
GLEAN3_07946	SPU_007946		Complete 5' end sequence?\n
GLEAN3_03882	SPU_003882		Very short scaffold\n
GLEAN3_02955	SPU_002955		Alignment with vertebrate dentin due to serine rich repeat in dentin.\n
Sp-PGRP5	SPU_030064		Small scaffold.\n
GLEAN3_03669	SPU_003669		exons 107766-107856 & 108255-108332 & 132399-132485 do not have homology on a protein level to phospholipase C beta [Lytechinus pictus] but do have nucleotide level homology.   \n \nGLEAN3_22715 is a haplotype of this gene \n
GLEAN3_23216	SPU_023216		N-terminus of prediction is longer than nAChR of the other organism.  \n
GLEAN3_19709	SPU_019709		See GLEAN3_01774. \n
GLEAN3_13095	SPU_013095	After reviewing the three different subjects for query GLEAN3_13095 it was determined that this was the best match since there were no repeats and the sequence had sufficient coverage. A search using GBrowse V0.5 revealed that there is no EST support for this particular GLEAN model. There also appeared to be weak intensity scores after reviewing the transcriptome results. This was an un-annotated gene so no additional comments are available.	See GLEAN3_11220. \n
GLEAN3_04062	SPU_004062		uncorrect model : incomplete \nunvalid exons and missing exons \nsome 3' exons in GLEAN3_05456 \n
GLEAN3_03655	SPU_003655		#\nclose to Fz1,Fz2,Fz7, orthology to be precisely determined\n
GLEAN3_22373	SPU_022373		This model was fused to another glean model (GLEAN3_25278) to create a more accurate model for Sp-Triad. The Gene ID for such new model is: Sp-Triad.\n
GLEAN3_25278	SPU_025278		This model was fused to another glean model (GLEAN3_22373) to create a more accurate model for Sp-Triad. The Gene ID for such new model is: Sp-Triad.\n
Sp-Triad	SPU_030065		This model was created by fusing two overlapping glean models based on a manual inspection of multiple protein sequence alignments. Redundant exons were taken out from the final sequence.\n
GLEAN3_12877	SPU_012877		Other names: KIF27 and KIF7 \nThis gene is part of the Hedgehog signaling pathway\n
GLEAN3_03743	SPU_003743		PRP19/PSO4 pre-mRNA processing factor 19 homolog.\n
GLEAN3_14295	SPU_014295		part of the Hedgehog signaling pathway \nthis model has been modified by adding the sequences of glean3_03312 and glean3_03313 in front of it's sequence.\n
GLEAN3_26178	SPU_026178		Gene is intact! \nNo ESTs.  There are no introns, but the ORF seems intact. \nThis would be the first intronless ABC gene in a multicellular organism! \nChip data suggests this is an expressed sequence.\n
GLEAN3_17959	SPU_017959		This is an excellent match, although alignment is missing for the first 34 N-term AA's.  Entry appears to encode a complete ORF, however.\n
GLEAN3_16028	SPU_016028		First few exons appear irrelevant, matching to mannose receptor or extracellular proteins. A 200 aa bcl domian matches exactly with Mil2 (GLEAN3_01916).\n
GLEAN3_03241	SPU_003241		Three ESTs. Two extend the 5' end, but the extra sequences are not on the contig (could be on another contig).\n
GLEAN3_18342	SPU_018342		No ESTs \nGene apppears complete!!\n
GLEAN3_16850	SPU_016850	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. However, mRNA information was available from SpBase. When examining the excel data it appears that there are several gaps and internal repeats present within each of the different subject hits. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values ranging below 10. 	No ESTs \nAppears to have the wrong N-terminus\n
GLEAN3_07357	SPU_007357	For this particular GLEAN model no CDS were found on either the Baylor annotations page or in the SpBase search engine. There was also no gene features information available on either site.\nAdditional information:\nNo ESTs;\nExtra exons and the wrong N-terminus, missing exon in NBF;\nDeleted dubious C-terminus	No ESTs \nExtra exons and the wrong N-terminus, missing exon in NBF \nDeleted dubious C-terminus\n
GLEAN3_24666	SPU_024666		Missing C-term and N-term \nContig missassembled \n3 ESTs but they do not assemble into a single contig\n
GLEAN3_19656	SPU_019656		It seems a pseudogene. It contains a big part of the homeodomain, but nothing else similar to other genes. \nThe homeodomain sequences are identical to those in GLEAN3_24715. No Chip expression.\n
GLEAN3_19327	SPU_019327		Gene contains a frameshift in element 17, which may make this a pseudogene.  The scaffold stops short around 50-100 nt before the stop codon.  \n
GLEAN3_11836	SPU_011836		This 185 gene is partially present on Scaffold1870, but the location of the 3' end is unknown.  This scaffold contains the leader, intron, and part (elements 1-17) of the open reading frame.  The sequence contains subelement 15e, which makes it a member of the 185/333-E group, although the exact pattern is unknown due to the missing sequence.  The best BLAST hit is to Sp0368, or 185/333-E4.  A frameshift in element 1 may indicate that this is a pseudogene.\n
GLEAN3_02238	SPU_002238		1 EST with the central portion of the gene. \nModel is missing an exon in the NBD and perhaps the last exon. \n
GLEAN3_26825	SPU_026825		1 EST \nMissing N-terminus and C-terminus\n
GLEAN3_01916	SPU_001916		First few exons appear irrelevant, matching to mannose receptor or extracellular proteins. A 200 aa bcl domian matches exactly with Mil1 (glean3_16028).\n
GLEAN3_16525	SPU_016525		the evidence for this gene assignment is a similar domain organization\n
GLEAN3_28724	SPU_028724		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThe three last exons of this model do not align with corresponding Pelle/IRAKs from other animal groups. An overlapping Fgenesh++ prediction (S.P_Scaffold255.seq.N000008) does not include these three exons (which fall on a separate predicted gene), and would therefore appear as a better model. These three exons of GLEAN3_28724 do not align strongly with any other protein, and they do not code for any detectable protein domain, which argues against them representing a separate gene. In addition, they are very strongly supported by tiling array data. For lack of better evidence for either alternative, we have decided to accept this glean model in its present form. It is however left to be determined experimentally whether these exons do correspond to Sp-Pik2 or a separate gene. \n \nA closely related model (GLEAN3_00073) clusters with Irak4 in a multiple alignment tree. On the other hand, this model is equally distant to all other irak-related molecules, both based on alignments of their kinase domains or combined kinase and death domains. Because IRAKs and pelle proteins from insects are so similar, and because there is no clear co-clustering of both sea urchin homologs with specific Irak genes, we have decided to follow the approach taken for naming the C.elegans orthologs: to name them "Pik" genes after Pelle/Irak.\n
GLEAN3_10283	SPU_010283		Alignment with vertebrate dentin due to serine-asp rich repeat.\n
GLEAN3_15338	SPU_015338		Alignment to vertebrate dentin due to serine rich repeat in dentin.\n
GLEAN3_00073	SPU_000073		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThere seems to be duplicated exons towards the C-terminus of this model. For lack of better evidence, we cannot currently determine whether these are truly duplicated exons or due to assembly errors. If these "duplicated" exons are taken out, then the alignment with murine IRAK4 is significantly improved. \n \nThis model clusters with Irak4 in a multiple alignment tree. On the other hand, a closely related model (GLEAN3_28724) is equally distant to all other irak-related molecules, both based on alignments of their kinase domains or combined kinase and death domains. Because IRAKs and pelle proteins from insects are so similar, and because there is no clear co-clustering of both sea urchin homologs with specific Irak genes, we have decided to follow the approach taken for naming the C.elegans orthologs: to name them "Pik" genes after Pelle/Irak.\n
GLEAN3_08499	SPU_008499		Alignment to vertebrate dentin due to serine rich repeat in dentin. \nEST sequences align with Glean3 model.\n
GLEAN3_20082	SPU_020082		This model is part of a novel gene (Sp-Jak) that results from fusing GLEAN3_22023 and GLEAN3_20082. This modification was made based on a manual inspection of sequence alignments at both the aminoacidic and nucleotide levels.\n
GLEAN3_22023	SPU_022023		This model is part of a novel gene (Sp-Jak) that results from fusing GLEAN3_22023 and GLEAN3_20082. This modification was made based on a manual inspection of sequence alignments at both the aminoacidic and nucleotide levels.\n
Sp-Jak	SPU_030066		This model was created as a fusion of GLEAN3_22023 and GLEAN3_20082, based on a manual inspection of sequence alignments at both the aminoacidic and nucleotide levels, both between these models and between each model and vertebrate JAKs.\n
GLEAN3_28494	SPU_028494		Alignment with vertebrate dentin due to serine rich repeat.\n
GLEAN3_00825	SPU_000825		Glean3_00825 was found to be very similar to previously cloned sea urchin SM30 genes.  Comparison to a previously isolated genomic clone (Akasaka et al 1994, JBC 269: 20592-20598) indicates that glean_00825 is probably not SM30-alpha or SM30-beta. Glean3_00825, glean3__00826, glean3_00827, and glean3_00828 encode SM30 like proteins and they are tandemly arranged on Scaffold25604. \n \nMatched c-type lectin domain (cd00037).\n
GLEAN3_19821	SPU_019821		Only found the C-terminal end of the protein, there are no evidences of the N-terminal end.\n
GLEAN3_24454	SPU_024454		More of sequence on scaffolds 129415 and 21113\n
GLEAN3_13788	SPU_013788		See Glean3_12296 for AHR-like model; see Glean3_05022 for bHLH domain of AHR or AHRR homolog.  Glean3_05022 could be the missing N-terminus of this model (Glean3_13788) or of Glean3_12296.\n
GLEAN3_05762	SPU_005762		The C-terminal end of this sequence is also contained in GLEAN3_23715\n
GLEAN3_17036	SPU_017036		incomplete\n
GLEAN3_04230	SPU_004230		This glean result matches the C terminal reigion of Sp-PLC-delta.  The rest of the sequence is contained on scaffold 70915.  Part of scaffold 85759 is duplicated on scaffold 106154.  This gene was cloned by Coward et. al., 2003. Its accession number is NP_001008790.1. \n \n***NOTE only This annotation contains the fully complete data.\n
GLEAN3_12103	SPU_012103		This scaffold has the N terminal reigion of the PLC-delta sequence.  NOTE, The fully annotated sequence can be found on scaffold 85789.   Part of scaffold 85759 is duplicated on scaffold 106154.  This gene was cloned by Coward et. al., 2003. Its accession number is NP_001008790.1. \n
GLEAN3_23044	SPU_023044		#\nincomplete\n
puromycin-sensitive aminopeptidease	SPU_030067		partial CDS\n
GLEAN3_18530	SPU_018530		incorrect 5' exon prediction\n
GLEAN3_28479	SPU_028479		The predicted N-terminal region was incorrect. The predicted C-terminal region was largely incomplete. \n
GLEAN3_08595	SPU_008595		sequence incomplete\n
GLEAN3_14645	SPU_014645		GLEAN3_07315 is almost identical but shorter, duplication most likely due to assembly process. \n
GLEAN3_12278	SPU_012278		sequence probably incomplete\n
GLEAN3_07315	SPU_007315		Almost identical to GLEAN3_14645 but shorter, duplication most likely due to assembly process. \n
GLEAN3_05654	SPU_005654		We pulled the partial cDNA from an Sp egg library.  It aligns well with six of the middle Glean predictions.  Glean3_20904 also aligns well with the partial cDNA.\n
GLEAN3_04867	SPU_004867		Similar to SM30-alpha. Adjacent to SpSM30-like-B (glean3_04869), but on the opposite strand. \n \nMatches c-type lectin domain (cd00037) and pericardin like repeats (PR009765).\n
GLEAN3_08747	SPU_008747		See putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137538013-8866-40191825170.BLASTQ4\n
GLEAN3_04869	SPU_004869		Similar to SM30-alpha. Adjacent to SpSM30-E (glean3_04867) but on the opposite strand. \n \nMatches c-type lectin domain (smart00034).\n
GLEAN3_07404	SPU_007404		Member of CYP1 family. Tentatively designated CYP1F6 pending CYP nomenclature committee approval. Phylogenetically falls at base of CYP1 family, within CYP clan 2 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to   () of % (aa level). Single exon gene.\n
GLEAN3_01365	SPU_001365		Duplicated Gene...see also GLEAN3_08207. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137535659-19561-20778833499.BLASTQ4\n
GLEAN3_08207	SPU_008207		Duplicated gene...see also GLEAN3_01365 \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537713-13786-83388701897.BLASTQ1\n
GLEAN3_18810	SPU_018810		Appears most similar to SpSM32 (Illies et al.)  But it also contains the first exon of SpSM50 (glean3_18811).  SpSM32 and SpSM50 share a first exon. \n \nMatches c-type lectin domain (cd00037).\n
GLEAN3_07081	SPU_007081		Alignment with vertebrate dentin due to serine rich repeats. \nEmbryonic and larval EST sequences conform to gene prediction. \n
GLEAN3_02589	SPU_002589		See GLEAN3_06123, _00154 (C-terminus only). \n
GLEAN3_10811	SPU_010811		Alignments with known proteins, including vertebrate dentin, are due to high number of repetitive amino acids.\n
GLEAN3_12324	SPU_012324		missing 5' of CDS (domain I; HS attachment sites) \n5' of CDS probably encoded by GLEAN3_00937 \nmissing 3' of CDS \n3' of CDS probably encoded by GLEAN3_26338 \nhaplotype duplication of GLEAN3_28620 \nGLEAN3_12324 is longer than GLEAN3_28620 and possesses a 29 residue piece in the middle of the sequence that is missing in GLEAN3_28620 (possible intron)\n
GLEAN3_09723	SPU_009723		Alignments with known proteins, including vertebrate denti, dis due to repetitive amino acids.\n
GLEAN3_04183	SPU_004183		See GLEAN3_06123, _02589, _00254. \n
GLEAN3_20107	SPU_020107		Missing N-terminus.  See GLEAN3_06123, _00154, _02589, _04183. \n
GLEAN3_17634	SPU_017634		Several high scoring hits, entry had a BLAST score of 0.00. \nBLAST of ABP-120 revealed the same GLEAN3 prediction with a score of 0.00.\n
GLEAN3_02605	SPU_002605		See GLEAN3_06123, _00154, _02589, _04183, _20107. \n
GLEAN3_19540	SPU_019540		Missing 230 amino acids found in mouse at the amino terminal.\n
GLEAN3_09340	SPU_009340		Blasted with human homolog of Myosin IIIA and Myosin IIIB and obtained the same three highest scoring predictions.\n
GLEAN3_19203	SPU_019203		N-terminus prediction is longer than those of other organism.  See GLEAN3_06123, _00154, _02589, _04183, _20107, _02605.  \n
GLEAN3_22763	SPU_022763		See GLEAN3_06123, _00154, _02589, _04183, _20107, _02605, _19203. \n
GLEAN3_12184	SPU_012184		Missing N-terminus.  High scoring hits to AChE (GLEAN3_06123, _00154, _02589, _04183, _20107, _02605, _19203, _22763). \n
Sp-PLC-delta	SPU_030068		Part of scaffold 85759 is duplicated on scaffold this scaffold (106154). NOTE  annotation on scaffold 85759 contains the fully complete data.  This gene was cloned by Coward et. al., 2003. Its accession number is NP_001008790.1.\n
GLEAN3_07745	SPU_007745		Pfam00484.11 match.   \n \nTranscriptome data indicate that it is expressed in embryos. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_14664	SPU_014664		N-terminus is missing.  See GLEAN3_14664.  \n
GLEAN3_24978	SPU_024978		See GLEAN3_13765.\n
GLEAN3_08910	SPU_008910		See GLEAN3_28455, _14664. \n
GLEAN3_18811	SPU_018811		Glean3_18811 is on the same scaffold as SM37. This is to be expected for SM50 [Lee et al (1999) Develop. Growth Differ 41: 303-312 PUB MED:10400392]. Glean3_18811 is most similar to the aa sequence of S. purpuratus SM50.  However it is missing the first exon and the intron one would expect from the canonical SM50 gene (Sucov et al and Katoh-Fukui et al.) Just 5' to this glean model is glean3_18810 that encodes SpSM32 which shares the first exon with SpSM50. Need to change this gene model to reflect this. \n \nMatched c-type lectin domain (cd00037).\n
GLEAN3_28783	SPU_028783		See GLEAN3_02787.  \n
GLEAN3_28784	SPU_028784		See GLEAN3_13765, _24978.  \n
GLEAN3_16008	SPU_016008		2 exons are missing : \nexon 1, containing the signal peptide, found on scaffold 82776 \nexon 4 (instead of the 5 aa exon "GLFCF"), containing the transmembrane domain, found on scaffold 7868 \n
GLEAN3_08058	SPU_008058		48% identity with corresponding region in human valyl-tRNA synthetase (VARS2) \n45% identity with human VARS2-like \nSp-VARS isoformA has 47% identity with the Sp-VARSisoformB (glean3_02908)  \n
GLEAN3_19101	SPU_019101		#\nTWO COMMENTS: \n \n1) Alignment with best blast hit sequence suggests that the first exon (see below) is incorrect. This conclusion is supported by transcriptome signals. \n>GLEAN3_19101|Scaffold112071|10738|10828| DNA_SRC: Scaffold112071 START: 10738 STOP: 10828 STRAND: +  \nCTGAAATTCTGAGATGTCGAATGGGGAGGTGCGTAGTGACAAGGGTGTGGTATAGGCCTAGCTGTAGAGG \nATGCTTCAGAAGAGCCGACAT \n \n2) This model is adjacent to a very similar gene model,  GLEAN3_19102\n
GLEAN3_21352	SPU_021352		This prediction of one of many ACE genes on scaffold 52540. \nAlignment with best blast sequence suggests there may be a missing exon between the following exons in this model: \n>GLEAN3_21352|Scaffold52540|47460|47651| DNA_SRC: Scaffold52540 START: 47460 STOP: 47651 STRAND: +  \nCAAGGAGATGAGCGGGTATAGGTCCTGTGGGGTTAACCTTGTCCGTCCCGTACTTCTCTCCCAGCTTTCT \nTCTCACAAAGGCATGGATCTGGAGGTACATCGGTTTGACTGCATCCCAAAGGGCGTCGATCTTCTCCACG \nAAGTGTGGGTCTTCATAACGACGACGGAGTAAGTCGCCACGATCTTCATAAC \n>GLEAN3_21352|Scaffold52540|49826|49971| DNA_SRC: Scaffold52540 START: 49826 STOP: 49971 STRAND: +  \nGTGATTCGCTCATCAGGTGTTGAAGGCCTGGTTCCATGTTCAAGCATTGCTCCGATCGTTTCTTCCTCTG \nTGCATGCTCTCTCTTCCGACAAACCTTTCCGGTAGCGAAGATAGTTGTCATGTTGTCCTGGACCTCACGT \nTCTTTA \n
GLEAN3_11641	SPU_011641		45% identity with human EF2 kinase (AAH32665)\n
GLEAN3_21837	SPU_021837		ESTs used to confirm model only cover 5' portion of gene (first 6 exons) and 3' UTR, however tiling array correlates well with model predictions throughout.  Multiple splice varints likely exist since some ESTs contain the 3rd exon in the prediction and some do not.  Length of 3'UTR based on tiling array data and the presence of AAUAAA and CA at most 3' end.  Start of transcript indicated here is different than that in origional prediction and is based on EST data (BCM Exonerate CD304782 and CX555128) and tiling data.  The end of the CDS based on the existance of a TAA at site indicated.\n
GLEAN3_26627	SPU_026627		Missing N-terminus and should be combined with GLEAN3_26626.  The prediction includes extra C-terminus.  \n
GLEAN3_25999	SPU_025999		See GLEAN3_25315. \n
GLEAN3_17426	SPU_017426		Missing N-terminus.  \n
GLEAN3_08988	SPU_008988		See GLEAN3_25315, _25999, _08988. \n
GLEAN3_07186	SPU_007186		Missing N-terminus.  See GLEAN3_25315, _25999, _08988.  \n
GLEAN3_20213	SPU_020213		Missing N-terminus.  \n
GLEAN3_23617	SPU_023617		Missing the N-terminal TM domins, they are off the contig. \nThere are two ESTs, one in the coding and one in the 3'UTR. \nOne exon in an NBD was deleted from the GLEAN model\n
GLEAN3_05577	SPU_005577		Glean3_05577 may be partial- at end of contig. It is similar to a pair of genes, 09924, 09925, that are adjacent to one another on a contig. The other of the pair (with 05723) may be 05723, which is also on a small contig and likely truncated.\n
GLEAN3_12621	SPU_012621		See Developmental Biology 204(1) 293-304 (1998) for more information. \n \nUnable to verify all exons experimentally, but likely correct. \nPublished mRNA sequence does not extent along the scaffolding as far as the upstream UTR.\n
Sp-Zic4-like	SPU_030070		complete cds\n
GLEAN3_26395	SPU_026395		This sequence represents only the N-terminus of the protein.  SP-ABCC1a has the full-length sequence of a closely related gene.\n
GLEAN3_28797	SPU_028797		Unable to differentiate between ABCC8 and ABCC9 families.  This is orthologous to the human ABCC8/9 families.\n
GLEAN3_05723	SPU_005723		Glean3_05723 may be partial- at end of contig. It is similar to a pair of genes, 09924, 09925, that are adjacent to one another on a contig. The other of the pair (with 05723) may be 05577, which is also on a small contig and likely truncated.\n
GLEAN3_09924	SPU_009924		Lies adjacent to a highly similar gene, 09925, on the same contig.\n
GLEAN3_09925	SPU_009925		Adjacent to Glean3_09924, a similar gene.\n
GLEAN3_24191	SPU_024191		The first 2000 bases on the 5' end of this gene match very closely with the 5' half of GLEAN3_20669.  The latter half of both genes are quite divergent from eachother.  This may be a case of duplication of all or half of one of these genes.  They are distant enough, however, that they are not being labeled as duplicates.\n
GLEAN3_02411	SPU_002411		This gene is quite similar to GLEAN3_20669.  However, there are several instances of large insertions/deletions as well as numerous amino acid differences.  Perhaps it is a relatively recent duplication.  The number of differences make it unlikely that this is simply due to haplotype variation.\n
GLEAN3_25903	SPU_025903		Unable to differentiate between ABCC8 and ABCC9 families.  This is orthologous to the human ABCC8/9 families.\n
GLEAN3_08351	SPU_008351		This prediction  is incomplete. The 5' end of the protein is not predicted. Refer to the modified sequence of GLEAN3_28479 for the corrected sequence.\n
GLEAN3_04417	SPU_004417		U5 snRNP-associated 102 kDa protein. First part of the gene on GLEAN3_24258. Latter part on GLEAN3_04417. Likely missing one exon between two parts.\n
GLEAN3_21366	SPU_021366		U1 small nuclear ribonucleoprotein 70 kDa like. Likely missing 5' exon(s). Can't find the missing exon in protein predictions.\n
GLEAN3_28726	SPU_028726	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When comparing the excel data and the BLAST results, it appears that if the 2 scaffolds were combined, the sequence would have an orderly continuous arrangement without any repeats or gaps present.  There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than about 10. 	First 21 exons encode plexin.  The other 19 exons encode sortilin-1.  \n
GLEAN3_27526	SPU_027526		N-terminus of this gene is GLEAN3_27525 and should be combined.   \n
GLEAN3_27443	SPU_027443		Also, likely ortholog of plex B1.  \n
GLEAN3_20916	SPU_020916		This gene model is possibly incomplete, since it is located at the end of the scaffold.\n
GLEAN3_03553	SPU_003553		The CARD domain is located downstream of the NACHT. This is not seen in mammalian Nod1 and Nod2, where the CARD domain (or both) are located upstream of the NOD.\n
GLEAN3_03539	SPU_003539		This gene model could be incomplete since it is located at the end of a scaffold.\n
GLEAN3_06610	SPU_006610		This gene model could be incomplete since it is located at the end of a scaffold.\n
GLEAN3_13619	SPU_013619		The DEATH domain encoded in this protein resembles the Dr5 receptor protein DEATH domain. \n
Sp-VC1_2	SPU_030071		#\nThis gene model has been predicted by fgeneshAB and ++ that is not in the Glean3 list.\n
Sp-Tnfsf_like4	SPU_030072		This gene model has been predicted by fgeneshAB and ++ that is not in the Glean3 list. The expression of this gene model has been confirmed with QPCR.\n
GLEAN3_06529	SPU_006529		This gene includes most of Glean3_06530. The missing exons were added to this model. The last exon does not belong to gene model as well as the first exon of Glean3_06530. Fgenesh prediction is right. The Glean3_06530 model was not modified to reflect these discrepencies.\n
GLEAN3_06530	SPU_006530		This gene model is incomplete and combines with parts of Glean3_06529 to make a complete gene model. The Glean3_06529 model was modified to include the exons in this model. Please refer to this gene model for further detail.\n
GLEAN3_24075	SPU_024075		Fgenesh prediction contains one additional exon in the 5' end, which contains the signal peptide. This exon has been added to the glean3 model.\n
Sp-NT1	SPU_030073		Neurotrophin found by Genscan but not by Glean. \nManual predictions agree with genscan except that the manual prediction has one main 3' exon containing most if not all of the translated part, whereas the genscan prediction will have the gene encoded by 2 separate exons.\n
GLEAN3_10719	SPU_010719		Member of CYP1 family. Tentatively designated CYP1F3 pending CYP nomenclature committee approval. Phylogenetically falls at base of CYP1 family, within CYP clan 2 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP1A (Danio rerio) of 41.7% (aa level). Single exon, possibly part of tandem duplication.\n
GLEAN3_10720	SPU_010720		Member of CYP1 family. Tentatively designated CYP1F4 pending CYP nomenclature committee approval. Phylogenetically falls at base of CYP1 family, within CYP clan 2 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP1A (Danio rerio) of 41.7% (aa level). Single exon, possibly part of tandem duplication.\n
GLEAN3_10721	SPU_010721		Member of CYP1 family. Tentatively designated CYP1F5 pending CYP nomenclature committee approval. Phylogenetically falls at base of CYP1 family, within CYP clan 2 (MrBayes v3.0.1 WAG+gamma+I model). Percent ID (on masked basis) to CYP1A (Danio rerio) of 41.7% (aa level). Single exon, possibly part of tandem duplication.\n
GLEAN3_11692	SPU_011692		Chip expression data indicates likely expression in embryo. Ortholog to honey bee voltage gated L type Ca channel. Related to Ca channel gene found in coral ( Stylophora pistillata) by Allemand grouop in Monaco.\n
GLEAN3_07770	SPU_007770		Chip expression data does NOT confirm embryonic expression. Closely related to glean3_11692, which is expressed in embryo. Best hit is L type Ca channel in the snail, Limnea stagnalis. Also related to the coral Ca Channel gene described by Allemand's group.\n
GLEAN3_19522	SPU_019522		Beginning of gene on GLEAN3_24095. End of gene is on GLEAN3_19522. Possible haplotype for GLEAN3_19522 is GLEAN3_07176\n
GLEAN3_07176	SPU_007176		Beginning of gene on GLEAN3_24095. End of gene is on GLEAN3_19522. Possible haplotype for GLEAN3_19522 is GLEAN3_07176. \nRECOMMEND DELETION\n
GLEAN3_24095	SPU_024095		Beginning of gene on GLEAN3_24095. End of gene is on GLEAN3_19522. Possible haplotype for GLEAN3_19522 is GLEAN3_07176\n
GLEAN3_06939	SPU_006939		This gene model was modified by comparison to the corresponding FgeneshAB prediction and other typical Sp-Tlr genes. \nThis gene model may represent a pseudogene or contain a sequence error. \n
GLEAN3_09600	SPU_009600		See GLEAN3_27526. \n
GLEAN3_19917	SPU_019917		Lacking N-terminus.  See GLEAN3_27526, _09600. \n
GLEAN3_00729	SPU_000729	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. However, there was mRNA information available from SpBase. When examining the excel data, it appears that subject gb|DS006912| would be the best match for this GLEAN model based on the orderly arrangement of the sequence (no gaps or repeats present.) There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be some what strong with most of the values being 5 or greater.\nAdditional gene information from Baylor annotation (comments): Lacking N-terminus.  See GLEAN3_28726, _27443, _08472.	Lacking N-terminus.  See GLEAN3_28726, _27443, _08472.  \n
GLEAN3_18059	SPU_018059		See GLEAN3_12977.\n
GLEAN3_28135	SPU_028135		polyadenylated histone H10\n
GLEAN3_02431	SPU_002431	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it is evident that there are several small gaps within the sequence.  However, the sequence has an overall continuous and orderly arrangement without any repeats present between the 2 different scaffolds. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be 	Possibly only a partial sequence.  Appears to only contain C-terminus.\n
GLEAN3_08443	SPU_008443		Probably only a partial sequence.  It is missing ~200 amino acids off of the C-terminus and 400 off of the N-terminus.\n
GLEAN3_06899	SPU_006899		This gene encodes a precursor for a vasopressin/oxytocin/vasotocin-like peptide CFISNCPKGamide, which I suggest is named Sp-echinotocin. This is first vasopressin/oxytocin/vasotocin-like peptide to be identified in an echinoderm. It is likely that similar or identical peptides will be found in other echinoderms, which I suggest are known collectively as echinotocins. \n \nThe GLEAN model of this gene with 4 exons is wrong because the N-terminal signal peptide is encoded by a putative internal exon (exon 2 of the GLEAN model). The model that I have produced is comprised of 3 exons with: \n1. exon 1 encoding a signal peptide (confirmed by SignalP3.0 analysis), the echinotocin peptide and the N-terminal part of neurophysin. \n2. exon 2 encoding the main middle portion of neurophysin \n3. exon 3 encoding the C-terminal region of neurophysin. \nThe tiling data do not show signals that correspond with these exons, indicating that this gene is not expressed in the early stages of sea urchin development used as a source of mRNA for the tiling analysis. \nNo EST or cDNA data are available at present to confirm the prediction. \n \nThe model is however consistent with the structure of vasotocin/vasopressin/oxytocin genes in vertebrates, which are also comprised of 3 equivalent exons. \nBLAST analysis GenBank with the predicted 165 amino-acid precursor shows a high level of sequence similarity with vasotocin precursors in fish.\n
GLEAN3_15742	SPU_015742		divergently transcribed as a gene pair with GLEANH3_15741 \nSp-late-histone-H3b\n
GLEAN3_16523	SPU_016523		Classic zinc finger. Closest Ciona hit (ensembl:ENSCING00000008508)logged as Thyroid hormone receptor.\n
GLEAN3_20803	SPU_020803	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed on to 2 different scaffolds. When examining the excel data, it appears that there is one overlap within the sequence of >v2.1_scaffold22297. The overlap occurs at 228-275 and 228-377. The sequence continues to be orderly from 228-377.This is an un-annotated gene so no additional gene information was available from Baylor gene information (comments). There was also no Est information avaible from the GBrowse assembly V0.5 and the transcriptome intensity scores appear to be very strong with values averaging about a 15. 	SMART Confidently predicted domains, repeats, motifs and features: \n \nIG            begin: 134   end: 215 e-value 5.37e-04  \ntransmembrane begin: 359   end: 381 e-value -  \nTyrKc         begin: 451   end: 721 e-value 3.46e-132 \nBelongs to class II of TK receptors isdefined by  \n[DN]- [LIV]-x  (3)-Y-Y-R (Prosite PDOC00212) consensus activation loop.  \n
GLEAN3_11485	SPU_011485		Huntington like gene in Sea Urchin shows more than 50 % similarity with Human huntingtin gene. This part of the gene codes for 844 amino acids making N terminal hunting protein from the start codon. The Scaffold 54379 which has this gene is reverse complimented. The C terminal part of this gene has to be identified which might continue in another scaffold.\n
GLEAN3_15064	SPU_015064		GLEAN3_15064 has the first part of the gene. GLEAN3_17197 should have the latter half. GLEAN3_15899 is likely a haplotype of GLEAN3_17197.\n
GLEAN3_15899	SPU_015899		GLEAN3_15064 has the first part of the gene. GLEAN3_17197 should have the latter half. GLEAN3_15899 is likely a haplotype of GLEAN3_17197.\n
GLEAN3_18213	SPU_018213		This gene model was modified based on Genscan and domain structures: the first exon + the following 200bp intron show a typical Toll-like receptor.  The third exon could belong to next glean model(18214). \n
GLEAN3_24258	SPU_024258		U5 snRNP-associated 102 kDa protein. First part of the gene on GLEAN3_24258. Latter part on GLEAN3_04417. Likely missing one exon between two parts.\n
GLEAN3_18214	SPU_018214		This gene model was modified based on BLASTN search and domain structures: the third exon of GLEAN3_18213 + the upstream intron + this gene model + the gap between them show a typical Toll-like receptor.  This gene model may represent a pseudogene or contain a sequence error.\n
GLEAN3_14430	SPU_014430		GLEAN3_20121 has the first part of the gene. GLEAN3_14430 has the rest of the gene.\n
GLEAN3_16594	SPU_016594		U5 snRNP-specific protein, 116 kD. Haplotype of GLEAN3_14430\n
GLEAN3_01709	SPU_001709		one nt sequencing error (or else is a very new pseudogene since the 3' end is correct\n
GLEAN3_14099	SPU_014099		Given extreme similarity to TBP elsewhere in the genome, but many fewer nts suggests that this is an assembly error, not a real gene.  No transcription seen in the first two exons.  Plus we have evidence elsewhere that there is only one copy of TBP\n
GLEAN3_07759	SPU_007759		probably incomplete sequence at c terminus\n
GLEAN3_21279	SPU_021279		cleavage stage histone H3 \n \nNote I found the first 126 nts of the coding region on scaffold 24661 \n \nnts  14772 to 14897\n
GLEAN3_07165	SPU_007165		incomplete sequence tructaed at 3'end  \ncould also be ortholog of mrp8\n
GLEAN3_03833	SPU_003833		divergently transcribed gene pair with GLEAN3_03828 \nSp-late-histone-H3e\n
GLEAN3_03916	SPU_003916		incomplete\n
GLEAN3_03407	SPU_003407		U5 small nuclear ribonucleoprotein 200 kDa helicase (U5 snRNP-specific 200 kDa protein) (U5-200KD) (Activating signal cointegrator 1 complex subunit 3-like 1). \n \nGLEAN3_03407 and GLEAN3_06432 are orthologs of human U5-snRNP-200kDa protein. \n \nGLEAN3_13364 and GLEAN3_28486 belong to second class similar to activating signal cointegrator 1. \n \nGLEAN3_00121 and GLEAN3_17541 are unknown DEAD-box proteins similar to 200kDa snRNP protein and activating signal cointegrator 1 complex subunit 3. \n \nGLEAN3_17541 is the longest predicted protein.\n
GLEAN3_06432	SPU_006432		U5 small nuclear ribonucleoprotein 200 kDa helicase (U5 snRNP-specific 200 kDa protein) (U5-200KD) (Activating signal cointegrator 1 complex subunit 3-like 1). \n \nGLEAN3_03407 and GLEAN3_06432 are orthologs of human U5-snRNP-200kDa protein. \n \nGLEAN3_13364 and GLEAN3_28486 belong to second class similar to activating signal cointegrator 1. \n \nGLEAN3_00121 and GLEAN3_17541 are unknown DEAD-box proteins similar to 200kDa snRNP protein and activating signal cointegrator 1 complex subunit 3. \n \nGLEAN3_17541 is the longest predicted protein.\n
GLEAN3_13364	SPU_013364		U5 small nuclear ribonucleoprotein 200 kDa helicase (U5 snRNP-specific 200 kDa protein) (U5-200KD) (Activating signal cointegrator 1 complex subunit 3-like 1). \n \nGLEAN3_03407 and GLEAN3_06432 are orthologs of human U5-snRNP-200kDa protein. \n \nGLEAN3_13364 and GLEAN3_28486 belong to second class similar to human U5-snRNP-200kDa protein and activating signal cointegrator 1. \n \nGLEAN3_00121 and GLEAN3_17541 are likely orthologs of activating signal cointegrator 1 complex subunit 3. \n \nGLEAN3_17541 is the longest predicted protein.\n
GLEAN3_17541	SPU_017541		U5 small nuclear ribonucleoprotein 200 kDa helicase (U5 snRNP-specific 200 kDa protein) (U5-200KD) (Activating signal cointegrator 1 complex subunit 3-like 1). \n \nGLEAN3_03407 and GLEAN3_06432 are orthologs of human U5-snRNP-200kDa protein. \n \nGLEAN3_13364 and GLEAN3_28486 belong to second class similar to activating signal cointegrator 1. \n \nGLEAN3_00121 and GLEAN3_17541 are unknown DEAD-box proteins similar to 200kDa snRNP protein and activating signal cointegrator 1 complex subunit 3. \n \nGLEAN3_17541 is the longest predicted protein.\n
GLEAN3_27604	SPU_027604		Alignment indicates the gene may be truncated at amino terminal- other gene models suggest additional sequences could be included at 5'end, but no cDNA data.\n
GLEAN3_00121	SPU_000121		U5 small nuclear ribonucleoprotein 200 kDa helicase (U5 snRNP-specific 200 kDa protein) (U5-200KD) (Activating signal cointegrator 1 complex subunit 3-like 1). \n \nGLEAN3_03407 and GLEAN3_06432 are orthologs of human U5-snRNP-200kDa protein. \n \nGLEAN3_13364 and GLEAN3_28486 belong to second class similar to activating signal cointegrator 1. \n \nGLEAN3_00121 and GLEAN3_17541 are unknown DEAD-box proteins similar to 200kDa snRNP protein and activating signal cointegrator 1 complex subunit 3. \n \nGLEAN3_17541 is the longest predicted protein.\n
GLEAN3_28486	SPU_028486		U5 small nuclear ribonucleoprotein 200 kDa helicase (U5 snRNP-specific 200 kDa protein) (U5-200KD) (Activating signal cointegrator 1 complex subunit 3-like 1). \n \nGLEAN3_03407 and GLEAN3_06432 are orthologs of human U5-snRNP-200kDa protein. \n \nGLEAN3_13364 and GLEAN3_28486 belong to second class similar to activating signal cointegrator 1. \n \nGLEAN3_00121 and GLEAN3_17541 are unknown DEAD-box proteins similar to 200kDa snRNP protein and activating signal cointegrator 1 complex subunit 3. \n \nGLEAN3_17541 is the longest predicted protein.\n
Sp-Tlr223	SPU_030075		Partial Toll-like receptor predicted by FgeneshAB and Genscan. The nucleotides of this gene model have 92% identity to a typical Sp-Tlr gene (GLEAN3_12257). The model is located at the end of a short scaffold. \n
Sp-Tlr224	SPU_030076		#\nPartial Toll-like receptor. A part of this modified gene model is predicted by Genscan and NCBI. The nucleotides have 96% similarity to a typical Sp-Tlr gene (GLEAN3_10940). The model occupies all sequence of a short scaffold. \n
Sp-Tlr225	SPU_030077		Partial Toll-like receptor predicted by FgeneshAB. The modified gene model occupies all sequence of a short scaffold and the nucleotides have 92% identity to a typical Sp-Tlr (GLEAN3_00615).  \n
Sp-Tlr226	SPU_030078		#\nPartial Toll-like receptor predicted by FgeneshAB and Genscan. The coding region of this gene model occupies all sequence of a short scaffold and the nucleotides have 89% identity to a typical Sp-Tlr (GLEAN3_23035).  \n
Sp-Tlr227	SPU_030079		Partial Toll-like receptor predicted by Fgenesh++. The nucleotides of the coding region have 92% identity to a typical Sp-Tlr (GLEAN3_15066). This model is located at the end of a short scaffold. \n
GLEAN3_15139	SPU_015139		paralog of Sp-tachykinin-receptor-2 (GLEAN3_15140); is located on scaffold 49502 adjacent to Sp-tachykinin-receptor-2 (GLEAN3_15140), indicating that these two genes arose by recent gene duplication event\n
GLEAN3_05990	SPU_005990		Matches c-type lectin domain (smart00034.10)\n
GLEAN3_15140	SPU_015140		paralog of Sp-tachykinin-receptor-1 (GLEAN3_15139); is located on scaffold 49502 adjacent to Sp-tachykinin-receptor-1 (GLEAN3_15139), indicating that these two genes arose by recent gene duplication event\n
GLEAN3_00025	SPU_000025		 partial, missing N- and C-terminus\n
GLEAN3_00111	SPU_000111		 partial, missing N-terminus\n
GLEAN3_00116	SPU_000116		 partial, missing N-terminus\n
GLEAN3_00187	SPU_000187		 partial, missing C-terminus\n
GLEAN3_00288	SPU_000288	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When reviewing the excel data appears that the second scaffold doesn't begin until the 27th base pair. Other than this small gap in the sequence both scaffolds appear to have an orderly arrangement overall, without any other gaps or internal repeats present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several values greater than 10. 	 partial, missing C-terminus\n
GLEAN3_00383	SPU_000383		 partial, missing N-terminus\n
GLEAN3_00455	SPU_000455		 extra stretch of aminoacids in middle\n
GLEAN3_00590	SPU_000590		 missing C-terminus\n
GLEAN3_00597	SPU_000597		 extra C-terminus\n
GLEAN3_00607	SPU_000607		 partial, missing N-terminus\n
GLEAN3_00612	SPU_000612		 partial, missing C-terminus half\n
GLEAN3_00771	SPU_000771		 extra N-terminus\n
GLEAN3_00772	SPU_000772		 extra N-terminus\n
GLEAN3_00774	SPU_000774		 missing N- and C-terminus\n
GLEAN3_00781	SPU_000781		 extra N-terminus\n
GLEAN3_00799	SPU_000799		 missing some C-terminus residues\n
GLEAN3_00830	SPU_000830		 partial, missing N-terminus\n
GLEAN3_00835	SPU_000835	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it is apparent that there are several internal repeats within both scaffolds. On v2.1_scaffold60368 there are several gaps present as well that range from 1136-1196 and 1331-1447. There was some Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong. 	 extra C-terminus\n
GLEAN3_01068	SPU_001068		 extra C-terminus\n
GLEAN3_01141	SPU_001141		 partial, missing N- and C-terminus\n
GLEAN3_01142	SPU_001142		 partial, missing N-terminus\n
GLEAN3_01201	SPU_001201		 partial, missing some N-terminus residues\n
GLEAN3_01214	SPU_001214		 extra N-terminus\n
GLEAN3_01525	SPU_001525		 partial, missing N-terminus\n
GLEAN3_01849	SPU_001849		 partial, missing N-terminus half\n
GLEAN3_01860	SPU_001860		 partial, missing C-terminus\n
GLEAN3_01864	SPU_001864		 missing N-terminus residues\n
GLEAN3_01934	SPU_001934		 partial, missing N-terminus\n
GLEAN3_02030	SPU_002030		 partial, missing C-terminus\n
GLEAN3_02051	SPU_002051		 partial, missing C-terminus\n
GLEAN3_02142	SPU_002142		 partial, missing N- and C-terminus\n
GLEAN3_02201	SPU_002201		 partial, missing C-terminus\n
GLEAN3_02209	SPU_002209		 partial, missing C-terminus\n
GLEAN3_02338	SPU_002338		 partial, missing N-terminus\n
GLEAN3_02429	SPU_002429		 partial, missing C-terminus\n
GLEAN3_02491	SPU_002491		 partial, missing some N-terminus residues\n
GLEAN3_02546	SPU_002546		 missing stretch in middle\n
GLEAN3_02650	SPU_002650		 missing C-terminus\n
GLEAN3_02686	SPU_002686		 extra residues on N-terminus\n
GLEAN3_02736	SPU_002736		 partial, missing C-terminus\n
GLEAN3_02773	SPU_002773		 partial, missing C-terminus\n
GLEAN3_02881	SPU_002881		 partial, missing N-terminus\n
GLEAN3_02901	SPU_002901		 partial, missing C-terminus\n
GLEAN3_02903	SPU_002903		 partial, missing N-terminus\n
GLEAN3_02909	SPU_002909		 partial, missing N-terminus\n
GLEAN3_02911	SPU_002911		 partial, missing C-terminus\n
GLEAN3_02963	SPU_002963		 extra stretch in middle\n
GLEAN3_02998	SPU_002998		 missing stretch in middle\n
GLEAN3_03021	SPU_003021		 extra stretch in middle\n
GLEAN3_03138	SPU_003138		 partial, missing C-terminus\n
GLEAN3_03325	SPU_003325		 extra stretch in middle\n
GLEAN3_03329	SPU_003329		 partial, missing N- and C-terminus residues\n
GLEAN3_03397	SPU_003397		 partial, missing N-terminus\n
GLEAN3_03415	SPU_003415		 partial, missing N-terminus\n
GLEAN3_03444	SPU_003444		 partial, missing N-terminus\n
GLEAN3_03457	SPU_003457		 partial, missing N-terminus\n
GLEAN3_03561	SPU_003561		 partial, missing C-terminus\n
GLEAN3_03632	SPU_003632		 missing N-terminus residues\n
GLEAN3_03667	SPU_003667		 partial, missing C-terminus\n
GLEAN3_03712	SPU_003712		 partial, missing N-terminus residues\n
GLEAN3_03714	SPU_003714		 missing N- and C-terminus\n
GLEAN3_03785	SPU_003785		 missing N- and C-terminus\n
GLEAN3_03839	SPU_003839		 extra N-terminus, missing C-terminus\n
GLEAN3_03913	SPU_003913		 missing two stretches in middle\n
GLEAN3_03922	SPU_003922		 partial, missing N-terminus\n
GLEAN3_03982	SPU_003982		 partial, missing C-terminus\n
GLEAN3_04022	SPU_004022		 partial, missing N- and C-terminus\n
GLEAN3_04075	SPU_004075		 partial, missing N- and C-terminus\n
GLEAN3_04123	SPU_004123		 partial, missing N-terminus\n
GLEAN3_04216	SPU_004216		 partial, missing N-terminus\n
GLEAN3_04271	SPU_004271		 missing N-terminus\n
GLEAN3_04304	SPU_004304		 missing N-terminus\n
GLEAN3_04310	SPU_004310		 partial, missing C-terminus\n
GLEAN3_04326	SPU_004326		 missing stretch in middle\n
GLEAN3_04376	SPU_004376		 partial, missing C-terminus\n
GLEAN3_04416	SPU_004416		 partial, missing N-terminus half\n
GLEAN3_04593	SPU_004593		 partial, missing N-terminus\n
GLEAN3_04673	SPU_004673		 partial, missing N-terminus\n
GLEAN3_04694	SPU_004694		 missing N-terminus residues\n
GLEAN3_04720	SPU_004720		 partial, missing N-terminus\n
GLEAN3_04769	SPU_004769		 partial, missing C-terminus\n
GLEAN3_04771	SPU_004771		 extra stretch in middle\n
GLEAN3_04795	SPU_004795	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it is apparent that there are several repeats and gaps present within both scaffolds. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak, with most of the values being less than 5.	 extra stretches in middle\n
GLEAN3_04854	SPU_004854		 missing stretch in middle\n
GLEAN3_04877	SPU_004877		 partial, missing N-terminus\n
GLEAN3_04892	SPU_004892		 partial, missing N-terminus\n
GLEAN3_04946	SPU_004946		 partial, missing C-terminus\n
GLEAN3_04965	SPU_004965		 missing N-terminus residues\n
GLEAN3_05056	SPU_005056		 partial, missing N- and C-terminus\n
GLEAN3_05177	SPU_005177		 missing N-terminus residues\n
GLEAN3_05203	SPU_005203		 partial, missing N-terminus\n
GLEAN3_05246	SPU_005246		 missing N- and C-terminus residues\n
GLEAN3_05253	SPU_005253		 partial, missing N-terminus\n
GLEAN3_05261	SPU_005261		 partial, missing C-terminus\n
GLEAN3_05274	SPU_005274		 partial, missing N-terminus\n
GLEAN3_05285	SPU_005285		 partial, missing N-terminus\n
GLEAN3_05331	SPU_005331		 partial, missing N-terminus\n
GLEAN3_05356	SPU_005356		 partial, missing N-terminus\n
GLEAN3_05407	SPU_005407		 partial, missing N-terminus\n
GLEAN3_05423	SPU_005423		 missing N-terminus\n
GLEAN3_05523	SPU_005523		 extra N-terminus\n
GLEAN3_05534	SPU_005534		 missing stretch in middle\n
GLEAN3_05665	SPU_005665		 partial, missing N-terminus\n
GLEAN3_05675	SPU_005675		 partial, missing stretch in middle\n
GLEAN3_05757	SPU_005757		 missing N-terminus residues\n
GLEAN3_05811	SPU_005811		 partial, missing N- and C-terminus\n
GLEAN3_05848	SPU_005848		 partial, missing N-terminus\n
GLEAN3_05852	SPU_005852		 extra stretch in middle\n
GLEAN3_05863	SPU_005863		 missing stretch in middle\n
GLEAN3_05952	SPU_005952		 partial, missing C-terminus\n
GLEAN3_05977	SPU_005977		 missing stretch in middle\n
GLEAN3_06017	SPU_006017		 partial, missing N-terminus\n
GLEAN3_06102	SPU_006102		 missing N-terminus, extra stretch in middle\n
GLEAN3_06210	SPU_006210		 missing N-terminus residues\n
GLEAN3_06217	SPU_006217	After reviewing the data it appears that the sequence is distributed onto 2 different scaffolds.  If the 2 scaffolds were combined the sequence would have an orderly continuous arrangement without any gaps or repeats. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with values ranging from about 4-70. This is an un-annotated gene so no additional information was available from Baylor annotations (gene comments).	 missing C-terminus residues\n
GLEAN3_06291	SPU_006291		 missing N- and C-terminus\n
GLEAN3_06301	SPU_006301		 partial, missing C-terminus\n
GLEAN3_06339	SPU_006339		 partial, missing C-terminus\n
GLEAN3_06361	SPU_006361		 partial, missing C-terminus\n
GLEAN3_06381	SPU_006381		 partial, missing N-terminus\n
GLEAN3_06383	SPU_006383		 partial, missing N-terminus\n
GLEAN3_06414	SPU_006414		 partial, missing N-terminus and a stretch in middle\n
GLEAN3_06426	SPU_006426		 partial, missing N-terminus\n
GLEAN3_06437	SPU_006437		 partial, missing C-terminus\n
GLEAN3_06509	SPU_006509		 missing stretch in middle\n
GLEAN3_06565	SPU_006565		 partial, missing C-terminus\n
GLEAN3_06585	SPU_006585		 extra stretch in middle\n
GLEAN3_06603	SPU_006603		 partial, missing N-terminus\n
GLEAN3_06630	SPU_006630		 partial, missing C-terminus\n
GLEAN3_06663	SPU_006663		 partial, missing N-terminus\n
GLEAN3_06693	SPU_006693		 partial, missing N-terminus\n
GLEAN3_06790	SPU_006790		 extra N-terminus\n
GLEAN3_06852	SPU_006852		 extra stretch in middle\n
GLEAN3_06888	SPU_006888		 extra N-terminus\n
GLEAN3_06890	SPU_006890		 extra C-terminus\n
GLEAN3_06973	SPU_006973	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If the two scaffolds were combined, the sequence would have an orderly continuous arrangement without any repeats or gaps present. This is an un-annotated gene so no additional gene information (comments) could be found from the Baylor webpage. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10.	 partial, missing N-terminus, stretches in middle\n
GLEAN3_06985	SPU_006985		 partial, missing N-terminus\n
GLEAN3_07041	SPU_007041		 missing C-terminus\n
GLEAN3_07114	SPU_007114		 partial, missing C-terminus\n
GLEAN3_07122	SPU_007122		 partial, missing C-terminus\n
GLEAN3_07155	SPU_007155		 partial, missing C-terminus\n
GLEAN3_07228	SPU_007228		 partial, missing stretches in middle\n
GLEAN3_07245	SPU_007245		 extra N-terminus\n
GLEAN3_07246	SPU_007246		 partial, missing C-terminus\n
GLEAN3_07249	SPU_007249		 partial, missing C-terminus, extra N-terminus\n
GLEAN3_07284	SPU_007284		 extra stretch in middle\n
GLEAN3_07499	SPU_007499		 partial, missing N-terminus\n
GLEAN3_07511	SPU_007511		partial, missing N-terminus\n
GLEAN3_07583	SPU_007583		 partial, extra N-terminus, missing C-terminus\n
GLEAN3_07672	SPU_007672		 partial, missing N-terminus\n
GLEAN3_07687	SPU_007687		 missing N-terminus residues\n
GLEAN3_07756	SPU_007756		 missing N-terminus\n
GLEAN3_07774	SPU_007774		 partial, missing N-terminus\n
GLEAN3_07833	SPU_007833		 partial, misisng N- and C-terminus\n
GLEAN3_07905	SPU_007905		 partial, missing N-terminus\n
GLEAN3_07906	SPU_007906		 partial, missing N-terminus and stretch in middle\n
GLEAN3_07909	SPU_007909		 partial, missing N- and C-terminus\n
GLEAN3_07920	SPU_007920		 missing N- and C-terminus\n
GLEAN3_07927	SPU_007927		 missing C-terminus\n
GLEAN3_08009	SPU_008009		 missing stretch in middle\n
GLEAN3_08080	SPU_008080		 extra N-terminus, missing C-terminus\n
GLEAN3_08094	SPU_008094		 missing N-terminus\n
GLEAN3_08135	SPU_008135		 extra N-terminus\n
GLEAN3_08142	SPU_008142		 missing C-terminus\n
GLEAN3_08176	SPU_008176		 missing stretch in middle\n
GLEAN3_08219	SPU_008219		 missing N-terminus and stretch in middle\n
GLEAN3_08265	SPU_008265		 missing N-terminus\n
GLEAN3_08318	SPU_008318		 missing C-terminus\n
GLEAN3_08349	SPU_008349		 missing N-terminus residues\n
GLEAN3_08403	SPU_008403	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. On v2.1_scaffold22310 the sequence doesn't begin until 107 and there is a small overlap between the 2 scaffolds from 1168-1230. However, if the two scaffolds were combined the sequence would have an overall orderly arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values ranging from 5-10.	 missing N-terminus\n
GLEAN3_08458	SPU_008458		 missing C-terminus\n
GLEAN3_08464	SPU_008464		 missing C-terminus\n
GLEAN3_08406	SPU_008406	From the BLAST results as well as the excel data, it is evident that this is the best fit for this particular GLEAN model. When reviewing he excel data, it was apparent that there were several internal repeats as well as gaps present. The scaffold is also truncated at 1225 when the entire sequence actually terminates at 1590. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several values ranging greater than 10.	 missing C-terminus\n
GLEAN3_08542	SPU_008542		 missing C-terminus\n
GLEAN3_08553	SPU_008553		 missing stretch in middle\n
GLEAN3_08554	SPU_008554		 missing C-terminus\n
GLEAN3_08585	SPU_008585		 missing N-terminus\n
GLEAN3_08616	SPU_008616		 missing N-terminus residues\n
GLEAN3_08646	SPU_008646		 missing C-terminus\n
GLEAN3_08701	SPU_008701		 missing C-terminus, missing stretch in middle\n
GLEAN3_08705	SPU_008705		 missing N- and C-terminus\n
GLEAN3_08815	SPU_008815		 extra N-terminus\n
GLEAN3_08875	SPU_008875		 extra N-terminus\n
GLEAN3_08879	SPU_008879	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. Both scaffolds did not have any repeats present, however, on v2.1_scaffold40027 were two gaps present that ranged from 65-96 and from 393-654. However, v2.1_scaffold81019 filled in the missing sequence information from 392-655. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong. 	 extra stretch in middle\n
GLEAN3_08970	SPU_008970		 missing N- and C-terminus, missing small stretch in middle\n
GLEAN3_09066	SPU_009066		 missing N-terminus residues\n
GLEAN3_09082	SPU_009082		 missing C-terminus\n
GLEAN3_09100	SPU_009100		 missing N- and C-terminus\n
GLEAN3_09114	SPU_009114		 missing N-terminus\n
GLEAN3_09131	SPU_009131		 missing N-terminus residues\n
GLEAN3_09249	SPU_009249		 missing N-terminus, missing stretch in middle\n
GLEAN3_09359	SPU_009359		 partial, missing N-terminus;\n
GLEAN3_09372	SPU_009372		 extra N-terminus\n
GLEAN3_09392	SPU_009392		 missing N- and C-terminus, extra stretch in middle\n
GLEAN3_09437	SPU_009437		 partial, missing N-terminus\n
GLEAN3_09462	SPU_009462		 partial, missing N-terminus\n
GLEAN3_09491	SPU_009491		 partial, missing C-terminus\n
GLEAN3_09564	SPU_009564		 partial, missing C-terminus\n
GLEAN3_09617	SPU_009617		 partial, missing N-terminus\n
GLEAN3_09644	SPU_009644		 extra N-terminus\n
GLEAN3_09768	SPU_009768		 partial, missing N- and C-terminus\n
GLEAN3_09773	SPU_009773		 partial, missing N-terminus\n
GLEAN3_09853	SPU_009853		 partial, missing N-terminus\n
GLEAN3_09929	SPU_009929		 partial, missing N- and C-terminus\n
GLEAN3_09998	SPU_009998		 missing N-terminus\n
GLEAN3_10108	SPU_010108		 partial, missing N- and C-terminus\n
GLEAN3_10205	SPU_010205		 missing N-terminus\n
GLEAN3_10209	SPU_010209		 missing N-terminus residues\n
GLEAN3_10247	SPU_010247		 missing N-terminus\n
GLEAN3_10311	SPU_010311	After reviewing the data and performing a BLAST search, it appears that there is no sufficient match for this particular GLEAN model. The BLAST results indicate that the best match is v2.1_scaffold13759 based on e-value and bit score. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several scores that were weak (>5) and several that were strong (<10)	 missing stretch in middle\n
GLEAN3_10500	SPU_010500		 missing stretch in middle\n
GLEAN3_10557	SPU_010557		 partial, missing N-terminus\n
GLEAN3_10618	SPU_010618		 missing N-terminus\n
GLEAN3_10627	SPU_010627		 extra stretches in middle\n
GLEAN3_10742	SPU_010742		 missing stretch in middle\n
GLEAN3_10804	SPU_010804		 missing N-terminus residues\n
GLEAN3_11041	SPU_011041		 partial, missing N-terminus\n
GLEAN3_11234	SPU_011234		 missing N-terminus\n
GLEAN3_11283	SPU_011283		 missing short stretch in middle\n
GLEAN3_11329	SPU_011329		 missing C-terminus\n
GLEAN3_11566	SPU_011566		 missing N-terminus residues\n
GLEAN3_11589	SPU_011589		 partial, missing C-terminus\n
GLEAN3_11596	SPU_011596		 partial\n
GLEAN3_11713	SPU_011713		 partial, missing C-terminus;\n
GLEAN3_11928	SPU_011928		 partial, missing C-terminus\n
GLEAN3_11980	SPU_011980		 partial, missing N-terminus\n
GLEAN3_12279	SPU_012279		 missing some N-terminus residues\n
GLEAN3_12344	SPU_012344		 missing some N-terminus residues\n
GLEAN3_12348	SPU_012348		 missing some N-terminus residues\n
GLEAN3_12369	SPU_012369		 missing stretch middle\n
GLEAN3_12390	SPU_012390		 partial, missing N-terminus\n
GLEAN3_12615	SPU_012615		 missing stretch in middle\n
GLEAN3_12642	SPU_012642		 partial, missing C-terminus\n
GLEAN3_12644	SPU_012644		 extra N-terminus\n
GLEAN3_12659	SPU_012659	After reviewing the data and performing a BLAST search it appears that this is the best and only results for this particular GLEAN model. When examining the excel data, it was apparent that there was a gap from 293-346 and the sequence has poor coverage overall. This is an un-annotated gene so no additional information was available from Baylor annotations. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be strong with the majority of values being <10	 missing stretch in middle\n
GLEAN3_12695	SPU_012695		 missing N-terminus\n
GLEAN3_12698	SPU_012698		 missing stretch in middle\n
GLEAN3_12790	SPU_012790		 partial, missing N-terminus\n
GLEAN3_12817	SPU_012817		 missing stretch in middle\n
GLEAN3_13130	SPU_013130		 partial, missing N-terminus\n
GLEAN3_13232	SPU_013232		 partial, missing N-terminus\n
GLEAN3_13283	SPU_013283		 partial, missing N-terminus\n
GLEAN3_13353	SPU_013353		 missing stretch in middle\n
GLEAN3_13586	SPU_013586		 partial, missing C-terminus\n
GLEAN3_13757	SPU_013757		 partial, missing N-terminus\n
GLEAN3_13853	SPU_013853		 extra stretch in middle\n
GLEAN3_13906	SPU_013906		 missing stretches in middle\n
GLEAN3_13936	SPU_013936		 missing stretch, extra stretch in middle\n
GLEAN3_13966	SPU_013966		 missing stretch in middle\n
GLEAN3_14019	SPU_014019		 partial, missing N-terminus, missing stretch in middle\n
GLEAN3_14062	SPU_014062		 missing N-terminus, extra stretch in middle\n
GLEAN3_14176	SPU_014176		 missing N-terminus\n
GLEAN3_14341	SPU_014341		 partial, missing N- and C-terminus\n
GLEAN3_14342	SPU_014342		 partial, missing N-terminus\n
GLEAN3_14372	SPU_014372		 partial, missing N-terminus\n
GLEAN3_14460	SPU_014460		 partial, missing C-terminus\n
GLEAN3_14504	SPU_014504		 partial, missing N-terminus;\n
GLEAN3_14595	SPU_014595		 partial, missing N-terminus\n
GLEAN3_14653	SPU_014653		 missing central stretch\n
GLEAN3_14694	SPU_014694		 partial, missing N-terminus\n
GLEAN3_14710	SPU_014710		 partial, missing N-terminus and a stretch in middle\n
GLEAN3_14718	SPU_014718		 missing N-terminus and central stretch\n
GLEAN3_14939	SPU_014939		 partial, missing C-terminus\n
GLEAN3_15089	SPU_015089		 partial, missing N-terminus\n
GLEAN3_15155	SPU_015155		 partial, missing C-terminus and stretch in middle\n
GLEAN3_15257	SPU_015257		 partial, missing C-terminus\n
GLEAN3_15283	SPU_015283		 partial, missing N-terminus\n
GLEAN3_15310	SPU_015310		 missing C-terminus\n
GLEAN3_15323	SPU_015323		 missing N-terminus\n
GLEAN3_15348	SPU_015348		 missing N-terminus and stretch in middle\n
GLEAN3_15372	SPU_015372		 missing N-terminus\n
GLEAN3_15485	SPU_015485		 missing N-terminus\n
GLEAN3_15486	SPU_015486		 missing N-terminus\n
GLEAN3_15507	SPU_015507		 missing N-terminus residues\n
GLEAN3_15737	SPU_015737		 extra N-terminus\n
GLEAN3_15738	SPU_015738		 extra N-terminus residues\n
GLEAN3_15770	SPU_015770		 missing C-terminus\n
GLEAN3_15842	SPU_015842		 missing N- and C-terminus\n
GLEAN3_15851	SPU_015851		 missing C-terminus\n
GLEAN3_15894	SPU_015894		 missing C-terminus\n
GLEAN3_15966	SPU_015966		 missing C-terminus\n
GLEAN3_15988	SPU_015988		 missing C-terminus\n
GLEAN3_16076	SPU_016076		 missing C-terminus; extra N-terminus residues\n
GLEAN3_16082	SPU_016082		 partial, missing N-terminus\n
GLEAN3_16205	SPU_016205		 missing N-terminus\n
GLEAN3_16277	SPU_016277		 missing N-terminus\n
GLEAN3_16344	SPU_016344		 missing N-terminus residues\n
GLEAN3_16345	SPU_016345		 missing N-terminus residues\n
GLEAN3_16350	SPU_016350		 missing N-terminus residues\n
GLEAN3_16377	SPU_016377		 missing N-terminus residues\n
GLEAN3_16383	SPU_016383		 missing N-terminus\n
GLEAN3_16494	SPU_016494		 partial, missing N-terminus\n
GLEAN3_16520	SPU_016520		 missing N-terminus\n
GLEAN3_16651	SPU_016651		 missing N-terminus\n
GLEAN3_16702	SPU_016702		 missing C-terminus\n
GLEAN3_16825	SPU_016825		 missing C-terminus\n
GLEAN3_16826	SPU_016826		 missing C-terminus\n
GLEAN3_16831	SPU_016831		 missing N-terminus\n
GLEAN3_16860	SPU_016860		 missing stretch in middle\n
GLEAN3_16882	SPU_016882		 missing stretch in middle\n
GLEAN3_17058	SPU_017058		 missing N-terminus\n
GLEAN3_17086	SPU_017086		 missing N-terminus\n
GLEAN3_17099	SPU_017099		 missing C-terminus\n
GLEAN3_17257	SPU_017257		 missing N-terminus\n
GLEAN3_17403	SPU_017403		 missing C-terminus\n
GLEAN3_17417	SPU_017417		 missing C-terminus\n
GLEAN3_17478	SPU_017478		 missing N- and C-terminus; very similar to DAG kinase zeta form\n
GLEAN3_17538	SPU_017538		 missing N-terminus\n
GLEAN3_17573	SPU_017573		 missing C-terminus\n
GLEAN3_17585	SPU_017585		 missing C-terminus\n
GLEAN3_17661	SPU_017661		 missing N-terminus\n
GLEAN3_17759	SPU_017759		 missing N-terminus\n
GLEAN3_17992	SPU_017992	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If the 2 scaffolds were combined the sequence would have an orderly continuous arrangement with no internal repeats or gaps. The transcriptome intensity scores were difficult to determine due to a very large outlier of about 125 that distorted the scale of the graph. However, the rest of the values appeared to be weak (less than 5.) There was also no Est information available from GBrowse assembly V0.5. This is an un-annotated gene so no additional gene information was available from Baylor annotations ( gene comments).	 missing N-terminus\n
GLEAN3_18109	SPU_018109	After reviewing the data and performing	 missing N-terminus\n
GLEAN3_18110	SPU_018110		 missing N- and C-terminus\n
GLEAN3_18127	SPU_018127		 extra N-terminus, missing C-terminus\n
GLEAN3_18177	SPU_018177		 missing N-terminus\n
GLEAN3_18270	SPU_018270		 missing N-terminus, missing stretch in middle\n
GLEAN3_18322	SPU_018322		 missing N-terminus residues\n
GLEAN3_18421	SPU_018421		 missing N-terminus\n
GLEAN3_18431	SPU_018431		 missing C-terminus\n
GLEAN3_18433	SPU_018433		 missing N-terminus\n
GLEAN3_18446	SPU_018446		 missing C-terminus\n
GLEAN3_18466	SPU_018466		 missing N-terminus residues\n
GLEAN3_18527	SPU_018527		 missing C-terminus\n
GLEAN3_18618	SPU_018618		 missing C-terminus\n
GLEAN3_18620	SPU_018620		 missing N- and C-terminus\n
GLEAN3_18748	SPU_018748		 missing N-terminus\n
GLEAN3_18758	SPU_018758		 missing C-terminus\n
GLEAN3_18907	SPU_018907		 extra N-terminus\n
GLEAN3_19001	SPU_019001		 missing N-terminus\n
GLEAN3_19016	SPU_019016		 missing N-terminus\n
GLEAN3_19288	SPU_019288		 partial, missing N-terminus\n
GLEAN3_19357	SPU_019357		 missing stretch in middle\n
GLEAN3_19398	SPU_019398		 missing stretch in middle\n
GLEAN3_19420	SPU_019420		 missing N-terminus\n
GLEAN3_19421	SPU_019421		 missing N-terminus\n
GLEAN3_19428	SPU_019428		 missing N-terminus\n
GLEAN3_19468	SPU_019468		 partial, missing C-terminus\n
GLEAN3_19471	SPU_019471		 missing C-terminus\n
GLEAN3_19505	SPU_019505		 missing N-terminus\n
GLEAN3_19692	SPU_019692		 missing N-terminus\n
GLEAN3_19744	SPU_019744		 missing N-terminus\n
GLEAN3_19914	SPU_019914		 missing C-terminus\n
GLEAN3_22002	SPU_022002		 extra stretch in middle\n
GLEAN3_22028	SPU_022028		 missing N- and C-terminus\n
GLEAN3_22045	SPU_022045		 missing C-terminus\n
GLEAN3_22129	SPU_022129	After reviewing the data and performing a BLAST search it appears that this is the best fit for this particular GLEAN model based on the sequence coverage and orderly arrangement.  There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores were widely dispersed with several high values (40-70) and several moderate values (9-15). This is an un-annotated gene so no additional information was available from Baylor annotations assembly V0.5.	 missing some N-terminus residues\n
GLEAN3_22252	SPU_022252		 missing N-terminus\n
GLEAN3_22260	SPU_022260		 missing short stretch in middle\n
GLEAN3_22263	SPU_022263		 missing N-terminus\n
GLEAN3_22326	SPU_022326		 missing stretch in middle\n
GLEAN3_22378	SPU_022378		 missing C-terminus\n
GLEAN3_22397	SPU_022397		 missing C-terminus\n
GLEAN3_22399	SPU_022399		 extra stretch in middle\n
GLEAN3_22597	SPU_022597		 missing stretches in middle\n
GLEAN3_22651	SPU_022651		 missing N-terminus\n
GLEAN3_22750	SPU_022750		 missing C-terminus, extra stretch in middle\n
GLEAN3_22807	SPU_022807		 missing stretch in middle\n
GLEAN3_22949	SPU_022949		 missing N-terminus\n
GLEAN3_22986	SPU_022986		 missing N-terminus\n
GLEAN3_23118	SPU_023118		 missing N-terminus\n
GLEAN3_23270	SPU_023270		 missing N-terminus, missing stretch in middle\n
GLEAN3_23332	SPU_023332		 missing N-terminus\n
GLEAN3_23511	SPU_023511		 missing stretch\n
GLEAN3_23691	SPU_023691		 missing N-terminus\n
GLEAN3_23764	SPU_023764		 missing N-terminus\n
GLEAN3_23829	SPU_023829		 missing C-terminus\n
GLEAN3_23834	SPU_023834		 missing stretches\n
GLEAN3_23842	SPU_023842		 extra stretch in middle\n
GLEAN3_23859	SPU_023859		 missing C-terminus\n
GLEAN3_23890	SPU_023890		 missing N-terminus\n
GLEAN3_23942	SPU_023942		 missing N-terminus\n
GLEAN3_20026	SPU_020026		 missing C-terminus\n
GLEAN3_20089	SPU_020089		 missing N-terminus\n
GLEAN3_20200	SPU_020200		 missing some N-terminus residues\n
GLEAN3_20211	SPU_020211		 missing some C-terminus residues\n
GLEAN3_20302	SPU_020302		 missing C-terminus\n
GLEAN3_20368	SPU_020368		 missing N- and C-terminus\n
GLEAN3_20397	SPU_020397		 missing C-terminus\n
GLEAN3_20402	SPU_020402		 missing N-terminus\n
GLEAN3_20445	SPU_020445		 missing C-terminus\n
GLEAN3_20497	SPU_020497		 missing N-terminus\n
GLEAN3_20566	SPU_020566		 missing N-terminus residues\n
GLEAN3_20576	SPU_020576		 missing C-terminus\n
GLEAN3_20707	SPU_020707		 missing N- and C-terminus\n
GLEAN3_20739	SPU_020739		 missing N-terminus\n
GLEAN3_20886	SPU_020886		 missing stretch in middle\n
GLEAN3_20970	SPU_020970		 missing N-terminus\n
GLEAN3_21058	SPU_021058		 missing N-terminus\n
GLEAN3_21355	SPU_021355		 missing N-terminus\n
GLEAN3_21387	SPU_021387		 missing C-terminus\n
GLEAN3_21465	SPU_021465		 missing C-terminus\n
GLEAN3_21628	SPU_021628		 missing stretch in middle\n
GLEAN3_21658	SPU_021658		 missing C-terminus\n
GLEAN3_21772	SPU_021772		 missing N-terminus\n
GLEAN3_21788	SPU_021788		 missing C-terminus\n
GLEAN3_21802	SPU_021802		 missing N-terminus\n
GLEAN3_21854	SPU_021854		 extra N-terminus\n
GLEAN3_21867	SPU_021867		 missing N- and C-terminus\n
GLEAN3_21895	SPU_021895		 missing C-terminus\n
GLEAN3_21933	SPU_021933		 missing N- and C-terminus\n
GLEAN3_21934	SPU_021934		 missing C-terminus\n
GLEAN3_21979	SPU_021979		 missing N-terminus\n
GLEAN3_24016	SPU_024016		 missing N-terminus residues\n
GLEAN3_24174	SPU_024174		 missing N-terminus\n
GLEAN3_24224	SPU_024224		 missing N-terminus\n
GLEAN3_24261	SPU_024261		 missing C-terminus\n
GLEAN3_24267	SPU_024267	When reviewing the Excel data, subject gb|DS015797| was unique in that it appeared to have an orderly continuous arrangement with out any gaps or internal repeats present until the sequence reached about 1000. However, after performing a BLAST search additional sequence information came up that was not present in the excel data and the sequence appeared to be distributed onto 2 different scaffolds. The transcriptome intensity scores appeared to be somewhat weak (less than 5) with exception to 2 outliers of about 17 and 55 that were present. There was no Est. information available from GBrowse assembly V0.5. This is an un-annotated gene so no additional gene information (comments) were available from Baylor annotations. 	 missing C-terminus, extra stretch in middle\n
GLEAN3_24383	SPU_024383		 missing N-terminus\n
GLEAN3_24483	SPU_024483		 missing N-terminus\n
GLEAN3_24535	SPU_024535		 missing C-terminus\n
GLEAN3_24622	SPU_024622		 missing N-terminus\n
GLEAN3_24755	SPU_024755		 extra N-terminus; missing C-terminus\n
GLEAN3_24775	SPU_024775		 missing C-terminus\n
GLEAN3_24846	SPU_024846		 missing N-terminus\n
GLEAN3_24873	SPU_024873		 missing N-terminus\n
GLEAN3_24970	SPU_024970		 missing N-terminus, missing stretch in middle\n
GLEAN3_25092	SPU_025092		 missing N-terminus\n
GLEAN3_25100	SPU_025100		 missing C-terminus\n
GLEAN3_25411	SPU_025411		 missing N-terminus\n
GLEAN3_25567	SPU_025567		 missing N-terminus\n
GLEAN3_25661	SPU_025661		 missing C-terminus\n
GLEAN3_25697	SPU_025697		 missing N-terminus\n
GLEAN3_25702	SPU_025702		 missing N-terminus\n
GLEAN3_25770	SPU_025770		 missing N-terminus\n
GLEAN3_25858	SPU_025858		 missing C-terminus\n
GLEAN3_25989	SPU_025989		 missing N-terminus, missing short stretch in middle\n
GLEAN3_26127	SPU_026127		 missing C-terminus\n
GLEAN3_26212	SPU_026212		 missing C-terminus\n
GLEAN3_26291	SPU_026291		 missing some C-terminus residues\n
GLEAN3_26437	SPU_026437		 missing N-terminus\n
GLEAN3_26552	SPU_026552		 missing N-terminus\n
GLEAN3_26556	SPU_026556		 missing N-terminus\n
GLEAN3_26625	SPU_026625		 missing C-terminus\n
GLEAN3_26639	SPU_026639		 missing some N-terminus residues\n
GLEAN3_26702	SPU_026702		 missing C-terminus\n
GLEAN3_26737	SPU_026737		 missing C-terminus\n
GLEAN3_26881	SPU_026881		 extra N-terminus residues\n
GLEAN3_27010	SPU_027010		 missing stretches in middle\n
GLEAN3_27078	SPU_027078		 missing some N-terminus residues\n
GLEAN3_27209	SPU_027209		 partial, missing C-terminus\n
GLEAN3_27304	SPU_027304		 missing some N-terminus residues\n
GLEAN3_27344	SPU_027344		 missing some N-terminus residues\n
GLEAN3_27388	SPU_027388		 partial, missing N-terminus\n
GLEAN3_27650	SPU_027650		 partial, missing C-terminus\n
GLEAN3_27669	SPU_027669		 missing some N-terminus residues\n
GLEAN3_28001	SPU_028001		 missing N-terminus residues\n
GLEAN3_28105	SPU_028105		 missing stretch\n
GLEAN3_28139	SPU_028139		 extra N-terminus\n
GLEAN3_28141	SPU_028141		 missing some N-terminus residues\n
GLEAN3_28167	SPU_028167		 partial, missing C-terminus\n
GLEAN3_28178	SPU_028178		 missing N-terminus\n
GLEAN3_28238	SPU_028238		 partial, missing N-terminus\n
GLEAN3_28504	SPU_028504		 missing N-terminus, missing stretch in middle\n
GLEAN3_28572	SPU_028572		 partial, missing N-terminus\n
GLEAN3_28573	SPU_028573		 partial, missing N-terminus\n
GLEAN3_28728	SPU_028728		 extra N-terminus residues\n
GLEAN3_03616	SPU_003616		 partial, missing N-terminus\n
GLEAN3_03731	SPU_003731		 partial, missing N-terminus\n
GLEAN3_04870	SPU_004870		 extra N-terminus\n
GLEAN3_08806	SPU_008806		 missing N-terminus half\n
GLEAN3_10539	SPU_010539		 missing N-terminus half\n
GLEAN3_13979	SPU_013979		 missing N-terminus\n
GLEAN3_15618	SPU_015618		 missing most of the C-terminus\n
GLEAN3_18531	SPU_018531		 missing N-terminus half\n
GLEAN3_20972	SPU_020972		 missing some N-terminus residues\n
GLEAN3_23656	SPU_023656		 missing N-terminus half\n
GLEAN3_24406	SPU_024406		 missing N-terminus half\n
GLEAN3_24714	SPU_024714		 missing some N-terminus residues\n
GLEAN3_24790	SPU_024790		 missing N-terminus half\n
GLEAN3_26832	SPU_026832		 missing most of the C-terminus\n
GLEAN3_27484	SPU_027484		 missing N-terminus half\n
GLEAN3_27859	SPU_027859		 missing N-terminus half\n
GLEAN3_00068	SPU_000068		 partial, missing N-terminus\n
GLEAN3_00334	SPU_000334		 extra stretch in middle\n
GLEAN3_00967	SPU_000967		 partial, missing N-terminus\n
GLEAN3_01285	SPU_001285		 missing stretch in middle\n
GLEAN3_01563	SPU_001563		 partial, missing C-terminus\n
GLEAN3_02346	SPU_002346		 partial, missing N-terminus\n
GLEAN3_02465	SPU_002465		 partial, missing N-terminus\n
GLEAN3_02709	SPU_002709		 missing N-terminus\n
GLEAN3_03103	SPU_003103		 extra N-terminus\n
GLEAN3_03394	SPU_003394		 partial, missing N-terminus\n
GLEAN3_03812	SPU_003812		 partial, missing N-terminus\n
GLEAN3_04751	SPU_004751		 partial, missing N-terminus\n
GLEAN3_04937	SPU_004937	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed on to 2 different scaffolds. There are several repeats within each scaffold that is apparent from both the BLAST results and the excel data. The sequence does however have an orderly arrangement. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being above 5 and several that were above 10.	 extra N-terminus\n
GLEAN3_04990	SPU_004990		 partial, missing N-terminus\n
GLEAN3_05195	SPU_005195		 partial, missing N-terminus\n
GLEAN3_05196	SPU_005196		 missing stretches in middle\n
GLEAN3_05590	SPU_005590		 partial, missing C-terminus\n
GLEAN3_06347	SPU_006347		 extra stretch in middle\n
GLEAN3_06438	SPU_006438		 partial, missing N-terminus\n
GLEAN3_06617	SPU_006617		 partial, missing N- and C-terminus\n
GLEAN3_09308	SPU_009308		 partial, missing N-terminus and stretches in middle\n
GLEAN3_11046	SPU_011046		 extra N-terminus\n
GLEAN3_11345	SPU_011345		 missing stretch in middle\n
GLEAN3_12046	SPU_012046		 missing some N-terminus residues\n
GLEAN3_12806	SPU_012806		 extra stretch in middle, missing another stretch\n
GLEAN3_13454	SPU_013454		 missing stretches in middle\n
GLEAN3_22704	SPU_022704		 partial, missing N- and C-terminus\n
GLEAN3_27953	SPU_027953		 partial, missing C-terminus\n
GLEAN3_11059	SPU_011059		 extra N-terminus\n
GLEAN3_11934	SPU_011934		 extra N-terminus\n
GLEAN3_12498	SPU_012498		 missing N-terminus\n
GLEAN3_12676	SPU_012676		 missing C-terminus\n
GLEAN3_12814	SPU_012814		 partial, missing N-terminus\n
GLEAN3_13310	SPU_013310		 missing stretch in middle, missing C-terminus\n
GLEAN3_13361	SPU_013361		 partial, missing N-terminus\n
GLEAN3_13539	SPU_013539		 missing N-terminus\n
GLEAN3_13587	SPU_013587		 partial, missing C-terminus\n
GLEAN3_13735	SPU_013735		 extra stretch in middle\n
GLEAN3_14024	SPU_014024		 missing C-terminus/central stretch\n
GLEAN3_14152	SPU_014152		 missing C-terminus\n
GLEAN3_14338	SPU_014338		 partial, missing C-terminus\n
GLEAN3_14519	SPU_014519		 missing C-terminus\n
GLEAN3_14596	SPU_014596		 extra C-terminus\n
GLEAN3_14615	SPU_014615		 missing N-terminus\n
GLEAN3_14937	SPU_014937		 missing N-terminus, extra stretch in middle\n
GLEAN3_15162	SPU_015162		 missing C-terminus\n
GLEAN3_15314	SPU_015314		 partial, missing N- and C-terminus\n
GLEAN3_15853	SPU_015853		 extra stretch in middle\n
GLEAN3_15946	SPU_015946		 missing C-terminus residues\n
GLEAN3_16164	SPU_016164		 missing C-terminus and  stretch in middle\n
GLEAN3_16679	SPU_016679		 missing N-terminus\n
GLEAN3_16853	SPU_016853		 missing N-terminus, missing stretch in middle\n
GLEAN3_16947	SPU_016947		 missing C-terminus\n
GLEAN3_17326	SPU_017326		 extra C-terminus\n
GLEAN3_17681	SPU_017681		 extra stretch in middle\n
GLEAN3_06711	SPU_006711		 missing C-terminus\n
GLEAN3_07009	SPU_007009		 missing C-terminus, extra stretch in middle\n
GLEAN3_07217	SPU_007217		 partial, missing C-terminus\n
GLEAN3_07403	SPU_007403		 missing C-terminus\n
GLEAN3_08057	SPU_008057		 extra C-terminus\n
GLEAN3_08192	SPU_008192		 extra C-terminus\n
GLEAN3_08230	SPU_008230		 partial, missing N- and C-terminus\n
GLEAN3_08397	SPU_008397		 missing N- and C-terminus\n
GLEAN3_08467	SPU_008467		 extra C-terminus\n
GLEAN3_08631	SPU_008631		 partial, missing N- and C-terminus\n
GLEAN3_08633	SPU_008633		 missing N-terminus, extra C-terminus\n
GLEAN3_08869	SPU_008869		 partial, missing N- and C-terminus\n
GLEAN3_08934	SPU_008934		 partial, missing N- and C-terminus\n
GLEAN3_08937	SPU_008937		 missing C-terminus\n
GLEAN3_09014	SPU_009014		 partial, missing C-terminus\n
GLEAN3_09204	SPU_009204		 partial, missing C-terminus\n
GLEAN3_09336	SPU_009336		 extra C-terminus\n
GLEAN3_09416	SPU_009416		 missing C-terminus\n
GLEAN3_09458	SPU_009458		 partial, missing C-terminus\n
GLEAN3_09494	SPU_009494		 missing N-terminus\n
GLEAN3_09871	SPU_009871		 extra N-terminus\n
GLEAN3_14156	SPU_014156		 missing N- and C-terminus, extra stretches in middle\n
GLEAN3_14280	SPU_014280		 partial, missing N-terminus half\n
GLEAN3_14789	SPU_014789		 missing N-terminus\n
GLEAN3_15067	SPU_015067		 extra N-terminus\n
GLEAN3_16878	SPU_016878		 missing some N-terminus residues\n
GLEAN3_16897	SPU_016897		 partial, missing N-terminus\n
GLEAN3_16916	SPU_016916		 partial, missing C-terminus\n
GLEAN3_17350	SPU_017350	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed on to 2 different scaffolds. When reviewing the excel data, it is apparent that there are several internal repeats present as well as gaps. There was Est support from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed, with values ranging from about 2-36. 	 extra C-terminus\n
GLEAN3_17736	SPU_017736		 extra N-terminus residues\n
GLEAN3_17988	SPU_017988		 partial, missing N-terminus half\n
GLEAN3_18396	SPU_018396		 missing N-terminus\n
GLEAN3_18653	SPU_018653		 extra N-terminus\n
GLEAN3_19095	SPU_019095		 partial, missing N-terminus\n
GLEAN3_19683	SPU_019683		 missing steches in middle\n
GLEAN3_20426	SPU_020426		 partial, missing N-terminus half\n
GLEAN3_20467	SPU_020467		 partial, missing N- and C-terminus\n
GLEAN3_20639	SPU_020639		 partial, missing N-terminus\n
GLEAN3_20860	SPU_020860		 missing N- and C-terminus\n
GLEAN3_21110	SPU_021110		 missing N-terminus\n
GLEAN3_21455	SPU_021455		 missing N-terminus\n
GLEAN3_21878	SPU_021878		 extra stretch in middle\n
GLEAN3_23232	SPU_023232		 extra stretch in middle, missing C-terminus\n
GLEAN3_23320	SPU_023320		 missing C-terminus\n
GLEAN3_23426	SPU_023426		 missing N-terminus\n
GLEAN3_23457	SPU_023457		 partial, missing N-terminus half\n
GLEAN3_23702	SPU_023702		 partial, missing C-terminus\n
GLEAN3_23760	SPU_023760		 partial, missing N-terminus half\n
GLEAN3_23846	SPU_023846		 partial, missing C-terminus half\n
GLEAN3_25439	SPU_025439		 missing C-terminus\n
GLEAN3_25470	SPU_025470		 missing N-terminus\n
GLEAN3_25929	SPU_025929		 missing N-terminus\n
GLEAN3_25934	SPU_025934		 partial, missing N-terminus helf\n
GLEAN3_26466	SPU_026466		 partial, missing N- and C-terminus\n
GLEAN3_26857	SPU_026857		 missing C-terminus\n
GLEAN3_27797	SPU_027797		 missing central stretch\n
GLEAN3_27850	SPU_027850		 extra stretch in middle\n
GLEAN3_28290	SPU_028290		 extra C-terminus\n
GLEAN3_28310	SPU_028310		 extra N-terminus\n
GLEAN3_28802	SPU_028802		 missing C-terminus\n
GLEAN3_03330	SPU_003330		 partial, missing middle stretch and C-terminus\n
GLEAN3_03559	SPU_003559		 partial, missing C-terminus\n
GLEAN3_03859	SPU_003859	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it is evident that the 2 scaffolds have an orderly arrangement about them. However, the sequence is truncated at 950 resulting in poor overall coverage. There was no Est. information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall (excluding one outlier at 77) with most of the values being less than 10.  	 partial, missing C-terminus\n
GLEAN3_04907	SPU_004907		 partial, missing C-terminus\n
GLEAN3_05150	SPU_005150		 partial, missing N-terminus\n
GLEAN3_06125	SPU_006125		 partial, missing C-terminus\n
GLEAN3_06705	SPU_006705		 partial, missing N-terminus\n
GLEAN3_06845	SPU_006845		 partial, missing C-terminus\n
GLEAN3_07043	SPU_007043		 missing N-terminus\n
GLEAN3_18060	SPU_018060		 extra stretch on C-terminus\n
GLEAN3_19510	SPU_019510		 extra N-terminus half\n
GLEAN3_20937	SPU_020937		 missing C-terminus\n
GLEAN3_21169	SPU_021169		 missing C-terminus\n
GLEAN3_24001	SPU_024001		 missing N-terminus\n
GLEAN3_24891	SPU_024891		 extra N-terminus\n
GLEAN3_25779	SPU_025779		 missing N-terminus, extra C-terminus\n
GLEAN3_26067	SPU_026067		 missing N-terminus\n
GLEAN3_26271	SPU_026271	After reviewing the data and doing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first scaffold doesn't start until the 9th base pair and continues until 466. If the two scaffolds were combined there would be a small overlap between the two sequences from 465-556. Other than the overlap, if the two scaffolds were combined, the sequence would have an orderly, continuous arrangement. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 10. 	 extra N-terminus\n
GLEAN3_26777	SPU_026777		 extra N-terminus half\n
GLEAN3_26817	SPU_026817		 missing N-terminus half\n
GLEAN3_26904	SPU_026904		 extra strectch in middle\n
GLEAN3_26913	SPU_026913		 missing C-terminus half\n
GLEAN3_27400	SPU_027400		 extra C-terminus\n
GLEAN3_27510	SPU_027510		 missing N-terminus half, extra C-terminus\n
GLEAN3_27914	SPU_027914		 missing N-terminus half\n
GLEAN3_28042	SPU_028042		 missing C-terminus half\n
GLEAN3_28587	SPU_028587		 missing C-terminus\n
GLEAN3_28628	SPU_028628		 missing C-terminus, extra N-terminus\n
GLEAN3_28876	SPU_028876		 missing stretch in middle\n
GLEAN3_00862	SPU_000862		 partial, missing N-terminus\n
GLEAN3_01188	SPU_001188		 missing N-terminus\n
GLEAN3_01363	SPU_001363		 partial, missing C-terminus\n
GLEAN3_01561	SPU_001561		 partial, missing N- and C-terminus\n
GLEAN3_02713	SPU_002713		 partial, missing N-terminus\n
GLEAN3_03022	SPU_003022		 missing C-terminus\n
GLEAN3_03273	SPU_003273		 partial, missing N-terminus\n
GLEAN3_03429	SPU_003429		 partial, missing N-terminus\n
GLEAN3_04380	SPU_004380		 partial, missing N-terminus\n
GLEAN3_04666	SPU_004666		 missing N-terminus\n
GLEAN3_05157	SPU_005157		 partial, missing N-terminus\n
GLEAN3_06877	SPU_006877		 partial, missing N-terminus\n
GLEAN3_11374	SPU_011374		 partial, missing N-terminus\n
GLEAN3_12901	SPU_012901		 partial, missing N-terminus\n
GLEAN3_15291	SPU_015291		 partial, missing N-terminus\n
GLEAN3_15488	SPU_015488		 partial, missing N-terminus\n
GLEAN3_16048	SPU_016048		 partial, missing N- and C-terminus\n
GLEAN3_17522	SPU_017522		 partial, missing N-terminus\n
GLEAN3_19648	SPU_019648		 partial, missing N-terminus\n
GLEAN3_19864	SPU_019864		 partial, missing N- and C-terminus\n
GLEAN3_20156	SPU_020156		 partial, missing N-terminus\n
GLEAN3_20751	SPU_020751		 partial, missing N- and C-terminus\n
GLEAN3_21188	SPU_021188		 partial, missing N-terminus\n
GLEAN3_21571	SPU_021571		 partial, missing N-terminus\n
GLEAN3_21685	SPU_021685		 missing N-terminus\n
GLEAN3_23801	SPU_023801		 partial, missing N-terminus\n
GLEAN3_24740	SPU_024740		 partial, missing N- and C-terminus\n
GLEAN3_24922	SPU_024922		 partial, missing N-terminus\n
GLEAN3_27274	SPU_027274		 partial, missing N-terminus\n
GLEAN3_27775	SPU_027775		 partial, missing N-terminus\n
GLEAN3_27849	SPU_027849		 partial, missing N-terminus\n
GLEAN3_28164	SPU_028164		 partial, missing C-terminus\n
GLEAN3_28496	SPU_028496		 partial, missing N-terminus\n
GLEAN3_10026	SPU_010026		 partial, missing C-terminus\n
GLEAN3_10208	SPU_010208		 partial, missing N-terminus\n
GLEAN3_10211	SPU_010211		 extra stretches\n
GLEAN3_10270	SPU_010270		 partial, missing C-terminus\n
GLEAN3_10310	SPU_010310		 partial, missing N-terminus\n
GLEAN3_10479	SPU_010479		 partial, missing N-terminus\n
GLEAN3_10549	SPU_010549		 partial, missing C-terminus\n
GLEAN3_10644	SPU_010644		 partial, missing C-terminus\n
GLEAN3_10645	SPU_010645		 partial, missing N-terminus\n
GLEAN3_10646	SPU_010646		 partial, missing N- and C-terminus\n
GLEAN3_10683	SPU_010683		 partial, missing C-terminus\n
GLEAN3_10686	SPU_010686		 extra N- and C-terminus\n
GLEAN3_10778	SPU_010778		 extra N-terminus\n
GLEAN3_10785	SPU_010785		 extra steches in middle\n
GLEAN3_11036	SPU_011036		 partial, missing N- and C-terminus\n
GLEAN3_11133	SPU_011133		 extra residues on N-terminus\n
GLEAN3_11187	SPU_011187	From the BLAST search and the excel data it was evident that the sequence was distributed onto 2 different scaffolds. It appears that both scaffolds have an orderly and continuous arrangement without any gaps or repeats. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 10. 	 partial, missing stretch in middle, missing C-terminus\n
GLEAN3_11213	SPU_011213		 partial, missing N-terminus\n
GLEAN3_11438	SPU_011438		 partial, missing N-terminus\n
GLEAN3_11449	SPU_011449		 partial, missing N-terminus\n
GLEAN3_11499	SPU_011499	After reviewing the data it appears that there is no sufficient GLEAN  model that fits GLEAN3_11499. The sequence appears distributed onto 2 different scaffolds. On >v2.1_scaffold48174 there is a gap from However, the sequence coverage doesn't start until about 1500 but from that point on there is an orderly arrangement of the sequence with no gaps. The total coverage of the sequence is a little low (3066/4155) but there appears to be no repeats and the BLAST search indicated that there was a low e-value of zero coupled with a high bit score. \nAdditional comment found: extra at N-terminus; extra stretch in middle.	 extra at N-terminus; extra stretch in middle\n
GLEAN3_11604	SPU_011604		 extra N-terminus\n
GLEAN3_11845	SPU_011845		 partial, missing N-terminus\n
GLEAN3_11893	SPU_011893		 missing C-terminus residues\n
GLEAN3_11978	SPU_011978		 partial, missing C-terminus\n
GLEAN3_12289	SPU_012289		 extra N-terminus\n
GLEAN3_12430	SPU_012430		 extra residues on C- and N-terminus\n
GLEAN3_12473	SPU_012473		 partial, missing C-terminus\n
GLEAN3_12675	SPU_012675		 missing N-terminus; extra C-terminus\n
GLEAN3_12789	SPU_012789		 partial, missing C-terminus\n
GLEAN3_12834	SPU_012834		 extra N-terminus residues\n
GLEAN3_13010	SPU_013010		 missing stretches in between\n
GLEAN3_13037	SPU_013037		 missing some N-terminus residues\n
GLEAN3_13046	SPU_013046		 partial, missing N-terminus\n
GLEAN3_13105	SPU_013105		 partial, missing N- and C-terminus\n
GLEAN3_13242	SPU_013242		 extra N-terminus; extra residues in center\n
GLEAN3_13338	SPU_013338		 extra N-terminus\n
GLEAN3_13376	SPU_013376		 missing some N-terminus residues\n
GLEAN3_13428	SPU_013428		 extra N-terminus residues\n
GLEAN3_13440	SPU_013440		 partial, missing N-terminus\n
GLEAN3_13463	SPU_013463	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. The first scaffold doesn't begin until the 9th base pair, however, it continues in an orderly arrangement until the scaffold is truncated at 234 where the rest of the sequence is carried on the second scaffold. There are no internal repeats present on either scaffold. There was some Est support availablr from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong. 	 partial, missing N-terminus and center stretch\n
GLEAN3_13750	SPU_013750		 partial, missing N-terminus\n
GLEAN3_13912	SPU_013912		 missing N-terminus\n
GLEAN3_13942	SPU_013942		 partial, missing C-terminus\n
GLEAN3_13943	SPU_013943		 partial, missing C-terminus\n
GLEAN3_14025	SPU_014025		 extra residues in center\n
GLEAN3_14135	SPU_014135		 extra N-terminus\n
GLEAN3_14136	SPU_014136		 extra N-terminus\n
GLEAN3_14251	SPU_014251	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data and comparing that to the BLAST results it is apparent that the sequence has an orderly continuous arrangement. If the two scaffolds were combined there would be good sequence coverage without any internal repeats or gaps present. There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed with some values either at or below 5 and some that were above 5. 	 partial, missing stretch in middle\n
GLEAN3_14315	SPU_014315		 partial, missing N-terminus, matches on small C-terminus part\n
GLEAN3_14406	SPU_014406		 partial, missing N-terminus\n
GLEAN3_14507	SPU_014507		 partial, missing N-terminus\n
GLEAN3_14569	SPU_014569		 partial, missing C-terminus\n
GLEAN3_15048	SPU_015048		 partial, missing N-terminus\n
GLEAN3_15062	SPU_015062		 extra N-terminus\n
GLEAN3_15069	SPU_015069		 partial, missing C-terminus\n
GLEAN3_15156	SPU_015156		 partial, missing N-terminus\n
GLEAN3_15169	SPU_015169		 extra N-terminus\n
GLEAN3_15343	SPU_015343		 missing N- and C-terminus\n
GLEAN3_15380	SPU_015380		 partial, missing N- and C-terminus\n
GLEAN3_15497	SPU_015497		 extra C-terminus\n
GLEAN3_15568	SPU_015568		 partial, missing N-terminus and central stretch\n
GLEAN3_15573	SPU_015573		 partial, missing N- and C-terminus\n
GLEAN3_15586	SPU_015586		 partial, missing N-terminus\n
GLEAN3_15781	SPU_015781		 missing C-terminus\n
GLEAN3_15793	SPU_015793	After reviewing the data and performing a BLAST search it appears that the data is distributed onto 2 different scaffolds. According to the excel data and the BLAST search, if the 2 scaffolds were combined the sequence would have an orderly, continuous arrangement. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be very weak excluding a very large outlier at about 100.	 partial, missing central stretch\n
GLEAN3_15814	SPU_015814		 partial, missing C-terminus\n
GLEAN3_15837	SPU_015837		 partial, missing C-terminus\n
GLEAN3_15963	SPU_015963		 partial, missing C-terminus\n
GLEAN3_15981	SPU_015981		 missing N-terminus\n
GLEAN3_15997	SPU_015997		 extra N-terminus\n
GLEAN3_16152	SPU_016152		 partial, missing central region\n
GLEAN3_16169	SPU_016169		 extra stretch in middle\n
GLEAN3_16302	SPU_016302		 missing N-terminus, extra C-terminus\n
GLEAN3_16324	SPU_016324		 missing N-terminus\n
GLEAN3_16347	SPU_016347		 partial, missing C-terminus\n
GLEAN3_16408	SPU_016408		 extra C-terminus\n
GLEAN3_16410	SPU_016410		 extra N-terminus\n
GLEAN3_16456	SPU_016456		 extra C-terminus\n
GLEAN3_16569	SPU_016569	After reviewing the subjects for GLEAN3_16569, it appears that there may be an assembly error in the sequence. The sequence appears to be ordered on the first and second scaffolds with internal repeats. There seems to be only one internal repeat that is repeated in the genome. There was no EST information available. It was unclear whether the transcriptome scores were weak or strong since they ranged from 3-15. This is an un-annotated gene so no additional comments were available on the Baylor page.	 extra N-terminus\n
GLEAN3_16627	SPU_016627	From the excel data and the BLAST results, it is evident that this is the best fit for this particular GLEAN model. When examining the excel data, it was clear that the scaffold contained an orderly arrangement without any internal repeats present, however, the entire sequence terminates at 1659 but it was apparent that this scaffold was truncated at 1199. There was no Est support available from GBRowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values ranging less than 10. 	 partial, missing C-terminus\n
GLEAN3_16652	SPU_016652	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. Both scaffolds appear to have an orderly arrangement without any gaps or repeats present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong.	 extra N-terminus\n
GLEAN3_16713	SPU_016713		 missing C-terminus and stretch in middle\n
GLEAN3_16747	SPU_016747		 partial, missing N-terminus and stretch in middle\n
GLEAN3_16824	SPU_016824		 extra N- and C-terminus\n
GLEAN3_16988	SPU_016988		 missing N-terminus\n
GLEAN3_16996	SPU_016996		 missing N-terminus\n
GLEAN3_17107	SPU_017107		 partial, missing N- and C-terminus\n
GLEAN3_17212	SPU_017212		 extra stretch in middle\n
GLEAN3_17281	SPU_017281		 partial, missing C-terminus\n
GLEAN3_17347	SPU_017347		 missing N-terminus\n
GLEAN3_17415	SPU_017415		 extra N-terminus\n
GLEAN3_17467	SPU_017467		 extra N-terminus\n
GLEAN3_17481	SPU_017481		 partial, missing N- and C-terminus\n
GLEAN3_17553	SPU_017553		 partial, missing N- and C-terminus\n
GLEAN3_17563	SPU_017563		 partial, missing N- and C-terminus\n
GLEAN3_17662	SPU_017662		 partial, missing C-terminus\n
GLEAN3_17695	SPU_017695		 partial, missing C-terminus and central stretch\n
GLEAN3_17801	SPU_017801		 extra N-terminus\n
GLEAN3_17836	SPU_017836	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When examining the first scaffold, it was apparent that there were 2 internal repeats (duplicates) present that occurred within 668-800 and 885-1016. There was also a sequence over that occurred within the second scaffold that occurred from 1208-1272 that was not apart of the sequence. The was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being about 5 (excluding one outlier value of 67.)	 partial, missing C-terminus\n
GLEAN3_17838	SPU_017838		 partial, missing N- and C-terminus\n
GLEAN3_17965	SPU_017965		 partial, missing C-terminus\n
GLEAN3_17990	SPU_017990		 partial, missing N-terminus\n
GLEAN3_18166	SPU_018166		 partial, missing C-terminus\n
GLEAN3_18179	SPU_018179		 partial, missing C-terminus and stretch in middle\n
GLEAN3_18240	SPU_018240		 partial, missing N-terminus\n
GLEAN3_18250	SPU_018250		 extra N-terminus\n
GLEAN3_18251	SPU_018251		 partial, missing N-terminus\n
GLEAN3_18271	SPU_018271		 missing stretch in middle\n
GLEAN3_18471	SPU_018471	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 3 different scaffolds. Within the first scaffold (v2.1_scaffold71909) there is a gap that spans from 294-1164 and the missing portion of this sequence is filled in by the second scaffold (v2.1_scaffold68450). There is an overlap between these 2 scaffolds within the beginning portion of the sequence as well. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several values that were greater than 5.	 partial, missing N- and C-terminus\n
GLEAN3_18489	SPU_018489		 partial, missing C-terminus\n
GLEAN3_18522	SPU_018522		 ;partial, missing C-terminus\n
GLEAN3_18751	SPU_018751		 partial, missing N- and C-terminus\n
GLEAN3_18872	SPU_018872		 partial, missing N- and C-terminus\n
GLEAN3_18882	SPU_018882		 partial, missing N-terminus\n
GLEAN3_18894	SPU_018894		 partial, missing C-terminus\n
GLEAN3_18900	SPU_018900		 partial, missing N- and C-terminus\n
GLEAN3_18975	SPU_018975		 extra N- and C-terminus\n
GLEAN3_19004	SPU_019004		 extra stretch in middle\n
GLEAN3_19007	SPU_019007		 partial, missing C-terminus\n
GLEAN3_19074	SPU_019074	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. When examining the excel data, it is apparent that there are numerous gaps within the scaffolds as well as internal repeats. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	 partial, missing N- and C-terminus\n
GLEAN3_19118	SPU_019118		 partial, missing N-terminus\n
GLEAN3_19177	SPU_019177		 extra stretch in middle\n
GLEAN3_19278	SPU_019278		 extra C-terminus\n
GLEAN3_19297	SPU_019297		 partial, missing N-terminus\n
GLEAN3_19409	SPU_019409		 partial, missing C-terminus\n
GLEAN3_19417	SPU_019417		 partial, missing N-terminus and central stretch\n
GLEAN3_19634	SPU_019634		 extra stretch in middle\n
GLEAN3_19640	SPU_019640		 partial, missing N-terminus\n
GLEAN3_19712	SPU_019712		 extra C-terminus\n
GLEAN3_19771	SPU_019771		 partial, missing N-terminus\n
GLEAN3_19781	SPU_019781		 extra N-terminus\n
GLEAN3_19811	SPU_019811		 partial, missing N-terminus\n
GLEAN3_19885	SPU_019885		 partial, missing N-terminus\n
GLEAN3_19970	SPU_019970		 partial, missing N-terminus\n
GLEAN3_20038	SPU_020038		 missing C-terminus\n
GLEAN3_20060	SPU_020060		 extra N-terminus\n
GLEAN3_20118	SPU_020118	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If the two scaffolds were combined, the sequence would have an orderly continuous arrangement, without any gaps or repeats present. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed with values ranging from about 2-40. 	 missing N-terminus\n
GLEAN3_20138	SPU_020138		 missing N-terminus\n
GLEAN3_20143	SPU_020143		 extra N-terminus\n
GLEAN3_20155	SPU_020155		 partial, missing N-terminus\n
GLEAN3_20162	SPU_020162		 missing N-terminus\n
GLEAN3_20250	SPU_020250		 missing central region\n
GLEAN3_20362	SPU_020362		 extra N-terminus, missing stretch in midddle\n
GLEAN3_20435	SPU_020435		 missing N-terminus\n
GLEAN3_20530	SPU_020530		 missing N-terminus\n
GLEAN3_20673	SPU_020673		 extra N- and C-terminus\n
GLEAN3_20679	SPU_020679		 extra N-terminus\n
GLEAN3_20757	SPU_020757		 extra stretch in middle\n
GLEAN3_20758	SPU_020758		 partial, missing C-terminus\n
GLEAN3_20759	SPU_020759		 missing N-terminus\n
GLEAN3_20773	SPU_020773		 extra N-terminus\n
GLEAN3_20819	SPU_020819		 extra N-terminus\n
GLEAN3_20850	SPU_020850		 extra N-terminus\n
GLEAN3_20880	SPU_020880		 extra N-terminus\n
GLEAN3_21035	SPU_021035		 missing stretches in middle\n
GLEAN3_21103	SPU_021103		 missing N-terminus\n
GLEAN3_21267	SPU_021267		 missing N-terminus\n
GLEAN3_21559	SPU_021559		 partial, missing N- and C-terminus\n
GLEAN3_21598	SPU_021598		 extra N-terminus\n
GLEAN3_21625	SPU_021625		 partial, missing C-terminus\n
GLEAN3_21652	SPU_021652		 extra N-terminus\n
GLEAN3_21775	SPU_021775		 missing N-terminus residues\n
GLEAN3_21781	SPU_021781		 missing N-terminus\n
GLEAN3_21862	SPU_021862		 extra N-terminus\n
GLEAN3_21941	SPU_021941		 missing N-terminus\n
GLEAN3_21997	SPU_021997	After reviewing the data and performing a BLAST search it appears that for this particular GLEAN model there is no good match. There are numerous repeats, poor sequence coverage, and gaps within the sequence. There was no Est support available from GBrowse assembly V0.5.	 partial, missing N-terminus\n
GLEAN3_22108	SPU_022108		 extra N-terminus\n
GLEAN3_22135	SPU_022135		 missing C-terminus\n
GLEAN3_22159	SPU_022159		 missing N-terminus\n
GLEAN3_22186	SPU_022186		 missing N-terminus\n
GLEAN3_22349	SPU_022349		 missing N-terminus\n
GLEAN3_22476	SPU_022476		 extra N-terminus\n
GLEAN3_22548	SPU_022548		 missing C-terminus\n
GLEAN3_22549	SPU_022549	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. There are several gaps present within v2.1_scaffold81745 however; this missing sequence information is present within v2.1_scaffold18034. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong. 	 extra N-terminus\n
GLEAN3_22584	SPU_022584		 missing N-terminus\n
GLEAN3_22628	SPU_022628		 missing C-terminus\n
GLEAN3_22639	SPU_022639		 partial, missing N-terminus\n
GLEAN3_22796	SPU_022796		 missing N-terminus\n
GLEAN3_22900	SPU_022900		 extra stretch in middle\n
GLEAN3_22926	SPU_022926		 missing N-terminus\n
GLEAN3_23011	SPU_023011		 missing N-terminus\n
GLEAN3_23059	SPU_023059		 extra N-terminus\n
GLEAN3_23144	SPU_023144		 missing N-terminus\n
GLEAN3_23220	SPU_023220		 extra N-terminus\n
GLEAN3_23237	SPU_023237		 extra N-terminus\n
GLEAN3_23311	SPU_023311		 partial, missing C-terminus\n
GLEAN3_23630	SPU_023630		 partial, missing N-terminus\n
GLEAN3_23634	SPU_023634		 partial, missing C-terminus\n
GLEAN3_23638	SPU_023638		 partial, missing N-terminus\n
GLEAN3_23689	SPU_023689		 partial, missing N-terminus\n
GLEAN3_23693	SPU_023693		 extra N-terminus\n
GLEAN3_23816	SPU_023816		 partial\n
GLEAN3_23909	SPU_023909		 extra N-terminus\n
GLEAN3_23972	SPU_023972		 missing C-terminus\n
GLEAN3_23979	SPU_023979		 missing C-terminus\n
GLEAN3_24234	SPU_024234		 missing C-terminus\n
GLEAN3_24264	SPU_024264		 missing C-terminus\n
GLEAN3_24298	SPU_024298	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first portion of the sequence (v2.1_scaffold161) doesn't begin until the 5th base pair and continues until 537. The rest of the sequence is completed on v2.1_scaffold40446 from 531. However, at 531 within this scaffold there is an internal repeat present. Excluding this repeat, if the two scaffolds were combined, the sequence would have an orderly continuous arrangement. There was Est information available from GBrowse V0.5 and the transcriptome intensity scores appear to be strong with most of the values ranging from 5-10	 missing N-terminus\n
GLEAN3_24312	SPU_024312		 partial, missing C-terminus\n
GLEAN3_24355	SPU_024355		 partial, missing N-terminus\n
GLEAN3_24450	SPU_024450		 missing N-terminus\n
GLEAN3_24522	SPU_024522		 missing N-terminus\n
GLEAN3_24629	SPU_024629		 missing N-terminus\n
GLEAN3_24668	SPU_024668		 partial, missing C-terminus\n
GLEAN3_24736	SPU_024736		 partial, missing C-terminus\n
GLEAN3_24774	SPU_024774		 partial, missing N- and C-terminus\n
GLEAN3_24862	SPU_024862		 partial, missing N-terminus\n
GLEAN3_24895	SPU_024895		 partial, missing N- and C-terminus\n
GLEAN3_24949	SPU_024949		 extra N-terminus\n
GLEAN3_25014	SPU_025014		 partial, missing N-terminus\n
GLEAN3_25024	SPU_025024		 missing N-terminus\n
GLEAN3_25036	SPU_025036		 extra N-terminus\n
GLEAN3_25052	SPU_025052		 partial, missing N-terminus\n
GLEAN3_25123	SPU_025123		 missing C-terminus\n
GLEAN3_25196	SPU_025196		 missing some N-terminus residues\n
GLEAN3_25227	SPU_025227		 extra N-terminus, missing C-terminus\n
GLEAN3_25313	SPU_025313		 missing C-terminus and stretch in middle\n
GLEAN3_25469	SPU_025469		 partial, missing C-terminus\n
GLEAN3_25545	SPU_025545		 missing C-terminus and stretch in middle\n
GLEAN3_25546	SPU_025546		 missing N-terminus\n
GLEAN3_25709	SPU_025709		 missing N-terminus\n
GLEAN3_25728	SPU_025728		 missing N-terminus and stretch in middle\n
GLEAN3_25751	SPU_025751		 missing N-terminus\n
GLEAN3_25758	SPU_025758		 missing N- and C-terminus\n
GLEAN3_25917	SPU_025917		 missing N-terminus\n
GLEAN3_25958	SPU_025958		 partial, missing C-terminus\n
GLEAN3_26160	SPU_026160		 extra stretch in middle\n
GLEAN3_26220	SPU_026220		 extra N-terminus\n
GLEAN3_26273	SPU_026273		 missing C-terminus\n
GLEAN3_26311	SPU_026311		 missing N-terminus\n
GLEAN3_26351	SPU_026351	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The beginning of the sequence (v2.1_scaffold3418) covers the first 925 base pairs and overlaps with the rest of the sequence (v2.1_scaffold63737) at 481. The beginning of the sequence (v2.1_scaffold3418) has a slightly higher bit score than some of the other scaffold results from BLAST. When reviewing the excel data it is apparent that there is an internal repeat on the second scaffold (v2.1_scaffold63737). This repeat is also visible from the BLAST results. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be weak overall with most of the values being >5	 extra N-terminus\n
GLEAN3_26352	SPU_026352		 missing N-terminus\n
GLEAN3_26475	SPU_026475		 partial, missing N- and C-terminus\n
GLEAN3_26558	SPU_026558		 missing N-terminus and stretch in middle\n
GLEAN3_26714	SPU_026714		 extra N-terminus\n
GLEAN3_26718	SPU_026718		 partial, missing C-terminus\n
GLEAN3_26807	SPU_026807		 extra N-terminus\n
GLEAN3_26833	SPU_026833		 partial, missing N-terminus\n
GLEAN3_26931	SPU_026931		 missing C-terminus and stretch in middle\n
GLEAN3_27152	SPU_027152		 partial, missing N- and C-terminus\n
GLEAN3_27176	SPU_027176		 extra N- and C-terminus\n
GLEAN3_27179	SPU_027179		 partial, missing C-terminus\n
GLEAN3_27374	SPU_027374		 partial, missing N-terminus\n
GLEAN3_27419	SPU_027419	After reviewing the data and performing BLAST searches, it was determined that no orderly GLEAN model fit sufficiently. There were matches on four different scaffolds. Scaffold v2.1_scaffold50623 was unique in that it matched the first one hundred bases pairs. However, it is difficult to distinguish between the remaining scaffolds due to the similar number in base pair matching and bit scores. The GBrowse assembly V0.5 revealed that there is EST information available and the transcriptome intensity scores appear to be very strong as well. Additional information found on the Baylor page under comments:	 missing stretch in middle, extra C-terminus\n
GLEAN3_27728	SPU_027728		 partial, missing C-terminus\n
GLEAN3_27756	SPU_027756		 partial, missing C-terminus\n
GLEAN3_27758	SPU_027758	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data it appears that if these 2 scaffolds were combined, the entire sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was some Est. support available from GBrowse assembly V0.5 and the transciptome intensity scores appeared to be weak with most of the values being less than 5.	 partial, missing C-terminus\n
GLEAN3_27814	SPU_027814		 missing N-terminus\n
GLEAN3_27852	SPU_027852		 extra N-terminus, missing stretch in middle\n
GLEAN3_27870	SPU_027870		 partial, missing C-terminus\n
GLEAN3_28096	SPU_028096		 extra N-terminus\n
GLEAN3_28097	SPU_028097		 extra stretch in middle\n
GLEAN3_28111	SPU_028111		 partial, missing C-terminus\n
GLEAN3_28274	SPU_028274		 missing C-terminus\n
GLEAN3_28305	SPU_028305		 missing N-terminus\n
GLEAN3_28451	SPU_028451		 partial, missing C-terminus\n
GLEAN3_28493	SPU_028493		 extra N-terminus\n
GLEAN3_28554	SPU_028554		 extra N-terminus\n
GLEAN3_28562	SPU_028562		 extra N-terminus\n
GLEAN3_28586	SPU_028586		 missing N- and C-terminus\n
GLEAN3_28763	SPU_028763	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data it is apparent that the two scaffolds had an orderly arrangement without any gaps or repeats present. If the two scaffolds were to be combined the overall sequence would have good coverage overall. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5l.	 extra N-terminus, missing C-terminus\n
GLEAN3_28794	SPU_028794		 partial, missing C-terminus\n
GLEAN3_28836	SPU_028836		 partial, missing C-terminus\n
GLEAN3_00314	SPU_000314		  missing some N-terminus residues\n
GLEAN3_02759	SPU_002759		 partial, missing N- and C-terminus\n
GLEAN3_02977	SPU_002977		 partial, missing N-terminus\n
GLEAN3_03360	SPU_003360		 partial, missing N- and C-terminus\n
GLEAN3_04289	SPU_004289		 extra N-terminus, missing stretch in middle\n
GLEAN3_05521	SPU_005521		 partial, missing stretch in middle\n
GLEAN3_05684	SPU_005684		 partial, missing C-terminus\n
GLEAN3_05984	SPU_005984		 partial, missing C-terminus\n
GLEAN3_06142	SPU_006142		 partial, missing C-terminus\n
GLEAN3_06330	SPU_006330		 extra stretch in middle\n
GLEAN3_06470	SPU_006470		 partial, missing N-terminus\n
GLEAN3_06968	SPU_006968		 partial, missing stretch in middle\n
GLEAN3_07015	SPU_007015		  partial, missing N-terminus\n
GLEAN3_08652	SPU_008652		 missing C-terminus\n
GLEAN3_08683	SPU_008683		 extra residues on N-terminus, missing stretch in middle\n
GLEAN3_10567	SPU_010567		 extra stretch in middle\n
GLEAN3_11965	SPU_011965		 partial, missing C-terminus\n
GLEAN3_11983	SPU_011983		 partial, missing N-terminus\n
GLEAN3_14671	SPU_014671		 extra C-terminus\n
GLEAN3_17318	SPU_017318		 missing stretches in middle\n
GLEAN3_18102	SPU_018102		  partial, missing C-terminus\n
GLEAN3_19330	SPU_019330		 missing N-terminus\n
GLEAN3_23389	SPU_023389		 missing C-terminus\n
GLEAN3_23503	SPU_023503		 missing N-terminus\n
GLEAN3_21699	SPU_021699		 missing N-terminus\n
GLEAN3_21700	SPU_021700		 missing N-terminus\n
GLEAN3_21800	SPU_021800		 missing C-terminus\n
GLEAN3_24133	SPU_024133		 missing C-terminus\n
GLEAN3_27188	SPU_027188		 missing stretch in middle\n
GLEAN3_24028	SPU_024028		 missing N-terminus half\n
GLEAN3_10213	SPU_010213		 extra C-terminus\n
GLEAN3_14975	SPU_014975		 extra N-terminus\n
GLEAN3_06894	SPU_006894		 missing N-terminus and stretch in middle\n
GLEAN3_09793	SPU_009793		 partial, missing C-terminus\n
GLEAN3_16830	SPU_016830		 missing N-terminus\n
GLEAN3_18973	SPU_018973		 missing N-terminus\n
GLEAN3_21020	SPU_021020		 missing N-terminus\n
GLEAN3_26855	SPU_026855		 missing central and C-terminus\n
GLEAN3_28730	SPU_028730		 extra stretch in middle\n
GLEAN3_21364	SPU_021364		 missing N-terminus residues\n
GLEAN3_23364	SPU_023364		 extra N-terminus and missing C-terminus\n
GLEAN3_23440	SPU_023440		 missing N-terminus\n
GLEAN3_25997	SPU_025997		 missing N-terminus\n
GLEAN3_26183	SPU_026183		 missing N-terminus\n
GLEAN3_28101	SPU_028101		 extra C-terminus\n
GLEAN3_28199	SPU_028199		 extra N-terminus\n
GLEAN3_20635	SPU_020635		 partial, missing N-terminus\n
GLEAN3_10555	SPU_010555		 partial, missing N- and C-terminus\n
GLEAN3_12120	SPU_012120		 partial, missing C-terminus\n
GLEAN3_13820	SPU_013820		 missing N-terminus; unrelated stretch in middle\n
GLEAN3_14753	SPU_014753	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. There are several gaps and numerous repeats present within the sequence. The excel data indicates that the sequence begins from 1-113 on subject gb|DS010782| however; this information is not reflected from the BLAST results. The absence of the scaffold from the BLAST results was probably due to very low bit scores and high e-value results. See below:	 missing N-terminus\n
GLEAN3_14982	SPU_014982		 partial, missing C-terminus\n
GLEAN3_16013	SPU_016013		 partial, missing N-terminus\n
GLEAN3_17727	SPU_017727		 \n
GLEAN3_18659	SPU_018659		 extra C-terminus, missing stretch in middle\n
GLEAN3_18893	SPU_018893		 extra C-terminus\n
GLEAN3_19341	SPU_019341		 partial, missing N- and C-terminus\n
GLEAN3_24989	SPU_024989	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. When reviewing the excel data it is evident that there are no repeats or gaps within each of the subject queries. This is reflected within the BLAST results; however, if the three scaffolds were to be combined there would be a sequence overlap between the second scaffold (v2.1_scaffold61057) and the third scaffold (v2.1_scaffold16316). There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with half the scores being about 5 and the other half being more than 10. This is an un-annotated gene so no additional gene information (comments) was available from Baylor annotations.	 extra N-terminus\n
GLEAN3_25126	SPU_025126	After examining the five different subjects from this query, it was determined that this was the best match. It was initially thought that this subject was not the best match due to several repeats in the sequence. However, when the BLAST search was preformed the results indicated that this was the better match since it covered more of the sequence. 	 extra N-ter,  missing C-terminus and stretch in middle\n
GLEAN3_26673	SPU_026673		 extra N-terminus and stretch in middle\n
GLEAN3_28146	SPU_028146		 partial, missing N-terminus\n
GLEAN3_28175	SPU_028175		 extra stretches in middle\n
GLEAN3_28446	SPU_028446		 missing N-terminus\n
Sp-Tlr228	SPU_030080		Partial Toll-like receptor predicted by Fgenesh, NCBI and Genscan. This gene model occupies all sequence of a short scaffold and the nucleotides have 96% identity to a typical Sp-Tlr (GLEAN3_27798).  \n
GLEAN3_21908	SPU_021908		#\nPartial Toll-like receptor. The nucleotides of TIR domain have 99% identity to GLEAN3_21907. Only 200bp of nucleotides in 5' upstream has high similarity to another Sp-Tlr gene. This gene may represent a recent duplication or assembly error. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_21907	SPU_021907		Partial Toll-like receptor.  The nucleotide sequence has 94% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor related gene although no LRR is found in the upstream sequence.  \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_04792	SPU_004792		Partial Toll-like receptor. The nucleotides of this gene have 94% identity to a typical Sp-Tlr gene (GLEAN3_05950). This is located at the end of a cintig and Unkown sequence (NNN) in the upstream region could make this gene model incomplete.\n
GLEAN3_06278	SPU_006278		Possible duplicated gene: GLEAN3_09260\n
GLEAN3_09260	SPU_009260		Possible duplicated gene: GLEAN3_06278\n
GLEAN3_10513	SPU_010513		3' Partial   \nGlean3_10488 is the 5' part of this gene\n
GLEAN3_10488	SPU_010488		5'partial\n
GLEAN3_20282	SPU_020282		Possible duplication: GLEAN3_22904\n
GLEAN3_22904	SPU_022904		Possible duplication: GLEAN3_20282\n
GLEAN3_21512	SPU_021512		Best empirically verified GenBank hit is poly(A) polymerase in Carassius auratus (goldfish), accession BAB39139, with E-value 0.0 and bit score 642. \n \nPSSMs producing significant alignments (indicating conserved domains) include pfam04928, "PAP_central, Poly(A) polymerase central domain" and pfam04926, "PAP_RNA-bind, Poly(A) polymerase predicted RNA binding domain". \n \nThe GLEAN gene model has been modified as follows: \n* The 3' UTR has been added, based on the Samanta embryonic expression data. \n* The first exon has been extended 5', based on BCM:Exonerate, NCBI:Splign, and the Stolc tiling array data.\n
GLEAN3_21760	SPU_021760		Different parts of this gene are found in different scaffolds in a non-linear organization.\n
GLEAN3_13202	SPU_013202		Different parts of this gene are found in different scaffolds in a non-linear organization.\n
GLEAN3_21651	SPU_021651		Different parts of this gene are found in different scaffolds in a non-linear organization.\n
GLEAN3_05522	SPU_005522		Different parts of this gene are found in different scaffolds in a non-linear organization. \n
GLEAN3_07087	SPU_007087		Different parts of this gene are found in different scaffolds in a non-linear organization.\n
GLEAN3_13967	SPU_013967		Different parts of this gene are found in different scaffolds in a non-linear organization.\n
GLEAN3_13930	SPU_013930		Different parts of this gene are found in different scaffolds in a non-linear organization.\n
Sp-VC1_3	SPU_030081		This gene model is located at the end of a short scaffold (Scaffold120560).The nucleotides have 87% identity to another Sp-VC1 gene.\n
GLEAN3_17609	SPU_017609		Duplicate prediction for GLEAN3_15676\n
GLEAN3_02696	SPU_002696		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThere is an overlapping NCBI model that includes less exons but shows a similar alignment to vertebrate Map3k7. The size of this model is closer to that of its vertebrate counterpart, and thus we have decided to accept this model in its present form.\n
GLEAN3_05254	SPU_005254		This model was annotated based on a manual inspection of multiple protein sequence alignments.\n
GLEAN3_03955	SPU_003955		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nIt is possible that there is some N-terminus sequence missing from this model, as based on alignments to vertebrate Tab2/3 and given that this model is located next to a region of various gaps between contigs. \n \nThere seems to be a duplication of this model (GLEAN3_12219). See the Gene Duplication page for further details.\n
GLEAN3_12219	SPU_012219		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nIt is possible that there is some N-terminus sequence missing from this model, as based on alignments to vertebrate Tab2/3 and given that this model is located next to a region of various gaps between contigs. \n \nThere seems to be a duplication of this model (GLEAN3_03955). See the Gene Duplication page for further details.\n
GLEAN3_18598	SPU_018598		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nAll other gene prediction protocols provide an identical structure for this gene, which is also supported by the genome-wide tiling array hibridization data.\n
GLEAN3_00742	SPU_000742		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis glean model is, in its present form, slightly largert than vertebrate Ube2 genes. An overlapping NCBI model (XM_791573.1) is slightly shorter; however it does not show a better alignment to vertebrate Ube2 genes. Therefore, in lack of additional evidence to favour either model, we have accepted the glean sequence in its present form.\n
GLEAN3_28607	SPU_028607		first exon with cadherin-like sequences is most likely irrelevant\n
GLEAN3_06829	SPU_006829		likely histone H2a pseudogene\n
GLEAN3_12627	SPU_012627		likely histone H2a pseudogene\n
GLEAN3_20123	SPU_020123		Sp-Elf has two splice variants differing in the 5' region: \nSp-Elf A       GLEAN3_20124 \nSp-Elf B       GLEAN3_20123 \n
GLEAN3_19879	SPU_019879		Sequence xp_786867 has been  predicted by automated computational analysis.  \nThe next best match is AB051576.1 Shiwa,M., Murayama,T. and Ogawa,Y. Molecular cloning and characterization of ryanodine receptor from unfertilized sea urchin eggs \n  JOURNAL   Am. J. Physiol. Regul. Integr. Comp. Physiol. 282 (3), R727-R737 (2002) \n \n            This record is derived from an annotated genomic sequence \n            (NW_791670) using gene prediction method: GNOMON, supported by EST \n            evidence. \n
GLEAN3_11042	SPU_011042	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. The BLAST results do not completely coincide with the excel data. There are differences between the base pairing information when comparing the 2 sets of information. However, both sets of data have an orderly arrangement without any repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak. 	#\nThis model was annotated and modified based on a manual inspection of multiple protein sequence alignments. \n \nVertebrate and insect SARM proteins contain only 2 SAM domains, whereas the original version of this glean model coded for 3 SAM domains. When the nucleotide sequence of this model was inspected in detail, we noticed that there were some identical exon sequences that likely resulted from assembly problems. We have modified this model accordingly (following the NCBI model), and this modified glean model now presents a domain structure like that found in mammalian and insect SARM.\n
GLEAN3_21673	SPU_021673		This model was annotated based on manual inspections of multiple protein sequence alignments. \n \nThis model is identical in aminoacidic sequence to GLEAN3_26252, and an inspection of the models strongly suggests the duplication is due to an assembly error. \n \nPlease refer to GLEAN3_26252 for further annotation details (exon structure, sequence, etc).\n
GLEAN3_26252	SPU_026252		This model was annotated based on manual inspections of multiple protein sequence alignments. \n \nThis model is identical in aminoacidic sequence to GLEAN3_21673, and an inspection of the models strongly suggests the duplication is due to an assembly error.\n
GLEAN3_10374	SPU_010374		Gene model includes 21 tandem Fibronectin Type 3 repeats\n
GLEAN3_14498	SPU_014498		Predicted protein sequence matches exactly to est-derived prediction Sp-Gg1d, except an intron is found in 3' UTR\n
GLEAN3_18408	SPU_018408		The gene model contains two exons that are not present in this protein as determined by cDNA sequencing.\n
GLEAN3_05096	SPU_005096		ATP-dependent RNA helicase A (Nuclear DNA helicase II) (NDH II) (DEAH-box protein 9)\n
GLEAN3_04517	SPU_004517		This model was modified and annotated based on a manual inspection of multiple protein sequence alignments. \n \nWe found that there was a gap in the alignment of the original version of this model with vertebrate/insect Pellino, which mapped to exon#5. The corresponding NCBI model, otherwise identical, has a slightly shorter exon#5, and shows a better alignment to other Pellino proteins. Therefore, we have decided to modify the GLEAN3 prediction accordingly. \n \nNB: The CDS for this model does not end with a STOP codon (i.e. there might be some C-ter sequence missing for this gene).\n
GLEAN3_08228	SPU_008228		There is unknown sequence (NNN) in the intron of this gene model. So It is still unknown if this model is intronless or not. \n
GLEAN3_19314	SPU_019314		cyclin G associated kinase/DnaJ (HSP 40) homolog. Gene prediction is not complete. GLEAN3_00818 is a related/similar protein.\n
GLEAN3_12096	SPU_012096		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nNote there is a slightly different FgeneshAB model for this gene; however, it does not provide a better alignment with other Ecsit proteins, and we have therefore decided to accept this model in its present form until additional evidence is obtained.\n
GLEAN3_09399	SPU_009399		Sp-MAP2K5 spans two glean prediction: \nGLEAN3_09399 and GLEAN3_09398\n
GLEAN3_00818	SPU_000818		cyclin G associated kinase/DnaJ (HSP 40) homolog. Gene prediction is not complete. GLEAN3_19314 is a related/similar protein.\n
Sp-Gg2	SPU_030082		created gene model on basis of est data\n
GLEAN3_23706	SPU_023706		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nWhile the best Blast hit for this model is to Transmembrane protease, serine 4 (Membrane-type serine protease 2)(MT-SP2), a careful inspection of its size and domain composition reveals that it more generally resembles members of the granzyme family and vertebrate marapsin. We therefore propose to name this and related genes Sp-Gra[nzyme]mar[apsin]-like. \n \nThe location of this model in the scaffold (away from ends or gaps), and the fact that other gene prediction protocols generated identical models strongly suggest that this is not an incomplete model.\n
GLEAN3_01588	SPU_001588		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nWhile the best Blast hit for this model is to sea star regeneration-associated protease SRAP, a careful inspection of its size and domain composition reveals that it generally resembles members of the granzyme family and vertebrate marapsin. We therefore propose to name this and related genes Sp-Gra[nzyme]mar[apsin]-like. \n \nThe location of this model in the scaffold (away from ends or gaps), and the fact that other gene prediction protocols generated very similar models strongly suggest that this is not an incomplete model.\n
GLEAN3_16107	SPU_016107		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nWhile the best Blast hit for this model is to sea star regeneration-associated protease SRAP, a careful inspection of its size and domain composition reveals that it more generally resembles members of the granzyme family and vertebrate marapsin. We therefore propose to name this and related genes Sp-Gra[nzyme]mar[apsin]-like. \n \nThe location of this model in the scaffold (away from ends or gaps), and the fact that other gene prediction protocols generated very similar models strongly suggest that this is not an incomplete model.\n
GLEAN3_22264	SPU_022264		E1B-55kDa-associated protein 5 isoform a. GLEAN3_22264 and GLEAN3_22265 are likely incomplete/incorrect predictions for this gene.\n
GLEAN3_22265	SPU_022265		E1B-55kDa-associated protein 5 isoform a. GLEAN3_22264 and GLEAN3_22265 are likely incomplete/incorrect predictions for this gene.\n
GLEAN3_06871	SPU_006871		Inspection of the tiling array suggests that glean may have missed the following exons: KLYPVTLWKAHLVTKSAKMQEMGWVRKIMHGRLRNDTSGLSAGRSSLNLLLTGCCGWGGCGCCWSCCCCCCGGGPRGCWSTCCGWPIARPPPAGCGWTG,YQHVSQRSLQDKSKLSKILFSHMAKYYSKLHSLKCDIKIIIMQNIVSPGLLLILNLLQKLVYCDCALAIPSLGLKTPFLEK\n
GLEAN3_07011	SPU_007011	After reviewing the data and performing a BLAST search, it appears that the gene sequence is on 3 different scaffolds. The best matches were on >v2.1_scaffold4300 and >v2.1_scaffold2310. However, there were 2 internal repeats on >v2.1_scaffold2310 and a gap within >v2.1_scaffold4300. There was no EST information available on GBrowse V0.5 and the transcriptome scores appear to be somewhat strong with scores averaging about a 10.\nAdditional information was found on the Baylor page under comments:\nInspection of the tiling array suggests that glean may have missed the following exons\nDINCVPLDLSIKRTNPQETSENEQEVGEEPLVEEPRMGEEPREEEPLVEELSMGEEPKGEESMQGGLLMREEPSEGELEGEEEGFEEQPGEYDSLDEELWVGGKPIKEEPLDEEQEREENIGLWEGALVEGPEGEEESLEEEPLVEEPEGEKEPLEEEEPEGEEEPEGEEPEGEESPEDSAAWIPV	Inspection of the tiling array suggests that glean may have missed the following exons: DINCVPLDLSIKRTNPQETSENEQEVGEEPLVEEPRMGEEPREEEPLVEELSMGEEPKGEESMQGGLLMREEPSEGELEGEEEGFEEQPGEYDSLDEELWVGGKPIKEEPLDEEQEREENIGLWEGALVEGPEGEEESLEEEPLVEEPEGEKEPLEEEEPEGEEEPEGEEPEGEESPEDSAAWIPV\n
GLEAN3_07046	SPU_007046		Inspection of the tiling array suggests that glean may have missed the following exons: NGKHGEEGQREGGKENGKHREEGQREGMRRMRDMGIRDREREGGEWETWRGGTERGREENKKHREEGQREGGRRMRNMGMRDRERE\n
GLEAN3_07360	SPU_007360		Inspection of the tiling array suggests that glean may have missed the following exons: EDDEDNDYFDPNESVVEEEESMEQSGTDGEDDDGVGDRGQVLPKEPKRADTKKGRSTSNKALICNICGLECEHGKALKQHLISHDPKSLQCSYCKRYFKRKGCLVFHLRTKHQVSIGKKWSRHEKEDLMTPSKIDEKGDDGDNDYFDPNERVVEKDESQEQSGTDGEDDEWVEDRGKELRKRRKRAVTKMSRNTSDKALICKICGLECEHSKSLKQHLISHDPNALQCSYCKWYFRRKSCLVFHLRTKHRVNVGKKWSRGNELVKNAVRAKAPKALEPKDLGDLQGGASTQLEDSSTTTILYSCKFCTKKFTKPDFLLKHEAIVHVNFRRYRCRVCRKAFSTKYALQSHSHIHVGEKRFECFICNRKFNSNSLLVRHLMHHDKPDNSDLVFAAMQPHVLSESGDPIDETVEAESAAPSTIDV,AESDPRSTHERHDKDQEQSQKLDPKEVKDQTQDKIFEQDDENKKTLPVCEICGEECKHNMALKQHLLSHDPTTYQCQYCDWFFKRKGCLIFHLRTKHKISAGRKWLRGTIDELLERDNALEDDSEEARLEEELRKQKRLQRLANNPKGPRHRCKLCGKECEHTRALKQHVMSHDPKSFQCKFCKWYFKRKGCLVFHLRTKHQVSVGKKWSRLEKEDLMTRSKTDEEENDEDNDYFDPN\n
GLEAN3_07361	SPU_007361		Inspection of the tiling array suggests that glean may have missed the following exons: VREGIPHQKVHAEAQAKAAHDTIQTLQVQLLRQDFQRQHWARAPRAPPQGYPASCLPDLWKGIWDQVFLADAPAGPYRREEVLVPHLRSEICLEQHPYTASPAA,KKFPSEGHLKEHAAFHKEMRDVRPICEVCGLECKHNKALKQHLLSHNPHAYQCEFCKRYFRRRGCLVYHLRHIHQTFVGKKWTRGNAQEQMTRYVGPGEEDEEEEFHPELAKDMLGKHILIKSKKPRVFKCQFCPKKFIRHNLVCKHERTVHMNNGRFKCEFCTKTFMEEYNYTLHKRKHTKERPFKCTECPQSFASEKALINHQPEHRGERPFKCDECGKAFRTRKYMLKHKRRQHMTPSRLFKCSYCDKTFKDNTGRERHERRHKGIRPHVCLTCGKAFGTKYSLQTHLQVHTGEKKFSCHICDQRFALNNTLIRHLLRHDKVAASEDPALITMQEEVTVQNNSTASTGLQEVQL,GSNVQGHYQQEPAPHSQHGMPLQQIPSLPSAQQPSPPLQQGQAHTGSGDAYHIEDLSHNTSHSRLATSTSGGPMNTTPGSIGGCANTTKSNSRVIASTGRTPKKRAAPGSSPKQPSKPPSMRAPMTQIVTPQAYQQAQMLQQQQQHQPRPRECMSFSKHCQTEPLFQQVYNASMQFTKENDLPDGLEFTEDEKEKKISGVVATKDFEPGVEFGPFTGEFVKEGLGCFNPNTWEVIEQNHK\n
GLEAN3_08377	SPU_008377	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data it was evident that there were several gaps present within both scaffolds; however both scaffolds did contain an orderly arrangement about them. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10.	Inspection of the tiling array suggests that glean may have missed the following exons: SVSGRFGSKSCIDRVWGRLFRCIKSLLRPKASYFRFSFNSAKSISIWKCACNSCWSIKDTRLLDIFSFCWLMLVVLRKNSLWKGLSQGVYSPFSSSIKSSCCFC\n
GLEAN3_08563	SPU_008563		Inspection of the tiling array suggests that glean may have missed the following exons: SVCNKKYRSSSDLKIHLRTHSGEKPFLCSFCGERCTDIGSLANHTIRFHREWTQQCPQCSKKSVSKSHLKTHMMVHTGEKPYQCPECSKRSATKSSLNKHMLTHNGEKRYECFECKKTYANKGDLYTHKGLIRSITLRNMSKLTPVKGLIDVL,IDIKDAVKVSTKLNGDAISSLRMQSGKQYHQCSVCGRKCPSKSDLARHLRTHTGEKPYPCPECDKRFSDKSSIPQHMLIHSGEKPYECSECSSRFNCKSNLRQHMKQHSTTKFHQCPKCDKK\n
GLEAN3_09474	SPU_009474		Inspection of the tiling array suggests that glean may have missed the following exons: CQVACEPTGPIPYAPEHTETHEAEIIRVQQHPNISVQGVHCSEEWQRLYRERLWTQQTSTSSTSKLEAAIHSEVQRRSPGANGQRPYVYPASALIQSDASV\n
GLEAN3_09553	SPU_009553		Inspection of the tiling array suggests that glean may have missed the following exons: IVCMGCSKMFCLEENMSQHLRRCKGLRELLKRKKNLRKISSNSDDDDDDGFIPNEEEESCGVLKALEGPDRAMGQSTMGDLNETETFGVETNRQGEDSKGRVIDMLGVRSFQDASKPFKCSYCTKRFLTKNRLLRHKHNRHPKSSVFKCDHCDQTFPYKHRLLKHLPTHNKDKRYKC,KQPLEGRSHLEPPETSWSIRELLCDKEEHFVCVTCGKHFPTNGRLKAHERFHESTCEKFECDMCGAVFKTSLSLMRHKKIHTEIQFKCTLCFKKYTCRSHLSRHMHTAHGFERVRGKILCMGCSRKFLLEDDMLKHLKSCKG\n
GLEAN3_09642	SPU_009642		Inspection of the tiling array suggests that glean may have missed the following exons: RSLFLFLPLFFFAFSAKASTRLFWICSCSGSMMAPASISFAAYSRLIPSMDDVRLGPVLALQLWFFKYFSLAYPLLHLSQWKGKSLVWSFMCSARLGLRLNVLEQWKHLKGLIPLCVMM,CSFGTRFGVTVMVLQVLLPGISLVALVTMERKVVGVELHVFSQVRPATECLGAVEAFEGFDPTVRDDVSFELVGSVERHVAACHRVEWTLEFLIRFMDQHVSFEFVLTVELCGADLTAEWFLAGVNENVRLQIVLTLKLLVTDEAFMQGLGAVGDEMASQVPLTSKDLVAFWTVELM,LVYFYKELTPFWLSADEIFDKHVIPILITMPWKAVCESLLPRKLSTEMNIIFCYLFTGGGTKHSFNYQSTEKTIRQSLTGLKKKSFCLSNSKKGFTIREFTD\n
GLEAN3_09685	SPU_009685		Inspection of the tiling array suggests that glean may have missed the following exons: LRPLLDAAICSLFCVFLPSNSLSLSLSLSLSLSLSLSPPPPPPPLRLYLYLSLPLLFSVFLPSNSISVSLPLPHSSLFVSSF,FVSLSSSLPLPPSSLFCVFLPPNSLSFFLSLSLTLYLFLSLLLSVYSNLLILSLSLFLSLPLLFSMSSYLLPSLSLPISSLM,MLLYVLFSVSSFLLILSRSLSLSLSLSLSLSLLLLLLLLFDSTSTSPSLFSFLSSFLLILSLSLFLSLTLLFLCLPSNLSLSLPLYLSLPLLFSVSSYLLILSPSSSPSLLLSTSSSLFSCPCIPTS,FSLLLPLPLSYSLPLPLSSLVRVFQPPNSLSLSLPLPPSSLFYVFLPPSLSLSPHIVFNVRQSIHTVIALLVVCLEMALSCRSSTVGSDCLAQGSSKTVRPRDSSSPSGLGICHRTKQ\n
GLEAN3_09831	SPU_009831		Inspection of the tiling array suggests that glean may have missed the following exons: ANILSLNQQSATGGQQIVTNGNVQYNITPQYQIDSEGNLITTHVATPVSVSQAQAQTTQTVVRQAPTTTSQATNVAGSNVAQIALPGAGVQYIQNGQIIQAVAPQPAPTQQRIMLGSQTITLQLAPNAVMSNANSHDQPVVTPVYNIISSTPPQGNQDNSAQTQQLQIQDQLQNAQIISNSGQIQAGNAANSNQATFVQVAGKPGQIILQQPQQAGQVQQIQVSGLQTSNTNVQTTQIRQQSGKVVASVIPQQVQQVQQVQQVQQQQTTTSQIISQQSSQTVQQQPQAQVIQIHQPIQGQAGTQISLQQQPGTGYYTI,VKPKLKLPKQLSGKHPLQHPKQQTWLEAMLHKLLCLVPGYSIFRMGRSFRRLHHNQHRLSNALCSALKPSPYNLLRMQSCPMQIHMINLSLHQSTISSAPPRHRATRTILPKLSSFRSKINFRMHKSSQIVAKFKQGMPPTVTRQHLCKLPVNQDKLSCNNHNKQDRCNKYKLAGSRHRIQMFRRRRSDNKVARLWHLLSHSRYNKCNKCSKYSSSRQQHHRLFHSSHHRQFNSSHRHRSYRSINPSRGKRALKSHYSNNQVLATTQ,CKSSPSSNYPNSCQASTHYNIPSNKRGWKQCCTNCFAWCRGTVYSEWADHSGGCTTTSTDSATHYARLSNHHPTTCSECSHVQCKFT,RAPDIEYKCSDDADQTTKWQGCGICYPTAGTTSATSAASTAAADNNITDYFTAVITDSSTAATGTGHTDPSTHPGASGHSNLITATTRYWLLHN,AAKSEHQFPCDWISMFVLVKRQPSGLLLEWNKVQKHNCGGPAVSSLTDIYKNPLFSKSTDWGNRYVRALVRLCRNWSRETTLVKRLPSCKSILTQIHAA,PPPLSLSLSSPPHTPATQQQITQQQQQQISQALVGMKAEKQQQASWQGVIQTEATGGAQGGTVTTISTNGTNYPPTMASYELQIDQSQIPKQEPPKKVRRLACTCPNCKDGDGR,LILSSLISFFFVLGEKKFVCKSCGKKFMRSDHLAKHQRTHIRKPGTVSMKGQSGGGDAPLQVDLSQGVDEFDEEMEKVMRDPQHEAMVQDAVQEAVY,SVLQSIQQINGGQFMQNPIVLKAPQTQVQTVHLQHGGTPVATPQSSSSQVQQQVITTDVSPAISTANNSTIAGLPNYNVHLAPLSPGPGPAGNTSAVNINTSTAGMFNSHLLNIP,VIFRPSDQFQLAPLYVFPHLFVCLFFCRNSEKGKKQHICHIADCGKIYGKTSHLRAHLRWHTGERPFVCDWLFCGKRFTRSDELQRHRRTHTGIPTQIII\n
GLEAN3_09832	SPU_009832		Inspection of the tiling array suggests that glean may have missed the following exons: ALLAATCSKIGTPAEGQAGAVNQGQTVTVLGQNQGQAVQIPGGFINAANAQQIQQALGLPPGFPLQFTTASAGVAGQQGTAAGGPMYIEVGPGGNIPSSSVGGATPTKSINAANILSLNQQGATGGQQIVTNGNVQYNITPQYQIDSEGNLITTHVATPVSVSQAQAQTTQTVVRQAPTTTSQATNVAGSNVTQIALPGAGVQYIQNGQIIQAVAPQPAPTQQRIMLGSQTITLQLAPNAVMSNANSHDQPVVTPVYNIISSTPPQGNQDNSAQTQQLQIQDQLQNAQIISNSGQIQAGNAANSNQATFVQVAGKPGQIILQQPQQAGQVQQIQVSGLQTSNTNVQTTQIRQQSGKVVASVIPQQVQQVQQVQQVQQQQTTTSQIISQQSSQTVQQQPQAQVIQIHQPIQGQAGTQISLQQQPGTGYYTI,VKPKLKLPKQLSGKHPLQHPKQQTWLEAMLHKLLCLVPGYSIFRMDRSFKRLHHNQHQLSNALCSALKPSPYNLLRMQSCPMQIHMINLSLHQSTTSSAPPRHRATRTILPKLSSFRSKINFRMHKLSQIVAKFKQGMQPTVTRQHLCKLPVNQDKLSCNNHNKQDRCNKYKLAGSRHLIQMFRRRRSDNKVARLWHLLSHSRYNKCNKCSKYSSSRQQHHRLFHSSHHRQFNSSHRHRSYRSINPSRGKRALKSHYSNNQVLATTQY,FRGKFDHHSCCYPCQCKSSPSSNYPNSCQASTHYNIPSNKRGWKQCYTNCFAWCRGTVYSEWTDHSSGCTTTSTNSATHYARLSNHHPTTCSECSHVQCKFT,YKCSDDADQTTKWQGCGICYPTAGTTSATSAASTAAADNNITDYFTAVITDSSTAATGTGHTDPSTHPGASGHSNLITATTRYWLLHN,ILILSSLISFFFVPGEKKFVCKSCGKKFMRSDHLAKHQRTHIRKPGTVSMKGQSGGGDAPLQVDLSQGVDEFDEEMEKVMRDPQHEAMVQDAVQEAVY,SVLQSIQQINGGQFMQNPIVLKTPQTQVQTVHLQHGGTPVATPQSSSSQVQQQVITTDVSPAISTANNSTIAGLPNYNVHLAPLSPGPGPAGNTSAVNINTSTAGMFNSHLSNIP,LPPPPLSLSSPPHTPATQQQITQQQQQQISQALVGMKAEKQQQAPWQGVIQTEATGGAQGGTVTTISTNGTNYPPTMASYELQIDQSQIKQEPPKKVRRLACTCPNC,SIPTDTPVRLPSFICMPFFCRNSEKGKKQHICHIADCGKIYGKTSHLRAHLRWHTGERPFVCDWLFCGKRFTRSDELQRHRRTHTGIPTQIII\n
GLEAN3_10295	SPU_010295		Inspection of the tiling array suggests that glean may have missed the following exons: IYLFSIPYIHSHHQPCLPLLLFVCPISSRDNVAQVNIAALVGNALRSVVPSPLLITRIPSLRILFRMTSMLPLYLFFPSNPSA\n
GLEAN3_10922	SPU_010922		Inspection of the tiling array suggests that glean may have missed the following exons: ISDSRVQQLDDNLSSQAERGTPLDSSESDSSTSKCGPCNYANDSGKKQSVAERETLDQGLSHHVYGQMDTERTKECDEKCEETLIEGIGQSSQFGGTEENLAKPLFTLVAGEPYISQCRIL,SFNPHEKPPSRKPYQCLVCTVTGCGLYRNRVWFVPQQGVVCTATGCGLYRNRVWFVPLQGGVCTATGCGSYRNRVWFVPQQGVVCTATGCGLYRYRVGFVPQQGGVCTFTGCGSYRNMVWFVS,FRIVEYNSLMTIYLHKRSGELLWIHPKVTVLQVSVALVTMPTILVKNKVSLKERHWTRGFHIMFTDRWIQKEQKNVTRSVKRL,LKALDNLPSLEEQRKILPSPCLPSLQESRTSASVEFCNKEFSNVLDVESHTCLMRPTREMIFKCSLCNKKFTQSTHLLRHATDARNHKGMKSLYQCSLCNQRFFYLSSLLKHVKLHSQRYPCLVCDLRFSSEKRLSMHSWSHRKDEPCECAVCKKTFPDAMSLAYHTRTHVGRNPYQYSACDLRFSDEKRPTRATRYERNRTGKNLFECHICNKIYLYERALTSHMETHTIKKLLCSSCGELFHDNFDLSLHMRSSHTGEKPYQCSVCSERFSKANSLLIHMKSHPAENPTNVWFVP,QGVVCIATGCGLYRNRVWFVPQQGVVCTATGCGLYRYRVGFVPQQGVVRTATGCGSYRNRVWFVPQQGVVCTVTGWGLYRNRVGFVPLQGVDRTATWCGLYRNRVWFVPLQGGVCTVTGWGLYRNRVGFVP,KATQQKTLPMFGLYRNRVWFVSQQGVVCTATGCGLYRNRVWFVPQQGVVCTVTGWGLYRNRVWFVPQQGVVRTATGCGLYRNRVWFVPLQGGVCTATGWGLYLYRVWIVPQHGVVCIVTGCGSYRYRVGFVP,QGGVCTATGWGLYRNRVWFVPLQGGVCTVTGWGLYRNRVWFVPQQGGVCTVTGCGLYRYRVWFVPLQGGVCTVTGCGLYRNRVWFLP,ESSEEGDTLHHRESQPGRQRHGYAIGHRTPNFTRATTAITSSHDVQDPPPPDSHPSSRLHLTERQSYKKSTSSSFHQTWYIL,NSLEKVQRRGIRFITGNHSREDSVTAMQLDIGLPTLQERRLQSRLAMMYKILHHQIAIPLPDYISQKGRATRSQHHLRFTRLGTSSDSYKNSFFPRTMKGWDELP,XXXXXXXXITRKDKDGDCAEEDGAIEMNLQLSDQVHVNDGCSKEDGGKEIHASQLGEVNYLTLTGYVKEEPLDSDSMEGVLNGANWGMIKPQPLGEEG\n
GLEAN3_10924	SPU_010924		Inspection of the tiling array suggests that glean may have missed the following exons: VLPYKCKICDKGYCYKKGLSAHMRTHTAKRSHKCTVCNERFLNIKRHMKIHSGLIQCSICNQGFSNHGNLTQHRKIHRKQNR,YQCMYCDVRFSRVDTLSRHIRSHTGEKPYECSFCNKKTFSQTAHLTRHIKIHTGERPFECSICSKMFAERSHLTDHQKIHTGEKPYLCSVCEKRFG,ATNLLHLMAVALSSPSWHHELHLPWSHYPKVLPLHSTLRSGNSLHPVVMHESPCHRLPYYSHRLRVLDLIVVNSSLWHHLPLCNHLLCLHS,IHPVPVGNRFIHPVPADNGLIQPGPECNGFIHPVLAGNGFTQPVLAGNRFFYPVHVGNGFKYAREYFLAFVVRVVILATSMITLFSICT\n
GLEAN3_11270	SPU_011270		Inspection of the tiling array suggests that glean may have missed the following exons: PLIPFRSSSPDRHLPVFPLAVFPWAIASALSSFPFPSCVGLRRGRLVGHDLVGCIYAIVNRCVGLTRQQNLIGRCHLMRLCVLRGDFLASPFRRRLQVLAHVRVLEPLDRAPEVTCLLPAVDKGHE\n
GLEAN3_11272	SPU_011272		Inspection of the tiling array suggests that glean may have missed the following exons: GWSSDGTSHHTRDIGRAFHPCGLFGGTARKLSKKIVFHIPHTRVTFPLRETSVYEKQGSPSKNKFYHNQSNGYAQHPGVFSKRVHVDCPL,VTWCTKTFLCHFALEPVARDRTQDTWCTITFLRHFALEPVARDRTQDTWCTITFLRHFELEPVARDRTQDTWCTITFLRHFALEPVARDQTW,TAHCSKFFITLFAFVWPSVFTIGMYSQMVILQGTLCEEFPTAFGTMIQTFFRVKSHDVSIATPLGSELDIAQGTRKSLDTGMYCKLMSLKVTGIAKCFVAL,PPEEVREPMRLSIVDSHVYTDEALLTEGLITDGTLMFFSFLHLSFKHLSFLHLSFLHLSFKHLSFLHLSFKHLSLKHLSFKHPFFLHLSFLHPSFKHLSFIHLSLKHISVMHLSFLHLSFMQSHMFRKATICKKLFITLRTLRRLLIIVPSHVHREGTLSQEPDTTH,VTYTKEPFIALWAWKRLLDIFRTFSPCIMTIFTMFFSGSGGSCKTVDQDLTSSFCRVDCVRITTLLSQRAYSCLFLTFECEQWSVQSTDDICH\n
GLEAN3_11280	SPU_011280		Inspection of the tiling array suggests that glean may have missed the following exons: PSPYPISDIHVHTRGSLNVITEKRSSLKTWEDLTPGHKAFRPAQLKAHHTGPKLTALSNAILIFITGWMFDLKSHYDGGGGGTFVISDFDHIVQCEIIYMFSC\n
GLEAN3_11583	SPU_011583		Inspection of the tiling array suggests that glean may have missed the following exons: NLPYPFLISLCLSLICGFRVILFLPLETYRPTSVNNSGIGSYGLLENTTFEQIRRVMETNFFGAVRMTQEVIPIMKKQRSGRIINISSTTGIFGEWKDAFIIE\n
GLEAN3_12083	SPU_012083		Inspection of the tiling array suggests that glean may have missed the following exons: CISVSVSAVSLSLSLSSSIFSNNMSLSVTLFSLTTSLFSGFYLLPSFTSSHSFLYNLYISLSLSDLPSLSPTTCHIHCIMKTYT,YKFKHISIDAYQSPSLLSLSLSLSLLLSSLIICLCLSLSSPSLHLSFQVFTFFHLLLPLIHSFIISISLSLSLTYPPSLLLHVTYIVL\n
GLEAN3_12546	SPU_012546		Inspection of the tiling array suggests that glean may have missed the following exons: LVVHSHRRGRGSKQQCKTVHEFHSFEEEGVVRCELYGDVLEPAFRALGQTIYHLTSVPAIVLSFLIQYHTDISELAPVEVEAL,LSAIKASVLSLIDIANWLRSILRSWWCIVTEGAGGPNSSARLFMNSIPLKKRGLSVVNCMVMSWSLRSGLSARRFITSQAFLRLSSPF,IFLMGESPEYAGKCVVNLAADKDVIKKTGRVLLTMELAEEYGFTDVDGHRPMNYRQLKALALMGGHTWVGAMIPGFIKIPFWALAAFTHKF,LLDGREPRVRRQVRGQSRGRQGRHKEDGPCAPHDGACRRVRFHGRRRPQADELPSAEGAGPDGWTHLGWSYDPWLHQDSLLGPGRLYSQILIGHIFLFLIVKNMEEERPEISMCNSLLYVFLYNDRKKSYMLAMQYCQS\n
GLEAN3_12632	SPU_012632		Inspection of the tiling array suggests that glean may have missed the following exons: CSLCSGGDSSLQSSYQSMRNSQEMSGDRSCNGARHLPEDFPLQDYLGTQSDSSERESCTSENNSYIIPSQVPYFHEKRAANRETPDATTSYHKPGGMDKERNEDIMMELDETINQSSQLCATGSVQNDPSCSLAKEKPFLCCVCSKGFALRISLSRHMTIHG,EDFPLQDYLGTQSDSSERESCTSENNSYIIPSQVPYFHEKRAADRETPDATTSYHKPGGIDKDRNEDIMMEFDETINQSSQLCATGSVQNDPSCSLAKEKPFLCCVCSKGFALRISLSRHMTIHGG\n
GLEAN3_12911	SPU_012911		Inspection of the tiling array suggests that glean may have missed the following exons: KSPSRKRGRPRGKTSSSVKSPKIAFSPEPATSQDDPGHLGRGMRKRKKKLSLDEVNSEDVDDGDDDDGDDDHGDDDDEEDILDDEEHDNDSEVEDESTEEQISNEANKNTIKLKPLEFKPRRRGRPRKNERRTHRKRDRNWSTMDDVKPKNEERVKLNLVVPMKVFKKILRDRADEWQKTYDDEHSLNLFRCEVERCRGMGMPEAEFDVHMKCHVSNMEGFRCFICQFMCLHWRNMRHHYCKVHDQMLSKVTCDFEGCQKEFPKYGALRTHVTISHIKPDLVTKLSTSTSDLSEFSSYLDNVKIKKIGKEDHEEDDEDDGEEPRKQRGRPPKRKRPVGRPPKDTEGYSRRQNLQRRVHGEDREKFQRRLVAFVCEVCSAKFNEEEKLMEHSLRHYHNDNDQINCTECEAFVTAEESSLRIHMSEEHKRLLQLHRCDKCNFSSNRFHDLKKHNIVHTGAKNFMCDKCGKCTTTPYNLKVHYRRMHASDEEKKIKCISCEYRCADKAVLKVTFHL,LFSNFIGATNKIISEHVMCKHANVRPYHCNICGWSTAYSGNMWKHVDTHQKELGDKMPEFPVNVVSTENHSVPTPLRAPSGKKRGMNKASNFKLKLAKPGKTRRQQQAQKTQQMEQTATISILDDNVQTIQVQAGGNLPEGVLMQVSE,QPTCIDCLISFYPKVPIHMGPNGQMMVTKGLSEEESIGSSALSRLAAAVASAQEVHIIQGNEGLEGGGQHQEHRIIATQVCLAFCPFF\n
GLEAN3_12912	SPU_012912		Inspection of the tiling array suggests that glean may have missed the following exons: SKSEIGNHFLTDHMEAYVSPIKEEGKTVSTNDSKEPDGEKEMEEKAGEEIGEDEENLAEEVEEKRPAPRPRGRPPKKRNQKPIKKQPYYYIEEVEVKDEGEETGEGEEEPKEFGRGMRRRKKAIPRHILRYELDDDEEFMEDEDYNENEERNVLPKKRVPTIPPIVMKGKRGRPRSSGSQEKGRQDEGSPKLKTPKIFKKTPTVSKLPLSDKVIERILDDRVNEWYLVFQEKHVLPIPCPFDGCALDVTQAELDVHLQCHAANLEGFRCPIDECNFLCPHWSNMRVHYRKTHEPTFYRLLCDLDGCAMVFPRIDKKSIHIHVTRKHLRPELHDELSSKDFNIEKYEKYISIVKAEEADLMKQVGEQAEDDEGEAGTSLGEHSNVVHVQQMSAVEQLETESDFDTENSRLKKRGRPRGSTKAAKLARIAAGEVFEKKGKRDKKGKGKRSFQRVMNNVSFFCDVCGGKYKSEQAVFDHKTLHYRDENCNVLRCTECTEYSTEESSELREHVALSHKSLLHLHRCDECQFSTNRYHDLKKHILVHSGSKDYMCDKCGTCTTTAYNLRVHWRRYHAPESEKNVKCFACDYMCADNGILKVLFFSFSF,VTRASCICIAVMSVNSQPIATMISRSTFSSIRAARTTCAISAAPVPPRLTISESTGVATTLLNRRRMSSASLVTTCVLIMAF,LFIFILQEHIRSKHGLMVYGKDFDNARPLPTYACSQCDYIGRKKSSLAYHMRIHTENRQFKCHICPYASKTKNNLLLHIRTHEGLQPLKCPECDFRGKNDKGLDNSFER,HLGNYVHIFYPFLQDHIKQHHKSMLKNPIYHNCPHCDYVGHKRQSLEFHMRIHMEQRRFKCHLCPYASKTKNHLKIHMQTHDGFQSASCPDCNFKGL\n
GLEAN3_12913	SPU_012913		Inspection of the tiling array suggests that glean may have missed the following exons: QEIIKMSTSGDDTQFNCRLCNFVGGSKTEIAEHFLTEHIEQYVSLSKATPSKASKNETETVKETKEKEEEQKEVESDNEDVISEEQAKEEEKKSSKTLDEILKRKLEKDAGPGFGRGMRRRKKATPIKYSMDDDEEEDEWLPRKEPEPSKTIYVRQPSILNKGRGRQRKKGKRGRPPMVGFKKSKPKSAPPPPPKQPPCKTKIDIRKRSDEKNLPIPYYVFNRILDDRVESWFKNYKEKRDAIEMIRCPNDRCANVMSVDEMAVHEKCHVPNLDGFRCCECGYISLHWAKMRVHYRSDHNSKLNAATCDFEGCEKVFPCIGTKLLQSHAIKAHFKPQLLARLKSPDFDPSEYDKFMQAPENEADRQSIARKRGNKRTADVEDEEEEEEEEEEEENDTTSKPNPKKPRRKKFKRYKVHVCTICFARFKDEGEMLSHKDAHYKDNSKDIIYCTECTEYNASEEEPMRNHLATVHKRMLHLHRCDECNNFSTNHYHDLKKHLVTHTGAKNYMCELCGRRTTTPFNLRIHYRRIHASEDEKKHHCMSCDYKCADKGILKVRANE,RALTLIHQSMTSSCKPLKMRLTGSQLQENVVISALQMLKMKKKKKRKKKRKRMILLVSQIQKSLVGKSLSATRSMYALSALPDSRMKGRCYLTRMHITRTIPRILSTVQSAQNTMPLKKNPCVIT,MVNVEPAHDVQPHHVKLSADNQIIRHHPMIEYSSVPPVSVAQQIIAAGNDMAQVISEQQYHQIQQQQQQQHQQQQQQQQQQQQQQQQQQQQQHQQQQQQQQQQQQQANHGHPPQSHTPLAPPLVLHSQQPKPPLHPVPLEIHAVPIHPTTHSNIPTPVVEQIVNTDHAVHHLVAAMMPRW,STAVCHRCLSPSRSSQQAMTWPKSYQSNSTIRSNNNSSSNTSSSSNNSSNNNNSNSSNNNSSINSNNSSSSSSNSKPTMGIRHSPTHL,HGPSHIRATVPSDPTTTAAATPAAAATTAATTTTATAATTTAASTATTAAAAAATASQPWASATVPHTSSTSSCTTQPAAQTTLAPGASGDPRCTHPSYHPQ,IQVDASNGMNQQEIIEYTVPNTQQIITSSGDITEVITSDHHYHASHHISDHHQLQQQHHQQQQQQQQHQQQHGGDSSIQQAAMHAGIPTTSAAEQVPTAIVEQIVRATPHSEDQNVVHNLVASMIPHDAELVMSMMQIQHVPQVQHVQQVQQIHHVHQPNVSQ,TNRRSSNTQFQILSRSSPAVATLPRLSLAITTTTPVTTSATTISCSNSIINSSNNNSNINSNMAATAAYSRQLCTQVYPQPALPSKSRLR,MHSGATNKIVSEHIMSKHAKVRPYRCNVCGWTAAYNGNMWKHVENHQKILGDQMPEFPVSVLSNIDDLNMPTPLRAPSGKKRGQGKDPTSPSSPTSRPKAKRSRPKYAEPPSGVILANRSVVQEVPVTVTVTHVEQEQPQAQAPPPPQAHTLPL,LFSNFIGATNKIISEHVMCKHANVRPYHCNICGWSTAYSGNMWKHVDTHQKELGDKMPEFPVNVVSTENHSVPTPLRAPSGKKRGMNKASNFKLKLAKPGKTRRQQQAQKTQQMEQTATISILDDNVQTIQVQAGGNLPEGVLMQVSE,YLVTCHLIHTLYFIIPQEHIKHKHGLLLNKDSYGRPKPTPTYACTVCDYVGRKPKSLEYHSRIHKENRQFKCHLCPYASKTKNNLVLHVRTHEGLQPNKCPHCDFKG,QPTCIDCLISFYPKVPIHMGPNGQMMVTKGLSEEESIGSSALSRLAAAVASAQEVHIIQGNEGLEGGGQHQEHRIIATQVCLAFCPFF\n
GLEAN3_12914	SPU_012914		Inspection of the tiling array suggests that glean may have missed the following exons: SKSEIGNHFLTDHMEAYVSPIKEEGKTVSTNDSKEPDGEKEMEEKAGEEIGEDEENLAEEVEEKRPAPRPRGRPPKKRNQKPIKKQPYYYIEEVEVKDEGEETGEGEEEPKEFGRGMRRRKKAIPRHILRYELDDDEEFMEDEDYNENEERNVLPKKRVPTIPPIVMKGKRGRPRSSGSQEKGRQDEGSPKLKTPKIFKKTPTVSKLPLSDKVIERILDDRVNEWYLVFQEKHVLPIPCPFDGCALDVTQAELDVHLQCHAANLEGFRCPIDECNFLCPHWSNMRVHYRKTHEPTFYRLLCDLDGCAMVFPRIDKKSIHIHVTRKHLRPELHDELSSKDFNIEKYEKYISIVKAEEADLMKQVGEQAEDDEGEAGTSLGEHSNVVHVQQMSAVEQLETESDFDTENSRLKKRGRPRGSTKAAKLARIAAGEVFEKKGKRDKKGKGKRSFQRVMNNVSFFCDVCGGKYKSEQAVFDHKTLHYRDENCNVLRCTECTEYSTEESSELREHVALSHKSLLHLHRCDECQFSTNRYHDLKKHILVHSGSKDYMCDKCGTCTTTAYNLRVHWRRYHAPESEKNVKCFACDYMCADNGILKVLFFSFSF,VTRASCICIAVMSVNSQPIATMISRSTFSSIRAARTTCAISAAPVPPRLTISESTGVATTLLNRRRMSSASLVTTCVLIMAF,TVLDGQTITLAEGISEEAAMGASALSRLSQGGEITVREVHFLQGGDNQQQHHEMITYNVPPVAVTQQIIAGGDMGHVISEAHYQQQQQQLQQEHEVEQHHYVEAGLQTVQVVTSHQDNGVPRAVTEQVVHAMPHPQDHHQDHHQDHRGADQNQDQPVVHQLIPMTLPHEAELVMSMMQAHQASISQSQ,SRDGTGGARHAAPSRPPPGSSSGSQRCRSKPGSACRAPAHSDDSATRGRASHEHDAGTPSKYLSVTVTYHQEICGSRGGAGAMRPFI,LFIFILQEHIRSKHGLMVYGKDFDNARPLPTYACSQCDYIGRKKSSLAYHMRIHTENRQFKCHICPYASKTKNNLLLHIRTHEGLQPLKCPECDFRGKNDKGLDNSFER\n
GLEAN3_13406	SPU_013406		Inspection of the tiling array suggests that glean may have missed the following exons: FLSFSLSLSLSLHAPPLSLPLYNLFXXXXXXXXXXXXXXXXXESSEVLKEQAASCCNSNISGERNFAQLDSHLHHAPNIGIGKIESKVMFKANATRHWLQNKPQGSRKELIRNNIKAGAKERKREMENKVTYREKVKRRVREKQQKLTEKAEKARNKVEELIESILQDGVIEHRDECETKVNGMSKTRATGMLKAQIQFRTKILGQDIGKHALSKCSVDELKNILLSIPEPTDKNFLTDVKDPGQIINREFAQKWEKDEKEQWYNATVINLKDGEFEVSYQGNNELFYMTIAEFFTDIHL\n
GLEAN3_13407	SPU_013407		Inspection of the tiling array suggests that glean may have missed the following exons: YHLITFLILWSFLRSYPSHNTRSFNTRSSNMKPADAGVRRGAILVLILPHPPHQLKHQANPSRLLNPSLQSPSLQKQLQASPFW,TKFGWMGFPQAAPINEESRLLYLVIQLMKNHPAVSSLSPTKTATNIRLRYKTICDRIMDDPLLSTLNLPLPNINSKSITNFISKQETKYNLMSTAQPKVVSHRRVISFDNIPDPVELPEVIPKPQYQELQYKIVQHEAGRRRGEKRRHPGSDPSSSTTPTEASGEPITSPKPIAPKPIAPKAAAGIPILVV\n
GLEAN3_14197	SPU_014197		Inspection of the tiling array suggests that glean may have missed the following exons: TPFLLHLEISHYLLSLPSFLLPSLVLTDFFCTLYLLSPFYLFSFLPGSHILFLACFLSLSLSLSLSLSLPLYVYFSLLTHHPSSLSILHELRIMRICSNPEFSLLSA,CSLQVALHVSSLSSRLSFSLPLSLSLSPSLHFSDSLPLRGYYVVCGFANELELPGRCAAMYKCHWPLRTHTHTRCIHLQHTLALSHTHNTHTHSELSAHTAGKGGLIHAYRTKGLCS,CSCSHRALHSKLLLVSCASSSMLKIKPLLMILPTHSPCYYLYCHNTCHLVKICNSWFLFCAHPNICFRSLSSSPARFVVNIQVMNLYFREKSHSLLMSIPHPSDI,TGVGCMLLADYAIHDRAVHQSDRGTVFLHLDHNSRNCTTPLTTNCWKCIISLKEKEREGKNTLPTQHSNKSRKKRGTVITDFDWVCMYDIGSAPPIHGAGSVTHSRHLNTVHAWSSNDNEER\n
GLEAN3_14643	SPU_014643		Inspection of the tiling array suggests that glean may have missed the following exons: NIWDNSHIELSCTPDNATYVLLYSHQTLYDPTTSKFRKSESSDIKSSTISEVIILKCTDWDLSIFKLCVDIWDHLLTMNWSVSNFHKYE\n
GLEAN3_14684	SPU_014684		Inspection of the tiling array suggests that glean may have missed the following exons: LNFSDAQKFQCSLCDGLFTSAKLILRHIRCEHNSGDEMIPVLTWKKKKKKKYVAIEVSSKQQLATQIKVSDQEESAFKCGTCKKVFPSFGRLMAHELFHEKEQASPVNVKDSLTVTKQMKPQGSKQYVCSECTKEYKSWRSLNRHEREAHGYRCDFCLERFPKKKDCLTHEQTHQAFKSSQPAGKSKASPTKSRAASTGIQPSEPAPDEPKDMLGRPTDYYKRPYKCRFCTKRYSSRGTAERHEKEVHKGEGDFKCSYCTKVFATVSRLKDHLVLHKYVNMYRCTECPRSFASESALNNHQGEHTGLKPFKCEVCGRGFRTRKLALKHKRRIHQERPKRFLCTFCDKGFADKSDWKVHERRHKGIRQYVCLECGKGFTSSTSLAAHKQAMHIKVKPFSCAVCSKSFALNHQYNHHMAKHRLEGEGNALASMQQS,SGPVHSSANHLHHAHLQRPVAFGGHGVKFSSLPASYTPPITAQEPAVERNDVPVSLTTCVPVVERTDLPTTTRESVIDIPPISMRESPTDSCIPPTTKQELAEDSYMSPSTAHESPIDSSISPTTTQESVEDSSIPATTTSEQVTDSINIEPVATSEVAMDDDIPPAKSWGTVEERSIRLVTTHKPQAEGNTLRAQGEDSL,LGQSSIKSAPSLQSTDHRKQCLPSSKPFHQANASPPAPDVSAAEPPLFAALNLTKTISVMDLPLSLSLRVASQDKVEGVVAKDTVEKGVEFGPYTGTLLDEEQGSSKETTWEV,VSHLLNQLLHYKAPTIESSVYLRQSRFIKLMQVLRPQTSLLPNLLFLQRSISQRQSPSWIFHCLSHYELHRRTRSKEWLPRIQLRRGWSLDPTQEHCWMRSRDRLRRQPGRY,ANIIHQTPPQPPVTLPVHIRGHGVKSSTLSANYAPPINTTHGAVEEERNDLQIATHGSVEASKMLPLAFHKSVVERNFQPTTSYESGLHVESNALPIITCKSA,TRCPGVRDRTLVVTQRSVNKLFVARNCHSFTLKKPRQFILWQFTAPYLNKPLLSLSLSLSPSLYATSLNLENELSSASTDSNLTLYH\n
GLEAN3_14685	SPU_014685		Inspection of the tiling array suggests that glean may have missed the following exons: SPVISDAQKFQCSHCEGLFSSAKLILRHIRCEHSDGEPCEMMPALAWKRKGKKKGREKSVAIKFKFNHPINHPIVRKRKNSEEEEECDFRCGTCVKSFPSLGRLKEHELFHEMMHGDKPYECSECNQRYTAQSSLNRHEREVHGFLDDYKPRSRPKRLKAHVPKKPLHCRYCGQGYKSRGALANHERRIHGSRHPIREPDLPNDEPKDMLGRPSDYYQRPFKCRFCPKRYVSWTTVEQHEKEVHTREGTFKCSHCPKVCASESRLKEHLVVHKYMHMHRCTLCPRSFASESALNNHQGEHTGLKPFKCEICSRGFRTRKLTLKHKQRMHQERPKRYICSICNKGFAEKCNLKVHERRHKGIRQFVCLECGKGFTARFSLTAHMQAMHIKERPFACEICGKSFALNHHYNHHMAKHRLDGDDSIPQ,RRMYRKSHFTVVTVAKGTNHAVHSRTTRGESMALGTRFGNRTYRTMSPRICLVDPLITTSDPSSADFVQRDTFPGQRLNNTRRRSTREKALSSAVIVPRFAPVRAV,SVKEIQTIKQREQCSSSSHQASASSSSSDTSNPTPNTSKDESQLLAALNLKKTKSIQDLPQNLLFRATPEGKVDGVVAKERIEKGVEFGPYAGTLLDEEQGWTRDTTWEVRRAVFHKTVF,FPLDSAHGVSNAGIIHQARQQLPVHLRGHGVKSSTLSANYAPPITTHEPIRERNDLPITTHESVSSIIQPLTTPESGAKSNVPRPQGTVCNFCLVGFC,MKKHEPKFYRCKKCNQKCKTKTALNKHEREVHGHQCRFCSERFFKKSECMKHEQTHQAFKSLKPAVKKHESLSKTQASSPTLIHQPSEPSPSEPKDMLGKSTNYY,TRCPGVRDRTLVVTQRSVNKLFVARNCHSFTLKKPRQFILWQFTAPYLNKPLLSLSLSLSPSLYATSLNLENELSSASTDSNLTLYH\n
GLEAN3_15071	SPU_015071		Inspection of the tiling array suggests that glean may have missed the following exons: STTGFPSWSIPVPLERSGWTKAMQSCFFFIMRRGDYGPMVLIPMLFLENKKQIIFDSPRLEGETGKVMRSLWPWTNHVVTI,STTGFPSWSALVLMERRGWRSFSVQVIHCHRVGEAMINNWFPFMEHPCTIGEIRMDKSNAIMFFFYHAEGRLWTNGTNTNALLRKQKTNHL,LVFTEPTPPPPTPTPPPKSPTPPPKEPTPPPPKPKPKRKAIVKKTKAVPPPPPPKTPTPQPPTPKPPTPKIPTPQPPTPTPTPPKDPTPPPPSPPPKVTLGKFMCTCFVLFKIQIYI,LFLQNQHHHHQHRLLHQSHLPHPQRSLHLPLPSLNPSVRQSSRRPKQSHHPLHQRPPPLNLPPPSHQPPKYQLLSHPPLPQHPLKTPPLPLLHRHRKSHLVSSCVHVLFFLKFKSIL,LYDDRCVHDEHHKYRQDEFQKDGYRLKHSPEEGVREKCEDTSVVLEMGKEARRELKQESDSPWESCYELHASGVRLTPEADGEENRTKTLKCH\n
GLEAN3_15137	SPU_015137		Inspection of the tiling array suggests that glean may have missed the following exons: CLGSPLSNFTMMLKSRMCLGYLQSWKEKKSELAFNQLFFQSSIFNKEHLLIQLSLTLLPLVSILAKWLLRHCRLINFSNP,LNYNFQQNILLTWVKHFTTSPLPPPPLPLSHTLSSAPPLPSPPPQCPPQHSNAGATPPPPPPSPPLPLSHTLSSAPPLPSPPQCPPQHSNAGATPLPPPSP,NILQLHLFLHRLFLYPILSPRLLLFHLLLLSVLLSTPMQVPLLLLLLPRHPFLYPILSPRLLLFPLLLSVLLSTPMQVPLLFLLPR,FSTKHIANMGKTFYNFTSSSTASSSIPYSLLGSSSSISSSSVSSSALQCRCHSSSSSSLATPSSIPYSLLGSSSSLSSSVSSSALQCRCHSSSSSLA\n
GLEAN3_15358	SPU_015358		Inspection of the tiling array suggests that glean may have missed the following exons: STASWRTESVSALTLMTSSDLRAWQCGSTVLSMSDVRAVLALLHHSASSSRCSKGVLFKTLPYSPEVKLALSDIMKTRSFRR,PIIFFHFFVRLSPTAYHSNFCEGYCPFPLDSHFNGTNHAAVQAILHTRKMKRRDGRRIPSPCCVPNSFTGLSVLYLNEEKNVVIKDFEQMVATSCGCH,RVLPLPPRLSLQRDQSCCRTSYSTYQENETQRWKTDSKSLLCTQQFHWSLGALFERGEERGHQGLRTNGCYKLRMSLMSVMQPRDRHRLWFHQ,LHGLDNQADDVSDGLLIIVAIASLIVLWSLSLVRFLFLSLSFGSSEFGLVISMVPHLEVLGWYEGCDEESRHDDQAQIQPAQMENFKRKTRSM,AGTGGCIVGAGSTPSEAAGTGLVAEGAACNGVGACGAVARLVLGAVRVLGAPYGFSVTLVVVAGAGGVGLVLGGAVAVLGAWLMMVVVVAAPGVRNWVCC,PSPHPFSSSPLFHVSLSLSTSPLFPFPLLSLTLLLYISLHLSFTLAAPLSKYIHNPLLNRPPSSLPAFMPFISFFPLVSPSRSLHYPISPSSLSSSSSIIFPLYILLFPKEEPC,SKKPNPGFYENILLFVIDPKPVLTFSSPLFFLSSLPCLTVSVYLSFIPLPPVIPYPLALYFSPSFLHFSCTSLKIYTQSSA,KPAFSMIHTHTHTHAPESHTQTHAHILQYSKPKLSMYVNTPLPASIQHAPLRTYSSLHRTPRKQAVLYIEAFDLKINRSRTEM,CDGLVLLVAPPFSSGIVHLKEGINGHGSLLLGLIATDNLYVFTQSDRNDATLLGARILCRGNKVSGNESQRSHMRNENTILFFLFSILKT,VHSKVNQVGLYHYRFLPAYRTHQTGLPFNNRCWLSSNDISRHLTQFNPHTITMHESGKPTSAMDRKARFTQRTTPDSQLASKVCIGFIARVVN,INRSRSTVKLIKLVCIIIGFYQHIEHIKRAYHLITGAGYHQMISPVTLLSSIHTLSPCTKVVSLQARWIGKRDSHNAPHQTHN,EPRSPPRSSTKGLHPRSQQGFEQTQTLFILPCRLNILNITIIDFDVDDGAYDDEHDDSADHDHGGVQWTSDDVDVDEAMSRSSMLQSTLK,GNLTLTGYKDEKSFKRSSSSTTKTSTLGRQRKTSSAPVETIRLMTSLKVHKFEMKTKKKVENALIIVNGSHPKFFRPDNK,RLRRRWKMLSSSSMDRILNSSVQITSELIIKIYCVYSLCLPCPLPLCPSLSFSLSLSLSLSLSLSYNLSYLLLPQFIYMTQSLLYV,WLDGKKKRNTLFLVFSKQYTLSTTTYCRKKKFLAKGVHSVQHMTVFRVHNHPLNMKQKHFFYAINLTIRMREKQTVKQGSISRTNH,LNDFMFQQERDSGISASFTRSTCESERSSSSSMSSEELNDLSPQRFFPTTVSFDLEDDDHNTSPVPSPTLREPSGRQANSIVRRSTFYRNSEGNTQNIY,AYYTTAKTEVTNGIPYLYDDDVLVLGRLLYFVKCGVFSSRCSHEGSQNHDDQLHHSTCYMHCYSNCIPAKICPTARSQWST,SLFHLSSLLSLFSLFSLSPSSHFLLYFLLSVSLFLIYKVYIQTEIIISNNYVCKERRRGTECGLPLQLCGSCLVLVQLHVHVFHLYSN,FSSKGYIFPQNHIHFVPLCIGTKLDVEGSIDIYCGGYISISYFYDIDRIFCLNELHMNIVLAKQMHSLLSLFVFDRLTRY\n
GLEAN3_15640	SPU_015640		Inspection of the tiling array suggests that glean may have missed the following exons: LIHQKEYGMKSKSWRANFHLQHQMSSSTVHVSVCPNFLYRLLINRRKCKRRAHHPAKGHDEMLSIGSNKYHRVFLESSSTRKGERF,SLVYRHTWFTLLRVVCSKYFNINLVHSQLFYHSSRTFVLLFLPINSSMIYIFFLKTASPFPLSAQALSAAPFSTTWPLLGPRGSLQSCPQSEMFH,SELVNSLERQPTSIALYLMCYLGGGMSSKGYRLRSVRRLHVSRAQERKKRPVRFLCNLSPYRVAVLMTIFGHLVAWQSWRGHHLFSYHCAPILFISFHLSLSLSLSLS,SHCLSASLSLSLFIFLFYPPQSLVTSPSCCPSLHFFKSLNLSSHLPLFSTFSPSLPISSLSFSLVISLISSLDLSCKIYPGHFSFLPPQLYLIPFIMYLYLYFIITFVF,KPTRPIFPHSRFALFPYGINQCDAATASTSLSFPLALFTPLSIYLYHHSFFLEGERKKNKSRHTEARPGFSPKLGSEILKQ,SARNRKQNLTVTTRHLDYFSYDNQLPLTTPAPPFSISLSLSLSLSLSLSFYFFLPLSLSPSLSSSSPSPLLLCNHPFLLFSP,STPSYHPRSPFLYLSLSLSFSLSLSLFLFLSPPLSLSLSLFFFSLPSTPLQPPFSPLFSIILFLSNFLSLSFHNASAISLPVLSSN\n
GLEAN3_15688	SPU_015688		Inspection of the tiling array suggests that glean may have missed the following exons: LAYSNSMLPTCRHHPFSDLNANFSHLSFIDFKFLVQNAIHNQRHIGITCYLINTDNYFVFIPPNQAILAPVCKFHQSSSTG\n
GLEAN3_15772	SPU_015772		Inspection of the tiling array suggests that glean may have missed the following exons: PLLILPKTDFEFPVSSPEKNSMSKRQRMHMVIESTCYDQFVVEPIRSGKRGFFTYLVGLIFVRIVDDIFSHLSLPLFYLVLLLQ,IYQVSISVEFVSPTYLHVFICFPLLSLALSIVYTNATSFSMFVCFHFLNLFSPLLFVFSSSLFKTKSFLIFSPLLLPPSILCSVFILISSPSLFNSLLDINLHLSISLSPLTLPPSLPHAHCVTHFPFFTRNEFKEGYFLA,TCLYLVIKVPPTLASMCSSKVPTLASMCSSKVESYFGNRKHIWGKERDVMNLPSIYFCRICLPNLSTCIYMFSSTLFSLVHCLYQCNLIFYVCLFSFFKFVFSFVVCVLLISL\n
GLEAN3_16250	SPU_016250		Inspection of the tiling array suggests that glean may have missed the following exons: RSCLPNSSQSKSSTLELSARTFMYLNTDTMPFLFKGLIRPILEYGQAAWSPYRLGEQRILESVQRRATKIIPGLRNLSYQERLTQLQLPTLIHRRIRGDMIDVY\n
GLEAN3_16490	SPU_016490	After reviewing the data for GLEAN3_16490 is appears that there is an overlap of the sequence on two scaffolds. There is EST information available on GBrowse V0.5 and the transcriptome intensity score appear to be strong as well ranging from about 10-140.\nAdditional comments from the Baylor under gene information (comments):\nInspection of the tiling array suggests that glean may have missed the following exons: PTSVRVCGMKKLTQTWKQQKVSKLRDSSFTTASSKTQLVPSQDGGKISQGSTGSHGATSAAPTPSASGGSRKRLGQHIPRQAVPVSVPKPVHDGAEGEKREKENEEDLAKKRKVGGVIGDQSKRRSPRLQKGRDPSR	Inspection of the tiling array suggests that glean may have missed the following exons: PTSVRVCGMKKLTQTWKQQKVSKLRDSSFTTASSKTQLVPSQDGGKISQGSTGSHGATSAAPTPSASGGSRKRLGQHIPRQAVPVSVPKPVHDGAEGEKREKENEEDLAKKRKVGGVIGDQSKRRSPRLQKGRDPSR\n
GLEAN3_16518	SPU_016518		Inspection of the tiling array suggests that glean may have missed the following exons: EDSELSYLENFSPVSHQIVLKVIRSHKIKSCSLDPLPASVFSRCIDCLLPAITDIINDSLKAGVCPEPLKTALIVPTLKKSSLDPENLKNYRPISNLSFISKVIERVICTQLMTYLASNSLLASRQSAYRENHSVETVLLRVQNDLLLSLDSGNEALLVLLDLTSAFDTVDHQLLSRLEKCYGISGTAAKWFESYLSGRKQQVIIDGITSDPALLRWGVPQRSVIGPLLFICFTTLIQDIIHSHGFTSMMYADDKQLYITVKPSVINHITQKLDICLQEIRLWMQHNFLFLTIVKWQFFTCLLNLENRMSCYQYLLIIRQFIVPRQFAISELFWTTICLCAATLILCAAKHHLLLEE,DLLIVREDDESVEEVKVIHSMSSDHAAIAFTLCESDIVQSFASQTSLDEMDINQKVDMYNKTLFSFLDRHAPEMRQNVQLRPHAPWYNITLKELKQNLRAKERIWRKSKHRSPVMKDELQNSTKEYFSTLKRFRREHHRQTISSASTQELYKEIDNMTIEKAKAVLPTHKSKSDMVIFFILHSSKTKYAD,CNQPHNTETGYMSSGNPFVDATQLSLLNDSKMAILHLSSKFRKSNELLPISVNNTPVHCSKTVRNLGVILDNHLSLRSHINTVCRKASFALRRIGKVRRFFNKASTEILIHSFVSSLLDNCNSLLIGICDKDVNKLQRIQNSAARLVSLQKKCQHITPILKDLHWLPVKFRIQFKIVLLTFKSLNDLSPEYQSDLVLQYVPSKS,KQTEISDFIITENLDVLAITEAWLTGDSRDSTTIADIQNTLQDFKFLKLPRKGKRGGGICVILRNLFDCKARPYSFVTFECLEVTIRSTHKDTVSLFAIYRPPVCPRSVPVSHFFTEFS,TALCSKEFSLGQWCYSSYSLLLLLCCYVEAMVMVFTLTPVHVLTTVSKSESSSLLPHRKIYGSPIPYYSNSTSTFQLQLLDCGDVNPNPGPHTERSTTHSSTPIYHGVNSYHTPNRKYDIPFLKSLNLLSRNDPQV\n
GLEAN3_17427	SPU_017427		Inspection of the tiling array suggests that glean may have missed the following exons: PPSSLPSSILSSILSPSLSLSLFLFLPLSLSPTPFVSPSPLLSHSFLYSSSFTLPHLHMILSCPSESHCFHFLFFSFLSFSSITPSLPSSSILSFSLSLSHPLCVSLSLIQSLFSLLQHIYPPPPPSTLIC,LCEDRIWTCMILSCPSENHFVFIFYFLFFILLFYNPLPPFPPPFSPPFSLPLSPSLSFSFSLFLSPPPPLCLHLPYSVTLFSTRAHLPSPIYI,FFLAHLKTTSFSFSIFSFLSFSSITPFLPSLLHSLLHSLSLSLPLSLSLSPSFSLPHPLCVSISLTQSLFSLLELIYPPPSTYDSFLPI,NRKKISYMATMGAPNNSYRYTCSWACPRYLAMWCPNKILWIYEHNFTSNLSFYIVSCLTLDPVELLINVKAKRFGTEHLITNSKMILQISRSKK,DCTESSLLTVPLSRVDSFFVVLEALTFVLLLDNSDCTESSLLDVPLSRVDSFFAVLEALTFALLLDNLDCAVLSLLLEILIDFVFFVVLSFKGSQSSTFLTIGNLLDFVLLLL\n
GLEAN3_17656	SPU_017656		Inspection of the tiling array suggests that glean may have missed the following exons: IFGSFHHHHCFPLHHYMSSSFLRHFWISESYRHHHHYHHHLFLLLLLHYYQWLISFSFSSVAGYFEPANLLHYSHAHYHYPYLPS,LSIFDPLLFPGVLLHFWITSSSSPSSSSSSSSSLSSSSLLLSIFDLLLLLGVFVDFWIIPSSPLLSSSSLHVFFFPEAFLDF\n
GLEAN3_17750	SPU_017750		Inspection of the tiling array suggests that glean may have missed the following exons: YLERSKFTLSLLSLILLLFIARCFLGRCRLRGLVQTGLSSRQMTEAMNHVQDWYNDPRSIHANEVEPEVERVTILAMGKSITELGHEENDSTCNEHIL,DLFSYSCERSSYQKVASLKTASSIYIIYRNTNGPLSSNSYRADICTLRKDNCLKLSFTGITSKLQNADCTRPLTKNIESTIHKTCTWKIFTVR\n
GLEAN3_17811	SPU_017811		Inspection of the tiling array suggests that glean may have missed the following exons: GFRNQVAKILSISQQCSCAPARGQQMMCLLHTILETPGIVHITSCTCAVSERSLFIYLFLKYIPLFIIAAKVPSRCIWLAKCRDMPGEHEALFRLRNPLL,IQMEFLRCYCVMYVHENGRIFLIIDQNSLCYFILFVHSNTHLGLPQSGCQNFKHFAAMFVRTCAWPADDVFTTHNTRNSRHCAYNQLHLRSFRTFSIHLFISKIHSAVHYCC,TVYVLRSRREHMSCFAYGCGWFVFTVGKKIFIAYVSLVILCVVYFLFSLMFTHVVVYCERGNDTKYEGHILGRYVQFVTRWRQDDRVVSQ\n
GLEAN3_17846	SPU_017846		Inspection of the tiling array suggests that glean may have missed the following exons: ESPVNPIELWKKGPIVARSVVKTDEDVLPVRVFNPTREQRNIKAGTAIASLSAVEEVGTEMCNQTMPTETSASRNMPPDKQGNKDVDARRTTTHFFQCDSCTKSYEKKDSLQRHKREVHILKYRCDQCDYRTGRKTEMERHQGAHEVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPPRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDLTKSSCQKTPESPQHDGSEAS,RCVNCLSGCSIQQGNMGSQRQALPLPHCQRWKKWAQKCAIRRCPLKPVQAETCHPTSRETRTLMPEVRLPTSFNVTLVPKVMRRKTL,DFLVQINAVLDCQKMELRTEWGIIPCLDSEGESFCRRIVAGEEYSIPPGHEMVLANRVTGEKIALCEGLVESPVNPTELWKKGPIVARSVVKADEDVLIACQGVQFNKGTWDHKGRHCHCLIVSAGRSGHRNVQSDDAH,LPVRVFNSTREHGITKAGTAIASLSALEEVGTEMCNQTMPIETSASRNMPPDKQGDKDVDARGTTTHFFQCDSCTKSYEKKDSL,WCQKWDPVTMAQGRNRQNRPTPGEMMKHLERSMTEVDQMMEQLRASMSQVPVADPEPGRQEAFPPTLSGQTASGPTADSQMPAAQRVRFSSTPREQSRRLPSAGTGVHITPPLFSPP\n
GLEAN3_18373	SPU_018373		Inspection of the tiling array suggests that glean may have missed the following exons: LKMKDKVSRMETLLVELHKVVVERSKQTTVRGQPYEGNESLILIGDSEHAVHVDKRRFQRAVHLATSVHLTKQTTERGQPYKAVEGNESLILIGNSEYAVHVDKRRFQRAVHLANSGRALLLKLMSMVIQPDELGNFSYRGDRNMEPPLDSLIDDDRFKAIQLQVRKSFPCFDAPKNLRRIRDAVNGKCRKLRRISSPS\n
GLEAN3_18619	SPU_018619		Inspection of the tiling array suggests that glean may have missed the following exons: MTKYFLFISLFMACFMVTTCHDLRVCICTPIDLLSPSCPHICNSQLIVKIICTNNCQSLVNQSPYHFVLPEMLQLQAAEPH\n
GLEAN3_18812	SPU_018812		Inspection of the tiling array suggests that glean may have missed the following exons: RVLSKKYHYNPLRIHRGETLLKVCTLYIVIRESLPKDNITCHLRIRYIQAREKQFGFGAGQEFQGAHCFLPPPPSTPGTSAYI\n
GLEAN3_18850	SPU_018850		Inspection of the tiling array suggests that glean may have missed the following exons: VRDIYINTQTCTYQVASDHINNMSVSKNIYSSSTDVCTFAVQNFLYIKQKQLESGLRKFYIVSVCYLVLLLNLISCKLYIDSL\n
GLEAN3_19435	SPU_019435		Inspection of the tiling array suggests that glean may have missed the following exons: YCEEVYMLERNEYALCRLCELCKPLQSFIVYGSSWRSCYFGNPHGSCPQNICQIFLDFRRCCILVSCKMIRYVYCRHEEYQSL\n
GLEAN3_19652	SPU_019652	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it is apparent that there is one sequence overlap that occurs within v2.1_scaffold83941 that ranges from 1725-1779 that isn't apart of the sequence. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values ranging greater than 10.	Inspection of the tiling array suggests that glean may have missed the following exons: GFSFSWTARTCNCSECLAVNPFPHIPHSCGRIPVCRSACFFMSHCLANLLLQNEHGNLSSTTLCTLPMCAFTSSFLPKPFPHTSHLNGFRPVCSPWWFFRAFSDAKERGHSVHL,SGFLPSISLLEVSPLSSCTTSMFCTTLLGLLTLFNFIRLWVVLFSGLTSSWIVSSLCGALSWFLSCSFSCSLIPCFHLNCKGHMLQV,VLLSIFLFNSLEDFDDSLPSRVCLWVLWLVEGGMWLVEGGLWLVEDGLWLVEDGLWLVEDGIWLVEDGIWLVEDGIWLVEDGIWLVEGGISLEFLLSCSVR,FFPPEALPAYVALKWFQACVFPMVVLQGILGCEGTWTFCTLVEFICNVMFLLVNFITTAIHKLLWTIQALISPPAFVDLSLMGLHTANRGVSLRAIFTLESKRWILEIIRLLAKHILVGSIPIIFMYNIHVLYHSSWFVDFVQLYTFVGRPVLRVDEFLDREFSLWRPLLVPLMLLLVFSHPMFPSELQRAHATSVVLVRLPVNFLFMAIQGFG,EVCLRTLLALVDHSLMASHVSHKLMLTGLCFSADWAIVFSCECVKHHVCLKPALCGERLTTSHANKIFPRLGGRLHIRFHFMSSTFNILVQFPGRL,TRRKPKLKRNKPSSESIFFNTGQNSKGSCSHDPTQFDANACRAALNGACAQLAINQNKQTSVDFMGFPIQVVEMVRFFYNVNGPKDQLACSKTHLLLLL\n
GLEAN3_19824	SPU_019824		Inspection of the tiling array suggests that glean may have missed the following exons: NCQVSFSTEASPAGLARKRPDPGVGAEVISQIRACAELLVTYPALVRLLACMHAYMLDEVTLRCVSLGTYSAAVGSLPGMTTHVEGEASLCTAGFVAQLAGEWTFTCMNSHVNCQVSFTTEAPPAGLARKRLDPGVGAEVISQVGRLAERFLADGACEWLLTRMATHVAFKVASFDKALVADTAFEWSL,CAMVLWPLARCSFQRICLFVLFGYKIYLTVNSNSRFTNHSHTLSRGFYIQTLPAIYLNHVIGWRSWICIRYMLALKLSCTQTIFHQVISIRSL\n
GLEAN3_19825	SPU_019825		Inspection of the tiling array suggests that glean may have missed the following exons: AEFFQTYITFVRFLVGVGSHVPGQIRACAELLVAYAALVRLLACMHADVLDEVALRFKRLWTCSAAIRSLPSMATHVKGQVALRTAGLVAQLAGEWTFTCMNSHVNCHVSFTTKAPPAGLARKGPDPGVGA\n
GLEAN3_20311	SPU_020311		Inspection of the tiling array suggests that glean may have missed the following exons: STNGIHNIAYAQCRKQLRTPKCDEVARSMPKRCGRTEGSCSVAQVSLEPRERSNRPRFQTNKQHHVVRIILKTEHGPIISIFYRFSKVFRSEFVECI,LCKLFFVKCLFEVQMVFTISLMRNAESSFGRRSVTKLLEACRKDAVEPRVRVASLKCRLSLANVPTDQDFRPINSTMLSE\n
GLEAN3_20481	SPU_020481		Inspection of the tiling array suggests that glean may have missed the following exons: MARVFAFACPSAHTGQACSWLFSWNNSCAFSLPQEGKIFPQVPHLKTNSFLSSLSFMSLSSFSFLTCCLMVNSVSAGFFFFFLYVRTGIISSGIRLVCSHLMCLRISLADVNRPSHWLHWNF,RLESLVCLLMLHTFTLLKKPFRTEPALMPMDFPLMFIEGSLCFALLVAFLTPVELGFMFFHVNGKGLCIRLSFSTHRTSMFLAVFMEQLVCLQPTAGRKDLPTGSTLENQLFLVVTVIHVTV,KHEREVHGHQCRFCSERFFKKSECMKHEQTHQAFKSLKPAAKKHESLSKTQASPPTIHQPSEPSPSEPRDMLGKSTNYYLQQRPFKCRFCPKRYVLRKKVNEHEKECHTGEAAFKCTHCPKIFTSKAAMMIHMKCHEQHRMYRCTLCPRSFASESALNNHQGEHTGLKPFKCEVCGKGFRVKKAVYAHRRRMHQERPKRFFCSVCDKGFADKANLVKHERRHKGIRPY,RKKSQCKEQSAPATNSSSSDTAQLLVALNLKKNAIISSSQELPKGLIFKVASEDKVEGVVAKDTMEKGVEFGPYTGTLLDEEQGWSKDTSWEVGRNSGNNNIDRMFL,VRNHNVKNKVLQPQTPHPRIRPSFSWLSISKRMQSYPRVKNFQRASSSRWLRRTRWREWLPRIQWRKGWSLDHTQEHYLMRSRDGPRTHHGR,ISGTQKFQCSQCEGLFTSAKLILRHIRCEHTSRIPDEMIPVLTYKKKKKKPAETELTIKQHVRKEKLDSDMNDSDDTKELVFKCGTCGKIFPSCGRLKAHELFHENSQEHACP\n
GLEAN3_21205	SPU_021205		Inspection of the tiling array suggests that glean may have missed the following exons: PRPFYYLLHILSPPVCFFLPLFISSPPPPPLSVSLSAYISPPSPFPSASRSFSLSPLLPSLSAYQSLLSPLPSASSSLSEAPLLLLLLSLSLSPLSPPIYLHPIPSRLLLTPSLSLLLLLLLSLSLSPPISIRFFLSRPHSL,FDHALSTTCSTSSPLPSASSSLSLSPLLLLLLSLSLSPPIYLHPLPFRLLLAPSLYLLSFPLSPRINLYSLLSRLLLPPSLKLLSSSSSSLSLSPRSLRLYISTRSPPVCFSLPLFLSSSSSFSLSLSLRLSLSASSSLVPIPYKTQFT,LFLPRTIDLTTPFLLLAPHPLPSRLLLPPSLYLLSSSSSSLCLSLRLYISTLSLSVCFSLLLFISSPSLSLRVSISTLSSPVCFFLPL\n
GLEAN3_21382	SPU_021382		Inspection of the tiling array suggests that glean may have missed the following exons: FGEEDPFTERLEHFMTMPQDQGQFRYTDMMDTNLPHTMDGEEYLQQTMDTSSNSFTCDLCQKPCGNRTNYFVHLRIKHAEELPYECGICEERFSTEIILKDHIHSSHTGDEGDFFKCDVCAARFSEKQYLHAHMLSHNEYATFPCHICRKTFMKRKDLQKHLSSHAKTIRESIPCPMCNKVFRLPRNLAIHLRTHSATFICQLCSEKPQSQEVPTSDDHKDGLGRTSTPEENCTEQEEIPGDEQQLFCKVCSITFTDQEDRIGHKCKVNRCSNCLKDFSCPSLLAADGESCSSEENPMCDTCNKAFSLMDKVKPDEHASKVEKGTVQCRMCLKRFPKPILNYSKPISNHPKRNQPCAAKEASNSPLRFPCRICGNIYFMKSTLRKHVKVHEREHIYLCEICDTIFHKKKTYKRHLKVHDEKRLKCPKCKVT,GPTVLPLYVSFVLRNLSRRKFLHLMTTKMALVGLLLLKKTVQSRKRYQGMSNNSFVKYVASLLQIKKTASDINARLTAAVTA\n
GLEAN3_21383	SPU_021383		Inspection of the tiling array suggests that glean may have missed the following exons: IEKSPSVQEIKSDLDTKGGENQPQEALHQKIHLPGESQTSSENLVAVGSSPEQLPRHHCIYCKEEFQSMQTLLHVDHACVRINSTTKRCPICPKQVGSRKKFRQHIVSHNLTCKTRHKNQAANRMMDISQVKSPGQTCRCEWCHKEFDDRNKLIDHTRIHFAYGRNQCPVCERWYTNTTYFRQHVRVHGIILSDKDRQYNIQQAGPSKEQHRRCLWCDKKFQSSDKLAAHTYLHVARGRNCCPVCGKWVGRRKRSVFRAHLLTHGIKTFCDDKGQSCKRSVKGSHATQGLKQEVKSVVDGLTQSEVQGENSSRVPTHRCAWCHLGFHDLETLVNHTHDHIHSGMNRCPVCCLSFSGVWGFKLHLSVHGIPTPYRK,QDDVLSLGRSPLSSHTQTFADSLTSGTPPCVSNSDQGPPHPDENVQAILDTVSMAASQVQTSCRLPEHCCQWCGREFDGLEELVAHTRIHIKDGKNQCPVCKKWLSNKSNFKQHLKRHGIIPPFENKSVSIEHIRSNSMFISQMGKQRKSSTSIKKSYLSRKSSKYKTSLSIGSSDLKSLRSAHKRMLIDHPSSASSHSQRDEMSLDLKSLRSAHKRMLIDHPSSASSHSQRDEMSLDLKSLRSAHKRQLIDHPSSASSHSQRDEMSLDLKSPRSAHKRQLIDH,VIDGREKGLLEENVAGDLESPITRVVRVQITKENYQDLIEDETFDFQNEMDPCQELPHAKEPCILSDPQSGPFEYGTCRLCKKACGNRRMYLMHLRTKHSEELPYECQVCKARYLDEEDLESYNRKRVGQIENGSKEENTCELCGDKTSYACHTCNMEFDNRPKLIVHQKVHCKGKHCYCHLC,SGELPVSQIDSKDEDVHRAKGQNDDISCKRPHLAKDTWTMREQTKAETPHGDGRCDEVQQSSDERGRSNSSKSHVPHVLPVGRQRQNQESQQSLEKKQNVQITGNEMSGSQVGQKGMKKQTAGLGKNDDSVLTLY,SLDANSDASASQGQRKEVEQQTNQCHIDHISHSQRSLHLTSDTQALTESPEPANQPIHRCLWCVQEFSSLNELNVHLQIHIVKGKNRCPVCKKWLCNEYYFKTHLQLHGFKIEKQRQIKSLKKKSETNQITKSQIIFRRAPVRGQRH\n
GLEAN3_21843	SPU_021843	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first portion of the sequence (v2.1_scaffold28760) had an orderly continuous arrangement; however the rest of the sequence (v2.1_scaffold28760) had several internal repeats. There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5	Inspection of the tiling array suggests that glean may have missed the following exons: PFFSLHHFFMNFSALLLLFTSACSLGFCPLYTSSLIHLLSSCSCFFLNFNLPVDSPRLTNYFGACRKTKQITTDVHKKRSKFLSAKEIPSNFFFYLILLTGQER,TFLPCFCFSRVLAPSGSVLSIRLPLFTCYLLALVFSLILIFLLTLQDSPTTLGPAEKQNKLQPMSIKNVQSSFLPRKYPPIFFFI,LKKKKYSWLQHYYPKVTNEQAIVACSYSQKIGARNHMYSMIFSKACAAMRMYQAVESVDVLCGEHISKLFQKEVLCWPVQSYAHVDTIHVMQEAVCRLRTC\n
GLEAN3_22474	SPU_022474		Inspection of the tiling array suggests that glean may have missed the following exons: SEDGCLEVDNEGASEGEHLKKEDHEDSESGWLPVLEGISAVSQDSSQVQPRLENSSQSERELDESMNDPTNSPPTCDEDSTGKQGFDCTKCKKRFSVESDLGSHVMTCQSARRFSRQSRI\n
GLEAN3_22892	SPU_022892		Inspection of the tiling array suggests that glean may have missed the following exons: KASFSIASLPSLFSILLRLPSPSLVLPLSFPWSLSFLYLVLSLSLTLSPSSSHCFPLPLPLSLCLCLSFSTLSPFYLHHLRLAHLYL,RHLSQLHLSPLSFLSCSVFLHLVLFCPSPSLGLSHFYISFSPSLLHFHPLPLTVSLSLFLSRSVSVSLSLLSPLFICIISDWLIYIYNNLIQGNSRQFLSFGHSLPCTLTIVSRFHIIGVGY\n
GLEAN3_22893	SPU_022893		Inspection of the tiling array suggests that glean may have missed the following exons: NPSMETGKNFKCKKCSRLFTTSGGLRKHLLRCHEKHHVMKSRLRSRQTQTPEACMTRAGRDSSDQPIDTIKTSAGNLTPGEESEEFCFKTKCLNQRKVCNICLKSFAMFKF\n
GLEAN3_22955	SPU_022955		Inspection of the tiling array suggests that glean may have missed the following exons: ISICPIFEHSMNLNSRIINSFAVSLPSLLYSHHPRLLTLHPFLLSHQRHLLLAASHPRPRLCCCSATCVTGWYSYVCTTRCGGSGGLQRDAVAYGWPVLGSIRLFDVAFQSIPACTDLAPLTTTLI,IRGLLIRLPFLSRLFCIRIILASSHFILSSSPISAICFLPHPILVLGFVVVAPPASPDGTVTSAPPDAVAAVGFSGTQLPMGGLYSEVSGSSM\n
GLEAN3_23727	SPU_023727		Inspection of the tiling array suggests that glean may have missed the following exons: LALYALYQLHERPPMQVPPPPQQDRFLNICLGGAVRHKRSFSKFASRITFWGLKFFWSREWAINLLYRNCPHPPLIFIKTARS,QVAICGWYRDGDVNAAIQKIVQIVQNTICPLIALHIYQPPFYFAPPPPPFSLSFSLCPSVKLEPYPAPFSPSLFPLKTSAWSNKE\n
GLEAN3_24745	SPU_024745		Inspection of the tiling array suggests that glean may have missed the following exons: HMCVQSEERPYQCSLCNKTFSTSRQCLKHIRAHADGKPYQCPHCPRRYAEESTLVGHVSRAHSAGKTYQCPLCNKSFSRMSNLKLHARSHTGEKPYKCSVCGKAFSRMSNLTRHTRFHTGAKPFECSYCNGRFTEKRNLIQHMRIHTGEKPFECSICNVTFSRNGSLTR,FFSSDGNCSLDSTKQKTRLLHQNLGERELKDEQRQLLDGKVKCVVIRKRGSDSLQTVEIHVEAPVDKSKREDLLNGSSCSCSACNRETRFVETAERNTCTSDPASFEDEGGWITIEKEKEFDDE,LNADHNNHRGSCSEEKLIQCLDHDKKPYKCSFCGKAFSIMSNLTQHSRFHTGAKPFECSYCDGRFTEKKNLIKHMKIHTGENKPYECSICNKTFSRYGSLTSHKRT\n
GLEAN3_24748	SPU_024748		Inspection of the tiling array suggests that glean may have missed the following exons: RIDICGKQSTGSLSVRSSFDNPVSASSTARLLSRGSPVTLSSVGIASAVPPSASSFVESSSGESPAALSSSLGLVVEASVCNTGPTSFSPFCSSPKLLAVSGTFSEMDITCLSNPSHGIDCQGVHSSILRQSSSACI,RSWAASWLTGPPPDISLAASSTAKPVPGGSLVNASPARGSTSVGSNQQALCLSGHHLTTLYLRPRLLGCCPVGLQLLCLLLG,GSYTVISTMKNKPPICNSMAAFPRESKCEPSAWCMVFTTLCKTGCLPSPILLKTLSAMNDCVDPVSIRNVTGCPSTLATMKRPCRF,FSHQRKRRRSLSSLGSRTCENMPLISAIKATLSCRNLVKTPNSVFVRSGPCNSSRLRQKTPNSAEQSKTTLSFFFCGCTTPWCGRYQTQPSSLLVSCSGTFSI\n
GLEAN3_24877	SPU_024877		Inspection of the tiling array suggests that glean may have missed the following exons: DNHGSPTVFPHRTFILSEALCGFDFFIFVSPVSFISNLPNLRTSSEKVISVNPKARSSLANLDIIIVLFLQAGSSFFSLILTHSHLLELVL\n
GLEAN3_24901	SPU_024901		Inspection of the tiling array suggests that glean may have missed the following exons: FFLSLFRFLHMKKSMCVDQSTQTMPDIYSCSHCGSSLLAKPMPPLARPQPSGPQQLPGLESSQARCINKKDNGDDYTYDDGIVTLDNEKELLGPDKWVSEAPDHPVAIFVKEEVMSDLEQSNSMHSDKEESFFDPRQDQEFRSDSKLEHSETGWYEEEEEEEEEEEEDEDEMEDDEEIDYEALERLDPTYDPFIGSRPHQLVSIVNF\n
GLEAN3_24953	SPU_024953		Inspection of the tiling array suggests that glean may have missed the following exons: MDLQNHRVPVQCSDSSEDILPCSDYIKFVQNSFNKTYKDSLYSERGVQTRCQHNIKKYHYRNSYEEVAMPVLSLFIRRQWNSGVWVHLDLDEIYLNLLDAPCDLRVSRE,WIYRIIGCLFNVQTLQKIFCPVQTILNLFKTHLTKRIRIHCIVKGAFRLDANIISRNTITEIVMKRSLCQCFLCLLDVNGTPGCGFI,ISIIITVIIIFTFINLILTNIYFTTINFTIFTSIIITVIIIFTFISIIVTFIIIFITIMEIILLIIVITNINVISITRSPHFSSFFWSSSSSSSLSYII,SASSLLSSSSSPSSTSSSRTFTLLPSISPSSPASSLPSSSSSPSSASSLPSSSSSLPSWKSSSSSLSSPTSTSSALPVPLIFLPSSGLPLHHHHCHISYR,QHHHYCHHHLHLHQPHPHEHLLYYHQFHHLHQHHHYRHHHLHLHQHHRYLHHHLHYHHGNHPPHHCHHQHQRHQHYPFPSFFFLLLVFLFIIIIVIYHIE\n
GLEAN3_25848	SPU_025848		Inspection of the tiling array suggests that glean may have missed the following exons: NEVFFILFFNLLCSIPLTLGFIVPSFTFENLYPPLIQTDTKSLGKLRGIAFCLLFHFLHISIISFLAPCDLLLCCFPSSFSEAWTFQGFCLRWKTFFIFISFCLPFCTFQFLKFLHYFRMSHVVIMNFA,LLSMCLFRSCFVSNCESQSRHTMAFSSFSIDVLMVVKSFLSLPTSNLLSSFVSFCAVSWIAFCSIISLGSHVPSCDSWTFKSAVCLFLSK,TLRKWATIKALLRNDFLHWSQLYTGTCRMKIFDAVQSCNLCRSKSHVLFKITWQSGHGCPSLGWRVELGQSLSTTFAVPDVWVSLSSPSSGSVIHAA,LHALAKDEHIIGVFMDLSKAFDTLDLTLDHDILLHKLYHYGVRGVSLNWSCSFLSCRSQYTVFDNAKSTMSSFTCGNVCHRALF\n
GLEAN3_25850	SPU_025850		Inspection of the tiling array suggests that glean may have missed the following exons: IYSCLFFSFLFSVSIHFGDEQHIKTEHGVAQPVQTEDTVPIREKLVTPATVSQHQDSKEVQIDYDDDEDNSYAAWMTEPDDGDDNDTQTSGTAKVVESDCPSSTRQPREGHPCPDCHVILNSTWDLDLHRLHDCTASKIFIRHVPVYNCDQCRKSFRRRALMVAHLRKVHDNNMTHSEIVEKLEELKSTERQTKGNKDEKCLPSETKALKRPGLREGRGKTTKKKVAGSKKRDDGDMQEMKQQTKSNAPQLSKRLCIRLNKGWIKILESERWNDEPQGQGNTAEQVEEKDEEDLILTRQVELVIEDGERETSTMMPISESKISEAVLKEAAEGVEENQDASTISHLERKRQTADLKVHESQEGTCEPREMIEQKAIHETAQKETKEESKLDVGKDKNDFTTIKTSIENEENAIVCLLCDSQFETKHDRNKHMLNSHTEHRQLYKCSTCGKTFVQK,HIKETKYTCELCGKLFYTTGAIKLHVDSHNKERAFKCEECGKGFLRAYLLKVHNETVHSNASHCLCEVCGSAFKSQSNLKQHNLTAHTDVYKYSCDVCGKKFKRTTHRNAHMKVHSNDPANKPFKCKLCSKVFAAQARLKVHMDWHYNIRSHTCDVCGKSFLTKGNLDKHQYVHKDKKPHECQICFHGFVD,RSRPGAFLVLGGFCLGAGIGLLGAGTGLCCTVLSASLCNNPPSDMSCICSVSVSAWTPTPSSTRSSVCDIPLSSSDWSTC\n
GLEAN3_26209	SPU_026209		Inspection of the tiling array suggests that glean may have missed the following exons: SRDGARHLPEDFPLQDYLGTQSDSSERESCTSENDSYLIPSQVAYFHEKRAADRETPDATTSSHEPGRMDIDRNADITMEYNEMNNQSSQLHATGSDQNDPSCSLTEEKRFLCHVCSKGFYFKCRLSRHMEIHGIEKAPSKKSHQCMVCDLRFSRV,EGSHFKSKIHISSHSLCSGGDSSLQSSYQSMRNSQEMSGDRSCDGARHLPEDFPLQDYLGTQSDSSERESCTSENDSYLIPSQVACFHEKQA,SVCGKSFREKSTHTKHMTTHSGEKPHVCLICNKAFSNTSGLSRHNKIHTGERPYECSFCKKTFSQTHHLSRHIKIHTGERPFECSVCSKTFSERGYLTEHQRVHTGEKPYFCSICEKRFTSNSCCKRHMRIHTGEKPFPCS\n
GLEAN3_27147	SPU_027147		Inspection of the tiling array suggests that glean may have missed the following exons: KPEWRPIPFYVELPPFWLLADEISSPIPHIFPLSPFRATPPFLPQVFLVPNPYTPPCPLAYTTFYLLMYTTLAHSSCHTPIPSRLPPLYHNSHLLLLTTPFFTLSFIIFVLGGVLYLSNVFQEGKKKRGKDN,STYYYTCTCTIVIVAAFFCTLLVCCNDEKPYLTLKSAPHHASLWSRLNLYSLEASSAAMRLARMLVPLCPGLWCQVKRFCGRHPGHYNKVISKDRPKEKQKVVSIDRWSLY\n
GLEAN3_27709	SPU_027709		Inspection of the tiling array suggests that glean may have missed the following exons: SSTVFHHQHLSSPLHAPPSCARSKPSANCRPAHIHYIQRVLSCDPVCDSLGTTCWGKRSRNSGKGSNPIGHRPPLLLHLLRGPDSGGTSHA,LVFVHELTCAKPTLADIALVRFLTCVFLLVVIKCAFRGKGSGAKGALERSVLGMLLDMNQQLAFHVKAFLTEAAGEFAAAEVDLVLVALENVAIDEPSRTKLALERAGTIVIVHGFPEHVDLFDLVLVLCWVGSILFNLAYLRCLLYLGTSSCQWLNTMMLPQMIMQEAVSDECLTADIASSRP,CHIGSCWKNSNLSFGSSLLQVTGMWFNAMYFLKMCIKGCWVCETLYTKITFKITRPLSFGTTELVLSFKHFPMFLLLVSRQALLIDVSLVTICTRPRGKRRVEGAWLVGGGSTMFHTFMSHYKLFLCVHQPTHITSIWSFIVILDMCFERVFVRAIKPTLGALVFVTFCLTGLSFSIGGTVPFLKMKQKSVLLFIKTSTLRAPKGIQLVSFHMALKLLRTCTGVATSLAKILTRSVPPYPQLNLQPFFTTSTFRVHCMLRLLVPGQSLPRIADLPTFITSKGCFLVTQYVIR,PLKGPKRTPAGSVRSATTQTVFCPSFLYASPSGRGPPRGASSGSLVVSAASTGASACSDGARLGAELVCGGPDCSPAGTMCG,YAIHNNIMCRAPKGENRGEKQISMRLFYKTNRIYLQICTHDPLTKLPTRDLHRLGDLLFETFITLILLVRFISNFCHSVHLVYLLSYK\n
GLEAN3_27753	SPU_027753		Inspection of the tiling array suggests that glean may have missed the following exons: KNFTHIKKGYHQSPPTVCYQLPLFVCLVFPSLSPSVSVSLSHSVSTALYPPVSPSLSVSLSLSKVYLSHLLASCKSFSFRCLRIVSLSEDDLENRLE,SSQPATHMGLCCCCSLRRAASNAAALTLCLSLPAALLPEARKNRSEWLADGWSTEVESGACLVADVSWLAADGGAEYVDDVDGCEWDVPC\n
GLEAN3_27912	SPU_027912		Inspection of the tiling array suggests that glean may have missed the following exons: RSFPLALELAVGQTSPEEIVALQHKWLESHLSPVGAASISARACTFLTVLTRNPPLGSGHCIMIFSCMSLLLRDQTPASSDLRNQPKLAELW,SGSFFRAFASRLSLTSSFSSGGSSSSSGGSSSSSHAHSTSSSAPSTSSNGPSTSSNGPSTSSDGPTSMSASGSPAVSASVAWLVGSSSYCVHESLV,GFRFPFEPDFIFFLRWLILFFRWLILFLPCPLYFLQCPFYFLQWPIYFLQWPIYFLRWPHFYVCFRVSSCVCFSSLAGWILLILCP\n
GLEAN3_27919	SPU_027919		Inspection of the tiling array suggests that glean may have missed the following exons: NVSDAQKFQCGMCEDLFTSAKLILRHMRCEHSQDRTEDLYPVLMWKRKKKKKALETENSSQEDLTIENKVSEQGDLVFKCRTCEKVFSCHGRLKEHESFHKFSQGHACPVCDKKETNSRTLAKHMKTHEPLVLKCKECNRIYKTKSALRKHLNEFHGHQCRICSERFPYMTDCKKHEQTHQGSTKNGAHTGKSLLSASSPEEPKDMLGKTSNYYQRPFKCRYCPKRYSLRSSVKTHEKERHTGDLVFKCPHCPKVFGREYRLIDHLRSHEENRMYRCKLCPKTFGSESALTNHQGEHTGLKPFSCDICSKGFRIKKAVQDHKRRIHQKRQMRFFCSVCNKGFADKGNFTKHERRHKGVRPYVCLECGKGFTAKSCLTTHIKAMHTAEKPFSCELCGKTFSLNQNYTYHMFRHKEQGDISSIQQ,SYLKDSGPVHSSANHLHHAHLQRPVAFGGHGVKFSSLPASYTPPITAQEPAVERNDVPVSLTTCVPVVERTDLPTTTRESVIDIPPISMRESPTDSCIPPPTKQELAEDSYMSPSTAHESPIDSSISPTTTQESVEDSSIPATTTNEQVTDSINIEPVTTSEVAMDEDIPPAKSWGTVEERSIRLVTTHKPQAEGNT,LYFFCHFFLSPGQTPIAPPAPSKPLHSKEQRKQDTSSSSQSNTSPPASAVEPSLLAALNLKKSRMLSSTPELPQGLSFRWTLEGKVEGVVAKGTVEKGSEFGPYPGSLMNEEQGLSKD,ALTSNLVEFFPLYHMTTCWLDSNIVFEGIAYDRVVLEAVLCNLEIYYGRAIKFCVICHNQQTEVALLNNIIDPGFGEPTHLAIG,SSFYEQWKVITPLVTTSLKGINKKTDNSNNRHHSSNHNHNSNHSSSNMLHTIPLICQYHTTHPLSISLSIWGTINRPEFSQYHTQDLWHKCSVQFIMVILINLILIF,AMEGNYPPGYYLPERDQQENRQQQQPPPQQQPQPQQQPQQQQHVAHYPFDLPIPYHTSSQHQSEYMGDDQPTRVFPVPHPGPMAQMFSPVYHGNINKSDFDI\n
GLEAN3_28222	SPU_028222		Inspection of the tiling array suggests that glean may have missed the following exons: ADNCSGLQGFLIFHSFGGGTGSGLNALLMERLSVDFGKKSKLEFAIYPAPQVSTAVVEPYNSILTTHTTLEHSDCAFMVDNEAIYDICRRNLDIERPSYQNLNRLIGQIVSSITASLRFDGALNVDLTEFQTNLVPYPRIHFPLVTYAPVISSEKAFHEQLSVSEITTSCFEPLNQMVKCDPRHGKY,TLSSWSVSPSTSARSPNWSSPSIRHLRFPPLLSSHTTPSLPLTPPSSTPTVPSWSTTKPSTISAVVISTSSVRHTRTSTV,LAKVQRAVCMLSNTTAIAEAWGRLNHKFDLMYAKRAFVHWYVGEGMEEGEFAEAREDLAALEKDYEEVGIDSCDAEAEDDEDY\n
GLEAN3_28746	SPU_028746		Inspection of the tiling array suggests that glean may have missed the following exons: FRSFTEIDFWQNEEDGRCFDGETRESGDDDHGGNNKSCECKVTEPSTGSVCSYCQKKHERTDSGGKPSLQCFFCDCSFSIECHLTRHLQFHVGMKTYDTFHCSLCKKSFLSKSDLVKHKTKCTGEKPYECIHCTSTFAKQTDLKVHIRTHNQVKNILTVQTQDQTEHSYGQSQSQCPYCKRAFKTKSTLDSHIGTMTFENSYSCSHCSSTFRSKCSLTLHNRTHKYQCFLCNKRFASLDGRNTHVKWHTGVKPHHECSYCSKKFSKKCHLDEHVRIHTGEKPYRCSYCEKGFRTKGNFTKHLKIHNGGNNEEG,KQQGEREQRLTPVKEVGLCLACYMKEESSMEFYIKEEKLLFYEAETGKSDRDQESLQEDVKQSCVDEKGWIDMFAEPEIASSALSSDPQASETVLIAVGQESVLEDDERIGDTSQESSERESVPPTQ\n
GLEAN3_28753	SPU_028753		Inspection of the tiling array suggests that glean may have missed the following exons: RISCYNTNIHDEALLLYAISHVYFSHPMMQTSCYIENIDIYVSCCEELDYTWIEMYCYNENMNKNFQRSLFLCVSQGWTLF\n
GLEAN3_00017	SPU_000017		Inspection of the tiling array suggests that glean may have missed the following exons: KLKPLETRMMDRLEERHQKERPWETRMIDRPEERHQKERPWETRMDRPEERHQKEKPWKTKMMERPWETRMMERPWETRVMDRPEERHQKEKPWETRMIDRPEERHQKEKPWETRMMDRPEERHQKEKP,QTLAEVKVETIGDKNDGQAGGKTSKGETMGDENDRQTGGKTSKGETMGDEDGQAGGKTSKGETMEDENDGETMGDENDGETMGDESDGQTRGKTSKGETMGDENDRQTGGKTSKGETMGDENDGQTGGKTSKGETIGDENDSSIQFISHHF\n
GLEAN3_00437	SPU_000437		Inspection of the tiling array suggests that glean may have missed the following exons: VVALLPADTDLLVVELEVVGAQDLAVAAEYSDSDFLDVVSLDVVVLEAVGLFVVAAGLTDVADVVLFEVAVGLHDVVVFALHLVDDVMSEDQQEVVPGVALLLVAVILDYWAVPGSHLVDILVIGLHFH,HIIIKYRSADVVCTRRLGYWFQTTELAHEVHCNITQAMGRRRAKSIFGTGVISTPLFSTKLFGTGNVDNLANMKSHFGTLYYRPM,FEAEQFCMFIYNNITIYMFLHSVSVDTIVQCRYYPCLCHFTVPTSQIIHLQIAMLCIKITRSPRTPKWYPNQPRYISHNTPSPCFALETKIHTSNIELPETCI\n
GLEAN3_00440	SPU_000440		Inspection of the tiling array suggests that glean may have missed the following exons: PLQEFTLFFAPNCVLFLHESVMLSVQSREFGKKKFCCVTSSLFLSLRQTGSVHSTVGVHFNYDDAGTSRMHESLSSLSLPFQYQ,SRCLRIRPRRLLRRLTKRLPLRLSRRLLLRLSQRLLKRLPRRLTRRRPRRFLRRPLRSPPRRLSKSESSLRTPRVESMSS,IKMPQNKTKKAAEKANEKAPIKAVKKASVTAVAKTSEKAAKKAHEKAPKKVPQETSEKPAEKVVKKRVKSEDPKSGVHVFLGAKDFPRWCVMKEKSGCKTNVQLMRWLLGIAEKYFG\n
GLEAN3_00540	SPU_000540		Inspection of the tiling array suggests that glean may have missed the following exons: IQININKGENHITRRERILFLHSDNYFDLPVSNSLAFKQLTMAHTLIVSYPTHLAPTGLFHHLTSLKNRHTHDLPNDYFC\n
GLEAN3_00578	SPU_000578		Inspection of the tiling array suggests that glean may have missed the following exons: THTREKLYECSHCQKSFSHKGNLTQHLLTHTGEKPYECCSCKKGFSQKSTLNCHILTNGRKAIRVFTTKTHIAEKPYVHIVVNGFLKKVLLNIYVRKVFLTNAISHPTPTNTHRKKAL\n
GLEAN3_00603	SPU_000603		Inspection of the tiling array suggests that glean may have missed the following exons: HKIFIYFWSGCVNGGGQIKAEEGITAKELLVEVEELYNSAPTRTPEENEVVQTLYEEWLGGVGSEKARTMLHTQYHALEKNTNALNIKW,TEEGRSRLKKASQRRNYSWKLKNCTTLLLRGHPRRTRWSRLFTRSGSEEWAVRKQGLCCIRSTMRWRRTQMLLTSSGRTGIGICRLGN,QPNINYLFLTQQRGXXXXXXXXXXXXXXXXXXAQSLRVKPDCIYHVTVMPCYDKKLEASRDDFYDDVYRTRDVDCVITSGMYLLFCLALF\n
GLEAN3_01483	SPU_001483		Inspection of the tiling array suggests that glean may have missed the following exons: NPKQNIFVFFLNPEKPWQGEDKPSPSSQPKKSSSSRMKRKIKEENTPNDEVLDHVTRVTGLETVDGRIVDEEGEPEPESKKKMKKAKRKHI\n
GLEAN3_02586	SPU_002586		Inspection of the tiling array suggests that glean may have missed the following exons: SLSCVYYPKKYIFIKFKHLPGLPYINYAVLAWGKSLITQLDKLFLAQKRVIRTICNADFCAHTNPLFYHHRILKVEDIYYMQLGSLMYDLNSGVLALAKIFKKNNQIHNYTTRSASAFHLPHARTKFTLNSLVCNRPRFWSTLVLTPVSICLQT,QGQTITKAFFILPSPYFIPGYFLCFSFLFPIPLLYLQMFYSLFLLHLSVPPPPPTNLSSSIILSIYLLFLIFSPLSHSLSISPLHLFPFCVKGSIAFFNFL,INNIFILPIYIHVHLPWIHLCTAYVHHLGYQGVFLPCNHSKGKLSQKLSSSSQVLISFLGIFFVFLFFSPFPFCIFKCSIPFSFFISLYPPPPPPISLPQSYFPYISFS,IISLFYQFIYMCIYLGFIYAQRMCTISATRECSYHVIIARANYHKSFLHPPKSLFHSWVFSLFFFSFPHSPFVSSNVLFPFPSSSLCTPPPPHQSLFLNHTFHISPFLNLFPPLSLAIHFPPPPIPLLCQRFNCFLQFS,LYSVTCTCTCMLGKLENPFCQNSPFPESLKKECTTNILFTLIILIILLDCTFPLNFCGALKIMMRSIIIKYLASSVPGRRKAN\n
GLEAN3_02674	SPU_002674		Inspection of the tiling array suggests that glean may have missed the following exons: SWNSSDLMLRVSASWRVCSQLLVPFGCELHPTLMILFLSGSETCVWYSGILYLLLSISNKMYNGGIRWRISILDSMVNLNGASTCGC\n
GLEAN3_02708	SPU_002708		Inspection of the tiling array suggests that glean may have missed the following exons: LNEWLTSKSTCTCALTTSFLINLLVLKLLPIGLACISSTNELLILPSGHSILKPWHGHINIPRTRVSCAWPKCFFNQGKGPLQWNNVV,LSSRFLHMKKRVCVDQATQTMPYVYSCNHCGSSPLANKPMPPLARIQPSGPQKIKLMDRVSNPLFSGPKTPNIKVIMIKKSTAGVLKRDSRMTPQALNNLMQLGSNLQAAQPLQGNLTGSS\n
GLEAN3_03207	SPU_003207	According to the BLAST results, this is the best match for this particular GLEAN model. When examining the excel data it appears that there are several internal repeats present as well as sequence overlaps. There is also a large gap present that spans from 1183-1925. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 10. 	Inspection of the tiling array suggests that glean may have missed the following exons: ERFHRNQALKKHMMRHAGNEPYPCSECNVRCLSKPGLVRHMATHSGTKDHQCCKCGKMFARPHDLRKHEQSHEEEPETYLCFICGQTFDHKKNYHAHIGTHTRRQHGGRPTCKAKDSSETSHLLNTSDGRIS,DVYPSLVLCGIWLHIVVQKTTSVANVGKCLLDHMISASMNNHMRKNLRLICVLYVARRSIIRKITMLILVRIREDSMVDVQHAKQKTLPKPLIC\n
GLEAN3_03490	SPU_003490		Inspection of the tiling array suggests that glean may have missed the following exons: IGWFQRIRSVAAKMLILLPQPTMDLTACTNAKNTILKVRFSYSLLEFWYILLQPSPSALSVTFLSPKEKQLVLPAYTYLYVSC,RVALRDYVFSHGPTRGPIEYNTNTDDATAAITCVQHTQKPKCEENFGLFSGTRADHVEIKLLTNSNQVHDVNQHLPKLHPL\n
GLEAN3_03649	SPU_003649		Inspection of the tiling array suggests that glean may have missed the following exons: IYSVSRKDGYRPYYIFPLMKYHISNEPPVIIYNFIKKKIHLLVRKVTYNSSRHHSVTSSPITELCKADRVKTSAVIAPGVNTSEEIARGKTGVLADRCVCVCR\n
GLEAN3_03848	SPU_003848		Inspection of the tiling array suggests that glean may have missed the following exons: EGESCKKAKRKRKPTKPIKFMEYCSEAERIVVGNLPSPKATASYSTDQVMLMETDSSALNNEQFSQSSTQHMYCLLNFKIYECLQCGLGFASEKAMNVHIRTHSKEKPYWCTECDIGFTEHQLYMAHKQSHRPCKCDECGASFGNGSTLKNHKLLHLQSKNFKCSVCPKMFKQRAGLTCHMRSHTDERPYLCKECGAAFVDNKSLQNHMSVHSDEKAFKCSVCPKMFKQRAGLAHHMKHHNDDKQYLCKECGAAFAYNIHLQNHKAIHSDEKTFKCTICPKMFKQRAGLTGHMKAHTDEKPFMCELCGKSVKTKSTLKKHRMIHSEEKPYQCPLCPQAFKQRAGLSQHSHKHGEGNPY\n
GLEAN3_04802	SPU_004802		Inspection of the tiling array suggests that glean may have missed the following exons: KLMNTFLLLFPPQSFYKNQTPLRARQTIKMKLKKQQKNPITTIVIMMMMTALQEKEVGRKAPWLEGVKAPNLRRNRQRNHLNHHRQRNTNATSVTLS,YLEINEHFSFTFSSPVILQKPDTSQGKTDNKDETQEAAEESNHNNCNNDDDDSPAGEGGREEGTLAGGGQGPEPEAEQAEEPFEPPPPEEHECNICNSVLTSLWELD\n
GLEAN3_04806	SPU_004806		Inspection of the tiling array suggests that glean may have missed the following exons: EGYSIFHTFLVVHLALSFPTRDLHLRPLVLLLLLLLELRGRGTYCALFGLCDLFLGMYLHVSVKMETFADLGADGTGLELGRGRFTHVGVHVGIDLHLI,GIQHLPYLSRSPSRPELPYSRSSSEASCAAAAAAAGAAGKRNLLRPLWPLRSLPWNVSSCVGKDGDLCRPWCRWDRSGAGARALHPCWRSRRN\n
GLEAN3_04807	SPU_004807		Inspection of the tiling array suggests that glean may have missed the following exons: NPVWMFVCLFMSFWELNSLKQCGHGCFRFGSVRSGCSCAANVPPLMSWWRRSFFWARDLQMSQMYENSVRSGEVSSWFKTSASFPQPPHLTLTVQGSRK\n
GLEAN3_06058	SPU_006058		Inspection of the tiling array suggests that glean may have missed the following exons: FTKYQLSIHMKNHPEVKPFQCSACDKRFSLKSYLAQHMKYHSDKKTHQCPMCPKGFIRNSVLQEHIKTHASEKPFECAMCGKRFSSKISLAVHMKKVCKKRPDREQQDNPLPSV\n
GLEAN3_09398	SPU_009398		Sp-MAP2K5 spans two glean prediction: \nGLEAN3_09399 and GLEAN3_09398\n
GLEAN3_00751	SPU_000751		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix D.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_01794	SPU_001794		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix C.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nIt is noteworthy that GLEAN3_01797 (Sp-MACPF-C.2), a model adjacent to this gene, also contains the MACPF domain. A comparison of their protein sequences reveals high similarity but a fair number of differences as well. It is to be determined whether this fact reflects the erroneous assembly of different haplotypes (both genes are indeed located in an area of numerous contigs) or if reflects a true gene duplication event.\n
GLEAN3_05223	SPU_005223		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix A.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNB: Given the position of this model in a small scaffold and its similarity to typically larger models, it is likely that this is an incomplete model. Also, other gene prediction protocols generated slightly different models for this gene. For lack of better evidence, we have decided to accept this glean model in its present form. In addition, GLEAN3_22091 shows high sequence similarity to this model, but some differences as well (including sequence gaps); it is yet to be determined if these two models might represent haplotypes wrongly assembled as two different genes.\n
GLEAN3_06818	SPU_006818		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix G.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNB: This model is located at the end of a scaffold. We cannot rule out that this model has been forcedly truncated during the gene prediction process. In fact, its only functional domain (MACPF) lies next to the end C-terminus of the predicted protein, which is uncharacteristic of the other members of this family of genes.\n
GLEAN3_14677	SPU_014677		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix C.3 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNB: Other gene prediction protocols incorporate additional C-terminus sequence in their models for this gene, but without adding new identifiable domains to the predicted protein. For lack of better evidence, we have therefore decided to accept this model in its current form.\n
GLEAN3_16546	SPU_016546		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix B.3 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nThe structure of this gene is highly supported by the embryonic genome-wide tiling array hybridization data, and by identical models generated by almost all gene prediction protocols.\n
GLEAN3_17952	SPU_017952		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix A.4 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_22091	SPU_022091		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix A.2 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNB:  GLEAN3_05223 shows high sequence similarity to this model, but some differences as well (including sequence gaps); it is yet to be determined if these two models might represent haplotypes wrongly assembled as two different genes.\n
GLEAN3_07159	SPU_007159		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix E.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNB: This model is located at the end of a scaffold. We cannot rule out that this model has been forcedly truncated during the gene prediction process. In fact, its only functional domain (MACPF) lies next to the end C-terminus of the predicted protein, which is uncharacteristic of the other members of this family of genes.\n
Sp-Gg1d	SPU_030084		Gene model derived from est matches. Predicted protein sequence matches exactly to GLEAN3_14498 (Sp-Gg1), but an additional intron seems to be present in 3'utr.\n
Sp-Gg3	SPU_030085		glean inferred from est data\n
Sp-Gg4	SPU_030086		Gene predicted based on homology to human sequences, but this locus might be a pseudogene (because it doesn't have introns).  It seems that it is expressed though based on tiling array data.\n
GLEAN3_00986	SPU_000986		This gene model is located at the end of a  very short contig. The nucleotide sequence of the first exon has 94% similarity to that of another Sp-Tlr gene. It could be a member of Toll-like receptor. The second exon that was 100000bp separated from the first was eliminated. \n
GLEAN3_13279	SPU_013279		#\nGene fragment- likely haplotype of Glean3_18156- see annotation of that gene.\n
GLEAN3_18156	SPU_018156		Complex gene with varying exon predictions among different models. Tiling data inconclusive.\n
GLEAN3_07980	SPU_007980		Haplotye of Glean3_25345- see that gene for annotation.\n
GLEAN3_05834	SPU_005834		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts best to Cathepsin C, and it strongly co-clusters with it in a NJ phylogenetic tree. We have therefore decided to name it "Cathepsin1" but note this high similarity by making "CathepsinC" one of the synonyms for this model.\n
GLEAN3_09042	SPU_009042		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 2" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like1" one of its synonyms. \n \nNB: The structure of this model is highly supported by the fact that other gene prediction protocols generated almost identical models and by the genome-wide tiling array hybridization data.\n
GLEAN3_09368	SPU_009368		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts best to Cathepsin L2; however, a NJ multiple protein alignment tree shows that its sequence is equally related to that of Cathepsin L, L2, K and S. For this reason we decided to name this model only "Cathepsin 3" and not group it with other Sp-CathepsinL-like models. \n \nNB: The structure of this model is highly supported by the fact that other gene prediction protocols generated almost identical models.\n
GLEAN3_09601	SPU_009601		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts similarly to Cathepsins X, Y and Z, and it strongly co-clusters with them and GLEAN3_13893 in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 4" for consistency, while noting its high similarity to Cathepsin Z by making "CathepsinZ-like1" one of its synonyms. \n \nNB: Other gene prediction protocols generated noticeably different models for this gene. However, the genome-wide tiling array hybridization data indicate that all exons of this glean model would be expressed at similar levels during embryonic development, which supports the idea that they all belong to the same gene. For lack of better evidence we have decided to accept this glean model in its current form.\n
GLEAN3_13893	SPU_013893		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts similarly to Cathepsins X, Y and Z, and it strongly co-clusters with them and GLEAN3_09601 in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 5" for consistency, while noting its high similarity to Cathepsin Z by making "CathepsinZ-like2" one of its synonyms. \n \nNB: The structure of this model is highly supported by the fact that other gene prediction protocols generated almost identical models and by the genome-wide tiling array hybridization data. It must be noted, however, that the N-terminus for this model is located very close to one end of the scaffold. For this reason, we cannot rule out that more N-ter sequence not included in this model exists.\n
GLEAN3_14767	SPU_014767		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 7" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like3" one of its synonyms. \n \nThe structure of this model is highly supported by the fact that other gene prediction protocols generated almost identical models. It must also be noted that both adjacent models (GLEAN3_14766/GLEAN3_14768) show signficant similarity to this model, which raises the question of whether these may reflect gene duplications. A careful inspection of their sequences and domain structure suggests that GLEAN3_14766 is a true cathepsin gene (annotated as such by Esther Miranda - Duke University), whereas GLEAN3_14768 may have been generated as a result of an assembly error (see GLEAN3_14768 for more details). Both models show significant differences at the aminoacidic level with GLEAN3_14767, which would argue that they are due to true gene duplication events, something that is also observed among vertebrate cathepsin L and L-like genes.\n
GLEAN3_14768	SPU_014768		This model was annotated based on a manual inspection of multiple protein sequence alignments. Based on such analysis, we propose that this is very likely an incomplete/wrong model. A careful analysis of its sequence and structure domain indicates that its true last coding exon (containing a STOP codon) may fall in a sequence gap \nadjacent to its second last coding exon (i.e. the last coding exon may have been forcedly incorporated to this model). For lack of supporting evidence for this claim, we have accepted this glean model in its present form. \n \nThis model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 8" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like4" one of its synonyms. \n \nThe (wrong?) structure of this model is highly supported by the fact that other gene prediction protocols generated almost identical models. It must also be noted that its adjacent model (GLEAN3_14767) shows signficant similarity, which raises the question of whether this may reflect a gene duplication. A careful inspection of their sequences shows some noticeable differences at the aminoacidic level which would argue that they are due to a true gene duplication event, something that is also observed among vertebrate cathepsin L and L-like genes.\n
GLEAN3_14765	SPU_014765		This model was annotated and modified based on a manual inspection of multiple protein sequence alignments. We have modified this model based on the models generated by NCBI and FgeneshAB, both of which show a good alignment to cathepsin L-like genes. The original glean3 prediction presented exon and domain structures that clearly resembled artificially fused genes. In fact, the remaining of the original glean3 prediction, which is represented as well by respective NCBI and FgeneshAB predictions, closely resembles genes present in othey phyla, which supports the claim that the original version of GLEAN3_14765 wrongfully fused to separate genes. \n \nThis modified model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 9" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like5" one of its synonyms. \n \nNB: The adjacent model (GLEAN3_14766) shows signficant similarity to this modified GLEAN3_14765, which raises the question of whether this may reflect a gene duplication. A careful inspection of their sequences shows some noticeable differences at the aminoacidic level which would argue that they are due to a true gene duplication event, something that is also observed among vertebrate cathepsin L and L-like genes. Although we cannot rule out, based on this analysis, that they may be different alleles of the same gene and that the apparent duplication be due to an assembly error.\n
GLEAN3_28748	SPU_028748		Amino terminal domain truncated- no obvious 5' exons. Tiling data not consistent with gene models.\n
GLEAN3_14914	SPU_014914		This model was annotated based on a manual inspection of multiple protein sequence alignments. \n \nThis model Blasts best to Cathepsin F, and it strongly co-clusters with Cathepsin F in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 10" for consistency, while noting its high similarity to Cathepsin F by including "CathepsinF-like1" as one of its synonyms. \n \nNB: The structure of this model is highly supported by the fact that other gene prediction protocols generated very similar models and by the genome-wide tiling array hybridization data.\n
GLEAN3_20838	SPU_020838		This model was annotated and modified based on a manual inspection of multiple protein sequence alignments. We have modified this model based on the corresponding model generated by NCBI, which shows a better alignment to cathepsin L-like genes (although it generates an incomplete cds - i.e. no stop codon). The remaining (C-ter) of the original glean3 prediction does not Blast back to any known sequence in nr nor does it contain any identifiable functional domain, which suggests it may represent an artificial fragment fused to the model for lack of an earlier stop codon in the scaffold. \n \nThe modified model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 13" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like8" one of its synonyms. \n \nNB: The adjacent model (GLEAN3_20837) shows signficant similarity to this modified GLEAN3_20838, which raises the question of whether this may reflect a gene duplication. A careful inspection of their sequences shows some noticeable differences at the aminoacidic level which would argue that they are due to a true gene duplication event, something that is also observed among vertebrate cathepsin L and L-like genes. Although we cannot rule out, based on this analysis, that they may be different alleles of the same gene and that the apparent duplication be due to an assembly error.\n
GLEAN3_20837	SPU_020837		This model was annotated and modified based on a manual inspection of multiple protein sequence alignments. Its structure is supported by almost identical models generated by all other gene prediction protocols. \n \nThis model Blasts best to Cathepsin L, and it strongly co-clusters with Cathepsin L and several other glean models in a NJ multiple protein alignment tree. We have therefore decided to name this model "Cathepsin 12" for consistency, while noting its high similarity to Cathepsin L by making "CathepsinL-like8" one of its synonyms. It should be noted, however, that this model lies close to one end of the scaffold, and we cannot therefore rule out that additional N-ter sequence has been left out in the assembly process. \n \nNB: The adjacent model (GLEAN3_20838) shows signficant similarity to this model, which raises the question of whether this may reflect a gene duplication. A careful inspection of their sequences shows some noticeable differences at the aminoacidic level which would argue that they are due to a true gene duplication event, something that is also observed among vertebrate cathepsin L and L-like genes. Although we cannot rule out, based on this analysis, that they may be different alleles of the same gene and that the apparent duplication be due to an assembly error.\n
GLEAN3_17141	SPU_017141		This gene is present on three GLEAN predictions. GLEAN3_17141 contains the first ~890 AA and GLEAN3_12930 and GLEAN3_21952 have the rest. GLEAN3_12930 is the largest piece with ~1400 aa.\n
GLEAN3_12930	SPU_012930		This gene is present on three GLEAN predictions. GLEAN3_17141 contains the first ~890 AA and GLEAN3_12930 and GLEAN3_21952 have the rest. GLEAN3_12930 is the largest piece with ~1400 aa.\n
GLEAN3_21952	SPU_021952		This gene is present on three GLEAN predictions. GLEAN3_17141 contains the first ~890 AA and GLEAN3_12930 and GLEAN3_21952 have the rest. GLEAN3_12930 is the largest piece with ~1400 aa.\n
GLEAN3_02324	SPU_002324		embryonic lethal, abnormal vision, drosophila, homolog-like 1; Hu antigen R\n
GLEAN3_10853	SPU_010853		Putative pre-mRNA splicing factor ATP-dependent RNA helicase DHX15 (DEAH box protein 15)\n
GLEAN3_08911	SPU_008911		This model was annotated based on a manual analysis of multiple protein sequence alignments and domain composition. Its structure is supported by almost identical models generated by other gene prediction protocols. \n \nThis gene aligns well with and shows a domain structure that resembles genes of the heme-dependent peroxidase superfamily, which includes a large number of genes from various phyla. These genes are most typically identified based on their biological function (e.g. ovoperoxidase, lactoperoxidase, myeloperoxidase, eosinophil peroxidase, etc), but they all present a secretory signal peptide and a single haem-peroxidase domain. \n \nNB: The N-ter of this model is likely missing, based on the structure of the genes with which this gene aligns best and the fact that this model is located at the end of a scaffold.\n
GLEAN3_19097	SPU_019097		This model was annotated and modified based on a manual analysis of multiple protein sequence alignments and domain composition. \n \nThis modified model aligns well with and shows a domain structure that resembles genes of the heme-dependent peroxidase superfamily, which includes a large number of genes from various phyla. These genes are most typically identified based on their biological function (e.g. ovoperoxidase, lactoperoxidase, myeloperoxidase, eosinophil peroxidase, etc), but they all present a secretory signal peptide and a single haem-peroxidase domain. \n \nNB: Different gene prediction protocols generated noticeably different models for this gene (mostly towards the N-ter of the model). The NCBI model shows the best alignment with other heme-dependent peroxidases to which this model Blasts back, and thus we chose it to modify this glean prediction.\n
GLEAN3_02004	SPU_002004		This model was annotated based on a manual analysis of multiple protein sequence alignments and domain composition. Its structure is supported by almost identical models generated by other gene prediction protocols and by the embryonic genome-wide tiling array hibridization data. \n \nThis gene aligns well with and shows a domain structure that resembles genes of the heme-dependent peroxidase superfamily, which includes a large number of genes from various phyla. These genes are most typically identified based on their biological function (e.g. ovoperoxidase, lactoperoxidase, myeloperoxidase, eosinophil peroxidase, etc), but they all present a secretory signal peptide and a single haem-peroxidase domain. \n \nNB: The N-ter of this model is missing, based on the structure of the genes with which this gene aligns best and the fact that this model is located at the end of a scaffold.\n
GLEAN3_10343	SPU_010343		DEAD Box containing protein. Similar to either DEAD (Asp-Glu-Ala-Asp) box polypeptide 5 OR DEAD (Asp-Glu-Ala-Asp) box polypeptide 17.\n
GLEAN3_24969	SPU_024969		the original glean is a C-terminus of snx13 gene; the N-terminus is on the same scaffold - the previous glean GLEAN3_24968; I've combined predictions here.\n
GLEAN3_06014	SPU_006014		Added 3' exon from Angerer model due to alignment to mmp13.\n
GLEAN3_24968	SPU_024968		n-terminal piece of Sp-RGS-PX1; full annotation is in Glean GLEAN3_24969\n
GLEAN3_18333	SPU_018333		First exon position is uncertain: some discrepancy with the est. Neither est nor homology searches give a nice fisrt exon prediction, the exact region must not be in the assembly yet.\n
GLEAN3_07675	SPU_007675		pre-mRNA splicing factor SF3a (60kD). GLEAN3_16711 is a duplicate prediction.\n
GLEAN3_16711	SPU_016711		Duplicate prediction. GLEAN3_07675 is complete gene.\n
GLEAN3_11993	SPU_011993		Human gene is significantly longer. But the prediction may be complete as Elegans and Drosophila genes are about the same size. \n \n \nThis gene encodes subunit 2 of the splicing factor 3a protein complex. The splicing factor 3a heterotrimer includes subunits 1, 2 and 3 and is necessary for the in vitro conversion of 15S U2 snRNP into an active 17S particle that performs pre-mRNA splicing. Subunit 2 interacts with subunit 1 through its amino-terminus while the single zinc finger domain of subunit 2 plays a role in its binding to the 15S U2 snRNP. Subunit 2 may alsofunction independently of its RNA splicing function as a microtubule-binding protein.\n
GLEAN3_04897	SPU_004897		Likely missing 3' exon not presenton contig.\n
GLEAN3_02029	SPU_002029		Haplotype of Glean3_04897 (Sp-mmp24d).\n
GLEAN3_13513	SPU_013513		Duplicate prediction for GLEAN3_26806\n
GLEAN3_26806	SPU_026806		Distal part of the SF3a120. First part is predicted by GLEAN3_27895.\n
GLEAN3_27895	SPU_027895	After reviewing the data and performing a BLAST search, it appears that there is no sufficient fit for this particular GLEAN model. The sequence is distributed onto 2 different scaffolds that contain several gaps as well as overlaps. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong overall with most of the values being <10	GLEAN3_27895 contains the first part of the gene. GLEAN3_26806 codes for the tail end. GLEAN3_13513 is a duplicate prediction for GLEAN3_26806.\n
GLEAN3_00866	SPU_000866		Likely missing 5' exon based on blast alignments.\n
GLEAN3_27966	SPU_027966		PHD finger-like domain protein 5A (Splicing factor 3B associated 14 kDa protein) (SF3b14b)\n
GLEAN3_02161	SPU_002161		This gene encodes one of four subunits of the splicing factor 3B. The protein encoded by this gene cross-links to a region in the pre-mRNA immediately upstream of the branchpoint sequence in pre-mRNA in the prespliceosomal complex A. It also may be involved in the assembly of the B, C and E spliceosomal complexes. In addition to RNA-binding activity, this protein interacts directly and highly specifically with subunit 2 of the splicing factor 3B. \n            This protein contains two N-terminal RNA-recognition motifs (RRMs), \n            consistent with the observation that it binds directly to pre-mRNA.\n
GLEAN3_00101	SPU_000101		Likely has an extra-exon predicted.\n
GLEAN3_17104	SPU_017104		The nucleotides of the second exon has 93% identity to another typical Sp-Tlr gene, while that of the first exon doesn't. This gene model may be a member of Toll-like receptor. \n \n
GLEAN3_03478	SPU_003478		Missing one or more exons at the beginning of the gene. GLEAN3_03477 may represent one of the missing exons or it may be a duplication of the GLEAN3_03478.\n
GLEAN3_03477	SPU_003477		Missing one or more exons at the beginning of the gene. GLEAN3_03477 may represent one of the missing exons or it may be a duplication of the GLEAN3_03478.\n
GLEAN3_13007	SPU_013007		GLEAN3_17802 is a duplicate prediction.\n
GLEAN3_17802	SPU_017802		This is a duplicate prediction for GLEAN3_13007\n
GLEAN3_21629	SPU_021629		similar to mitosis-specific chromosome segregation protein SMC1. GLEAN3_26628 appears to be a duplicate prediction over the last exon(s).\n
GLEAN3_26628	SPU_026628		GLEAN3_26628 appears to be a duplicate prediction over the last exon(s).\n
GLEAN3_12607	SPU_012607		small nuclear ribonucleoprotein D1 polypeptide (16kD); snRNP core protein D1; Sm-D autoantigen"\n
GLEAN3_08908	SPU_008908		Modified gene model to reflect cloned cDNA. Included sequences present on small scaffolds 85810, (167442 and 56237 both match same region), 160020. \n
GLEAN3_24030	SPU_024030		GLEAN3_10849 is likely a duplicate prediction for GLEAN3_24030\n
GLEAN3_10849	SPU_010849		GLEAN3_10849 is likely a duplicate prediction for GLEAN3_24030\n
GLEAN3_19147	SPU_019147		GLEAN3_21883 is a duplicate prediction of GLEAN3_19147\n
GLEAN3_21883	SPU_021883		GLEAN3_21883 is a duplicate prediction of GLEAN3_19147\n
GLEAN3_12978	SPU_012978		Lacking C-ternimus.  See GLEAN3_28726, _27443, _08472. \n
GLEAN3_05089	SPU_005089		This gene model has no TIR domain. But the nucleotides encoding signal peptide, LRRNT, LRR(15-23), CT has 88% similarity to another typical Sp-Tlr. The unknown sequence (NNN) in the 3'end could makes the gene model incomplete. \n
GLEAN3_01231	SPU_001231		#\nThis gene model has no TIR domain. But the nucleotides encoding LRR(12-21) and CT has 85% similarity to another typical Sp-Tlr. Unknown sequence (NNN) at the 3' end of this gene model could makes the gene model incomplete. \n
GLEAN3_06611	SPU_006611		This gene model has no TIR domain. But the nucleotides  encoding SP, NT, LRR(12-23), CT has 86% similarity to another typical Sp-Tlr. The unknown sequence (NNN) at the 3'UTR could makes the gene model incomplete. \n
GLEAN3_08150	SPU_008150		#\nThis gene model has no TIR domain. But the nucleotides encoding SP, NT, LRR(12-23), CT has 86% similarity to another typical Sp-Tlr. The unknown sequence (NNN) at the 3'UTR could makes the gene model incomplete. \n
GLEAN3_09172	SPU_009172		#\nThis gene model has no TIR domain. But the nucleotides encoding SP, NT, LRR(9-21), CT has 85% similarity to another typical Sp-Tlr. Unknown sequence (NNN) at the 3' end of this gene model could makes the gene model incomplete. \n
GLEAN3_28478	SPU_028478		Sp predicted similar to CG5680-PB\n
GLEAN3_00151	SPU_000151		dulicate accession: NP_999689 \nGLEAN3_10284 is a nearly identical internal duplicate\n
GLEAN3_16748	SPU_016748	From the BLAST results and the excel data, it is evident that this is the best results for this particular GLEAN model. When examining the excel data, it was apparent that the sequence did not begin until the 164th base pair and the there was a small gap within the sequence as well that spanned from 280-407. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	One of 2. Overlaps with GLEAN3-17385, and matches exactly in the overlap. This sequence is longer in the C term, and appears to lack the start codon. Only the c-terminal 2/3 of the protein matches with cdc2L1, and the string of Es appears to not be correct.\n
GLEAN3_17385	SPU_017385		One of 2, GLEAN3_16748 overlaps with this sequence. They match exactly (DNA & protein). This gene is longer at N term. and appears to contain the start codon, although internal sequences appear to be bad, and the long runs of Es aren't 'real'.\n
GLEAN3_08111	SPU_008111		matches middle of human/mouse gene sequence \nappears to be missing exons on beginning/end\n
GLEAN3_24925	SPU_024925		Notes:  \n-1 bp missing after 34146 \n-1 bp insertion at 36263 \n-2 bp insertion at 39462-3 \n-2 bp missing after 39814 \n-18 bp missing after 41009 \n-15 bp missing after 41117 \n-mismatch btwn 41178-94 \n-12 bp missing after 41291 \n-6 bp insertion after 41348 \n-mismatch btwn 41465-73 \n-6 bp missing after 41496\n
GLEAN3_13575	SPU_013575		Prediction of last exon is likely to be incorrect.\n
GLEAN3_14846	SPU_014846	From the BLAST search, it appears that this is the best fit for this particular GLEAN model. This scaffold appears to have the best overall sequence coverage when compared to the rest of the results. However, it does not have the highest bit score or lowest e-value results. When reviewing the excel data, it is evident that not all of the BLAST values coincide with the excel values resulting in base pair differences between the two sets of data. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall with the majority of valse being <5	1 of 2, the other is GLEAN3_18780. These 2 are overlapping and nearly identical where they overlap, although each has gaps with respect to the other.\n
GLEAN3_18780	SPU_018780		1 of 2, other is GLEAN3_14846. These are overlapping genes that are nearly identical where they overlap, although each has gaps with respect to the other. This gene BLASTs with an e of zero to both CG7337-PA (XP_781545.1) and to MAPKBP1-like.\n
GLEAN3_11815	SPU_011815		not the greatest homology. Possibly not correct.\n
GLEAN3_09620	SPU_009620		1 of 2, the other is GLEAN3_09206, which seems to be an internal fragment of this gene. They are identical in the overlapping region. Both genes hit XP-794873.1 as well.\n
GLEAN3_09206	SPU_009206		1 of 2, other is GLEAN3_09620. This one is a shorter version of 09620, and is identical. Both genes BLAST to both XP_794873.1 and XP_783732.1\n
GLEAN3_24798	SPU_024798		GLEAN3_24798 represent the first half of the gene. GLEAN3_09840 is the latter half.\n
GLEAN3_09840	SPU_009840		GLEAN3_24798 represent the first half of the gene. GLEAN3_09840 is the latter half.\n
GLEAN3_19790	SPU_019790		glean prediction looks like a duplicated part of larger rgs12-containing region. See full annotaion with GLEAN3_04238.\n
GLEAN3_02103	SPU_002103		Obtained Glean3_02103 from S. purpuratus genome by using N-terminal peptide sequence of NM_003972 (human gene BTAF1) with score of 72 bits and E-value 3e-13. Other Glean3 candidates had poor scores.  \n \nThe best genbank hits (XP_795066, E=3e-59; and XP_788365, E=2e-55) are predicted partial peptides similar to TBP-associated factor 172 (TAF-172) (TAF(II)170) of Strongylocentrotus purpuratus and are incomplete at the carboxy terminal. Predicted TAFs from other organisms also appear with high scores. \n \nBlasting Genbank yields some sequence support from empirical data, yet raises some warning flags. Human BTAF1 RNA polymerase II, B-TFIID transcription factor-associated,170kDa, has an E-score of 2e-24. However, sequence data for an endonuclease/reverse transcriptase (presenilin gene) from Branchiostoma floridae (amphioxus) is also returned, with an E-value of 1e-24, and other reverse transcriptases from schistosomes, several mosquito species and chickens. \n \nUpon examining the Genboree presentation of Glean3_02103, one observes: 1) The Glean, NCBI, and FgeneshAB predictions appear to span two different contigs while the Genscan model does not, by virtue of omitting the segment at the 5? end (this gene is on the ?strand); 2) The NCBI model lacks the largest exon predicted by all other gene models; and 3) No Exonerate or Splign data appear.  \n \nMicroarray tiling data from Systemix seem to indicate weak signals supporting two of the seven exons (5175-5338; 3816-4478), no support for two exons (1323-1481; 317-496), and questionable support for the remainder. \n \nUpon observing the questionable support for the gene in its entirety, I selected the sequence corresponding to the exon missing in the NCBI data and blasted the translated peptide sequence against Genbank. This time, my query obtained a hit corresponding to XP788888, which is the predicted CDS for an endonuclease/exonuclease/phosphatase family and RNA-directed DNA polymerase (5R694) of Strongylocentrotus purpuratus. Moreover, when I blasted the N-terminal 240 aa against Glean3, the best hit was GLEAN3_03468 (484 bits, E=e-137) instead of the initial GLEAN_02103, and is a stronger match than the original.   \n \n
GLEAN3_09223	SPU_009223		#\nThis gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(8-22), CT has 90% similarity to another typical Sp-Tlr. The unknown sequence (NNN) in 3'end of the coding could makes the gene model incomplete.\n
GLEAN3_01621	SPU_001621		internal duplication: looks like the ends of two contigs in the scaffold (aagj01193203 and aagj01193210) are actually overlapping... hence one of the exons was duplicated in prediction & deleted in annotation (actually, 2 exons are duplicated because of this, but only one extra made it into prediction)\n
GLEAN3_09450	SPU_009450		This gene model may represent a pseudogene or contain sequence error. A part of intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified model contains some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_11156	SPU_011156		This gene model doesn't have a TIR domain But the nucleotides encoding SP, NT, LRR(15-22) has 90% similarity to another typical Sp-Tlr. The unknown sequence (NNN) in 3'end of the coding could makes the gene model incomplete. \n
GLEAN3_16689	SPU_016689		Sp-seawi is made up of glean3_16689 on scaffold 19510 and glean3_24002 on scaffold 55433\n
GLEAN3_25996	SPU_025996		This is the only N-terminus and should be combined with GLEAN3_25997.  \n
GLEAN3_11775	SPU_011775		Partial Toll-like receptor. The nucleotides encoding SP, NT, LRR(5-17) has 89% similarity to another typical Sp-Tlr gene. This gene model is located at the end of the scaffold. \n
GLEAN3_12584	SPU_012584		#\nThis gene model doen't have a TIR domain. The nucleotides of the fist exon has 90% similarity to another typical Sp-Tlr gene, while the second exon could be wrong prediction. The fist exon is located at the end of the contig. That could make the gene model incomplete. \n
GLEAN3_24002	SPU_024002	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data and comparing it tot the BLAST results it is evident that the sequence doesn't begin until the 27th base pair (on  v2.1_scaffold23028 ) and continues until approximately the 590th base pair. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10	See Glean3_16689\n
GLEAN3_01826	SPU_001826		GLEAN3_01826 predictions may be incomplete. GLEAN3_09266 matches partially completely with GLEAN3_01826.\n
GLEAN3_18535	SPU_018535		GLEAN3_12748 appears to be a duplicate prediction for GLEAN3_18535.\n
GLEAN3_02257	SPU_002257		GLEAN3_18588 is a duplicate prediction for GLEAN3_2257.\n
GLEAN3_18588	SPU_018588		GLEAN3_18588 is a duplicate prediction for GLEAN3_02257.\n
GLEAN3_02983	SPU_002983		Partial sequence. \nNaked cuticle-2 is an EF hand calcium-binding domain protein similar to the recoverin family of myristoyl switch proteins.\n
GLEAN3_25144	SPU_025144		GLEAN3_25144 contains the first part of the gene. GLEAN3_23888 contains the second half. Both the predictions overlap significantly.\n
GLEAN3_23888	SPU_023888		GLEAN3_25144 contains the first part of the gene. GLEAN3_23888 contains the second half. Both the predictions overlap significantly.\n
GLEAN3_16974	SPU_016974		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(7) has 98% similar to another Sp-Tlr gene(GLEAN3_13751).  This gene model is located in a short scaffold, which could make it incomplete. \n
GLEAN3_14928	SPU_014928		#\nGLEAN3_14928 was divided into two Sp-Tlr genes (Sp-TlrP10 and Sp-TlrP11). This gene model doesn't have  a TIR domain, but the nucleotides encoding LRR have high similarity to other typical Sp-Tlr genes. The model is located at the end of a contig.\n
Sp-TlrP11	SPU_030087		GLEAN3_14928 was divided into two Sp-Tlr genes (Sp-TlrP10 and Sp-TlrP11). This gene model has unknown sequence in the 3' region, which could make the model incomplete.  \n
GLEAN3_25813	SPU_025813		In complete prediction.\n
GLEAN3_09091	SPU_009091	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. Both scaffolds appear to have an orderly arrangement without any internal repeats or gaps present. If the two scaffolds were combined, the sequence would have a continuous arrangement with good overall coverage. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to weak with most of the values being less than 10.	Scaffold_79280 missing 5' (leader) and 3' end (remainder of serine protease domain), probably because of incomplete sequence data.\n
GLEAN3_28187	SPU_028187	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. These two scaffolds both had the highest bit score and the lowest e-value results compared to the others. If these two scaffolds were combined, the sequence would have a continuous orderly arrangement. There was some Est information available on GBrowse assembly V0.5 and the transcriptome intensity scores were not very strong with values ranging from about 1.5-8. This is an un-annotated gene so no additional gene information (comments) was available.	Scaffold_80160 missing 5' start (leader sequence), one exon of vWF domain and 3' end (remainder of serine protease domain), probably because of incomplete sequence data.\n
GLEAN3_11546	SPU_011546		Prediction possibly too long.\n
GLEAN3_14932	SPU_014932		GLEAN3_14932 was divided into two Sp-Tlr genes (Sp-TlrP12 and Sp-TlrP13). This gene model doesn't have  a TIR domain, but the nucleotides encoding SP, NT, LRR (13-23) have high similarity to other typical Sp-Tlr genes. The model is located at the end of a contig. \n
GLEAN3_28188	SPU_028188		#\nScaffold_80160 is incomplete.  Appears to be complement factor B, but missing vWF domain and most of serine protease domain, probably because of incomplete sequence data.\n
Sp-TlrP13	SPU_030088		GLEAN3_14932 was divided into two Sp-Tlr genes (Sp-TlrP12 and Sp-TlrP13). This gene model doesn't have  a TIR domain, but the nucleotides encoding SP, NT, LRR (16-23), CT and TM have high similarity to other typical Sp-Tlr genes. The model is located at the end of a contig. \n
GLEAN3_15299	SPU_015299		This gene model doesn't have a TIR domain, but the nucloetides encoding SP, LRR(7-13) has 89% similarity to another Sp-Tlr gene. This gene model is located at the end of a scaffold, which could make it incomplete. \n
GLEAN3_15511	SPU_015511		#\nThis gene model may represent a pseudogene or contain a sequence error. 450bp of 3'UTR was accepted to a coding region that encodes a TIR domain.  \n
GLEAN3_25703	SPU_025703		Incomplete gene model: expected 5' part of the gene is missing\n
GLEAN3_15534	SPU_015534		This gene model doesn't have a TIR domain. But the nucloetides of the first exon has 91% similarity to another Sp-Tlr gene (GLEAN3_15533).  The first exon is located at the end of a contig, which could make it incomplete. \n
GLEAN3_15789	SPU_015789	From the excel data as well as the BLAST results, it is evident that this is the best fit for this particular GLEAN model. When examining the excel data, it was apparent that there were no internal repeats or gaps present within the scaffold resulting in good overall coverage. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	#\nThe first and second exons and the intron between them were accepted to this modified gene model. The nucleotides of them has 88% identity to another Sp-Tlr gene (GLEAN3_15066). The 3'end of the model is located at the end of a contig. \n
GLEAN3_18099	SPU_018099		The gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(6013) has 89% identity to another Sp-Tlr gene(GLEAN3_15066). This gene model is located in a short contig, which may make it incomplete. \n
GLEAN3_19041	SPU_019041		#\nThis gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(4-14) has 90% idenitity to another Sp-Tlr gene(13751). This exon is located at the end of a contig, which could make this model incomplete. \n
GLEAN3_07261	SPU_007261		Partial sequence. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537553-3176-9890171404.BLASTQ4\n
GLEAN3_19835	SPU_019835		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(3-7)have 98% identity to another Sp-Tlr gene (26274). It is located at the end of a contig that is far from the next one. That may make this model incomplete. \n
GLEAN3_20045	SPU_020045	After reviewing the data and performing a BLAST search, it appears that there is no sufficient fit for this particular GLEAN model.. There are numerous repeats and sequence gaps within the scaffolds. There was no Est support available and the transcriptome intensity scores appeared to be very weak.	#\nThere is 1414bp of unkown sequence (NNN) in this gene model, which could make it incomplete.  The nucleotides except unknow sequence have 85% identity to another Sp-Tlr gene(15303). So it could be a member of Toll-like receptor. \n
GLEAN3_01665	SPU_001665		Similar to Dual specificity phosphatase 11. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137536261-8980-151643868263.BLASTQ4\n
GLEAN3_20666	SPU_020666		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(9-21), CT have 88% identity to another Sp-Tlr gene (05950). There seems to be an assembly error in the contig of this model, which makes it incomplete.   \n
GLEAN3_14625	SPU_014625	After reviewing the data it appears that there is no sufficient match for this particular GLEAN model. The data appears to be dispersed between 3 scaffolds.  There is EST information available from GBrowse V0.5 and the transcriptome intensity scores appear to be some what strong (averaging about a 15).\nAdditional information found on Baylor under gene information (comments): Does not clade with human Ppm1g in phylogenetic analysis.	#\nDoes not clade with human Ppm1g in phylogenetic analysis.\n
GLEAN3_24003	SPU_024003		Based on position on contig and alignment with best blast hit (human) it is likely this gene model is missing a 3' exon.\n
GLEAN3_28884	SPU_028884		GLEAN3_28884 represents 5' end (exons 1-3) of the Sp-Stx16 gene. GLEAN3_28885 represents the 3' end (exons 4-11) of this gene.\n
GLEAN3_20266	SPU_020266		Similar to Protein phosphatase 1D magnesium-dependent delta isoform .\n
GLEAN3_28885	SPU_028885		GLEAN3_28885 represents the 3' end (exons 4-11) of the Sp-Stx16 gene. GLEAN3_28884 represents the 5' end (exons 1-3)of this gene.\n
GLEAN3_05149	SPU_005149	After reviewing the data and performing a BLAST search, it appears that there is no sufficient match for model_GLEAN3_05149. The sequence was distributed onto 4 different scaffolds. The BLAST search indicated that the best results were in fact a short stretch of the sequence that only covered 231/996. This portion of the sequence did not have very good coverage but did however have the lowest e-value and bit score when compared to the other results. If the 4 scaffolds were combined, the sequence would have a continuous, orderly arrangement with no repeats present. There was no Est information available from GBrowse assembly V0.5 and transcriptome intensity scores appeared to be strong (5-42) and had a bell shape curve distribution.  This is an un-annotated gene so no additional comments were available from Baylor annotations (comments). 	Incomplete gene model: expected N-terminal parts are absent\n
GLEAN3_21194	SPU_021194		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(8-18) has 91% identity to another Sp-Tlr gene (00615). This gene model is located at the end of a scaffold. \n
GLEAN3_07282	SPU_007282		Similar.  Portions of this sequence are nearly identical to Tyrosine-protein phosphatase, non-receptor type 23.  \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537628-5867-112430593701.BLASTQ4\n
GLEAN3_21299	SPU_021299		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(18-23) has 89% identity to another Sp-Tlr gene (14548). This gene model is located at the end of a contig, which could make it incomplete. \n
GLEAN3_18583	SPU_018583		It's missing 5' and 3' end of the gene.  It overlaps with Glean3_07337.\n
GLEAN3_16411	SPU_016411	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data it appears that both sequences have an orderly arrangement without any gaps or repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with half of the values being less than 5 and the other half greater than 5.	Blasts to PTPRM, but phylogenetic analysis showed that it was does not clade with the PTPR K/M/T/U group.  Renamed PTPRorph1. Partial sequence. \n
GLEAN3_21421	SPU_021421		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(13-23) have 87% identity to another Sp-Tlr gene (16468). This gene model is located at the end of a contig that is far from next contig. \n
GLEAN3_21425	SPU_021425		The nucleotids of the first exon + the following 200bp have 87% identity to another Sp-Tlr gene. The first exon is located at the end of a contig and far from the second one(76000bp).  The second exon and below could be a wrong prediction. \n
GLEAN3_23505	SPU_023505		#\nSplign sequences support gene model.\n
GLEAN3_00663	SPU_000663		This is clearly a partial sequence of a DUSP.  It has been tentatively identified as DUSP24.  Duplicate of GLEAN3_18623.   \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137535380-30970-10768348136.BLASTQ4\n
GLEAN3_21914	SPU_021914		Partial Toll-like receptor. The nucleotids SP, NT, LRR(14-23), CT, TM have 86% identity to another Sp-Tlr gene(00615). Unknown sequence (NNN) in the 3'end of the coding could make this model incomplete. \n
GLEAN3_09937	SPU_009937		Similar to Intestinal acid phosphatase PHO-1.  Partial sequence.\n
GLEAN3_23031	SPU_023031		Patial Toll-like receptor. The nucleotides of the fist exon + the following 683bp in intron have 90% identity to another Sp-Tlr gene (21162). This first exon is located at the end of a contig. and far from the second one (13500bp).  \n
GLEAN3_18623	SPU_018623		#\nDuplicated gene.  See GLEAN3_00663.\n
GLEAN3_17995	SPU_017995		Similar to amPTPN3 and to Gallus gallus protein tyrosine phosphatase, non-receptor type 1.\n
GLEAN3_16144	SPU_016144		The sequences for GLEAN3_16143 were added in front of 16144.  They appear to be part of the same gene.  Also, there is an RVT domain in this gene that probably should not be there.  May be a sequencing error. \nMultiple duplications.  See also GLEAN3_08253, GLEAN3_16053, GLEAN3_19852, GLEAN3_20604, GLEAN3_22839, GLEAN3_24537, and GLEAN3_27101.  Blasts to PTPRA but part of a novel clade in phylogenetic analysis.\n
GLEAN3_23193	SPU_023193		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(10-21), CT have 87% identity to another Sp-Tlr gene (28576). There seems to be an assembly error in the contig of this model, which may make it incomplete.   \n
GLEAN3_16053	SPU_016053		Blasts to PTPRA, but in phylogenetic analysis it forms part of a novel clade with PTPRLec1, PTPRLec2, PTPRLec3, PTPRLec5, PTPRLec6, PTPRFn1, and PTPRFn2.  \n
GLEAN3_22839	SPU_022839		Blasts to PTPRA, but forms part of a novel clade in phlyogenetic analysis.  See also PTPRLec1, PTPRLec3-6, PTPRFn1, and PTPRFn2.\n
GLEAN3_23934	SPU_023934		Partial Toll-like receptor. The nucleotides of the first exon have 90% identity to another Sp-Tlr gene (09435). The first exon is located at the end of a contig and far from the 2nd - 4th exons.  The 2nd - 4th exons were eliminated. \n
GLEAN3_23936	SPU_023936		#\nThis gene model doesn't have a TIR domain. But the nucleotides encoding SP, LRR(9-19) have 87% identity to another Sp-Tlr gene (14352). This gene model is located at the end of a contig, which could make it incomplete. \n
GLEAN3_27101	SPU_027101		Multiple duplications.  See also GLEAN3_08253, GLEAN3_16053, GLEAN3_16144, GLEAN3_19852, GLEAN3_20604, GLEAN3_22839, and GLEAN3_24537.\n
GLEAN3_22506	SPU_022506		Missing N-terminus.  \n
GLEAN3_00971	SPU_000971	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 4 different scaffolds. However, the data appears to continuous if scaffold >v2.1_scaffold29491 was combined with scaffold >v2.1_scaffold79426. There was no Est information available from GBrowse V0.5 and the transcriptome intensity scores appeared to be strong with values ranging from 5-33. There was no annotated information available through the SpBase website regarding this gene. 	Missing C-terminus.  See GLEAN3_22506.  \n
GLEAN3_01698	SPU_001698		Missing C- and N-terminus, but highly hit to GABA transporter.  For N-terminus, this prediction should be combined with GLEAN3_01697.  See GLEAN3_06561. \n
GLEAN3_00076	SPU_000076		Missing N-terminus.  See GLEAN3_06561, _01698.  \n
GLEAN3_25257	SPU_025257	After reviewing the data and performing a BLAST search, it appears that this is the best results based on the overall coverage, bit score, e-value, and orderly arrangement of the sequence. However, when examining the excel data and the BLAST results, the entire sequence appears to be repeated within the same scaffold results.	This gene model doesn't have a TIR domain. The nucleotides encoding SP, NT, LRR(15-23), CT have 88% identity to another Sp-Tlr gene (21420). This model is located at the end of a contig, which could make it incompelte.  \n
GLEAN3_25613	SPU_025613	From the BLAST results as well as the excel data, it is evident that this is the best fit for this particular GLEAN model. When reviewing the excel data in comparison with the BLAST results it was clear that this scaffold contained an orderly arrangement without any gaps present. However, there was two base pair duplicates within the scaffold (from 1-321 and from 322-1746) that may be apart of the sequence. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 6.	#\nThis gene model doesn't have a TIR domain.  The nucleotides encoding SP, NT, LRR(9-19), CT has 94% identity to another Sp-Tlr gene(07850).  The model is located at the end of a contig, which could make it incomplete. \n
GLEAN3_14977	SPU_014977		Missing N-terminus.  See GLEAN3_03832. \n
Sp-TlrP34	SPU_030089		The nucleotides of the first and second exons in GLEAN3_26438 and the intron between them have 88% identity to another Sp-Tlr gene (GLEAN3_15066). The second exon is located at the end of a contig that is far from the third one. The third to 10th exons could belong to Sp-ABCH1. \n
GLEAN3_27445	SPU_027445		#\nThis gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(15-23), CT, TM have 88% identity to another Sp-Tlr gene (23035). This gene model is located at the end of a contig. \n
GLEAN3_28380	SPU_028380		This gene model doesn't have a TIR domain. But the nucleotides SP, NT, LRR(8-15) have 88% identity to another Sp-Tlr gene (14548). This gene model is located at the end of a scaffold, which could make it incomplete. \n
GLEAN3_07205	SPU_007205		#\nMissing N-terminus.  See GLEAN3_03832, _14977. \n
GLEAN3_09356	SPU_009356		Missing N-terminus.  See GLEAN3_03832, _14977, _07205.  \n
GLEAN3_08246	SPU_008246		Missing N-terminus.  See GLEAN3_03832, _14977, _07205, _09356.  \n
GLEAN3_08617	SPU_008617		Full length.  See GLEAN3_03832, _14977, _07205, _09356, _08246.  \n
GLEAN3_16011	SPU_016011		Missing N-terminus.  See GLEAN3_03832, _14977, _07205, _09356, _08246, _08617. \n
GLEAN3_27712	SPU_027712		highly homologous to glean3_27713, located on the same scaffold\n
GLEAN3_27713	SPU_027713		higly homologous to glean3_27712\n
Sp-PLN	SPU_030090		Absence of a complete cDNA prevents further identification of the gene, but it probably extends to another scaffold based on the protein's size and the position of the exisiting cDNA on the end of the scaffold.   \nNOTE:  \n- exon 8 is missing 9 bp after 2555.   \n- exon 4 probably falls in the poly-N region \n
GLEAN3_22727	SPU_022727		Haplotype of Glean3_05376.\n
GLEAN3_18472	SPU_018472		Has similarity to hatching enzyme. Tiling data supports glean3 gene model.\n
GLEAN3_05663	SPU_005663		Similar to Protein phosphatase 1 regulatory subunit 12B. Partial sequence. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537271-5077-208204720659.BLASTQ4\n
GLEAN3_05182	SPU_005182		Sp-064 has many exons that are found on several Glean3 models; 05182, 12439, 18503, 17239.   \nThere are additional Glean3 models that overlap and may represent the other allele.  These are:  \n08381 overlaps with 05182;  \n10678 also overlaps with 05182 but in a different region than 08381;  \n18503 overlaps with 12439;  \n10474 overlaps with 18503; \n07883 overlaps with 17239. \n \nGLEAN3-05182 has exons 1 through 14 of 23.\n
GLEAN3_13607	SPU_013607		Similar to Sp-R-PTP-delta.  Partial sequence.  May be a portion of a duplicate gene.  Another Sp-R-PTP-delta, GLEAN3_00831, is not on the same scaffold.  GLEAN3_00831 is probably a duplicate.\n
GLEAN3_12439	SPU_012439		This scaffold has exons 16 through 18.  Exon 15 is missing from the assembly. \nOther Glean models overlap this region and may be the other allele.  GLEAN3_12439 overlaps with 18503.\n
GLEAN3_25759	SPU_025759		Similar to R-PTP-delta. Partial sequence. See also GLEAN3_00831 and GLEAN3_13607. See structure of this gene at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137533949-23252-71761336427.BLASTQ4\n
GLEAN3_18503	SPU_018503		GLEAN3-18503 includes exons 19-22 and part of exon 23 of this gene, which is also found on GLEAN3-05182, GLEAN3-12439 and GLEAN3_17239.  However, it does not include the 3' end of the gene which is located on GLEAN3-17249. \nGlEAN3_18503 also overlaps with GLEAN3_10474.\n
GLEAN3_17239	SPU_017239		Glean3-17239 includes the 3' end of this large gene.  It overlaps with Glean3-07883\n
GLEAN3_10823	SPU_010823		Similar to R-PTP-mu. See also GLEAN3_22405.\n
GLEAN3_22686	SPU_022686		Similar to R-PTP-mu.  Partial sequence. See also GLEAN3_06528, GLEAN3_16411, GLEAN3_18743,  and GLEAN3_26582. \n
GLEAN3_16669	SPU_016669		Similar to Dual specificity protein phosphatase 3. Partial sequence.\n
GLEAN3_28174	SPU_028174		no domains were detected\n
GLEAN3_18743	SPU_018743		Similar to R-PTP-mu. Partial sequence.  See also GLEAN3_06528, GLEAN3_16411, GLEAN3_26582, and GLEAN3_22686.\n
GLEAN3_06528	SPU_006528		Similar to R-PTP-mu. Partial sequence. See also GLEAN3_16411, GLEAN3_22686, GLEAN3_26582, and GLEAN3_18743. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537418-19107-58243529506.BLASTQ1\n
GLEAN3_23162	SPU_023162		Blasts to PTPRT, but in phylogenetic analysis it forms part of a novel clade that also includes PTPRLec1, PTPRLec2, PTPRLec3, PTPRLec4, PTPRLec6, PTPRFn1, and PTPRFn2.  \n
GLEAN3_01207	SPU_001207		Incomplete prediction/assembly problem. First 305 AA from the 544 AA protein are present. \n
GLEAN3_01597	SPU_001597		A description of an homologue of this gene appears in: \nEmery,P., So,W.V., Kaneko,M., Hall,J.C. and Rosbash,M. Cell 95 (5), 669-679 (1998) \nCRY, a Drosophila clock and light-regulated cryptochrome, is a \nmajor contributor to circadian rhythm resetting and \nphotosensitivity\n
GLEAN3_11174	SPU_011174		Matches_GLEAN3_08752. The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: FRELDSLYEGISSITSSRRRVEVTTFLLDQLESNDRSNGQVMVAQQPTNPTNNHVNNNMNSGNLEQPMHHDSESDEGFEEMDTGVEGAVGQSHSVSPTPSDE\n
GLEAN3_08752	SPU_008752		Matches_GLEAN3_11174.\n
GLEAN3_17642	SPU_017642		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: STYIGPWCAGFGDLRARGSSGGSRPYTCTCTYTRILAIWGTGMIRDPVMKNKKFTVWFPKIGVHLIHRDLRYCAFVINLLICSSLATCFLSDGCQCLFKLL,DRFLSLFAHLECVSETLIGILSVCLLSSSQTWKWKSFIPVNIFLHLSCLQERCAANCGICFCYLQYLRMVVENIIKTFRD\n
GLEAN3_06683	SPU_006683	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data and comparing it to the BLAST results, it is evident that both scaffolds have an orderly arrangement about them. If the 2 scaffolds were combined, the sequence would have an orderly continuous arrangement without any gaps or internal repeats present. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be weak with all of the values being >10.	Model_must_be_split_in_2. Transcriptome data indicates that Glean may have falsely predicted the following exons: 3,14.\n
GLEAN3_23867	SPU_023867		Exon 2 is alternatively spliced in Paracentrotus lividus, Arbacia punctulata, and Sphaerechinus granulavis.\n
GLEAN3_09262	SPU_009262		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: ASLSPTPHVHQRYPQGLGHLQSSPASLAYQTASRSQETCWRGVYQMVKLHQGQYSSVTVSCDLNSFGVCRRVLFQRLPSAGSASLVSSGCSCPVLGVFRIPLMRDIYQVTLASSLQTLYYL,QFQGTLCLLFWKVFLSFASLAFLILHQYLVYLLFLVYLFFVSPLLLLDFVCGSHWCLSLLNEFLLVWCVLHDYSQGFVLSEG,NGFYQQILRWLNSLYPSYTKDTSSRFLDREENGGLFLWLRFNDRFSLYPDLLHDLHDFLHGRLADDIRHKQVFLQHLTFTKGIHKV,TVFIPHTRRIHPQDFLIEKKMEVFFCGCGSTIDSLCTPTSFTIFMTSFMAVWLTISVISKSFSNTSRSPKVSTRSRASSKLTRFFGLPNSKSLSGNMLARGLSNGETSSRTVLICDSFLRSKLFWRLSASSFPTFTFSGLRFFGFFRMLLPCAWCVPDSFDERYLSSDTCFLTSDTVLSVTNSLVGIGLVTEVSSGALPNS,SRLLVESMPASLNVLSADGSSTSSILDSVCVCLILASFSGDWLLCALLSITISGDPVSTVLEGLPLFRFFGFSDPASVSGLPAFLGLPLFRFSSPSAGLCVWLTLVSFPTE,PMLISFSAGWCVTTASSKPTRDSVLNSLGGLPLFCFSSSCFISKMHALDLSFKAPLLSSVNLSNLISTVSILDLSIVDVLATTLGDKQDPGTIPLLGLPCFCLSPFPDPELDIFGWYGLNSLKGLPTFPVLFFSDSTSDSNCSVRASLPVD,FSFRGLPLFLFSLLSVPESIPDRLTSSSLSVDWLSSTLACKLSTDESDIDTFRGLPLFLFSELESTSESVPDFPNPAILSNDWPRLTLDPLLGLPLFLFCGLSAEVSTPS,SSIHTLPLHCANKHGFIPSPPTLSSSGLPPPPPSPPPPRPLLGSHKHAASPTTLAATTAATAATTAAESGPATGAVATVACRHAPPGSFHSSSH,TISSPCYVSPDVHKYRYTSACCLGLFCCASAIQIEIKILCWVIQNLYFLLLQYQGFYLSQQTKLRFSICVCVYLFMNYLCTCTMQPLIPCHCSLAPGFTFIWRV\n
GLEAN3_23730	SPU_023730		Transcriptome data indicates that Glean may have falsely predicted the following exons: 5,8.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: AEPKLCRGTPTHSITVQHSIIWSMLTKPCASMHTFSRSNQGEKSKHFFIDIWTGSVKHVIFESCANSIDLSPSAYAIGDCLCRSRYAQTPMVDFIRTLKLVQ,QSSFDSGIILVTSLHSICHTNPLFPHHQSSPCRNLTQTKILHSPLHTTNLPHAETSRKPRYSIPLSHHPSSPCRNLTQTKILHPPLHRHTDYKSTDDLQHLSYSEQCNFLLGLLLD,HLYTPSATPTPSSHTTNLPHAETSRKPRYSTPPFTPPIFPMQKPHENQDTPSPFHTTHLPHAETSRKPRYSTPLYTATLTTSRPMIYNILAIRSSVTFCSDFCWIDIDLTGVTGVLWMRMGNIIFQQIIIKRGLL,FRFGHHPRDIFTLHLPHQPPLPTPPIFPMQKPHANQDTPLPPSHHQSSPCRNLTKTKILHPPFTPPIFPMQKPHANQDTPPPFTPPH,VERSSNGIKPRATSLRPPLEYIITRLLQEQKPDRGKLDRETRTLRKRLVTLRASLVIPSAPPAQKAQKHGCSPDLIMLSLTVISSSLHPSRAAVHHSNPPPPLFHPIPSLHFILPLFLVPLRFRV,PGNKDPEETFSDITRFTCNSLCSTRPEGAKTRMQSRSHHAIPYGDIIVVASFQSSRASFQPPPPSLPSHSIPPLHPSSVSCPSSLPCR,LNHVLNRIKTMKEQDTLRHHSLHLAFREPYPTIPVSQRRQLHDNGTEISLTLSNMQRPKHLLHSPPLLLSHLNYHPKCSIHKKTQNQIESSTLIRSCPLFSITLVLYP\n
GLEAN3_08177	SPU_008177		Transcriptome data indicates that Glean may have falsely predicted the following exons: 16.\n
GLEAN3_20637	SPU_020637		Matches_GLEAN3_26099.\n
GLEAN3_18126	SPU_018126		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: TSSGGMFHLNTPLKLGKALHQWGGIVFIFTSILWCIKIIRPIFFQAIQPTGDKAIYFSGSSDRDKRAHLMIVLPVNSRGRPLM,VQAGQSQSRSNPIEISKVSTMKLPFNLFLNSHTPFLKPCFQSACCLLIVETIRHSFLTPYSILVHVSCTNVERTLNPIIASTCSHTRLR,NRHTPTITHTSQTGCHPHVPQSSTGFNLNDKSASSTRCRFFAHIHRLIKTYGSENVDIEIRHVYSLSWQHMGASMRFVLVCSGSSGGVHKQCNVYRNRLPHRV,KPIHPRWFCSFGTGVALGMIGRFEKGQNVRLIRREKVWHNHDNGNSEPRITLRPQLSKFIWRTPSRTLLTSVSAFPLIRGIKPGGFIRQNEYLPVYANTDIERACA\n
GLEAN3_14539	SPU_014539		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: PYPLRPIAHTPLSITIRRSLPLPCSFPFPFPSLLPLYLACPLSSLLPSLFLLFSLSFPCLLSLLPLSTSPLSILCSLLPLYFTSFST,PTPRSPSRSVDHSLSLALSPFPSPLSSLSILHVPFPLFFPLFSFSSPSLSHVYCLSSPSPPRPFLSSAPSSLFTLLHFLLDIIAVYAFPNTNSLPLLN,LPPSSHSPHPALHHDPSITPSPLLFPLSLPLSPPSLSCMSPFLSSSLSFPSLLPLFPMSIVSPPPLHLAPFYPLLPPPSLLYFIFYLILLQCMLSQTPTLCLFLIEF\n
GLEAN3_16343	SPU_016343		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: SHTNPSFFQSLYLPKEWYKSLLTYSVCFPRNSSEDTKIAIKSRDKNPFPFLLYDLLFMITVHFPRGLLLHSDKSPSLSQDV\n
GLEAN3_02677	SPU_002677		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LHASPPCSPSPYLHLFFPPSLSIPSTMIAPRITSLLSVSLSPSFLPSIALHPFRCDHLAPRIPSLLSVSLSPSFLPSIALHPFRHDHL,SLYFSFSLSFRLPISPSILPSVVLHPFRHDHVAPRIPSLLSVSLSPSFLSSIALHPFHHDSSTHHLLAFRLPISIFSSLHRSPSLPL\n
GLEAN3_02603	SPU_002603		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RARSEKRSTSLSKLLIDLPINCQKYARYCSEIRAAVSLFKQNSSNCLQCANLLRTILVQYTSHCYCYFIHLLYLVTSSRLDTSFVCLTSNFEETVNCHFYILYNEQSGNNRIRIR\n
GLEAN3_05435	SPU_005435		Matches_GLEAN3_23739. The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: CMSFSLLVLYLFFCISLSISPSLPLHYASLSSFNLCLSVSVSLSLSLSLSLFPFLSLSLPFSIMVMSLSIHLPSLIFPLPLSFYVHLGLFLLLCLSFPIF\n
GLEAN3_23739	SPU_023739		Matches_GLEAN3_05435. The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: VIITTTIIIIIISNLTTPPLSLTMTLEIPFKRGWDLSRRGCSSLCLVNTDKLNNVYTVCVVCHPFSTHRINGTFPCMCGHLQGILRVRTTISFSPCHL\n
GLEAN3_04598	SPU_004598		Matches_GLEAN3_06159.\n
GLEAN3_06159	SPU_006159		Matches GLEAN3_04598. Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,4.\n
GLEAN3_14461	SPU_014461		Matches GLEAN3_24163. Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.\n
GLEAN3_24163	SPU_024163		Matches_GLEAN3_14461.\n
GLEAN3_25486	SPU_025486		Matches_GLEAN3_25486. Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,3.\n
GLEAN3_23868	SPU_023868		Matches_GLEAN3_23868.\n
GLEAN3_20346	SPU_020346		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RASLLYPTFPDSLCLSLSLSLSLSVCVCVCVCVCVCVCLPPSLSLSLSLPALLAFSIFPSPHIFFTLSFLDSILHLGQPVFLSSLSLSLSLYMYLPTLSFSNPPSFCYVFLNYLFAYSLSKFSFFFLNL,LISSPLFQYCHVFICHNTISHPLNAPLYSTLLSLILSVSLCLSLSLCPSVCVCVCVCVCVCVSPPLSLSLFLSQLSWPSPSSPLPIFSLP\n
GLEAN3_14177	SPU_014177		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.\n
GLEAN3_11315	SPU_011315		Matches_GLEAN3_11315. The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LHVCRLLQAICLSSSLFPLTLSSVSSSLKCSHFPIYSLSSSSPFSLSVSLPPSLSLSPSPFHSPPLIILCLSGDASSLALSLPSMPAHFTAVC,KCIEGDYMYVVCFRQSVCLLPSFPSPSPVSLHLSNAPISPSTPSLLRLRSLYLSLSLLPSLSLHLPFILPRSLFSVSLAMLLRLLFPSRPCRPTSQLFA,ILRDENVLRVITCMSFASGNLSVFFPLSPHPLQCLFISQMLPFPHLLPLFFVSVLFICLSPSFPLSLSISLSFSPAHYSLSLWRCFFACSFPPVHAGPLHSCLP,HQHSLPVAPCKTPCRSRETIPIIFFHFKTLMQPPQMTNIVQYFIIKTRARGLLCTFEAALPNPPPLLLSLSLLHHPQTIQSTHPRPKQMPGKFTNDHTSGGSRIL\n
GLEAN3_17983	SPU_017983		Matches_GLEAN3_17983.\n
GLEAN3_18954	SPU_018954		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LDDFKRINKPNGVDDNLQKKYQVLERHVTSLRQYGQKMNQRLEMLETSNNGVCVWKIANYNEKKKDAMKTNVKSICSPPFYTSQYGYKLCGRVFLMGDGVGKGTYISLFLTIMKGSFDAVLPWPFKERITFQLVNQDDSINKSIVEAFRPDPASSSFKKPTTEKNIGAGCPLFAKIQIIEDPKSGFIRDNTMYLKIICQTSDVPEIK\n
GLEAN3_06753	SPU_006753		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: XXXXXRLHLSGVWCSLSDNAAGSETLDFMCLVFSVKPLDLEVFYLHYLKFGVRQGYWIWSTDLTAIIWCEVPANVTSFGALHLT\n
GLEAN3_13305	SPU_013305		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: ALDKGNTWQFPELHPNWNLNATSNLMKIFSHIFIRSTFLSLLILSVPVPMITSITTIPKIFNLVVANYHVFTKNTSSYQLKI,ISLSGGRGHLCDCLGGQSSSSIEAAYCYSHGRVGDGVTSPSTSSSLTKTTSTKLSPHSSLSSSSGCYIIPNLMGHVIYLIIFLWDSPCNYFI\n
GLEAN3_08936	SPU_008936		Matches_GLEAN3_08936.\n
GLEAN3_27487	SPU_027487		Matches_GLEAN3_08936.\n
GLEAN3_16650	SPU_016650		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: GMSKVIWYSLLQSDLSAEEIQAWLDRKVHGTVLESIHTCRDTDPAMQDNNYARYERSDKALKPLIGQVMCTRPSISCATCRSSIVNFPSPGASEDQLEQLGLREQVSMVYMIEWMCCTIF,SYYVDKHDQRERTEPSPFSFRLPSVALFSHLTFHPLFCFQFDIKGKCPHSRATPLMLLICPCTKASVNKTSDIGRRRPLTGSPLFSV\n
GLEAN3_06150	SPU_006150		Transcriptome data indicates that Glean may have falsely predicted the following exons: 4,7.\n
GLEAN3_15456	SPU_015456		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RSWTEAMTERIRDVHILREGERERGGDREKERERRRERERGDRGLEEMRKKAYAWWEKQQCSIHRRVQSSMRMYPGGMTGCYARPQDHRKQKRKTTTTTT\n
GLEAN3_00485	SPU_000485		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: SLSVSLYIYIFPLFFPFFDTTLSPSLPPFFYFSTTQDLYMSPSIFYFYFSSLFHLYLSLSLSLSLFISLSPSLSLSHSFLSSYSVLSSLSVSSFSFFLSPTENA,LSLSLSIYISFPYSFPFLTPLFLHPFPPFSIFPPHKTSTCLPLFSISISLRSSISISLSLSLSLSLSLSLPLSLFLTPSSLHIPSFLLSPSPLSLSFFLQLRTR,HHSFSIPSPLFLFFHHTRPLHVSLYFLFLFLFALPSLSLSLSLSLSLYLSLSLSLSFSLLPLFIFRPFFSLRLLFLFLSFSN\n
GLEAN3_02592	SPU_002592		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: IFATKRPYIVVAVQPVIKCELLYILFLPMSPLHAPSLPFFLDNIIWSSSFLCLSISLKYPPDTNATFSNIITIITQNRLLLSPSISPILPFVYIK,VAGSKKKTLVSFISCPKPESGRACTPLVPLWPVFTRCKPPKRGQTLLDIEWMRTYLIRLPPSRLLCATLEIFSQTYESNS,SLLSLSLTPLTSLSYLPPFPFLPSLSFSYSSSLSSSSSFPLLLLLFVFLFVFFPFPFSRLLSLFLSFSVSPSSWVLAGGSRGHCPQISCPCRCPSFGHIVMWLLMCLSSTMTQQ,HFSLFLLPLSLHYPIFLPSLFSPLSLSLILLLYLLLLLFLFFFFFLSSSSCSSLSPSLVSSPCFFHSQSHPLPGSLQEDLEGIAPRFRAHADVLRLVTLSCGC,FVNRRTPLTQNLSLRLLSQPYHLPPPSLSLTPFLFQFNHTSTRIHCLIGLFSDKRGNKSTKLMSRPDDDTISKTNMSERINYQNIMERGCKSNSSCDGNGCIGIDG\n
GLEAN3_16685	SPU_016685		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: PLVTYAYLPDCLSGSSRGQVRKARRSGACFVEFQFHFCLPSVSLPLSPSLSLSLSLPLSPSLHLFLHDSFHPSLTLFSPLQ,LPTLTFLTVYRGAPGVRYEKLVARGPVLSSSNFIFAFPPSLSHSLPLSLSPSLSLSPLLSISFFMTLFIPRLLFFPPCNENGLYVSHLSVV\n
GLEAN3_13843	SPU_013843		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,4.\n
GLEAN3_13689	SPU_013689		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1,2.\n
GLEAN3_27446	SPU_027446		Matches_GLEAN3_27446.\n
GLEAN3_13178	SPU_013178		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: MEQVDPPKDDATSSESLGEQREKTVPDVNSAGDQCSKESDGNEGKEETAKIPNSKEEDPALPSTSTGEEGMTADSSSHDDPEGNADEKMEESKDTDDKIEERQGTDDKGAKQVDGDDQLEEGEDRNNEHPGREPRDAEFTSEIFKIMLRNLPTRFGFQVGV\n
GLEAN3_18392	SPU_018392	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. If the two scaffolds were to be combined, the overall sequence would have an orderly arrangement without any gaps or repeats present.  There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with most of the values either being 5 or less. 	The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LFSAGKLDKSGSPRHIFDDGIAAKRPRITLPPAPRLRSLPYNTPPVAPLRSEAHRREVAPQTQPSFHPRHGQAVTSPNDIEDQRQVLVSDHAQRPARSHLVQSHHILQRNHLQRQQQHHHHHLLPQQHSLVSLLREPVVTTSPAFERLGIGPRAVTGNEAGSASGMPQTRASPVCDSCTDGAGCWKEMTGIGCKLETKELWDRFHELGTEMII\n
GLEAN3_00129	SPU_000129		Matches_GLEAN3_00129. Transcriptome data indicates that Glean may have falsely predicted the following exons: 4,9,12.\n
GLEAN3_22971	SPU_022971		Matches_GLEAN3_22971.\n
GLEAN3_15425	SPU_015425		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: FKSFQKTTNGGMSVSFSPNLSSNLEKNTLQNFDLNCHDKTVFYDIGHRSRLMQDMCTYKTFMQKKISQNNIQEHYYLFEEKQNGRRNASNPCSTVSISILNIRILIP,HDLNLSKRQQTGECLFHFLLILAAIWKRIRSKTLILTAMTKLYSMILDTEAGSCKTCVLIKPSCKRRYHRIIYRSITIYLKKSRMEGEMHPILAALCQFPF,LGVTLRDRRRNKEIRKELKVGNILELARDMRLRWFGQSEWADEGKPAKDRMTRAVEGSRGRGRPETCWKEGYLKKELNLTAAQTGNRREWRLRIRPTNPC\n
GLEAN3_11080	SPU_011080		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.\n
GLEAN3_05022	SPU_005022		This is the bHLH domain of Sp-ahr.  The C-terminal sequence is  either in glean3_13788 (more complete) or glean3_12296 (one PAS domain only).  \n
GLEAN3_25737	SPU_025737		Matches_GLEAN3_25737 The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: AIFFNSLESLTREALPSSPTLESVLPYPESSVNLPNAPPACSILVRGVLSEEFSSTLGPVLLGVLSSVGRGRINEVMSEPGGALSLFAGTSWPVLG,RILLFTVAFCLSHIFQQLGVPHKRSPAIIPHTGIGASIPRVQCKSSKCTTGLFHSRKGGAVRGVQLYFRTSALGCIVFSRAGENQ\n
GLEAN3_25589	SPU_025589		Matches_GLEAN3_25589 The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: ITSFMRQGPDLLLSSIRDLLTDNGHNILETIPLYLIQFSLVKRRRKELSLRQGTIRRTHFFEFVSEKAFSVIRKQIFFIFPQQ,IYSIYFTGGGELSNQLMFPTSMLSGSTSKFTSPNIFSKLEFSIESFPESLPSTYLLGVIIRDLRELRIPPDVLGMFLFTGDNMWCLGVLECPTRLL\n
GLEAN3_11348	SPU_011348		Matches_GLEAN3_11348. Transcriptome data indicates that Glean may have falsely predicted the following exons: 5.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: TGGEPRQNIWTLIRSRRYGTTSSRSYDRLYMENNGLNWWRTPVESPDINPIEKVWNDLKRFLRQIVHGKQWTQLVANPSRISRH,TGGEPQQNLQISIRSRRYGTTLRGSYERSYMENNGLNWWRTPAESPDINPIEKVWNDLKIDIYIYIIIFAFFFILYTREWKTATKPEHLIGIEVF,CWKRCSLFASGSICNFEGHCRRSLTKNVKYRSIETETAVDHGVLNLEQEKSSKFLIIKYRLEGTRMKSLALLSRIVESSIARFIVESFQGPGSSP,VSNTFLFCQQVPHIFDTSGDKVKVRTEVYSHDALKMWWAALIACGIYIHAHLTMLCQRLINKYREKFMYGAPTIISIARPG\n
GLEAN3_27598	SPU_027598		Matches_GLEAN3_27598. The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RISRRSELSNVKGSFLLRETTSELATSRVQFRIGFSTIQYSTDGTISRSPAKVMLEALQYICLRKHLQLRRPLQNGHRQRMINTVV\n
GLEAN3_24486	SPU_024486		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,3.\n
GLEAN3_14405	SPU_014405		Transcriptome data indicates that Glean may have falsely predicted the following exons: 8.\n
GLEAN3_17375	SPU_017375		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1,2,3,6,7.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: EELTVEDGKDSHVPPLLIYLNDEYSESVLDMLLLAHLKIEDKIGIYHVLPLHCWTHQFHYHWTGFYLHHSTAISEWEVQLLRYYSLGQGSVVFCLLSLPSRHELDEYYRHHLLY,PPIPYQDGIATKIGAKPTFKSLFLKDPILALKCFFGPAVPASYRLQGPHVWSGARDTIMNVWQNTVSGTKFRDTPIANGPEGYPIALKLIFLVCIVAGLYLAMM\n
GLEAN3_00749	SPU_000749	After reviewing the data and performing a BLAST search it appears that this is the best match for this particular GLEAN model. When comparing the excel data to the BLAST results it is apparent that the sequences do not correspond. The excel data indicates that the sequence is distributed onto 2 different scaffolds that have a continuous and orderly arrangement until about 528. However, there are several internal repeats present within this scaffold. The BLAST results display the same results, but the sequence terminates at 342 instead of 528. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 10.	ligand binding domain is found in glean3_11061\n
GLEAN3_27623	SPU_027623		Matches_GLEAN3_27623.\n
GLEAN3_19444	SPU_019444		Matches_GLEAN3_19444.\n
GLEAN3_28093	SPU_028093		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: KPRWTYKQVACQNITAAKARQKWHKQSSKIGCSGGVSRFSVCTLPILMSVILFINESNLKLGSKLRIRGVVPYFVPLNNSLAIIEAKNRITWAKFLNRHKHVL\n
GLEAN3_28148	SPU_028148		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: SYLSLHPYLFVCLPVFLSFCLSGFCRPLHILTPPVYICLLVCLSVCLSVWLSLCLRLSVCSCLYISLSLSVCPCMPVCLSVCVCMRVWVGGGVRVSLSFSILATPQPSSNYPALSPSLPLSQSLSHSFSLSSSLSF,LILSFSPSIPLCLSACLSVFLSLWLLPTTPHLNSPCLYLFVSMSVSLSVCMVVSLSPPLCLFLSVYISFSVCLSVYACLSVRVRMYACVGGWGRTCVSLFFNLSDPSTILKLSSSFPFSPPLPISLPFFLTLFISLILTLMSLFSLWAQLHSDPPS,ERQIKREKYRDKVCVSVLPVCVCVRVRERERERERGRERGGGAGKREKGVNERDRERERNTEIKCALVCCLCVRVCVCVCVCM\n
GLEAN3_03704	SPU_003704		see glean3_09520 for information about correct glean model assembly. \n \nnote missing sequence: \nITPKCGVPNVFPSPLRLGE \n \nalso there is either alternate splicing, or a extraneous exon.\n
GLEAN3_11576	SPU_011576		Transcriptome data indicates that Glean may have falsely predicted the following exons: 5.\n
GLEAN3_01519	SPU_001519		Matches_GLEAN3_01519. Transcriptome data indicates that Glean may have falsely predicted the following exons: 5.\n
GLEAN3_03920	SPU_003920		Matches_GLEAN3_03920. Transcriptome data indicates that Glean may have falsely predicted the following exons: 4.\n
GLEAN3_27215	SPU_027215		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: VKWVCACYERYYFVLSCRFMVLNIITTDGGVVQITIYGGGGGVKSRLETHLCLHVKHSARYKRPSHSSAITRNGNVLSHTPVTLTILAPDRHQTDTRPTPDRHYPEQ\n
GLEAN3_12491	SPU_012491		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: NSLLSCREQVSRKLAKLYFLHLGIRGGPISLIMITLASKIGLLKFTWKKVFDTGIANPLALMSPRHRSSQIQYFSTSTAGNRFKLSLSVKWDRPRILRNQSSSSIILRTFFQFIKNMMVSLIRYHFLMFLI,REREWWRMRYMQGGEDERGRDRWLDREKGRKRDRERERERRGYKWVQKIDHGLFEIDLYHHLSIPLFLPLHTIISCQKSKHSNACLTKYSLSLFKP,TTEPQTVFVCHVCSITLHLYLPLSLSLSITLYFSFSCSFYYYFSPFFHSHYFSSNALLPPHFRSLALSLSTNHLVSFSRSLLSLLFIFSYSPLRSLYGYDTCMWQIYRGS\n
GLEAN3_13047	SPU_013047		Transcriptome data indicates that Glean may have falsely predicted the following exons: 4.\n
GLEAN3_11297	SPU_011297		Transcriptome data indicates that Glean may have falsely predicted the following exons: 3.\n
GLEAN3_27334	SPU_027334		Transcriptome data indicates that Glean may have falsely predicted the following exons: 3,4,6.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: YCCPKPFRLVTATLRTMVKLTNEAMDIFSKTLSFLNIEPKMINATRPVGRPFSLGKEDVSVFHAVRHDHHHHIITMSDHHLMLIPDMNDEVTDAH\n
GLEAN3_18351	SPU_018351		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: TQPCFKLNVFENRDARVKKCRLNAPLAKHPDKTKPSCPFLHLRPGSKLLCLQICAKRLGVGNSEKLRNFEACFIFFIKLLPIWFPQLILFCTQGLSYQDCYFPPIFIYFRFGIFNQGNRRSVIF,VSKSMVFTNTHLFLPFLISVGEALKVRFLLIKFAYSTSTLSLYIRSAPSGTYRIPVISMCDIILNIQLYANGNDHTFSNIKD,LSLVLLTIFYDTGRRSREFRLRAEPHFRSVFVYSSILFPYFQKLLNSKYSKINCFHGNLERNFPMPLNDCMRRIAATQCDL\n
GLEAN3_07981	SPU_007981		Transcriptome data indicates that Glean may have falsely predicted the following exons: 5.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: PSISKSPFLYQSSIIPPPFLHHSSTIPPPFLHHSSTGRLSFLHHSSTIPPPFLHHSSTSLLPFLHQSSSIPPPFLHHSSTSCLQFLHHSSTSPLLFLHQSSTNPPPLFHHPSTTPTLNLYQSFTNHNPKCLPH,FLHHSSTIPPPFLHHSSTIPPPVVYHSSTIRLPFLHHSSTIPPPVFFHSSTSRLPFLRHSSTIPPPVVYNSSTIPPPVLYYSSTNPPPILHHSSTIPPPLPHSISTSHLPITIPSVFPI,KSIPLPILHNSSTIPPPFLHHSSTIPPPFLHRSSIIPPPFVYHSSTIPPPFLHQSSSIPPPVVFHSSAIPPPFLHQLSTIPPPFLHQSSTIPPPILHQSSTTLPPSLHHSHTQSLPVIYQSQSQVSSPY\n
GLEAN3_04414	SPU_004414		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: ASARVLCLYSLCIVRFLILPSDLCQGFPNLQWVHFCRRCHLHVHSMCMMDLSSSCKSFNICSNLQFKPFRSLPQFRLITFGFPVCRSHFKEILNYKFMA\n
GLEAN3_00424	SPU_000424		Matches_GLEAN3_00424.\n
GLEAN3_04844	SPU_004844		Matches_GLEAN3_04844.\n
GLEAN3_17725	SPU_017725		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: KRGIRSASTQLIVHFVRLAFGYLHVKFFYLTCLHVCKMLIEIQQYYGLFNSMLTKEHCTLHVVAYSKGDREQVIFLNGFPCSRLKC,CPSHNMHSSLFQILVANMMSWLYSSQHSTHFNISICFAKFVIFHPGHVFLIRVSGRICSCSIASCKHTNCVFLPIHVYCLGCSHYSYNFFSIIRVGIKLYSPYLIRLYSEFTFF\n
GLEAN3_01998	SPU_001998		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: NTTCAARKLVYANETQNHHLFPIIRIMLFCHIFGIDLHLCGPYTGMHFHCKKYIKNEMIMSSPKINERIFPDVLASLLRSPFEIEEAPVAACVELYSSLINTTRCRRGNSEHNFIFA,LIRHDVEEATVSIISFLLSQPASQGVPHHTNYKYFHFIRASHCPRAFSFWLLAGFARIFGSWGRGFSCCSCDNRGKENMVGKEEEKNRFNIVSKKRVVAESVYHRLPRST,FSNNHHQSFLFLPSLSLSLSLCLSLSFCRLIIVCELFLYQSMASPLQSPPPIVRLPPLGTKLSTGWSCMHAEKTPQFWSP,LFSPFLSFLIITISHSYFYPLSLCHSLSVFLSLSVALSLFVSFSCTSQWLAPFNPLHPLFACLLWAPSCPQVGHACMQRRLHSSGPPKWPCLPCVVLRLPFFGGIDSSPIVSASVFLSPHSSGFGS,MKSIYKCNCSSCFNLGIVSKLESTAPCILPSLQLSGSNANQNKDILGKPFVETQWGCWSNWRRDDWPCDLKCHAGNIPTTSNSTDTDCLNTYVMVLVMNVFGYVCTCAVVFTLR \n
GLEAN3_21210	SPU_021210		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: GEISCFSVHRLVTFLKVLFFRKCYLGSIKGSPHRHDDDRSSYNHQVNSRKYSVNMLMTSSLMHCGSICRKRTATFLCGLSLNLETSVEPLNVLTYMY,DYVHSNRRGCTPVSRVVCCRDIWQCGTKGSLCCAGDRDVYYRTELSRLVRRNALLCTCMICTLTEMAFLKGFDRFQTVSRLHIVCCSKRFCHFYRSTLYS\n
GLEAN3_26962	SPU_026962		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: CVFHSFVYCKYDKLRKGDDRSQDLGSLQCVSYKDVIWTFTTGHKILSSSIDMILLFSCVTSLVYVCHYFMYVFLTPSPVPFVYPFILIETCSVLLVNSFTSEDT\n
GLEAN3_10404	SPU_010404		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.\n
GLEAN3_19268	SPU_019268		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2.\n
GLEAN3_18951	SPU_018951		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,3.\n
GLEAN3_14157	SPU_014157		Transcriptome data indicates that Glean may have falsely predicted the following exons: 8.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LHLPILIYSSLIPSSTYPLVHLTFSFSYSFLLSIFPILLFSPRPASHDCVPRQPRFYPKPHLISCIHPVLHVCQESVLINLPTKLDPSFHSRLSTCMTV\n
GLEAN3_14170	SPU_014170		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: QREEESEKEEKKGEGRKEKRKECRIESNLGFPISRVPILRLELKRGRGKERERERERGRGKERDIITCQKPLEESFRKEYNKTTAPGISDKQKLVYFKTSLSISQGALIHFKHVSQTHAQDNHYF,FLSCRGKLMMAEGNACMIYIRNTRVESSMCMVTNKVVWLDTKQKNVCQNIMITRPPGTVMRPTFFFLMKRRKDMNSCRARSKIWRKWVDGDIKQKVCAYVCVNEENVNSATVVWGQDCIK,LSTMLSTQEYQYRIMFRVTKLPRAVLHFFPVVFDVSISWQLPPPYFLLLSFFLTPPSNIFLPLFISPSLYLSICTMTFNGRRLKLHILKSNHTLPRSYQTPELAEGESRSTRD\n
GLEAN3_19129	SPU_019129		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,3,5,7,8.\n
GLEAN3_12122	SPU_012122		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: CNTFNVFIGILQPINELQGEGDKNRSLTWQKGFCVACDLSVCYDIQISDIHSIFAAVFCRQLLAVMLEHGLILYSRYVLQGTFLLSSPCHKERVVEFLSFSKVNFAAESMLVL\n
GLEAN3_01739	SPU_001739	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. The first scaffold (v2.1_scaffold72707) is unique in that the first 952 base pairs are covered continuously. The second scaffold (v2.1_scaffold64493) however has one small gap from 1313-1252 that is apparent within the BLAST results and the excel data. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed across the graph with most of the scores averaging about 5. 	The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RYKHYAENYFTITAKQENVKRKSCNSIRLTVVQCIVIYNYFCKNQRGFLTPLEQEATKDSRKMRHLYNVPCHSIPSLFVAFNSLYFLFPQKPPNSPLLIESQNY,NPICCENLLFFRSHHPIPCGLARYNLKYGRRNKNLISSTGSRNRKREKRNFVLQRQGGVLERNNRLGDAVTWILERMRVKREFIHLSTA\n
GLEAN3_18366	SPU_018366		Transcriptome data indicates that Glean may have falsely predicted the following exons: 3,4.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: KFTRIWSKLLGRTNICGFITNGSAVVGREGILSRSCFSDQCLKIGFVSPSPHPKGGHGVASGGGGGSKHANCWNLRCDMCNCNWCMLMNK,KFTRIWSKLLGRTNICGFITNGSAVVGREGILSRSCFSDQWSEDRVCFALTSSERGSRSGLGGGGGNLNTRIVGICVVTCVTGIFASSHIYMCMCIY\n
GLEAN3_28827	SPU_028827		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: TDNLQVALHKKGLKNVKPFFAVIIDIFSTFERNNISVPSQKGVNSVLEDMSLKCLSGSTYINACNANEKPTCLRESDGEFPNSNQVGSCSNPLEKILVIVNTTLTLEYS\n
GLEAN3_14802	SPU_014802		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LDNHYVILKFGKEMLKVSRIHHQPRAGPPPPPNILFLLFLLLLLLLLLLLLLLLLLLLLPLPWTWELQQQIIFLSFFLFFFREL\n
GLEAN3_11202	SPU_011202		Transcriptome data indicates that Glean may have falsely predicted the following exons: 6,7.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RENNLQHQSTGNSGLFSASFLHSPFLLSSLPSFIHPTNPKVELIDTKLAQMLLNVKRSAEGIRIRLHIVERWGRNETESLWMQTDQSDMYLDRDR\n
GLEAN3_26877	SPU_026877		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: QNCGCLAYLFTVVVMFYVNVRLIWKYTLKKESITLFESKTSFFSEALCNCENTVCALLNEMSVCLRVRVCVCEAEGGRERGTDRDEEKGDETRGSENVRLAEILEDEDAGWKEEGEPKKRSYDAHVLYWNLVLSSSLSWVPGATRALTK,NVGFRFLSKSNLFSLVAHDSFAIECTKYNYSACKYLNKKKTLFSQDILLFIKGHVSFVIALPPKMLKIFLHRIYFYIHAPTAESYSNDFRIGGKSNCILSPSLYNSSTILP\n
GLEAN3_16168	SPU_016168		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: QEIVRWPALCHTPSLPPFHSATTPPTRVDLSVTPPTFIVYSFPFYTYQTQTNKKQTKNICVKERRTVRPSTLYFSILTFLYFI\n
GLEAN3_26905	SPU_026905		Transcriptome data indicates that Glean may have falsely predicted the following exons: 9.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: MKSSAVFCNSSMASSFISSIISSTSFFTFFFTFFLTFFLTFFLALSLAFFFAFFFTFFFALSLTFFFTFFFAFFLAFFFTCFFTRFFACFFACFLTCFLACFLICFFATFFPVFLAALLPPFLAADPATFIILTVC,FIYGIILHILHHIVYFFFHLLFYFLLDLFLDFLLGLILGLFLRLFLYFLLRFVLDLLFHLLFRLLFSLLLHLLFYSLLCLLLCLLLDLFLGLLFDLFLRHFLSGFLGRPLATFLGR,NLPDDEENNLQNNLDEPWPFISCQRACSGYVLFKDLYGWPSTLILLLAALVFSFLLFFFLPWRHFPVHFFLIIVIRPFACT\n
GLEAN3_17404	SPU_017404		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1,7.\n
GLEAN3_07242	SPU_007242		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: QKVEDEHSSRPESLPDHHTDFQFLGELGSSQSLEYFLGKLGLSQSLEYFPEGQDCSSYEWNSYESIENADELTYNCTWCSIPKPCREGRGKQNNISSKI,LSMKLQCWVKNLRRLGGTRRRHNSRRLKTSTPAVRNRCQITTQTFSFLVNLGRLRALNIFLVNLGCLRALSIFPKVKIVVPTNGIPMRA\n
GLEAN3_10438	SPU_010438		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: LLPTPFQVFLHNRKQMCSIIHFLYVIALNEMQNVENVELIQVRVNFEASVHTALNECRLELHVCLCLLFFMSYMNIIHSPSELH\n
GLEAN3_05831	SPU_005831		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(16-21), CT, TM has 97% identity to another Sp-Tlr gene(GLEAN3_18838). So it could be a member of Toll-like receptor family.\n
GLEAN3_24820	SPU_024820		Blasted to PTPRA and PTPRT, but phylogenetic analysis showed that it is part of a novel clade also containing PTPRLec1, PTPRLec2, PTPRLec3, PTPRLec4, PTPRLec5, PTPRFn1, and PTPRFn2.\n
GLEAN3_09602	SPU_009602	After reviewing the data and performing a BLAST search, it appears that this is the best fit for this particular GLEAN model. When examining the excel data, it is evident that the sequence doesn't begin until 754, however, the rest of the sequence contains an orderly and continuous arrangement until the end. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be distributed into 2 distinct clusters that had overall weak scores. 	GLEAN3_09602 may be a duplicate prediction for GLEAN3_28213.\n
GLEAN3_11286	SPU_011286		This sequence apparently encodes the first exon of Sp p38, the rest of which is contained by GLEAN3_10118. Amino acids 1-47 (approximately) corresond to p38. Correct sequences below: \n \nDNA: \nATGTCTGCTTTTCATTCACTGCCAGAGGACTTCCATCACATTGAACTCAATAAAACGATATGGGAAGTCCCCAATCGGTATGTGCGACTGGAACCTGTGGGCTCAGGAGCGTATGGGCAAGTATGTTCAACAGAA \n \nProtein: \nMSAFHSLPEDFHHIELNKTIWEVPNRYVRLEPVGSGAYGQVCSTE\n
GLEAN3_28213	SPU_028213		GLEAN3_09602 may be a duplicate prediction.\n
GLEAN3_19852	SPU_019852		Blasts to PTPRA, but forms a novel clade in phylogenetic analysis with PTPRFn1, PTPRFn2, and PTPRLec2-6.  \n
GLEAN3_00237	SPU_000237		partial sequence only, internal. Also seems to be missing an exon.\n
GLEAN3_19770	SPU_019770		GLEAN3_19770 codes the first exon(s) for this gene. Rest of the gene is present in GLEAN3_19769.\n
GLEAN3_19769	SPU_019769		GLEAN3_19770 codes the first exon(s) for this gene. Rest of the gene is present in GLEAN3_19769.\n
GLEAN3_17187	SPU_017187		One of 3. This gene is a partial sequence, and is identical to 19022, which is longer and encompasses this gene. GLEAN3_01396 also is a JIP3, but is distinct from 19022 and 17187.  This gene and 19022 also BLAST well to XP_782498.1, sperm-associated antigen 9 isoform 1. \n
GLEAN3_09388	SPU_009388		Similar to phosphohistidine phosphatase 1.\n
GLEAN3_26582	SPU_026582		Partial sequence.  See also GLEAN3_06528, GLEAN3_16411, GLEAN3_22686, and GLEAN3_18743. \n
GLEAN3_28046	SPU_028046		Similar to Protein phosphatase PP2A regulatory subunit A. Partial sequence.\n
GLEAN3_01694	SPU_001694		Similar to Receptor-type tyrosine-protein phosphatase R. Partial sequence.\n
GLEAN3_15535	SPU_015535		Similar to Receptor-type tyrosine-protein phosphatase R. See also GLEAN3_20488.\n
GLEAN3_23889	SPU_023889		homolog: arrestin beta-1 from human, isoform B\n
GLEAN3_20488	SPU_020488		Similar to Receptor-type tyrosine-protein phosphatase R. See also GLEAN3_15535.\n
GLEAN3_03711	SPU_003711	From the BLAST results as well as the excel data, it was determined that for this particular GLEAN model the sequence is distributed onto 2 different scaffolds. When examining the excel data, it is evident that both scaffolds contain an orderly arrangement without any gaps or repeats present. There is however a sequence overlap between the two scaffolds that occurs between 704- 949. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5. 	Missing N-ternimus.  See GLEAN3_06561, _01698, _00076.  \n
GLEAN3_11457	SPU_011457		Missing N-ternimus.  See GLEAN3_14876.  \n
GLEAN3_05942	SPU_005942		See GLEAN3_14876, _11457.  \n
GLEAN3_20542	SPU_020542		Blasts to PTPRK, but didn't clade with these genes in phylogenetic analysis.  Formed a unique clade with Glean3_15923. Partial sequence.\n
GLEAN3_08253	SPU_008253		Similar to R-PTP-alpha.  See also GLEAN3_16053, GLEAN3_16144, GLEAN3_19852, GLEAN3_20604, GLEAN3_24537, GLEAN3_27101, and GLEAN3_22839.  \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537808-25802-57187428179.BLASTQ4\n
GLEAN3_17329	SPU_017329		The ATG (exon 1) probably lies in an unsequenced area.  Other notes for exon 15:  \n- 23 bp missing after 23565 \n- 16 bp mismatch btwn 23566-81 \n- 24 bp missing after 24183 & 24237\n
GLEAN3_06155	SPU_006155		glean modified to correspond to est. \nSeems like it is missing the C-terminal part of the expected protein. It could be contained in the prediction glean3_17460, but the est doesn't line up with the exon sequences there\n
GLEAN3_24537	SPU_024537		Similar to R-PTP-alpha.  See also GLEAN3_08253, GLEAN3_16053, GLEAN3_16144, GLEAN3_19852, GLEAN3_20604, GLEAN3_27101, and GLEAN3_22839.\n
GLEAN3_28020	SPU_028020		it seems that this prediction has two matches to Nef3 in C-terminus and N-terminus.\n
GLEAN3_15941	SPU_015941		It hits to the same query Mouse Rufy3 as GLEAN3_28460. They maybe the same gene,maybe not. It is named as Sp-Rufy4.\n
GLEAN3_28184	SPU_028184		has one kazal and two TY domains - like a splice isoform of SMOC (Q9H4F8)- a SPARC homologue\n
GLEAN3_02025	SPU_002025		Contains single NtA domain like N-terminus of agrin. \nOther GLEAN  predictions contain FOLN and KAZAL repeats and may comprise the next segment (especially GLEAN3_02467 and possibly GLEAN3_24994).  A fourth gene looks like the next piece (GLEAN3_22633)and the adjacent gene (GLEAN3_22634) contains LamG repeats that look like the C-terminus. These five gene predictions may be adjacent and comprise a full agrin gene.\n
GLEAN3_17460	SPU_017460		glean describes c-teminal part of the gene \npotentially could be the c-terminal part of the gene descibed as glean3_06155, but no linking est data is available\n
GLEAN3_15605	SPU_015605		Hh signaling pathway member\n
GLEAN3_19022	SPU_019022		One of 3. This Glean is identical to and encompasses GLEAN3_17187, both of which BLAST to JIP3 as well as sperm-associated antigen 9. In addition, GLEAN3_01396 is also a JIP3, but does not match these others and so probably represents a true duplication.\n
GLEAN3_24688	SPU_024688		See also GLEAN3_05592 and GLEAN3_06723.\n
GLEAN3_05592	SPU_005592		See also GLEAN3_06723 and GLEAN3_24688. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537110-23138-78233751102.BLASTQ4 \n \nAnd this sequence is a duplicate of GLEAN3_06723\n
GLEAN3_01396	SPU_001396		One of 3. GLEAN3_17187 and 19022 also encode a JIP3, distinct from this gene.\n
GLEAN3_25413	SPU_025413		Similar to c-myc binding protein.\n
GLEAN3_18908	SPU_018908		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: FTFTCYLSGQKKDPTILVLNRRSLLINIHEVPHPRAIWFYCYPPLHLNSSQPPDEYYKYSRSLCCVFVSCLCLYQAARSGL\n
GLEAN3_07911	SPU_007911		Using SpADAM cDNA, two predictions align GLEAN3_0911 and GLEAN3_20545.  Both are nearly identical from 1010 to 3072.  Missing exons are mostly on Scaffold_317, but some are only on Scaffold_663 (2 exons encoding 681-865).  I have corrected gene features of GLEAN3_20545.\n
GLEAN3_20545	SPU_020545		Using SpADAM cDNA, two predictions align GLEAN3_0911 and GLEAN3_20545.  Both are nearly identical from 1010 to 3072.  Missing exons are mostly on Scaffold_317, but some are only on Scaffold_663 (2 exons encoding 681-865).  I have corrected gene features of GLEAN3_20545.\n
GLEAN3_10565	SPU_010565		looks like part of slit - the C-terminal half - perhaps the last few domains are artefacts/duplication - missing N-terminal half with more LRR repeats \n \nADJACENT GENE (GLEAN3_10564) - LOOKS LIKE THE N-TERMINAL HALF - could be one or two exons encoding LRR repeats missing at junction\n
GLEAN3_16527	SPU_016527		novel architecture - TSP1 plus LamG x2 - no homologs known\n
GLEAN3_18348	SPU_018348		FA58C-LamG-LamG structure defines this as relative of CASPR - probably missing C-terminal half that should contain other domains (maybe FBG,more LamGs and EGFs, TM and/or 4.1m) \n \nThe prediction from GLEAN3_07341 looks like a likely candidate for C-terminus\n
GLEAN3_06742	SPU_006742	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 4 different scaffolds. After reviewing the excel data in comparison with the BLAST results it is clear that there are numerous gaps and internal repeats present within the 4 different scaffolds, resulting in poor overall coverage. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 5.5.	Hh signaling pathway regulator\n
GLEAN3_23409	SPU_023409		Has EGF and BNR repeats but no N-terminal reeler domain - probably a fragment. \n \nGLEAN3_08268 is very similar in structure\n
GLEAN3_08268	SPU_008268	After reviewing the data and performing a BLAST search it appears that no good GLEAN model fits sufficiently. The sequence is distributed onto 3 different scaffolds. There is some Est information available on GBrowse V0.5 and the transcriptome intensity scores appear to be somewhat strong with numerous values ranging from 3-15. Additional information from Baylor gene information (comments): Has EGF and BNR repeats but no N-terminal reeler domain - probably a fragment. GLEAN3_23409 is very similar in structure	Has EGF and BNR repeats but no N-terminal reeler domain - probably a fragment. \n \nGLEAN3_23409 is very similar in structure\n
GLEAN3_23757	SPU_023757		An cDNA containing everything but the 5' end of the gene was used in a BLAST search of the contig database, which resulted in the identification of an overlapping sequence that starts with a signal peptide.  The first exon appears to be on the - strand whereas the other exons are + strand, suggesting an assembly error. \n
GLEAN3_27145	SPU_027145		The full-length cDNA was assembled from several overlapping cDNA fragments and ESTs and confirmed by PCR of a full length ORF.  Gene features have been altered to comply with cDNA.\n
GLEAN3_28236	SPU_028236		Likely the unique ortholog of human CDC2L5 and CrkRS \n"Additional evidences of the existence of the gene" have been obtained in Sphaerechinus granularis\n
GLEAN3_28463	SPU_028463	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. There are several sequence overlaps and internal repeats present within the first scaffold that ranges from 770-857. However, the overall sequence does contain an orderly arrangement.  There was some Est support available from GBrowse assembly V0.5 and the trancsriptome intensity scores appeared to be some what strong with most of the values being greater than 5. 	The C-terminal portion of this GLEAN is clearly MAPKAPK5, whereas the N-terminal portion appears to be CG12134-PA (BLAST XP_788954.1). The sequence below has been modified to correspond only to MAPKAPK5. The MAPKAPK part is partially duplicated by GLEAN3_21161. \n \nMWSLGVIIYIMLCGYPPFYPDTPSRQLSKDMRHKIMAGQYEFPTEEWSLISDEAKDVVKRLLRVDPTERLTIEELCSHPWLRENSAPNTELHSPAIMLDKNMLDDAKQIHSEQLTAMRIPDKKVMLKPVAKANNPIVRKRILTRGQSIDNKIGEEQPPKKQNRENSEGVTCLRNIIAHCIVPPKDANGEDALCELMKRACQYNRDCPSLDKALNNLSWNGEQFCDKVDRSELALLLKDIVDQKERHEKC\n
GLEAN3_23951	SPU_023951		GLEAN3_23951 lacks C-terminal SH3 domain present in homologs.\n
GLEAN3_21161	SPU_021161		partial sequence,  identical to GLEAN3_28463 from aa 35-249 (with one exception). Mismatched short ends do not appear to result from frame shifts. Also BLASTs to XP_781571.1 (MAPKAPK5) with high e value.\n
GLEAN3_13910	SPU_013910		Duplicate gene (non-identical) to other MAPKAPK5s: GLEAN3_28463 and _21161 (these later 2 appear to be the same gene). The termini of this gene appear to be incorrect.\n
GLEAN3_23676	SPU_023676		Also BLASTs strongly to XP_789413. Identical and internal to GLEAN3_06513\n
GLEAN3_20611	SPU_020611		This gene appears to be missing an exon encoding SLLHLITQYLNPRTLSKDFQGK (aas 213-234).  \nThis is an overlapping identical duplicate of GLEAN3_25452.\n
GLEAN3_25452	SPU_025452		Also BLASTs strongly to XP_791076.1. Appears to be an overlapping identical duplicate of GLEAN3_20611\n
GLEAN3_11714	SPU_011714		sequence is only partial. GLEAN3_20782 is an internal identical duplicate of this gene. \n
GLEAN3_20782	SPU_020782		internal identical duplicate of GLEAN3_11714. Also BLASTs strongly to XP_797035.1\n
GLEAN3_26498	SPU_026498		This is the C terminal part of the protein; the N terminal portion is encoded by GLEAN3_27848. These 2 gleans overlap (nucleotide level): bases 1-353 (this  glean).  \n
GLEAN3_17694	SPU_017694		duplicate, non-identical to GLEAN3_10805. Partial sequence.\n
GLEAN3_07222	SPU_007222		appears to be missing the start codon, but 3rd aa is present\n
GLEAN3_27370	SPU_027370		The N-terminal sequence (exons from nt 8030 to 19367) is not part of Sp-CDK7. This sequence is similar to the sequence NP_000327.1 encoding sodium channel, nonvoltage-gated 1, beta [Homo sapiens].  \nLikely due to a problem of contig assembly. \nThe N terminus of Sp-CDK7 is missing and the C-terminus (two ultimate exons)is conflictiv.\n
GLEAN3_04845	SPU_004845		adhesion protein or cell surface receptor - novel architecture - FBG, an N-terminal  MNNL Notch ligand domain and multiple EGF-Ca repeats. Good Blast match with Notch homolog but that may be spurious (EGFs). \nCould be a Notch or Notch ligand but does not have ankyrin repeats characteristic of Notch or DSL characteristic of Notch ligands\n
GLEAN3_24020	SPU_024020	From the BLAST results as well as the excel data, it appears that the sequence is distributed onto two different scaffolds. When examining the excel data, it was apparent that there were several base pair duplicates and sequence overlaps within both scaffolds. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values ranging less than 5	novel architecture - C-terminal FBG domain - might be related to a role in immune defense. Has pfam:Nacht domain - NTP-binding?? \n \nBlast match to tenascin is probably misleading\n
GLEAN3_11551	SPU_011551		Based on best blast hit data, this protein is closely related to tolloid but lacks the C-terminal EGF, CUB and CUB domains.  One C-terminal predicted exon and two N-terminal exons do not encode conserved sequence and may not be part of this gene. \n
GLEAN3_07341	SPU_007341		LamG-EGF-LamG-4.1m - looks like C-terminus of CASPR or neurexin. \nNeurexin gene (GLEAN3_24416) has its C-terminus - CASPR gene (GLEAN3_18348) does not. Suggests this is part or CASPR gene.\n
GLEAN3_14828	SPU_014828		All predicted exons supported by EST data.\n
GLEAN3_12138	SPU_012138		EGF-LAMG-LAMG-EGF \n \nthese two domains occur in intermingled fashion in quite a few known proteins - neurexins, perlecans, CASPR \n \nURCHINS APPEAR ALSO TO HAVE NOVEL EGF/LAMG PROTEINS\n
GLEAN3_15404	SPU_015404		membrane-proximal portion of an adhesion receptor - a bit like Crumbs - has several LamG/EGF pairs and a TM domain \n \nthese two domains (or other EGF variants) occur in intermingled fashion in quite a few known proteins - neurexins, perlecans, agrin, crumbs, CASPR, some cadherins \n \nurchins appear also to have novel EGF/LAMG proteins \n
GLEAN3_24257	SPU_024257		#\nEGF-EGF-LAMG-LAMG-EGF-EGF-LAMG \n \nthese two domains (or other EGF variants) occur in intermingled fashion in quite a few known proteins - neurexins, perlecans, agrin, crumbs, CASPR, some cadherins \n \nurchins appear also to have novel EGF/LAMG proteins \n
GLEAN3_04895	SPU_004895		Multiple EGFCa repeats and C-terminal Lamg/EGFCa modules \nNo obvious TM domain \nNovel architecture \nEssentially same structure as >GLEAN3_16555 \nLook a bit like Crumbs but much larger and not the same domain organization\n
GLEAN3_16555	SPU_016555		Multiple EGFCa repeats and C-terminal Lamg/EGFCa modules \nNo obvious TM domain \nNovel architecture \nEssentially same structure as >GLEAN3_04895 \nLook a bit like Crumbs but much larger and not the same domain organization\n
GLEAN3_16807	SPU_016807		Multiple EGFCa repeats and three LamG domains interspersed before a TM segment \nLooks rather like Crumbs in overall organization but larger. \n \nGLEAN3_20365 is similar\n
GLEAN3_09927	SPU_009927		This model is on a short scaffold and is probably lacking Both N and C-terminal exons.  One predicted exon, given below, cannot be validated by sequence similarity to members of the M12A class of proteases. \n>GLEAN3_09927|Scaffold82736|3265|3440| DNA_SRC: Scaffold82736 START: 3265 STOP: 3440 STRAND: +  \nGAGAAGAAGAAGAAGAAAAAGATGATGAAGAAGAAGATGAGGAGGAGGAGGATGATGAAGAAGAAGAAGA \nAGGAGAAGATGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAAAAGAA \nGAAGACAAACAAGAAGAGGAGGAGGAAGACGAAGAA \n
GLEAN3_20365	SPU_020365		#\nMultiple EGFCa repeats and three LamG domains interspersed before a TM segment \nLooks rather like Crumbs in overall organization but larger. \n \nGLEAN3_16807 is similar\n
GLEAN3_21812	SPU_021812		Likely unique ortholog of human cyclin T1 and T2\n
GLEAN3_28742	SPU_028742		This model contains exons encoding a protein most similar to SpAN, a sea urchin astacin protease. but lacks a C-terminal domain found in SpAN.\n
GLEAN3_17070	SPU_017070	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. The first stretch of the sequence is on v2.1_scaffold14511 until 1080 is reached. The sequence is continued on v2.1_scaffold77083 until 1299 is reached. And the sequence is completed on v2.1_scaffold74943. When comparing the BLAST results with the data from excel, it is apparent that if the three scaffolds were to be combined the sequence would have a continuous and orderly arrangement. There is Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be strong with most of the values being greater than 10.	Comparison to best blast hit show that the following exons are not conserved with other M12A class proteins: \n>GLEAN3_17070|Scaffold1519|80742|80817| DNA_SRC: Scaffold1519 START: 80742 STOP: 80817 STRAND: +  \nATTTCTGCAGCCTGTCGCGAGAGATATTGGTTGCGGTTGCAATCTCGACGAGACTCTTCCAACATATGAA \nGAGCAT \n>GLEAN3_17070|Scaffold1519|82021|82153| DNA_SRC: Scaffold1519 START: 82021 STOP: 82153 STRAND: +  \nGATGACGCCGAATTTTGAAAATCAACTCTGTATGTCTGCCCATCGATTGGTACCGAGTCGTCATGTAGAA \nAATTGATACACGATCTGTTTGATATGTACTGCATAGTTTCCTCAATTACAGTTCTGAATGATT \n>GLEAN3_17070|Scaffold1519|83467|83734| DNA_SRC: Scaffold1519 START: 83467 STOP: 83734 STRAND: +  \nCTAACTCGCTATCGATCTCATACGGTAGTGTTGCGTCGGGCCATGTCGCTCCAGTTTCCACATTCCTCTT \nGGTCCTACTCCCGTTGCCATTGTGACCATCCTCTTCCATGAACTTCTTCTGCTCTTCAGTAAGGCGGATA \nTCTCCCAGGATGACGTCACCTGGATTCAGATTGTCCATTGGTTTGCTATGCTGTTCGGACTCAGCTTCCG \nCATTGTGAGGGCGCGCCAACACCGTATCGTCAACGTCTTTCTTGAATGGTGGCAGAGA \n>GLEAN3_17070|Scaffold1519|85911|86121| DNA_SRC: Scaffold1519 START: 85911 STOP: 86121 STRAND: +  \nTGATTTTCCCTTGTCATCGACGAAATCATGGTCGTGATCACTGTTGAGAGGAAGACGTTCGTCATCGACT \nGTCGTCGTGAATACGGCGACAGCTAGGCAGAGTAACAGTACAGAGCTCAGGCAAATCCTTTTCATCATTT \nTTTCGTTCCGGCGTTTCGGCAAAGCGACGAGATTCTCCAAACCAACGGAGTGACAGTAGTAAGCAGCAGT \nC \n
GLEAN3_09203	SPU_009203		Model probably contains partial CDS because it is at the end of a scaffold.  Of the 5 predicted exons, only 3 and 4 contain conserved sequences.\n
GLEAN3_14989	SPU_014989		The N-terminal sequence of this cyclin is probably the one encountered in GLEAN3_11295.    \nThree GLEAN: GLEAN3_00328,14989 and 0011295 encode the cyclin L protein. They differ in the N-terminal end. \n
GLEAN3_00218	SPU_000218		#\nThe model contains exons encoding CUB domains similar to those found in tolloid-like proteins within he M12A metalloprotease subfamily.  This sequences are very similar those in GLEAN3_17070, but are two divergent to be allelic.  This is very likely to be partial CDS.\n
GLEAN3_27114	SPU_027114		Comparison to best blast hit sequence suggests that the gene model contains exons encoding a tolloid/BMP-1 like protein.  The model is likely to be partial because there are only 3 cub domains instead of the 5 normally associated with this subclass of astacin proteases.  Note that this model is adjacent to a very similar gene, GLEAN3_27115.\n
GLEAN3_27115	SPU_027115		Comparison to best blast sequences suggests that this model contains exons encoding a protein related to tolloid and bmp-1.  It is like to be partial because there is only 1 cub domain rather than the 5 characteristic of proteins in this subclass of M12A.  Note that it is adjacent to a very similar gene, GLEAN3_27114\n
GLEAN3_19518	SPU_019518		identical to glean3_00129 over >200 aa; possible mis-assembly\n
GLEAN3_26758	SPU_026758		strong identity to HIF2a through first 239 aa\n
GLEAN3_08353	SPU_008353		needs to be split \naa 1-~900 = similar to Biotin protein ligase \naa ~900-1676 = similar to SIM but missing N-term. \nN-terminal likely found in GLEAN3_13962\n
GLEAN3_21277	SPU_021277		gi|72046985|ref|XP_786603.1|  PREDICTED: similar to ataxin 2 [Strongylocentrotus purpuratus]Length=898 \n
GLEAN3_24739	SPU_024739		Comparison to best blast sequence suggests that all but the first and last exons in this model are conserved with tolloid-like proteins.  While the first predicted exon may be part of this gene, the last one ( >GLEAN3_24739|Scaffold97632|8420|9724| ) encodes peptide sequence similar to other proteins.\n
GLEAN3_03612	SPU_003612		This model contains exons encoding an astacin protease of the tolloid family most closely related to the sea urchin protein SpAN, but lacks other exons characteristic of this group of proteins, such as CUB domains probably because they are on other scaffolds and this model is located at the end of scaffold 72693.\n
GLEAN3_04586	SPU_004586	From the excel data and the BLAST results, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it was discovered that there was one base duplicate (399-672) that may be apart of the sequence. There was also a sequence overlap between the 2 scaffolds that spanned from 19-90 within the second scaffold that was not apart of the orderly arrangement of the rest of the sequence. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak all of the values being less than 5.5. 	short with TM/LDLA/TY domains\n
GLEAN3_05781	SPU_005781		NOVEL ARCHITECTURE - TY-TY-WAPx5-VWCx4-WAP domains\n
GLEAN3_12001	SPU_012001		NOVEL ARCHITECTURE - EGFCa interspersed with 3 TY repeats\n
GLEAN3_19601	SPU_019601		NOVEL ARCHITECTURE - WAPx4-TY-x3-EGFx3  domains\n
GLEAN3_27371	SPU_027371	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, there appears to be several sequence overlaps present on v2.1_scaffold44545 resulting in no orderly arrangement within the last portion of the sequence. These sequence overlaps and repeats were also apparent from the BLAST results.  There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores were difficult to determine due to the scores being in close proximity to each other and the scale of the graph. This is an un-annotated gene so no additional gene information (comments) was available from Baylor annotations.	LNB-7TM receptor - lots of calx-B repeats \n \nHOMOLOGOUS WITH "VERY LARGE G PROTEIN-COUPLED RECEPTOR 1, VLGR1/MASS1/GPR98 - mutated in Usher syndrome 2C \n \nPREVIOUSLY CHORDATE RESTRICTED\n
GLEAN3_00957	SPU_000957		probably an ECM protein given its domain composition - TSPN - many VWC -VWD at C-terminus - novel architecture \nGLEAN3_04940 has very similar strucure minus the TSPN \nthe FN1 predictions are likely alternative predictions for the VWC repeats\n
GLEAN3_04940	SPU_004940		probably an ECM protein given its domain composition -   \nmany VWC and VWD at C-terminus - novel architecture \nGLEAN3_00957 has very similar strucure plus a TSPN domain at N-terminus \nthe FN1 predictions are likely alternative predictions for the VWC repeats\n
GLEAN3_27525	SPU_027525		C-terninus of this gene is GLEAN3_27526 and should be combined.  \n
GLEAN3_26798	SPU_026798		This is part of a sea urchin specific group of ADAM-TS genes.  There are two worm ADAMTS genes with some sequence similarity (wormbase# F08C6.1a.1 and C02B4.1)\n
GLEAN3_12296	SPU_012296		aa 1-235 strong id to ss and AHRs \nSee also Glean3_13788 and Glean3_05022 for AHR-like sequences. \nGlean3_05022 may be the bHLH domain of this model or of Glean3_13788.\n
GLEAN3_16604	SPU_016604		GLEAN3_15419 model on scaffold 76434 is also part of predicted Sp-Pask.   FgeneshAB prediction S.P_Scaffold70175 may have additional exons, based on alignment with mammalian PAS-K.\n
GLEAN3_13296	SPU_013296		only domain is the reprolysin domain\n
GLEAN3_13297	SPU_013297		only domain it contains is a part of the reprolysin domain\n
GLEAN3_25061	SPU_025061		contains TSP1 repeats found in ADAM-TS sequences\n
GLEAN3_01545	SPU_001545		contains a TSP1 and calcium-binding domain \n
GLEAN3_05275	SPU_005275		#\nPREDICTED: Strongylocentrotus purpuratus similar to Machado-Joseph disease protein 1 (Ataxin-3) (LOC581652), mRNA \n
GLEAN3_05234	SPU_005234		Comparison to best blast sequence suggests that the model contains exons encoding an astacin protease.  It lacks domains characteristic of the closely related astacins, tolloid and BMP1.  One of the predicted exons (>GLEAN3_05234|Scaffold6580|26133|26420) probably belongs to another gene.  The inferred amino acid sequence from the last two predicted exons (>GLEAN3_05234|Scaffold6580|26133|26420; >GLEAN3_05234|Scaffold6580|30923|31151| ) is not conserved.\n
GLEAN3_10948	SPU_010948		This model encodes a protein with the same domain architecture as the sea urchin protein SpAN; the primary sequence indicates that it is a different gene.\n
GLEAN3_01560	SPU_001560		Comparison to best blast sequence suggests that the model contains some but not all of the domains characteristic of tolloid/bmp1 proteins.  The last 4 predicted exons given below contain sequences similar to other kinds of proteins and therefore may not be part of this model. \n>GLEAN3_01560|Scaffold70398|20995|21158| DNA_SRC: Scaffold70398 START: 20995 STOP: 21158 STRAND: +  \nCCTTGGGCCTTGAGAGTTATGTCATCCCAGATTCAAGTCTGACAGCTTCCAGTGAATTTAATGCTGACCA \nTGGTGCAAAGAGAGGTCGTCTTAACCTGGCCAGAGTCGGGGATCTGCGTGGAGGCTGGAATCCAATGGAC \nAACGATGCAAACCCGTGGATCCAG (no blasts to something else; maybe EGF) \n \n>GLEAN3_01560|Scaffold70398|23209|23361| DNA_SRC: Scaffold70398 START: 23209 STOP: 23361 STRAND: +  \nGTGGATCTTCTGGACCTTTACCGTATCATTTCAGTTGCGACTCAAGGGCGACAAGATCTTGACCAGTGGG \nTTAATAGCTACAAGCTTGCTTGGAGTACTGATGGCACGACCTTTCGCACAGTGCAGGACATTCCCGGGCC \nAGGAGCTGACAAG (blasts as previous exon) \n \n>GLEAN3_01560|Scaffold70398|23720|23890| DNA_SRC: Scaffold70398 START: 23720 STOP: 23890 STRAND: +  \nATCTTCATCGGTAATGTTGACCGCAACACCATCATGACCAACACTCTGCCTGTGTCCCAGGTTTGCCGCT \nATTTCCGCTTGATGCCTGTCAGCTGGTATAAACACATTAGTGTTCGTATGGAGATATATGGATATGGTGA \nAGGCCCTGTCACAGGTCAGTATGAAAACTAG (blasts as previous two exons. \n \n>01560 \nMSRTLLLSGLVAMLMAYSLAKPLRKQKGYTKTKVPQIKKVEFNGEILEIAVEEDDPFHRPIPADEGYSPNAYETDMMLNPEQEAALSDPKNSRNKRKASKDTTKYWPKKIIDQATSQHVINVPYEFGLGVDRTAIKAAMAHWQDQTCVRFEIHDRSVSSLWQHRLKFIKSDGCYSYLGLQSKIGFQDVSIGKGCTRLGTVSHEIGHALGFWHEQSRPDRDEFVTVNFANIIQDKMNAFRKHTTDDVMTNVPYDYNSVMHYGAYGFGIDAKVPTLIPKDPLSMGEIGQRLGLSYLDVKLANFMYECDSHCPGASSCHSGFRDMNCKCRCPESHKGDYCEVVALNFPGNLGNPDEQIRLKFDALDMEPFDTSSKKCLDYINIRAGGNLYYEGTDFCGNTLPPEIIADEIILSFHSDETNTNKGFHGTYTREKISALGLESYVIPDSSLTASSEFNADHGAKRGRLNLARVGDLRGGWNPMDNDANPWIQVDLLDLYRIISVATQGRQDLDQWVNSYKLAWSTDGTTFRTVQDIPGPGADKIFIGNVDRNTIMTNTLPVSQVCRYFRLMPVSWYKHISVRMEIYGYGEGPVTGQYEN \n
GLEAN3_26547	SPU_026547		#\nThere may be an assembly problem with this model since part of the protease domain is repeated.  IN this model the order of domains is partial astacin protease, cub, cub, then what is probably the beginning of the protein - N-terminal signal peptide, activation domain, astacin protease, cub, cub.  \n
GLEAN3_18198	SPU_018198		Matches the Lysosomal trafficking regulator from rat along the entire coding sequence.  Conservation very high at 3' end.  Tiling experiment indicates high expression in embryos.\n
GLEAN3_22164	SPU_022164		an exon may be missing\n
GLEAN3_16045	SPU_016045		One of 2. GLEAN3_04024 is an exact duplicate of this protein, although 04024 is shorter and missing the N terminus.\n
GLEAN3_15349	SPU_015349		the N terminal region in particular matches AMPK-like, while the C terminus does not BLAST strongly\n
GLEAN3_17949	SPU_017949		One of 2. An almost perfect duplicate of GLEAN3_09559. This protein is longer, appears to contain the true N terminus and an exon missing from 09559.\n
GLEAN3_09878	SPU_009878		one of 2. Non-identical duplicate of GLEAN3_03844\n
GLEAN3_23875	SPU_023875		One of 4. Non-identical duplicate of GLEAN3_00442, 23876, 08085.\n
GLEAN3_05676	SPU_005676		GLEAN3_04836 is a partial duplcate prediction.\n
GLEAN3_04836	SPU_004836		GLEAN3_04836 is a partial duplcate prediction for GLEAN3_05676.\n
GLEAN3_09559	SPU_009559		one of 2. THis is an almost-perfect duplicate of GLEAN3_17949. This protein is missing the N terminus and an internal exon, but otherwise is an exact match.\n
GLEAN3_26779	SPU_026779		Partial sequence identical and included in GLEAN3_24526\n
GLEAN3_00442	SPU_000442		One of 5. Non-identical duplicate of GLEAN3_23875, 23876, 08085, 19751\n
GLEAN3_17487	SPU_017487		One of 2. This protein appears to be a shortened version of GLEAN3_05613, which has a much longer N terminus. This protein (17487) has a slightly longer C terminus.\n
GLEAN3_28480	SPU_028480		First half completely predicted. Last half of the gene missing. GLEAN3_17903 is a partial duplicate prediction.\n
GLEAN3_17903	SPU_017903		First half completely predicted. Last half of the gene missing. GLEAN3_28480 is a partial duplicate prediction.\n
GLEAN3_05613	SPU_005613		One of 2. GLEAN3_17487 overlaps C-terminus and is nearly identical. 17487 is slightly longer at C-terminus, but this protein (05613) is considerably longer in the N terminus.\n
GLEAN3_09980	SPU_009980		- shows comparable homology to vertebrate terminal deoxyribonucleotidyltransferase (TdT) and vertebrate polymerase mu \n- one intron in was skipped by GLEAN3 prediction\n
GLEAN3_26447	SPU_026447		PREDICTED: similar to fragile X mental retardation gene 1, \nautosomal homolog [Strongylocentrotus purpuratus].\n
GLEAN3_23876	SPU_023876		One of 3. This one is the longest, and is non-identical to either GLEAN3_00442 or GLEAN3_23875\n
GLEAN3_08085	SPU_008085		One of 5. Non-identical duplicates of 00442, 19751 and 23875. Nearly identical (and internal) to 23876, although the C terminus of this protein diverges. \n
GLEAN3_28711	SPU_028711		Possibly missing an exon in the middle.\n
GLEAN3_19009	SPU_019009		Strongylocentrotus purpuratus mRNA for SuDp98 protein Length=3650 \n
SpRag2L	SPU_030091		This gene has been verified by Race and RT-PCR.  It is expressed at low levels in early gastrula embryos, adult coelomocytes, and other adult tissues.  Though it has only low sequence identity with vertebrate Rag2 it is predicted to have the same structure and is encoded in reverse  orientation downstream of SpRag1L (a Rag1-like gene). \n
GLEAN3_01138	SPU_001138		Pfam00194 match.  \n \nTranscriptome data indicate that it is expressed in the embryo. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_10623	SPU_010623		GLEAN3_10623 is a partial duplicate prediction for GLEAN3_27004.\n
GLEAN3_27004	SPU_027004		GLEAN3_10623 is a partial duplicate prediction for GLEAN3_27004.\n
GLEAN3_12518	SPU_012518		pfam00194 match.  \n \nTranscriptome data indicate that it is expressed in the embryo. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_24809	SPU_024809		Pfam00194 match.   \n \nTranscriptome data indicates that it is expressed in embryo. \n \nA family of carbonic anhydrase-like proteins exists in sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_14262	SPU_014262		these other Glean3 sequences had high similarity to endonuclease-reverse transcriptase: Glean3_24197, Glean3_26145, Glean3_02879 \n
GLEAN3_24197	SPU_024197		these other Glean3 sequences also had high similarity to endonuclease-reverse transcriptase: Glean3_14262, Glean3_26145, Glean3_02879\n
GLEAN3_26145	SPU_026145	After reviewing the data and performing a BLAST search, it appears that the there is no good GLEAN model fit for GLEAN3_26145. The sequence is distributed onto two different scaffolds. There are numerous repeats within both scaffolds as well as gaps and sequence overlaps. When compared to the other scaffolds the BLAST results indicated that >v2.1_scaffold34698 had a relatively low bit score and a high e-value. There was Est. information available from GBrowse assembly V0.5 and the transcriptome score intensity appeared to be somewhat weak with most of the values in below 5. This is an un-annotated gene so no additional gene information (comments) was available from Baylor annotations. 	these other Glean3 sequences also have high similarity to endonuclease-reverse transcriptase: Glean3_14262, Glean3_24197, 02879\n
GLEAN3_02879	SPU_002879		these other Glean3 sequences also have high similarity to endonuclease-reverse transcriptase: Glean3_14262, Glean3_24197, Glean3_26145\n
GLEAN3_08844	SPU_008844		This blasts to PPEF1, but phylogenetic analysis showed that it was a homologue of human PPEF2.  Glean3_11860 is likely the identical protein.\n
GLEAN3_19367	SPU_019367		Glean3_22254 is a partial sequence of this entry\n
GLEAN3_22254	SPU_022254		This sequence was a partial sequence of Glean3_19367 and has been modified to include the sequence from Glean3_19367\n
GLEAN3_11655	SPU_011655		domains LDLa - CCP x4 - EGFCa x3 - Ig - SEA 7TM_2 \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs\n
GLEAN3_12383	SPU_012383	After reviewing the data it appears that there is no good GLEAN model that fits sufficiently to GLEAN3_12383 due to the number of gaps and internal repeats present. For the first part of the sequence (from approximately 1-2000 base pairs) it appears that there's very poor sequence coverage which was indicated by the BLAST results. There were very low bit scores and e-value results for the first part of the sequence. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be somewhat strong with values ranging from about 3-26 with a bell shaped (curve) distribution of scores. This is an un-annotated gene so no additional comments were available.	KR-FA58Cx3-CLECT-EGFCa-CUB X3-LDLa x6-LRR 7TM_1 \nNo GPS but looks like a bit like a member of the LNB-7TM family of adhesion domain GPCRs or like a glycoprotein hormone receptor - 7TM_1 favors latter \nNo known LDLa, KR or FA58C members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_15198	SPU_015198		Lyx4- EGFCa- Lyx2-LDLa- EGFCax2-Ig 7TM_2 \n \nNo GPS but otherwise looks like a bit like a member of the LNB-7TM family of adhesion domain GPCRs \nNo known LDLa or LY members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_23185	SPU_023185		EGFCa x13-Ig 7TM_2 \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs\n
GLEAN3_26721	SPU_026721		EGFCax3-Ig 7TM_2 \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs\n
GLEAN3_07507	SPU_007507		NIDO and VWD domains characteristic of mucins\n
GLEAN3_07661	SPU_007661		NIDO and VWD domains characteristic of mucins\n
GLEAN3_18197	SPU_018197		NIDO, IPT, AMOP, VWD,CCP - LOOKS LIKE A MUCIN but also has hyalin repeats\n
GLEAN3_20968	SPU_020968		NIDO and VWD domains characteristic of mucins\n
GLEAN3_25062	SPU_025062		NIDO, VWD AND EGF_CA TM - very similar structure to mucin4d of chickens\n
GLEAN3_05955	SPU_005955		DSL and EGFCa domains characteristic of Notch ligands \nGene structure looks like partial duplication - assembly problems?\n
GLEAN3_11976	SPU_011976		DSL and EGFCa domains characteristic of Notch ligands \nGene structure looks like partial duplication - assembly problems?\n
GLEAN3_13510	SPU_013510		DSL and EGFCa domains characteristic of Notch ligands \nGene structure looks like partial duplication - assembly problems? \nAlso very short - fragment?\n
GLEAN3_13646	SPU_013646		DSL and EGFCa domains characteristic of Notch ligands \nRather short - fragment?\n
GLEAN3_16194	SPU_016194		contains no domains\n
GLEAN3_10547	SPU_010547		DSL and EGFCa domains characteristic of Notch ligands\n
GLEAN3_21193	SPU_021193		 This sequence roots the veritbrate clade containing both ADAM-TS16 and ADAM_TS18 genes\n
GLEAN3_16016	SPU_016016		DSL and EGFCa domains characteristic of Notch ligands\n
GLEAN3_06268	SPU_006268		only domain it contains is an ADAMs spacer\n
GLEAN3_21044	SPU_021044		DSL and EGFCa domains characteristic of Notch ligands \n \nGene structure looks like a duplication - assembly problems? \n
GLEAN3_25985	SPU_025985		#\nDSL and EGFCa domains characteristic of Notch ligands\n
GLEAN3_00680	SPU_000680		the sequence roots the vertibrate clade containing both ADAM-TS6 and ADAM-TS10\n
GLEAN3_18098	SPU_018098		contains a reprolysin domain and adams spacer\n
GLEAN3_16423	SPU_016423		similar to gamma-interferon inducible lysosomal thiol reductase (GILT) - vertebrate GILT cleaves disulfide bonds in proteins and is involved in MHC class II-restricted antigen processing.   \n
GLEAN3_27456	SPU_027456		contains only part of the reprolysin domain\n
GLEAN3_08756	SPU_008756		This gene roost the vertibrate clade containing both ADAM-TS7 and ADAM-TS12.  \n
GLEAN3_14521	SPU_014521		appears to be a hapoltype, but is lacking a portion of the sequence.\n
GLEAN3_03170	SPU_003170		This is part of a sea urchin specific group of ADAM-TS genes.  There are two worm ADAMTS genes with some sequence similarity (wormbase# F08C6.1a.1 and C02B4.1). may be a haplotype.\n
GLEAN3_04710	SPU_004710		This gene roots the clade of AdamTS2 and ADAMTS3\n
GLEAN3_10597	SPU_010597		contains metalloprotease and reprolysis domains which are the first parts of an ADAM-TS gene.  \n
GLEAN3_04088	SPU_004088		only domain it contains is the reprolysin domain\n
GLEAN3_18171	SPU_018171		haplotype found\n
GLEAN3_18228	SPU_018228		Similar to Tyrosine-protein phosphatase 10D precursor.  Receptor-linked protein-tyrosine phosphatase 10D.  Partial Sequence\n
GLEAN3_26428	SPU_026428		Blasts to Ppm1h, but is not homologous to human Ppm1h in PP2C subfamily tree.\n
GLEAN3_03844	SPU_003844		One of 2. Non-identical duplicate of GLEAN3_09878\n
GLEAN3_14936	SPU_014936		Also BLASTs to XP_783422.1 (it's a tie). One of 2, identical duplicate of GLEAN3_22395\n
GLEAN3_22395	SPU_022395		Also BLASTs to XP_783422.1 (it's a tie). One of 2, identical duplicate of GLEAN3_14936\n
GLEAN3_04024	SPU_004024		One of 2. This is a shorter, identical duplicate of GLEAN3_16045. This lacks the N terminus\n
GLEAN3_25169	SPU_025169		One of 2. GLEAN3_24676 is an identical duplicate in most of the N terminal portion (although the extreme N termini diverge); however the C terminal portions are divergent. The divergences are not due to frame shifts. This GLEAN appears to contain a start codon, unlike 23676.\n
GLEAN3_24676	SPU_024676		One of 2. GLEAN3_25169 is an identical duplicate in most of the N terminal portion (although the extreme N termini diverge); however the C terminal portions are divergent. The divergences are not due to frame shifts. This GLEAN does not appear to contain a start codon, unlike 25169.\n
GLEAN3_01928	SPU_001928		This is the closest genbank match to the published hyalin clone. That sequence was incomplete, and the hyalin repeats appear in many genes so it is unclear whether this is the "authentic" hyalin, or whether there is a family of matrix proteins expressed in embryos.  The identity between the cloned gene and this glean model covers much of the model with missing Hyalin repeats at the N terminus.\n
GLEAN3_14069	SPU_014069		Similar to SpRag1L (GLEAN3_27600), a sea urchin Rag1-like gene.  One of the more complete matches.  Probably a pseudogene.  Matches Rag1 core region, but c-terminal matching (SpRag1L 789-879) is attached to N-terminal. Region of match is SpRag1L: 380-874, ~39% AA identity. Siminlar to GLEAN3_09909.  \n
GLEAN3_16839	SPU_016839		One of 3. Partially overlaps with 08255, which is identical in the N terminal part of the overlapping sequence, but divergent in the C terminal part. Also overlaps (in a distinct region) with 08254, which is identical and entirely contained in 16839. Note that 08254 and 08255 do NOT overlap.\n
GLEAN3_08254	SPU_008254		This protein is identical and completely internal to GLEAN3_16839.\n
GLEAN3_08255	SPU_008255		Partially duplicated by GLEAN3_16839. The overlapping region is identical in the N terminal part, but divergent in the C terminal part, protein and nucleotide.\n
GLEAN3_04671	SPU_004671		Non-identical to other IKKs: GLEANs 16839, 08254, 08255. GLEAN3_11356 is a shorter, internal identical duplicate, although 11356 seems to contain a spurious stretch of amino acids (see that sequence).\n
GLEAN3_07638	SPU_007638		One of 2. GLEAN3_27909 is an almost identical duplicate \n
GLEAN3_27909	SPU_027909		One of 2. GLEAN3_07638 is an almost identical duplicate\n
GLEAN3_00053	SPU_000053		52 hyalin repeats, 10 EGF repeats plus 6 other exons.  incomplete gene at end of scaffold 1258.  Scores at high level in tiling experiment against embryos.  Placed 5th in hyalin family because its match against hyalin1 is in HYR domains and is significant but intermittant.\n
GLEAN3_08700	SPU_008700		Glean3_11110 is a partial sequence with an  exact match to this sequence\n
GLEAN3_11110	SPU_011110		This is a partial sequence that is an exact match with Glean3_08700 \n
GLEAN3_02435	SPU_002435		Similar to Sidekick 2. Partial sequence. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137536477-1679-116181527840.BLASTQ4\n
GLEAN3_19632	SPU_019632		This is the closest match to Sp-hyalin1. It matches at HYR's.  It has 18 hyalin repeats and 9 EGF repeats in 27 exons.  It spans the complete scaffold and is incomplete\n
GLEAN3_28412	SPU_028412		One of 2. GLEAN3_16505 is a nearly identical duplicate\n
GLEAN3_00806	SPU_000806		possibly an FGF-R, probably a RTK\n
GLEAN3_03870	SPU_003870		This gene model doesn't have a TIR domain. But the nucleotides encoding SP, NT, LRR(12-22), CT, TM has 90% identity to another Sp-Tlr gene(GLEAN3_20741) and it is located at the end of a contig.  So it could be a member of Toll-like receptor family. \n
GLEAN3_02779	SPU_002779		#\nsee also Sp-HSP70(3)A \nFirst described by, \nAUTHORS   Foltz,K.R., Partin,J.S. and Lennarz,W.J. \n  TITLE     Sea urchin egg receptor for sperm: sequence similarity of binding \n            domain and hsp70 \n  JOURNAL   Science 259 (5100), 1421-1425 (1993)\n
GLEAN3_22798	SPU_022798		The BLAST hit is quite weak and only picks up the STK domain. \n
GLEAN3_02418	SPU_002418		One of 2. GLEAN3_06947 is an identical duplicate, but is missing the N term. THis protein appears to contain the start codon.\n
GLEAN3_25210	SPU_025210		Duplicate prediction for GLEAN3_07944.\n
GLEAN3_25819	SPU_025819		Duplicate prediction for GLEAN3_26605\n
GLEAN3_09909	SPU_009909		Similar to SpRag1L (GLEAN3_27600), a sea urchin Rag1-like gene.  One of the more complete matches.  Probably a pseudogene.  Matches Rag1 core region, but c-terminal matching (SpRag1L 753-862) is attached to N-terminal (As for GLEAN3_14069). Region of match is SpRag1L: 399-862, ~41% AA identity.   \n
GLEAN3_23673	SPU_023673		This Glean3 sequence appears to be a duplication; Glean3_23674 and Glean3_23672 are on the same scaffold and also match to endonuclease reverse transcriptase\n
GLEAN3_23674	SPU_023674		This sequence appears to be a duplication; Glean3_23672 and Glean3_23673 are on the same scaffold and also match to endonuclease reverse transcriptase\n
GLEAN3_23672	SPU_023672		This sequence appears to be a duplication; Glean3_23674 and Glean3_23673 are on the same scaffold and also match to endonuclease reverse transcriptase\n
GLEAN3_15315	SPU_015315		may be trucated at c terminus\n
GLEAN3_15136	SPU_015136		Similar to SpRag1L (GLEAN3_27600), a sea urchin Rag1-like gene.  One of the more complete matches.  Probably a pseudogene.  Matches Rag1 core region. Region of match is SpRag1L: 607-977, 38% AA identity.   \n
GLEAN3_09908	SPU_009908		Similar to SpRag1L (GLEAN3_27600), a sea urchin Rag1-like gene. Probably a pseudogene.  Matches Rag1 core region. Region of match is SpRag1L: 557-647, 37% AA identity.   \n
GLEAN3_26698	SPU_026698		Similar to portion of Sp-Rag1L (GLEAN3_27600). Probably a pseudogene in cobination with transposase.  Matches Sp-Rag1L N-terminal putative Zn-binding region: AA 7-105, 32%.  \n
GLEAN3_08886	SPU_008886		Similar to UBXD2.\n
GLEAN3_24394	SPU_024394		One of 2. GLEAN3_24395 is almost identical, but slightly shorter.\n
GLEAN3_20853	SPU_020853		Similar to UBXD1.\n
GLEAN3_12427	SPU_012427		Similar to UBXD1.\n
GLEAN3_24395	SPU_024395		One of 2, GLEAN_24394 is almost identical but is slightly longer\n
GLEAN3_19658	SPU_019658	After reviewing the data and performing a BLAST search, it appears that there is no sufficient fit for this particular GLEAN model. The sequence is distributed onto 3 different scaffolds. Within the first scaffold, there is an internal repeat (one duplicate) present from 1-82. Besides this repeat, the sequence has an orderly and continuous arrangement. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be	Similar to Receptor-type tyrosine-protein phosphatase delta precursor (Protein-tyrosine phosphatase delta) (R-PTP-delta), Partial sequence\n
GLEAN3_07508	SPU_007508		possibly an SNF-1 like serine threonine kinase\n
GLEAN3_14978	SPU_014978		Similar to VEGF.\n
GLEAN3_27566	SPU_027566		similar to protein tyrosine phosphatase, receptor type, D isoform 2 precursor\n
GLEAN3_20281	SPU_020281		Similar to Tyrosine-protein phosphatase, non-receptor type 1/2 (Protein-tyrosine phosphatase 1B) (PTP-1B), partial \n
GLEAN3_18356	SPU_018356		has 1 cub domain and 25 HYR domains each as a distinct exon.  Low to no expression in embryos.  Likely to be a complete gene.  Similar to Sp-hyalin1 due to homology of hyalin repeats.  \n
GLEAN3_19751	SPU_019751		One of 5. Non-identical duplicate of GLEAN3_00442, 08085, 23876. Identical to and completely internal to GLEAN3_23875\n
GLEAN3_25766	SPU_025766		Similar to Receptor-type tyrosine-protein phosphatase mu precursor.  Duplicates\n
GLEAN3_16500	SPU_016500		blastp shows 50% alignment to actin domain not seen in other urchin hsp70s\n
GLEAN3_24947	SPU_024947		3' partial  \nGLEAN3_24946 belongs to 5' end of SpWntA\n
GLEAN3_24669	SPU_024669		3' partial \nGLEAN3_23065 is an identical duplicated fragment \nGLEAN3_23463 belongs to 5' end of SpWnt4  \n \nReference: \nFerkowicz,M.J., Stander,M.C. and Raff,R.A. \nPhylogenetic relationships and developmental expression of threesea urchin Wnt genes \nMol. Biol. Evol. 15 (7), 809-819 (1998)\n
GLEAN3_23065	SPU_023065		3' partial \nGLEAN3_24669 is an identical duplicated fragment \nGLEAN3_23463 belongs to 5' end of SpWnt4  \n \nReference: \nFerkowicz,M.J., Stander,M.C. and Raff,R.A. \nPhylogenetic relationships and developmental expression of threesea urchin Wnt genes \nMol. Biol. Evol. 15 (7), 809-819 (1998)\n
GLEAN3_05686	SPU_005686		Incomplete KH-domain in the Nter end of glean3_05686 predition. \n3 additional exons found in scaffold20583. \nOnly 2 KH domains are present on the scaffold, PCBPs family ususally contains 3. \n
GLEAN3_21590	SPU_021590		Bucentaur contains a LINE repeat sequence in some species\n
GLEAN3_23298	SPU_023298		Partial sequence\n
Sp-Tlr001	SPU_030092		Partial Toll-like receptor. The nucleotids encoding CT, TM, TIR have 96% identity to another Sp-Tlr gene(08963). This gene model occupies entire sequence of a short scaffold. \n
Sp-Tlr002	SPU_030093		Partial Toll-like receptor. The nucleotids encoding CT, TM and TIR have 98% identity to another Sp-Tlr gene (24205). This gene model is located at the end of a short scaffold. \n
GLEAN3_23993	SPU_023993		#\nDuplicate prediction for GLEAN3_06932\n
Sp-Tlr210	SPU_030094		Partial Toll-like receptor. The nucleotids encoding CT, TM and partial TIR have 98% identity to another Sp-Tlr gene(21936). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_23091	SPU_023091		Partial sequence of a prickle protein. The sequence presents homology with the GLEAN3_23090 but is not identical.\n
GLEAN3_05447	SPU_005447		there's an internal duplication in the predicted protein, which is most likely the assembly problem\n
GLEAN3_01586	SPU_001586		probably missing part of carboxy end\n
Sp-Tlr183	SPU_030095		Partial Toll-like receptor. The nucleotids encoding CT, TM and partial TIR have 95% identity to another Sp-Tlr gene (19834). This gene model is located at the end of a short scaffold. \n
GLEAN3_14864	SPU_014864	According to the BLAST results in comparison with the excel data, it appears that this is the best match for this particular GLEAN model. When reviewing the excel data, it was apparent that there were several internal repeats present that occurred between 2358-2689. The scaffold was also truncated at 3325.  There was some Est support available from GBrowse V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5 (excluding an outlier value of 43.)	#\nthis hsp has a speract domain on the carboxy terminus. assigned to hsp70(3) family along with other hsps containing hsp70 domain and some gamete domain. needs additional verification.\n
GLEAN3_18199	SPU_018199		Except the first 50 aa, this sequence is also contained in GLEAN3_05021 however with further interspersed sequences. \n
GLEAN3_04983	SPU_004983		5 prime and 3 prime wrongly predicted \n5 prime is on Scaffold80207, no Glean model, my own genscan analysis identifies a 169_aa gene (exon) (pred. 8 on Scaffold80207): \n 8.00 Prom +  96623  96662   40                              -5.75 \n 8.01 Init +  96914  97145  232  1  1   71  111   260 0.725  25.07 \n 8.02 Intr +  98822  98985  164  0  2   28   25   110 0.308  -2.53 \n 8.03 Term + 101288 101401  114  1  0   79   42   218 0.969  13.89 \n 8.04 PlyA + 101438 101443    6                               1.05 \n \n>Scaffold80207|GENSCAN_predicted_peptide_8|169_aa \nMYRAVIYTIFVGLVCLDSVVEYGVEARRNGRKRNRNPGAGDVLSASGGDVVKVRPTPRRP \nQIPLKAEVQPPHSRGVPGVQNWAQCQRLVVQLQVDAEAMRNSSNLSRQKYHFVEINLIRK \nTYGTEQGDNHLVIIYFIVLSRLIIESIRFDDRMRSNNAERCDEQCRAGR \n \n>Scaffold80207|GENSCAN_predicted_CDS_8|510_bp \natgtaccgtgcagtaatttacaccatcttcgtgggcctggtgtgcctggacagcgtggtt \ngagtacggagtcgaagctcgcaggaatggaagaaagaggaacaggaatcctggagcaggg \ngatgttttatctgcatccggtggtgatgttgtcaaggtgagaccgacaccaagaaggcct \ncagattccactcaaagccgaggtacagcccccacattcaagaggtgttccaggggtgcaa \naattgggctcaatgtcaacgactggtagtgcaattacaagttgacgccgaggctatgcgt \naattcgagcaatttgtcgcgtcaaaaatatcactttgtcgaaataaatttgataaggaaa \nacttatggtaccgagcaaggggataaccacttagtgataatctactttattgtcctttca \ncgactaatcatcgagtctatacgatttgatgaccgaatgcggtcgaacaatgccgagcgg \ntgtgacgaacaatgccgagcgggccgctaa \n \n \n
GLEAN3_05021	SPU_005021		#\nThis sequence contains most of the GLEAN3_18199 \n
Sp-TlrP41	SPU_030096		Partial Toll-like receptor. The nucleotides encoding LRR(9-19), CT, TM and partial TIR have 87% identity to another Sp-Tlr gene(GLEAN3_07850). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP42	SPU_030097		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, LRR(3-9) have 97% identity to another Sp-Tlr gene(11537). This gene model at the end of a short scaffold. \n
Sp-TlrP43	SPU_030098		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(11-22), CT, TM have 87% identity to another Sp-Tlr gene(07850). This gene model at the end of a scaffold. \n
GLEAN3_22562	SPU_022562		Missing one (or more) exons at the beginning.\n
Sp-TlrP44	SPU_030099		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(0-2) have 87% identity to another Sp-Tlr gene(15066). This gene model at the end of a scaffold. \n
Sp-TlrP45	SPU_030100		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR(5-9) have 84% identity to another Sp-Tlr gene(25312). This gene model at the end of a short scaffold. \n
Sp-TlrP46	SPU_030101		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (0-10) have 87% identity to another Sp-Tlr gene(15066). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP47	SPU_030102		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (1-6) have 89% identity to another Sp-Tlr gene(18519). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_20306	SPU_020306		May have one exon too many at the 3'-end.\n
GLEAN3_13377	SPU_013377		Using HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
Sp-TlrP48	SPU_030103		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (6-10) have 87% identity to another Sp-Tlr gene(11541). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_13378	SPU_013378		#\nUsing HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
GLEAN3_18632	SPU_018632		Using HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
Sp-Tlr114	SPU_030104		This gene model could be caused by assembly error. The nucleotids encoding LRR (9-17) have more than 99.5% identity to another Sp-Tlr gene(21936). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP49	SPU_030105		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (5-9), CT, TM have 91% identity to another Sp-Tlr gene(15066). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_24263	SPU_024263		#\nUsing HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
Sp-TlrP50	SPU_030106		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(5-15) have 98% identity to another Sp-Tlr gene(05950). This gene model is located at the end of a contig.\n
Sp-TlrP51	SPU_030107		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (12-20) have 86% identity to another Sp-Tlr gene(11537). This gene model is located at the end of a scaffold.\n
GLEAN3_14670	SPU_014670		The GLEAN3_14670 prediction apparently missed exons 5-18, present in other gene models, coding for the highly conserved catalytic domain of synaptojanin.\n
Sp-TlrP52	SPU_030108		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (5-12) have 93% identity to another Sp-Tlr gene(00199). This gene model is located at the end of a scaffold.. \n
Sp-TlrP53	SPU_030109		Partial Toll-like receptor. The nucleotides encoding SP, NT, LRR(16-24), CT, TM and partial TIR have 90% identity to another Sp-Tlr gene(06164). This gene model is located at the end of a scaffold. \n
Sp-TlrP54	SPU_030110		#\nPartial Toll-like receptor. The nucleotides encoding CT, TM and partial TIR have 93% identity to another Sp-Tlr gene (13536). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP55	SPU_030111		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR (3) have 87% identity to another Sp-Tlr gene(20741). This gene model is located at the end of a scaffold. And it may represent a pseudogene or contain stop codons. \n
Sp-TlrP56	SPU_030112		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR (3) have 87% identity to another Sp-Tlr gene(24960). This gene model is located at the end of a scaffold. \n
Sp-TlrP57	SPU_030113		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding  LRR (8-16), CT, TM have 88% identity to another Sp-Tlr gene(16536). This gene model is located at the end of a scaffold. \n
Sp-TlrP58	SPU_030114		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding  SP, NT, LRR (13-22), CT, TM have 92% identity to another Sp-Tlr gene(20741). This gene model is located at the end of a scaffold. \n
GLEAN3_08658	SPU_008658		Pfam00194 match.  Partial gene.  \n \nTranscriptome data indicate that it is expressed in the embryo. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_13459	SPU_013459		Pfam00194 match. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_25722	SPU_025722		Pfam00194 match.   \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_13458	SPU_013458		Partial gene. Pfam00194 match.  \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_10471	SPU_010471		Pfam00194 match. \n \nPretty strong similarity to PMC EST (accession no.DN577792). \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_00337	SPU_000337		SigPep-SRCR(4).  Possibly incomplete.  \n
GLEAN3_00492	SPU_000492		SRCR(4) + 1 partial.  Gene probably partial.\n
GLEAN3_28422	SPU_028422	For this particular GLEAN model there was no Cds information available from Baylor annotations or SpBase. However, there was mRNA information from Spbase. When reviewing the excel data it appears that the sequence is distributed onto 2 different scaffolds based on the sequence starting on subject gb|DS012727| and continuing on gb|DS003972|. \nAdditional gene information from Baylor annotations (comments):\nThere wasn't a good checkbox for the problem here.  This looks to be an assembly error, where 2 contigs were inappropriately joined, cramming together to unrelated proteins into one model.  The first several exons are a copy of the more properly assembled glean3_XXXXX.  Then there is a short repeated region (unmerged alleles), and finally the gene in question.	There wasn't a good checkbox for the problem here.  This looks to be an assembly error, where 2 contigs were inappropriately joined, cramming together to unrelated proteins into one model.  The first several exons are a copy of the more properly assembled glean3_XXXXX.  Then there is a short repeated region (unmerged alleles), and finally the gene in question.\n
GLEAN3_07587	SPU_007587		not full length\n
GLEAN3_24316	SPU_024316		Not complete\n
Sp-TlrP59	SPU_030115		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding  SP, NT, LRR (13-21), CT, TM have 88% identity to another Sp-Tlr gene(20741). This gene model is located at the end of a scaffold. \n
Sp-TlrP60	SPU_030116		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding  LRR (2), CT, TM have 96% identity to another Sp-Tlr gene(08278). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP61	SPU_030117		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, LRR (5) have 98% identity to another Sp-Tlr gene(24208). This gene model occupies entire sequence of a short scaffold and may represent pseudogene or cotain a sequence error. \n
Sp-TlrP62	SPU_030118		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR (0-7) have 89% identity to another Sp-Tlr gene(14352). This gene model is located at the end of a contig. \n
Sp-TlrP63	SPU_030119		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (5-6) have 89% identity to another Sp-Tlr gene(03419). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP64	SPU_030120		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding NT, LRR (4-7) have 89% identity to another Sp-Tlr gene(09435). This gene model is at the end of a scaffold. \n
Sp-TlrP65	SPU_030121		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR (5) have 87% identity to another Sp-Tlr gene(14352). This gene model is at the end of a short scaffold. \n
Sp-TlrP66	SPU_030122		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (5), CT, TM have 89% identity to another Sp-Tlr gene(03419). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP67	SPU_030123		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR (5-6), CT, TM have 94% identity to another Sp-Tlr gene(09435). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP68	SPU_030124		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(8-14) have 90% identity to another Sp-Tlr gene(14352). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_25548	SPU_025548		C terminus is missing\n
Sp-TlrP69	SPU_030125		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding NT, LRR(4-9) have 89% identity to another Sp-Tlr gene(14352). This gene model occupies entire sequence of a short scaffold. \n
Sp-TlrP70	SPU_030126		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(12-21) have 90% identity to another Sp-Tlr gene(14352). This gene model is located at the end of a contig and may represent a pseudogene or contain sequence error. \n
Sp-TlrP71	SPU_030127		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR(6-7), CT, TM, TIR(partial) have 91% identity to another Sp-Tlr gene(09435). This gene model occupies entire sequence of a short scaffold and may represent a pseudogene or contain sequence error. \n
Sp-TlrP72	SPU_030128		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(14-23) have 87% identity to another Sp-Tlr gene(09435). This gene model is located at the end of a short scaffold. \n
Sp-TlrP73	SPU_030129		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR(5), CT, TM have 91% identity to another Sp-Tlr gene(03419). This gene model is located at the end of a short scaffold.\n
Sp-TlrP74	SPU_030130		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR(4-6), CT, TM have 92% identity to another Sp-Tlr gene(21225). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_02306	SPU_002306		#\nEGF/CCP/EGF/ZP\n
GLEAN3_03171	SPU_003171		LY4/CUB/SR/ZP\n
GLEAN3_04061	SPU_004061		NIDO/EGF/ZP\n
GLEAN3_04611	SPU_004611		LY2/MAM/ZP\n
GLEAN3_05270	SPU_005270		ZP/CCP3\n
GLEAN3_13342	SPU_013342		SR/ZP\n
GLEAN3_14213	SPU_014213		EGFCa3/ZP\n
GLEAN3_16300	SPU_016300		CCP14/ZP\n
GLEAN3_16840	SPU_016840		EGF2/ZP\n
GLEAN3_18648	SPU_018648		CUB4/ZP\n
GLEAN3_22873	SPU_022873		EGFCa/ZP\n
GLEAN3_22889	SPU_022889		EGFCa/ZP\n
GLEAN3_24217	SPU_024217		CCP/CLECT/CCP2/ZP\n
GLEAN3_26587	SPU_026587		EGF/ZP\n
GLEAN3_27535	SPU_027535		LY/ZP\n
GLEAN3_28276	SPU_028276		CUB6/ZP\n
GLEAN3_28843	SPU_028843		CCP11/ZP\n
GLEAN3_28844	SPU_028844		CCP3/ZP\n
Sp-TlrP75	SPU_030131		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding LRR(5-7), CT, TM have 91% identity to another Sp-Tlr gene(21225). This gene model occupies entire sequence of a short scaffold. \n
GLEAN3_20943	SPU_020943		This sequence is identical to the one deposited in GenBank as NP_999657 (derived from RefSeq NM_214492), except for an 18-amino acid (54 nucleotide) gap between the predicted initiator methionine (i.e. residue 1) and the second predicted amino acid residue in the GLEAN sequence.  This sequence (CCTCGAGAAATTATTACCTTACAGCTAGGACAATGTGGGAACCAGATTGGGATG in the RefSeq entry) is not detected in the Baylor DNA sequence dataset by BLASTP or TBLASTN. \n \nAnnotation entered by Bob Obar (robar@scientist.com).\n
Sp-TlrP76	SPU_030132		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(15-23) have 90% identity to another Sp-Tlr gene(14548). This gene model is located at the end of a short scaffold. \n
Sp-TlrP77	SPU_030133		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(15-23) have 91% identity to another Sp-Tlr gene(14548). This gene model is located at the end of a short scaffold. \n
Sp-TlrP78	SPU_030134		Partial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, NT, LRR(4-6) have 90% identity to another Sp-Tlr gene(27162). This gene model is located at the end of a short scaffold. \n
Sp-TlrP79	SPU_030135		#\nPartial Toll-like receptor does not have a TIR domain. The nucleotides encoding SP, LRR(2-4) have 93% identity to another Sp-Tlr gene(26274). This gene model is located at the end of a short scaffold. \n
GLEAN3_00323	SPU_000323		 fragment\n
GLEAN3_00359	SPU_000359		 fragment\n
GLEAN3_00444	SPU_000444		 fragment\n
GLEAN3_00472	SPU_000472		 fragment\n
GLEAN3_00591	SPU_000591		 fragment\n
GLEAN3_00592	SPU_000592		 fragment\n
GLEAN3_00769	SPU_000769		 fragment\n
GLEAN3_00777	SPU_000777		 fragment\n
GLEAN3_00791	SPU_000791		 fragment\n
GLEAN3_00820	SPU_000820		 fragment\n
GLEAN3_00859	SPU_000859		 fragment\n
GLEAN3_00901	SPU_000901		 fragment\n
GLEAN3_01023	SPU_001023		 fragment\n
GLEAN3_01036	SPU_001036		 fragment\n
GLEAN3_01064	SPU_001064		 fragment\n
GLEAN3_01286	SPU_001286		 fragment\n
GLEAN3_01350	SPU_001350		 fragment\n
GLEAN3_01364	SPU_001364		 fragment\n
GLEAN3_01417	SPU_001417		 fragment\n
GLEAN3_01453	SPU_001453		 fragment\n
GLEAN3_01766	SPU_001766		 fragment, should join with GLEAN3_01767, still incomplete gene\n
GLEAN3_01767	SPU_001767		 fragment, should join with GLEAN3_01766, still incomplete gene\n
GLEAN3_01770	SPU_001770		 fragment\n
GLEAN3_01828	SPU_001828		 partial\n
GLEAN3_01895	SPU_001895	After reviewing the data and performing a BLAST search, it appears that this is the best results for this particular GLEAN model. When comparing the BLAST results to the excel data, it was evident that the sequence had poor overall coverage as well as a low bit score and a high e-value. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 5. 	 fragment\n
GLEAN3_01910	SPU_001910		 fragment\n
GLEAN3_01973	SPU_001973		 fragment\n
GLEAN3_02544	SPU_002544		 fragment\n
GLEAN3_02741	SPU_002741		 partial\n
GLEAN3_02804	SPU_002804		 fragment\n
GLEAN3_02907	SPU_002907		 fragment\n
GLEAN3_02910	SPU_002910		 fragment\n
GLEAN3_02996	SPU_002996		 fragment\n
GLEAN3_03311	SPU_003311		 fragment\n
GLEAN3_03331	SPU_003331		 fragment\n
GLEAN3_03374	SPU_003374		 fragment, should join GLEAN3_03375, still incomplete gene\n
GLEAN3_03375	SPU_003375		 fragment, should join GLEAN3_03374, still incomplete genes\n
GLEAN3_03431	SPU_003431		 fragment\n
GLEAN3_03443	SPU_003443		 fragment\n
GLEAN3_03465	SPU_003465		 fragment\n
GLEAN3_03510	SPU_003510		 fragment\n
GLEAN3_03623	SPU_003623		 fragment\n
GLEAN3_03625	SPU_003625		 partial; missing C-terminus\n
GLEAN3_03631	SPU_003631		 fragment\n
GLEAN3_03663	SPU_003663		 fragment\n
GLEAN3_03729	SPU_003729		 fragment\n
GLEAN3_03784	SPU_003784		 fragment\n
GLEAN3_03886	SPU_003886		 partial, missing N- and C-terminus\n
GLEAN3_03977	SPU_003977		 fragment\n
GLEAN3_03983	SPU_003983		 fragment\n
GLEAN3_04141	SPU_004141		 fragment\n
GLEAN3_04142	SPU_004142		 fragment \nidentical to GLEAN3_00281\n
GLEAN3_04226	SPU_004226		 fragment\n
GLEAN3_04450	SPU_004450		 fragment\n
GLEAN3_04499	SPU_004499		 fragment\n
GLEAN3_04567	SPU_004567		 fragment\n
GLEAN3_04621	SPU_004621		 fragment\n
GLEAN3_04633	SPU_004633		 fragment\n
GLEAN3_04708	SPU_004708		 fragment\n
GLEAN3_04725	SPU_004725		 extra C-terminus\n
GLEAN3_04743	SPU_004743		 fragment\n
GLEAN3_04815	SPU_004815		 insertions\n
GLEAN3_04816	SPU_004816		 insertions\n
GLEAN3_04911	SPU_004911		 fragment\n
GLEAN3_04930	SPU_004930		 fragment\n
GLEAN3_04935	SPU_004935		 fragment\n
GLEAN3_05047	SPU_005047		 fragment\n
GLEAN3_05183	SPU_005183		 partial, missing C-terminus region\n
GLEAN3_05249	SPU_005249		 fragment\n
GLEAN3_05344	SPU_005344		 fragment\n
GLEAN3_05357	SPU_005357		 fragment\n
GLEAN3_05370	SPU_005370		 fragment, has extra residues on C-terminus\n
GLEAN3_05625	SPU_005625		 fragment\n
GLEAN3_05717	SPU_005717		 fragment\n
GLEAN3_05754	SPU_005754		 fragment, missing stretch in middle\n
GLEAN3_05803	SPU_005803		 small fragment\n
GLEAN3_05995	SPU_005995		 fragment\n
GLEAN3_06257	SPU_006257		 fragment\n
GLEAN3_06323	SPU_006323		 small fragment\n
GLEAN3_06521	SPU_006521		 fragment\n
GLEAN3_06556	SPU_006556		 fragment\n
GLEAN3_06698	SPU_006698		 small fragment\n
GLEAN3_06779	SPU_006779		 fragment\n
GLEAN3_06870	SPU_006870		 fragment\n
GLEAN3_06887	SPU_006887		 small fragment\n
GLEAN3_07045	SPU_007045		 fragment\n
GLEAN3_07321	SPU_007321		 fragment, extra N-terminus residues\n
GLEAN3_07657	SPU_007657		 fragment\n
GLEAN3_07739	SPU_007739		 fragment\n
GLEAN3_07748	SPU_007748		 fragment\n
GLEAN3_07853	SPU_007853		 fragment\n
GLEAN3_08032	SPU_008032		 fragment\n
GLEAN3_08069	SPU_008069		 fragment\n
GLEAN3_08130	SPU_008130		 fragment\n
GLEAN3_08288	SPU_008288		 fragment\n
GLEAN3_08308	SPU_008308		 fragment\n
GLEAN3_08320	SPU_008320		 fragment\n
GLEAN3_08372	SPU_008372		 fragment\n
GLEAN3_08459	SPU_008459		 fragment\n
GLEAN3_08685	SPU_008685		 fragment\n
GLEAN3_08734	SPU_008734		 fragment\n
GLEAN3_08748	SPU_008748		 fragment\n
GLEAN3_08759	SPU_008759		 fragment\n
GLEAN3_09056	SPU_009056		 fragment, extra mismiatch stretch on C-terminus\n
GLEAN3_09158	SPU_009158		 fragment\n
GLEAN3_09281	SPU_009281		 small fragment\n
GLEAN3_09625	SPU_009625		 small fragment\n
GLEAN3_09697	SPU_009697		 fragment\n
GLEAN3_09733	SPU_009733		 fragment\n
GLEAN3_09844	SPU_009844		 fragment\n
GLEAN3_10234	SPU_010234		 fragment\n
GLEAN3_10289	SPU_010289		 fragment\n
GLEAN3_11052	SPU_011052		 fragment\n
GLEAN3_11215	SPU_011215		 fragment\n
GLEAN3_11390	SPU_011390		 partial, missing C-terminus half\n
GLEAN3_11435	SPU_011435		 fragment\n
GLEAN3_11587	SPU_011587		 fragment\n
GLEAN3_11791	SPU_011791		 fragment\n
GLEAN3_12316	SPU_012316		 fragment\n
GLEAN3_12600	SPU_012600		 partial, missing C-terminus half\n
GLEAN3_12832	SPU_012832		 fragment\n
GLEAN3_12941	SPU_012941		 fragment\n
GLEAN3_12945	SPU_012945		 fragment\n
GLEAN3_12947	SPU_012947		 fragment\n
GLEAN3_13036	SPU_013036		 fragment\n
GLEAN3_13051	SPU_013051		 fragment\n
GLEAN3_13215	SPU_013215		 fragment\n
GLEAN3_13473	SPU_013473		 fragment\n
GLEAN3_13529	SPU_013529		 fragment\n
GLEAN3_13546	SPU_013546		 small fragment\n
GLEAN3_13637	SPU_013637		 fragment\n
GLEAN3_13678	SPU_013678		 fragment\n
GLEAN3_14043	SPU_014043		 fragment\n
GLEAN3_14512	SPU_014512		 fragment\n
GLEAN3_14648	SPU_014648		 fragment\n
GLEAN3_14856	SPU_014856		 fragment\n
GLEAN3_15031	SPU_015031		 fragment\n
GLEAN3_15060	SPU_015060		 fragment\n
GLEAN3_15061	SPU_015061		 fragment\n
GLEAN3_15095	SPU_015095		 fragment\n
GLEAN3_15100	SPU_015100		 fragment\n
GLEAN3_15101	SPU_015101		 fragment\n
GLEAN3_15194	SPU_015194		 missing N-terminus, extra C-terminus\n
GLEAN3_15476	SPU_015476		 partial, missing most of the C-terminus\n
GLEAN3_15492	SPU_015492		 fragment\n
GLEAN3_15571	SPU_015571		 fragment\n
GLEAN3_16020	SPU_016020		 fragment\n
GLEAN3_16176	SPU_016176		 partial, missing C-terminus\n
GLEAN3_16633	SPU_016633		 fragment\n
GLEAN3_16636	SPU_016636		 fragment\n
GLEAN3_16901	SPU_016901		 fragment\n
GLEAN3_16917	SPU_016917		 small fragment\n
GLEAN3_17139	SPU_017139		 fragment, unmatched residues on C-terminus\n
GLEAN3_17143	SPU_017143		 fragment\n
GLEAN3_17170	SPU_017170		 fragment\n
GLEAN3_17174	SPU_017174		 missing N- and C-terminus residues\n
GLEAN3_17608	SPU_017608		 fragment\n
GLEAN3_17696	SPU_017696		 partial, missing C-terminus half\n
GLEAN3_17703	SPU_017703		 fragment\n
GLEAN3_18027	SPU_018027		 fragment, unmatched stretch of aminoacids on N-terminus\n
GLEAN3_18092	SPU_018092		 fragment\n
GLEAN3_18766	SPU_018766		 missing some aminoacid stretches in middle\n
GLEAN3_18767	SPU_018767		 extra stretch of aminoacids in middle\n
GLEAN3_18929	SPU_018929		 fragment\n
GLEAN3_19020	SPU_019020		 fragment, should join GLEAN3_19027, still missing the N-terminus region\n
GLEAN3_19027	SPU_019027		 fragment; should join GLEAN3_19020, still missing the N-terminus region\n
GLEAN3_19298	SPU_019298		 fragment\n
GLEAN3_19402	SPU_019402		 fragment\n
GLEAN3_19780	SPU_019780		 fragment\n
GLEAN3_20014	SPU_020014	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If the two scaffolds were combined, the sequence would have an orderly and continuous arrangement without any repeats or gaps present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall with all of the values being <10	 fragment\n
GLEAN3_20174	SPU_020174		 fragment\n
GLEAN3_22715	SPU_022715		 fragment\n
GLEAN3_22752	SPU_022752		 fragment\n
GLEAN3_22980	SPU_022980		 fragment\n
GLEAN3_22989	SPU_022989	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it is apparent that both scaffolds contain an orderly arrangement without any internal repeats or gaps present. If the two scaffolds were to be combined, the overall sequence would have good coverage. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values ranging >7.	 fragment\n
GLEAN3_23129	SPU_023129		 small fragment\n
GLEAN3_23227	SPU_023227		 small fragment\n
GLEAN3_23266	SPU_023266		 small fragment\n
GLEAN3_23323	SPU_023323		 fragment\n
GLEAN3_23515	SPU_023515		 fragment\n
GLEAN3_23680	SPU_023680		 fragment\n
GLEAN3_23772	SPU_023772		 small fragment\n
GLEAN3_23841	SPU_023841		 small fargment\n
GLEAN3_23849	SPU_023849		 fragment\n
GLEAN3_24227	SPU_024227		 fragment\n
GLEAN3_24229	SPU_024229		 fragment\n
GLEAN3_24313	SPU_024313		 fragment\n
GLEAN3_24396	SPU_024396		 fragment\n
GLEAN3_24430	SPU_024430		 fragment\n
GLEAN3_24541	SPU_024541		 fragment\n
GLEAN3_24557	SPU_024557		 fragment\n
GLEAN3_25087	SPU_025087		 fragment\n
GLEAN3_25224	SPU_025224		 fragment\n
GLEAN3_25596	SPU_025596		 fragment\n
GLEAN3_26372	SPU_026372		 fragment\n
GLEAN3_26682	SPU_026682		 fragment\n
GLEAN3_26700	SPU_026700		 fragment\n
GLEAN3_26839	SPU_026839		 fragment\n
GLEAN3_26988	SPU_026988		 fragment\n
GLEAN3_26994	SPU_026994		 fragment\n
GLEAN3_27071	SPU_027071		 fragment\n
GLEAN3_27495	SPU_027495		 fragment\n
GLEAN3_27591	SPU_027591		 partial, missing C-terminus half\n
GLEAN3_27901	SPU_027901		 fragment\n
GLEAN3_28013	SPU_028013		 fragment\n
GLEAN3_28083	SPU_028083		 fragment\n
GLEAN3_28129	SPU_028129		 fragment\n
GLEAN3_28237	SPU_028237		 fragment\n
GLEAN3_28307	SPU_028307		 fragment, missing stretch in middle\n
GLEAN3_28454	SPU_028454		 fragment\n
GLEAN3_28651	SPU_028651		 fragment\n
GLEAN3_28709	SPU_028709		 fragment\n
GLEAN3_04266	SPU_004266		Annotation entered by Bob Obar (robar@scientist.com).\n
GLEAN3_05500	SPU_005500		C1q-related\n
GLEAN3_06578	SPU_006578		C1q-related\n
GLEAN3_09020	SPU_009020		C1q-related\n
GLEAN3_00433	SPU_000433		Single CADH domain-nothing else-obviously a cadherin fragment\n
GLEAN3_01454	SPU_001454		21 CADH repeats but no TM or Cyto domain-probably a partial-could it be hitched up to GLEAN3_01452-CLEARLY A CADHERIN BUT CLASS UNCLEAR\n
GLEAN3_02742	SPU_002742		11 cadh REPEATS BUT NO TM OR CYTO DOMAIN-CLEARLY A CADHERIN BUT CLASS UNCLEAR\n
GLEAN3_03730	SPU_003730		4 CADH + TM AND CYTO BUT NO CAT-BD-NON-CLASSICAL CADHERIN of the vertebrate type\n
GLEAN3_04074	SPU_004074		Single CADH domain-nothing else-obviously a cadherin fragment.\n
GLEAN3_04556	SPU_004556		2 CADH domains-nothing else-obviously a cadherin fragment.\n
GLEAN3_05228	SPU_005228		Single CADH domain followed by a LAMG domain-looks like a fly cadherin\n
GLEAN3_08299	SPU_008299		5 CADH domains and a TM domain-probable cadherin fragment of vertebrate type\n
GLEAN3_08380	SPU_008380		Single CADH domain and a TM domain-probably a cadherin fragment\n
GLEAN3_09073	SPU_009073		12 CADH domains and a set of EGF and LAMG repeats before a TM domain.  Cytoplasmic domain present-but no catenin-binding domain-probable non-classical cadherin of the fly type.\n
GLEAN3_10840	SPU_010840		13 CADH domains and EGF/LAMG/EGF_ probable cadherin fragment of the fly type.\n
GLEAN3_11375	SPU_011375		8 CADH domains-no TM-probable cadherin fragment.\n
GLEAN3_13323	SPU_013323		8 CADH domains-no TM-probable cadherin fragment.\n
GLEAN3_13476	SPU_013476	When a BLAST search was done on this particular GLEAN model, an error message was received indicating that he server encountered an internal error or misconfiguration and was unable to complete the request. When examining the excel data, it was evident that there were numerous repeats throughout the sequence which was probably the reason for the error message received.  The transcriptome intensity scores had an unusual distribution with three clusters of information dispersed along the graph. The scores did in fact appear to be strong, with most of the values being greater than 10. There was also some Est support available from GBrowse assembly V0.5. 	5 CADH repeats-no TM-also N-terminal ANK repeats-possible fragment/possible concatenation\n
GLEAN3_14606	SPU_014606		6 CADH repeats and TM-cyto domain present but w/o catenin-binding domain-probable non-classical cadherin of the vertebrate type\n
GLEAN3_15210	SPU_015210		6 CADH domains-no TM-probable cadherin fragment\n
GLEAN3_16980	SPU_016980		"20 CADH repeats, single EGF and TM-non-classical cadherin of the fly type"\n
GLEAN3_17039	SPU_017039		"3 CADH domains, 2 EGF and LAMG-no TM or cytoplasmic domain-probable cadherin fragment of the fly type"\n
GLEAN3_19394	SPU_019394		11 CADH repeats and TM-cyto domain present but w/o catenin-binding domain-probable non-classical cadherin\n
GLEAN3_19783	SPU_019783		"partial classical cadherin of the fly type, no CADH repeats but has classical cadherin cyto domain"\n
GLEAN3_21086	SPU_021086	After reviewing the data from the excel file and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The bit score and e-value for v2.1_scaffold28739 were better than v2.1_scaffold20830 and the percentage of identities (base pair matching) was better as well. This resulted in v2.1_scaffold28739 coming up as the better match according to BLAST even though v2.1_scaffold20830 has more sequence coverage. If the two scaffolds were combined, it appears that they would have an orderly continuous arrangement. There was no Est information available on GBrowse V0.5 and the transcriptome intensity scores appear to be somewhat strong with most of the values being greater than 5. 	2 CADH domains and TM-short cytoplasmic domain-probable partial cadherin fragment of vertebrate type\n
GLEAN3_23005	SPU_023005		Single CADH domain-nothing else-obviously a cadherin fragment.\n
GLEAN3_25622	SPU_025622		7 CADHs plus TM and cytoplasmic domain but no catenin-binding site-probable non-classical cadherin of the vertebrate type-previously reported as homolog of protocadherin 9\n
GLEAN3_26595	SPU_026595		Single CADH domain-nothing else-obviously a cadherin fragment\n
GLEAN3_27133	SPU_027133		"LOOKS A BIT LIKE FLAMINGO (SEVERAL TM DOMAINS-BUT NO GPS DOMAIN), FLAMINGO/CELSR subfamily"\n
GLEAN3_27356	SPU_027356		25 CADH PLUS EGF/LAMG/EGF AND TM-cyto domain present but w/o catenin-binding domain-probable non-classical cadherin of the fly type\n
GLEAN3_28328	SPU_028328		3 CADH domains-nothing else-obviously a cadherin fragment.\n
GLEAN3_00958	SPU_000958		collagen fragment\n
GLEAN3_04531	SPU_004531		collagen fragment\n
GLEAN3_05187	SPU_005187		collagen fragment\n
GLEAN3_06067	SPU_006067		collagen fragment\n
GLEAN3_07582	SPU_007582		collagen fragment\n
GLEAN3_11736	SPU_011736		collagen fragment\n
GLEAN3_12707	SPU_012707		collagen fragment\n
GLEAN3_13354	SPU_013354		collagen fragment\n
GLEAN3_14619	SPU_014619		adjacent fragment of a fibrillar collagen\n
GLEAN3_17571	SPU_017571		large fragment with a few collagen repeats-could be an N-terminal pro piece\n
GLEAN3_21235	SPU_021235		NOVEL COLLAGEN ARCHITECTURE-FUSION??\n
GLEAN3_22116	SPU_022116		possible relative of human col24a1\n
GLEAN3_22882	SPU_022882		collagen fragment\n
GLEAN3_22896	SPU_022896		collagen fragment\n
GLEAN3_22936	SPU_022936		possible relative of col3a1\n
GLEAN3_23283	SPU_023283		possible relative of col9a3\n
GLEAN3_25369	SPU_025369		collagen fragment\n
GLEAN3_26786	SPU_026786		collagen fragment\n
GLEAN3_27250	SPU_027250		collagen fragment\n
GLEAN3_13557	SPU_013557		"C-terminal fragment of a fibrillar collagen, possible relative of col27a1"\n
GLEAN3_14618	SPU_014618		C-terminal fragment of a fibrillar collagen-adjacent gene 14619 contains another fragment\n
GLEAN3_17791	SPU_017791		N-terminal fragment of a fibrillar collagen\n
GLEAN3_28613	SPU_028613		C terminus of fibrillar collagen\n
GLEAN3_05167	SPU_005167		fibrillar collagen of the I/II/III subclass-partial_lacks C-terminus\n
GLEAN3_26008	SPU_026008		fibrillar collagen of the I/II/III subclass -appears complete\n
GLEAN3_26009	SPU_026009		fibrillar collagen of the I/II/III subclass -appears complete\n
GLEAN3_09076	SPU_009076		Fibrillar collagen of the V/XI type-PROBABLY COMPLETE\n
GLEAN3_11016	SPU_011016		olfactomedin-related collagen\n
GLEAN3_03768	SPU_003768		type IV collagen-could be complete\n
GLEAN3_15708	SPU_015708		C-terminal fragment of a type IV collagen\n
GLEAN3_00142	SPU_000142		collagen XV/XVIII-partial-lacks N-terminal TSPN/LamG domain\n
GLEAN3_00691	SPU_000691		NOVEL ARCHITECTURE - CLECT and TSP1 domains alternating plus FA58C and FTP-"LINK" at C-terminus \n \n- a bit similar to GLEAN3_19437 and GLEAN3_05426\n
GLEAN3_05426	SPU_005426		NOVEL ARCHITECTURE - CLECT and TSP1 domains alternating plus EGFCA repeats at N-terminus and FA58C and "LINK"/PANAP at C-terminus \n \n- a bit similar to GLEAN3_19437 and GLEAN3_00691\n
GLEAN3_01768	SPU_001768		"PUTATIVE LAM G CHAIN; has LamNT, LamB"\n
GLEAN3_06118	SPU_006118		"partial LAM A OR G CHAIN, has LamB"\n
GLEAN3_06558	SPU_006558		"Looks complete-has lamB domain-so most like lam g chain has LamNT, LamB"\n
GLEAN3_07555	SPU_007555		"partial LAM A OR G CHAIN, has LamB"\n
GLEAN3_09846	SPU_009846		"partial LAM A OR G CHAIN, has LamB"\n
GLEAN3_14257	SPU_014257		"putative laminin fragment, LamB only, best blast hits are proteoglycan"\n
GLEAN3_15482	SPU_015482		Looks complete-has NO lamB domain-so most like lam b chain has LamNT\n
GLEAN3_20192	SPU_020192		"PUTATIVE LAM A CHAIN; has LamNT, LamB, missing C-terminus"\n
GLEAN3_22929	SPU_022929		"LAM A1/2 CHAIN; has LamNT, LamB, LamG, missing C-terminus"\n
GLEAN3_26039	SPU_026039		"LAM A3/5 CHAIN; has LamNT, LamB, LamG, looks complete  "\n
GLEAN3_27389	SPU_027389		"putative laminin fragment-lam a or g, has LamB"\n
GLEAN3_13481	SPU_013481	From the BLAST results as well as the excel data, it was evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it was apparent that both scaffolds contained an orderly and continuous arrangement without any gaps or repeats present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5.	has LamNT-looks incomplete-could be a laminin or a netrin\n
GLEAN3_25057	SPU_025057		has LamNT-looks incomplete-could be a laminin or a netrin\n
GLEAN3_26322	SPU_026322		has LamNT-looks incomplete-could be a laminin or a netrin\n
GLEAN3_01769	SPU_001769		has LamNT-Looks incomplete-could be a laminin or a netrin\n
GLEAN3_28368	SPU_028368		has LamNT-Looks incomplete-could be a laminin or a netrin\n
GLEAN3_06557	SPU_006557		has LamNT-looks incomplete-could be a laminin or a netrin\n
GLEAN3_00176	SPU_000176		novel architecture-no known proteins with this composition/organization\n
GLEAN3_01328	SPU_001328		"has a reeler domain and an EGF, may belong with reelin genes"\n
GLEAN3_02367	SPU_002367		"just a reeler domain, may belong with reelin genes"\n
GLEAN3_07070	SPU_007070		novel membrane protein with reeler domain and multiple EGF repeats-no known proteins with this composition/organization\n
GLEAN3_11038	SPU_011038		"just a reeler domain, may belong with reelin genes"\n
GLEAN3_12092	SPU_012092		"just a reeler domain, may belong with reelin genes"\n
GLEAN3_13071	SPU_013071		"just a reeler domain, may belong with reelin genes"\n
GLEAN3_14572	SPU_014572		Enormous protein with multiple EGF and EGFCA domains-N-terminal Reeler and a CUB domain near C-terminus. No known proteins with this composition/organization\n
GLEAN3_15603	SPU_015603		"has a reeler domain and two EGFs, may belong with reelin genes"\n
GLEAN3_15604	SPU_015604		"has a reeler domain and an EGF, may belong with reelin genes"\n
GLEAN3_16091	SPU_016091		reeler domain plus DoH catecholamine-binding domain\n
GLEAN3_16188	SPU_016188		"has a reeler domain and an EGF, may belong with reelin genes"\n
GLEAN3_16612	SPU_016612		"just a reeler domain, may belong with reelin genes"\n
GLEAN3_24087	SPU_024087		has a reeler domain but also a RING domain-probably not really related to reeler-novel domain combination\n
GLEAN3_26165	SPU_026165		has a reeler domain but also a large block of repetitive simple sequencs and two CCP domains-probably not really related to reeler\n
GLEAN3_26222	SPU_026222		"just a reeler domain, may belong with reelin genes-note that adjacent gene very similar"\n
GLEAN3_26223	SPU_026223		"just a reeler domain, may belong with reelin genes-not that adjacent gene very similar"\n
GLEAN3_26550	SPU_026550		"enormous protein with reeler domain, CUB domain and multiple internal EGF repeats-looks like assembly problem"\n
GLEAN3_13829	SPU_013829		large protein with few defined domains-2CCP at one end-SEA/EGF at the other\n
GLEAN3_16171	SPU_016171		"SEA, LDLa and CUB domains-novel architecture"\n
GLEAN3_21919	SPU_021919		CCP and SEA domains- novel architecture\n
GLEAN3_22753	SPU_022753		CCP and SEA domains- novel architecture\n
GLEAN3_27729	SPU_027729		SEA and EGF\n
GLEAN3_19795	SPU_019795		two sea domains-there is a human protein-interphotoreceptor proteoglycan-with a similar structure\n
GLEAN3_01785	SPU_001785		hyalin protein with SEA domain\n
GLEAN3_02355	SPU_002355		very large protein with many hyalin repeats and SEA domain and EGF repeats near the C-terminus \n \nexpressed during embryonic development\n
GLEAN3_03635	SPU_003635		"hyalin repeat protein with some other domains interspersed-SEA, VWD, CUB, LDLa"\n
GLEAN3_04945	SPU_004945		SEA/HYR/CLECT - short hyalin protein\n
GLEAN3_09594	SPU_009594		has Spond_N and TSP1 domains-realted to vertebrate spondins\n
GLEAN3_20379	SPU_020379		has Spond_N and TSP1 domains-realted to vertebrate spondins\n
GLEAN3_13393	SPU_013393		Annotation entered by Bob Obar (robar@scientist.com). \nThe epsilon-tubulin protein family is not yet a coherent one, and it is impossible at this time to determine whether the lack of similarity observed between the amino-terminal ~100 amino acids of this sequence and the corresponding segments of other epsilon-tubulin database entries is due to divergence or error.\n
GLEAN3_03119	SPU_003119		See putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137536744-28277-70510656271.BLASTQ4\n
GLEAN3_02663	SPU_002663		Annotation entered by Bob Obar (robar@scientist.com). \nThis GLEAN originally represented an amino-terminal segment of an epsilon-tubulin encoded by Scaffold498.  The gene model was later manually extended using sequence from Scaffoldi3903, which appears to encode the entire epsilon-tubulin polypeptide.\n
GLEAN3_12141	SPU_012141		Annotation entered by Bob Obar (robar@scientist.com). \nThis GLEAN represents an amino-terminal segment of an epsilon-tubulin.  Except for 6 codons near the amino terminus of each, it is identical to GLEAN3_02663.\n
GLEAN3_00178	SPU_000178		See putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137535187-7613-25495531689.BLASTQ4\n
GLEAN3_22346	SPU_022346		Pfam00194 match. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_26483	SPU_026483		Pfam00194 match. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_16740	SPU_016740		Pfam00194 match.  Transcriptome data indicate that it is expressed in the embryo. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined. \n \n
GLEAN3_09509	SPU_009509		#\nPfam00194 match. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_26747	SPU_026747		Pfam00194 match. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_12995	SPU_012995		Pfam00194 match. Transcriptome data indicate it is expressed in the embryo. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_00702	SPU_000702		Pfam00194 match. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_08894	SPU_008894		Pfam00194 match.  Transcriptome data indicate that gene is expressed in the embryo. \n \nA family of carbonic anhydrase-like proteins exists in the sea urchin. Orthologous relationships with vertebrate carbonic anhydrases and related proteins have not been carefully examined.\n
GLEAN3_06053	SPU_006053		gene model spans at least 3 glean predictions  \nGLEAN3_06053 (N-ter) \nGLEAN3_06054 (middle) located on the same scaffold but on opposite strand (assembly problem?) \nGLEAN3_24289 (end) \n \n2 glean predictions are duplicates of some exons GLEAN3_06612 (middle) and Glean3_01692 (end)\n
GLEAN3_06054	SPU_006054		cf Glean3_06053\n
GLEAN3_24289	SPU_024289		cf Glean3_06053\n
GLEAN3_16272	SPU_016272		There seems to be some extra predicted exons in the GLEAN prediction. NCBI GNOMON prediction seems to be more accurate (XP_797469) \n \nBest homology with vertebrate Alk6, but as the closely related vertebrate Alk3, doesn't seems to have its counterpart in the sea urchin, I called him Alk3-6.\n
GLEAN3_14830	SPU_014830		This model contains exons identical to GLEAN3_28742 and is either a duplication or an allele.\n
GLEAN3_04951	SPU_004951		Partial Toll-like receptor. This gene model is located at the end of a short scaffold. The nucleotide sequence has 95% identity to another Sp-Tlr gene, so it could be a member of Toll-like receptor.\n
GLEAN3_12005	SPU_012005		Possible Gene Prediction issue - it could be several concatenated proteins\n
GLEAN3_25902	SPU_025902		Possible Gene Prediction issue - it could be several concatenated proteins\n
GLEAN3_15305	SPU_015305	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. There is one sequence overlap within the first scaffold from 3-91, but if this overlap is discarded the overall sequence has an orderly arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	Unknown architecture-possible gene prediction issue\n
GLEAN3_18488	SPU_018488		highly conserved 69% identical to the human protein\n
GLEAN3_10750	SPU_010750		trypsin-like protease with SR and several FN3 repeats - novel architecture\n
GLEAN3_00406	SPU_000406		Ig domains (9) and a few FN3 - may be a fragment\n
GLEAN3_10007	SPU_010007		Ig3/FN3/Ig2 - may be a fragment\n
GLEAN3_10124	SPU_010124		Ig3/FN3-2/Ig7 - may be a fragment\n
GLEAN3_10125	SPU_010125		Ig13/FN3 - may be a fragment\n
GLEAN3_12992	SPU_012992		Igc2-9/FN3-2 - may be a fragment \ndomain structure  (9-2) is consistent with Ds-CAM (9-4-1-2) missing C-terminus. \nBlast match to DCC is fifth after other better matches and domain structure not right for DCC (4-6)- probably not DCC/neogenin but similar molecule\n
GLEAN3_25374	SPU_025374		novel architecture -SEMA and FN3\n
GLEAN3_20387	SPU_020387		ROS-RELATED -single LY and several FN3 BUT lacks TM and kinase domains\n
GLEAN3_27290	SPU_027290		FN3-TM-PPase - TM NOT PREDICTED - overlap with hits is limited\n
GLEAN3_15923	SPU_015923		FN3-TM-Ppase - overlap with hits is limited\n
GLEAN3_23943	SPU_023943		has Ig - EGF - SEVERAL FN3-GPS-7TM\n
GLEAN3_25788	SPU_025788		has  9 FN3s - 4 EGFCas - 3 more FN3-GPS-TM \n \nApart from lack of 7tm_2 domain this matches to overall pattern of LNB-7TM-GPCRs \nMay be missing C-terminus\n
GLEAN3_01609	SPU_001609		has FN3-GPS-7TM\n
GLEAN3_16363	SPU_016363		has mixture of FN3/EGFCa-GPS-7TM_2 \n \nmatches to overall pattern of LNB-7TM-GPCRs\n
GLEAN3_21832	SPU_021832		has CLECT-FN3-GPS-7TM\n
GLEAN3_09018	SPU_009018		novel architecture - LIPOXYGENASE (LH2) HOMOLOGIES FLANKING MULTIPLE FN3 REPEATS\n
GLEAN3_21340	SPU_021340		LEUCINE-RICH/FN3 PROTEIN -NOVEL ARCHITECTURE - HAS TBC DOMAIN\n
GLEAN3_00435	SPU_000435		myosin light chain kinase - structure predicted may suggest duplication or assembly problems\n
GLEAN3_19190	SPU_019190		Best matches are with the Rho-GEF kinases, Duet,  but that gene has an additional N-terminal segment - missing RhoGEF, PH and SH3 domains\n
GLEAN3_13917	SPU_013917		lots of Ig domains and a few FN3 plus a mixed-function kinase - good matches to projectin and twitchin - clearly a muscle-specifc structural kinase\n
GLEAN3_28470	SPU_028470		Kallmann syndrome 1 homolog - WAP/FN3n/LDLa\n
GLEAN3_04282	SPU_004282		Ig-FN3-2-BTB\n
GLEAN3_26161	SPU_026161		Ig/FN3/Ig/FN3 alternating\n
GLEAN3_00159	SPU_000159		hyalin protein - prediction is longer than cDNA sequences\n
GLEAN3_17178	SPU_017178		FN3-SPRY - there are a couple of classes of proteins with that combination sometimes plus other domains no good Blast hits domain structure is very similar to GL3_28216 - a good homolog of tripartite motif proteins - suspect this is one of those\n
GLEAN3_28216	SPU_028216		FN3-SPRY - best matches are to tripartite motif-contaning proteins from several species\n
GLEAN3_23539	SPU_023539		FN3-SPRY - ALSO RING and BBOX - there are several chordate proteins with that combination - probably transcription factors - matches with five best hits do not include the RING domain but the domain structure of the matching segemnt is very similar to GL3_28216 - a good homolog of tripartite motif proteins - suspect this is one of those\n
GLEAN3_27121	SPU_027121		"ECM or membrane adhesion protein - probably a fragment - many VWD/EGF/FN3 domains - somewhat homologous with human Fc gamma Ig-binding protein [AAD39266.1] and with zonaadhesins of pig and rabbit [NP_999548.1, AAF63342.2] but there are gaps"\n
GLEAN3_22706	SPU_022706		"ECM or membrane adhesion protein - probably a fragment - many VWD and FN3 domains - somewhat homologous with human Fc gamma Ig-binding protein [AAD39266.1] and with human zonaadhesins [AAL04410.1, AAL04412.1, AAL04413.1] - but there are gaps"\n
GLEAN3_03992	SPU_003992		CLECT/FN3/TM - MEMBER OF A LINKED CLUSTER OF THREE\n
GLEAN3_03993	SPU_003993		CLECT/FN3-2/TM - MEMBER OF A LINKED CLUSTER OF THREE\n
GLEAN3_03994	SPU_003994		CLECT-FN3-TM - MEMBER OF A LINKED CLUSTER OF THREE\n
GLEAN3_02394	SPU_002394		CLECT-FN3-TM\n
GLEAN3_05115	SPU_005115		CLECT-FN3-TM\n
GLEAN3_12215	SPU_012215		CLECT-FN3-TM\n
GLEAN3_23488	SPU_023488		CLECT-FN3-PANAP-TM -novel architecture\n
GLEAN3_21126	SPU_021126		CLECT-FN3- noTM\n
GLEAN3_08066	SPU_008066		CLECT-FN3- no TM\n
GLEAN3_23487	SPU_023487		CLECT2-FBG-FN3-PANAP-TM -novel architecture\n
GLEAN3_13668	SPU_013668		CLECT2-EGF3-FN3-TM\n
GLEAN3_14375	SPU_014375		CLECT2-EGF3-FN3-TM\n
GLEAN3_24360	SPU_024360		CLECT-EGF-FN3-TM\n
GLEAN3_28703	SPU_028703		CLECT-EGF3-FN3-TM\n
GLEAN3_19135	SPU_019135		CLECT-EGF2-FN3 - noTM\n
GLEAN3_12463	SPU_012463		CLECT-EGF2-FN3 - no TM\n
GLEAN3_00346	SPU_000346		CLECT/EGF2/FN3/TM - ALSO POSSIBLE PHOSPHATASE although PPase domain looks to be outside??\n
GLEAN3_12855	SPU_012855		CLECT/FN3 alternating WITH BLOCK OF CUB DOMAINS IN MIDDLE -novel architecture\n
GLEAN3_10216	SPU_010216		CLECT/FN3 alternating\n
GLEAN3_27967	SPU_027967		ANK2/FN3/RA RA domains - looks like some sort of intracellular adaptor\n
GLEAN3_18475	SPU_018475		Ig/FN3/TM - may be fragment - see adjacent gene\n
GLEAN3_27951	SPU_027951		Ig3/FN3-2/TM - may be fragment - see adjacent gene\n
GLEAN3_05899	SPU_005899		Ig2/FN3/TM - fragment\n
GLEAN3_06614	SPU_006614		Ig2/FN3-3/TM -N-term half matches with Robo - Cterm half does not\n
GLEAN3_08663	SPU_008663		Ig-4/FN3-4/TM -N-term half matches with Robo - Cterm half does not\n
GLEAN3_10851	SPU_010851		Ig5/FN3-2/TM -looks like quite a good domain match for CDO\n
GLEAN3_12530	SPU_012530		Ig/FN3/TM-fragment\n
GLEAN3_15844	SPU_015844		Ig4/FN3/Ig/TM\n
GLEAN3_15846	SPU_015846		Ig2/FN3/TM - fragment\n
GLEAN3_24482	SPU_024482		Ig2/FN3/TM - fragment\n
GLEAN3_24708	SPU_024708		Ig4/FN3-2/TM\n
GLEAN3_03460	SPU_003460		Ig/FN3-3/Ig/FN3-2 /TM -may be fragment\n
GLEAN3_09818	SPU_009818		Igc2-4/FN3-4/Igc2/FN3-2/TM \n \nthis does look like a Ds-CAM homolog - Ds-CAM has 9-4-1-2 arrangement of Ig/FN3 domains - this has 4-4-1-2 - suggests its missing the N-terminal 5 Ig domains \n \nNB - there are quite a few genes with 5 Ig repeats and nothing else - that might comprise the N-terminus \n(GLEAN3_12351, GLEAN3_15273, GLEAN3_08772, GLEAN3_07532, GLEAN3_10002, GLEAN3_13487, GLEAN3_16889, GLEAN3_00469, GLEAN3_25431, GLEAN3_25430, GLEAN3_15584, GLEAN3_09608, GLEAN3_09024) \nNone of these is obviously adjacent - nor are any other Ig only genes from the numbering - needs browser work.\n
GLEAN3_09318	SPU_009318		Ig6/FN3/TM -partial overlap only\n
GLEAN3_15705	SPU_015705		Ig2/FN3/TM - MAY BE fragment\n
GLEAN3_17022	SPU_017022		IG5/FN3/TM\n
GLEAN3_18030	SPU_018030		Ig2/FN3/TM - may be a fragment\n
GLEAN3_21725	SPU_021725		Ig5/FN3/TM - may be a fragment\n
GLEAN3_24019	SPU_024019		Ig8/FN3/TM - may be a fragment\n
GLEAN3_28401	SPU_028401		Ig5/FN3/TM - may be a fragment\n
GLEAN3_25975	SPU_025975		Ig3/FN6/TM - partial overlap only - does not have predicted TM and C-terminus (putative cyto domain) does not have Neogenin_C \nand is not homologous with DCC in Blast \n \nDomain sequence (3-6) is consistent with 4-6 arrangement in DCC/neogenin - would suggest it's missing N-terminal Ig domain \n
GLEAN3_26725	SPU_026725		Ig4/FN3-2/TM\n
GLEAN3_27415	SPU_027415		Ig2/FN3-3/TM\n
GLEAN3_25966	SPU_025966		LRR repeats-Ig-EGF-FN3-TM\n
GLEAN3_01538	SPU_001538		FN3/TM - probably a fragment\n
GLEAN3_01957	SPU_001957		FN3/TM - probably a fragment\n
GLEAN3_04072	SPU_004072		FN3-2/TM - probably a fragment\n
GLEAN3_04128	SPU_004128		FN3-2/TM - probably a fragment\n
GLEAN3_13954	SPU_013954		FN3/TM - probably a fragment\n
GLEAN3_19460	SPU_019460		FN3/TM - probably a fragment\n
GLEAN3_20590	SPU_020590		FN3-2/TM - probably a fragment\n
GLEAN3_22063	SPU_022063		FN3-2/TM - probably a fragment\n
GLEAN3_02657	SPU_002657		FN3-3/TM - probably a fragment\n
GLEAN3_02763	SPU_002763		EGF3FN3-3/TM - no Ig (?) - probably missing C-terminus\n
GLEAN3_04758	SPU_004758		FN3-2/TM - probably a fragment\n
GLEAN3_07789	SPU_007789		FN3-3/TM - probably a fragment\n
GLEAN3_11473	SPU_011473		FN3-8/TM\n
GLEAN3_12998	SPU_012998		FN3-6/TM\n
GLEAN3_15746	SPU_015746		FN3-5/TM\n
GLEAN3_16936	SPU_016936		FN3-31/TM\n
GLEAN3_17208	SPU_017208		FN3-14/TM\n
GLEAN3_18070	SPU_018070		FN3-4/TM- probably a fragment\n
GLEAN3_20335	SPU_020335		FN3-3/TM - probably a fragment\n
GLEAN3_21244	SPU_021244		FN3-3/TM - probably a fragment\n
GLEAN3_22585	SPU_022585		FN3-4/TM- probably a fragment\n
GLEAN3_26349	SPU_026349		FN3-4/TM- probably a fragment\n
GLEAN3_26991	SPU_026991		FN3-3/TM - probably a fragment\n
GLEAN3_22586	SPU_022586		EGF/Ig/FN3-2/TM - probably missing both ends\n
GLEAN3_19051	SPU_019051		EGF/FN3-2/TM - see adjacent gene\n
GLEAN3_27686	SPU_027686		EGF/FN3/TM - see adjacent gene - missing kinase\n
GLEAN3_27687	SPU_027687		EGF/FN3-3/TM - see adjacent gene- missing kinase\n
GLEAN3_00673	SPU_000673		EGF5/FN3-3/TM - probably fragment\n
GLEAN3_02870	SPU_002870		EGF-3/FN3-2/TM - missing kinase\n
GLEAN3_07983	SPU_007983		EGF2/FN3/TM - missing kinase\n
GLEAN3_10587	SPU_010587		EGF/FN3/TM\n
GLEAN3_11180	SPU_011180		EGF/FN3/TM\n
GLEAN3_12810	SPU_012810		EGF/FN3-2/TM\n
GLEAN3_13172	SPU_013172		EGF2/FN3/TM -partial overlap only - missing kinase\n
GLEAN3_18153	SPU_018153		EGF2/FN3/TM- partial overlap only\n
GLEAN3_23923	SPU_023923		EGF-2/FN3-3/TM - partial overlap only\n
GLEAN3_04347	SPU_004347		EGF-2/FN3-3/TM - very partial overlap\n
GLEAN3_09654	SPU_009654		EGF-3/FN3-2/TM - very partial overlap\n
GLEAN3_11804	SPU_011804		EGF/FN3-3/TM - very partial overlap\n
GLEAN3_14858	SPU_014858		EGF-2/FN3-3/TM -partial overlap - missing kinase\n
GLEAN3_15381	SPU_015381		EGF/FN3-4/TM - partial overlap\n
GLEAN3_16530	SPU_016530		EGF-2/FN3-3/TM - very partial overlap\n
GLEAN3_21253	SPU_021253		adhesion receptor - FN3/EGF_Ca intermingled - single CCP - a bit similar in composition to FLJ00133 protein but domain organisation different - see adjacent gene - pfam gives some hyalin repeats\n
GLEAN3_23398	SPU_023398		two FN3 - could be ECM or receptor - see adjacent genes\n
GLEAN3_23400	SPU_023400		two FN3 - could be ECM or receptor - see adjacent genes\n
GLEAN3_22518	SPU_022518		two FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_02470	SPU_002470		two FN3 - could be ECM or receptor\n
GLEAN3_02531	SPU_002531		two FN3 - could be ECM or receptor\n
GLEAN3_02801	SPU_002801		two FN3 - could be ECM or receptor\n
GLEAN3_03777	SPU_003777		two FN3 - could be ECM or receptor\n
GLEAN3_04756	SPU_004756		two FN3 - could be ECM or receptor\n
GLEAN3_06052	SPU_006052		two FN3 - could be ECM or receptor\n
GLEAN3_11337	SPU_011337		two FN3 - could be ECM or receptor\n
GLEAN3_16143	SPU_016143		two FN3 - could be ECM or receptor\n
GLEAN3_17760	SPU_017760		two FN3 - could be ECM or receptor\n
GLEAN3_17979	SPU_017979		two FN3 - could be ECM or receptor\n
GLEAN3_21019	SPU_021019		two FN3 - could be ECM or receptor\n
GLEAN3_21268	SPU_021268		two FN3 - could be ECM or receptor\n
GLEAN3_22616	SPU_022616		two FN3 - could be ECM or receptor\n
GLEAN3_22764	SPU_022764		two FN3 - could be ECM or receptor\n
GLEAN3_23304	SPU_023304		two FN3 - could be ECM or receptor\n
GLEAN3_23502	SPU_023502		two FN3 - could be ECM or receptor\n
GLEAN3_22519	SPU_022519		three FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_03995	SPU_003995		three FN3 - could be ECM or receptor\n
GLEAN3_04950	SPU_004950		three FN3 - could be ECM or receptor\n
GLEAN3_11397	SPU_011397		three FN3 - could be ECM or receptor\n
GLEAN3_11648	SPU_011648		three FN3 - could be ECM or receptor\n
GLEAN3_13667	SPU_013667		three FN3 - could be ECM or receptor\n
GLEAN3_17929	SPU_017929		three FN3 - could be ECM or receptor\n
GLEAN3_19818	SPU_019818		three FN3 - could be ECM or receptor\n
GLEAN3_22314	SPU_022314		three FN3 - could be ECM or receptor\n
GLEAN3_24061	SPU_024061		three FN3 - could be ECM or receptor\n
GLEAN3_25664	SPU_025664		three FN3 - could be ECM or receptor\n
GLEAN3_28131	SPU_028131		three FN3 - could be ECM or receptor\n
GLEAN3_28665	SPU_028665		three FN3 - could be ECM or receptor\n
GLEAN3_23737	SPU_023737		Ig2/FN3-3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_22705	SPU_022705		four FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_23738	SPU_023738		four FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_22580	SPU_022580		four FN3 - could be ECM or receptor\n
GLEAN3_26676	SPU_026676		four FN3 - could be ECM or receptor\n
GLEAN3_22535	SPU_022535		five FN3 - could be ECM or receptor\n
GLEAN3_04309	SPU_004309		five FN3 - could be ECM or receptor\n
GLEAN3_14850	SPU_014850		four FN3 - could be ECM or receptor\n
GLEAN3_17765	SPU_017765		five FN3 - could be ECM or receptor\n
GLEAN3_21252	SPU_021252		12 FN3 - could be ECM or receptor\n
GLEAN3_18476	SPU_018476		three FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_21082	SPU_021082		three FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_17766	SPU_017766		three FN3 - could be ECM or receptor - last of several similar adjacent fragments\n
GLEAN3_15008	SPU_015008		lots of EGF - a few FN3 repeats - one CUB domain - Blast hits with Notch but does not really look like Notch\n
GLEAN3_00454	SPU_000454		Ig/FN3-4 - weak match with Nr-CAM\n
GLEAN3_00554	SPU_000554		Ig4/FN3-2 - weak match with NCAM\n
GLEAN3_01159	SPU_001159		EGF/FN3-2\n
GLEAN3_05508	SPU_005508		Ig6/DISIN/FN3-2/IG3/FN3-4 - patchy match with contactin\n
GLEAN3_09039	SPU_009039		Ig/FN3-5 - weak match with neogenein\n
GLEAN3_09757	SPU_009757		Ig5/FN3-4 - no TM - incomplete?\n
GLEAN3_13497	SPU_013497		Ig/FN3-3\n
GLEAN3_13927	SPU_013927		Ig/FN3-3\n
GLEAN3_26307	SPU_026307		Ig4/FN3-2\n
GLEAN3_28098	SPU_028098		Ig/FN3-4\n
GLEAN3_28100	SPU_028100		Ig/FN3-5\n
GLEAN3_24882	SPU_024882		Ig3/FN3  - see adjacent gene\n
GLEAN3_27952	SPU_027952		Ig4/FN3  - see adjacent gene\n
GLEAN3_04501	SPU_004501		Ig5/FN3- weak match with nephrin\n
GLEAN3_04746	SPU_004746		Ig3/FN3 - weak match with NCAM\n
GLEAN3_05900	SPU_005900		Ig3/FN3 - weak match with nephrin\n
GLEAN3_06087	SPU_006087		Ig4/FN3\n
GLEAN3_07323	SPU_007323		Ig4/FN3\n
GLEAN3_08387	SPU_008387		Ig/FN3\n
GLEAN3_08771	SPU_008771		Ig2/FN3\n
GLEAN3_09571	SPU_009571		Ig/FN3\n
GLEAN3_10291	SPU_010291		Ig2/FN3-2\n
GLEAN3_14759	SPU_014759		Ig2/FN3\n
GLEAN3_17488	SPU_017488		Ig5/FN3\n
GLEAN3_17889	SPU_017889		Ig/FN3-2\n
GLEAN3_21745	SPU_021745		Ig3/FN3\n
GLEAN3_23843	SPU_023843		Ig2/FN3 - weak match with sidekick\n
GLEAN3_24986	SPU_024986		Ig/FN3-2\n
GLEAN3_26399	SPU_026399		EGF/Ig/EGF3/FN3-4\n
GLEAN3_01092	SPU_001092		Ig4/FN3\n
GLEAN3_25433	SPU_025433		Ig8/FN3\n
GLEAN3_21083	SPU_021083		four FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_22641	SPU_022641		four FN3 - could be ECM or receptor\n
GLEAN3_15007	SPU_015007		FN3-3/EGF/FN3-7/EGF2 - possible TENASCIN - BUT NO FBG\n
GLEAN3_01682	SPU_001682		FN3 domain\n
GLEAN3_02469	SPU_002469		FN3 domain\n
GLEAN3_09658	SPU_009658		FN3 domain\n
GLEAN3_14031	SPU_014031		FN3 domain\n
GLEAN3_14854	SPU_014854		FN3 domain\n
GLEAN3_19478	SPU_019478		FN3 domain\n
GLEAN3_20676	SPU_020676		FN3 domain\n
GLEAN3_21011	SPU_021011		FN3 domain\n
GLEAN3_25558	SPU_025558		FN3 domain\n
GLEAN3_27203	SPU_027203		FN3 domain\n
GLEAN3_02112	SPU_002112		FN3-18/EGF12 - possible TENASCIN - BUT no FBG\n
GLEAN3_03659	SPU_003659		FN3-7/EGF2 - conceivably TENASCIN - BUT NO FBG\n
GLEAN3_01994	SPU_001994		EGF/Ig/FN3\n
GLEAN3_03814	SPU_003814		EGF/Ig/FN3\n
GLEAN3_04830	SPU_004830		Ig/EGF2/FN3-4\n
GLEAN3_08166	SPU_008166		EGF5/Ig5/FN3-2\n
GLEAN3_16493	SPU_016493		Ig/EGF-2/FN3-2\n
GLEAN3_23397	SPU_023397		FN3/EGF/FN3-2 - see adjacent genes\n
GLEAN3_19052	SPU_019052		EGF/FN3 - see adjacent gene\n
GLEAN3_24881	SPU_024881		EGF/FN3  - see adjacent gene\n
GLEAN3_01907	SPU_001907		EGF3/FN3-3\n
GLEAN3_03293	SPU_003293		EGF/FN3\n
GLEAN3_03931	SPU_003931		EGF/FN3\n
GLEAN3_13220	SPU_013220		EGF/FN3-3\n
GLEAN3_19659	SPU_019659		EGF/FN3\n
GLEAN3_21589	SPU_021589		EGF/FN3\n
GLEAN3_25304	SPU_025304		EGF7/FN3\n
GLEAN3_26416	SPU_026416		EGF2/FN3\n
GLEAN3_03418	SPU_003418		EGF/FN3-2\n
GLEAN3_04018	SPU_004018		EGF/FN3-3/EGF\n
GLEAN3_05820	SPU_005820		EGF2/FN3\n
GLEAN3_06514	SPU_006514		EGF/FN3\n
GLEAN3_07791	SPU_007791		EGF2/FN3\n
GLEAN3_09637	SPU_009637		EGF/FN3-3\n
GLEAN3_11344	SPU_011344		EGF/FN3\n
GLEAN3_23165	SPU_023165		EGF2/FN3-3\n
GLEAN3_01463	SPU_001463		all FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_24778	SPU_024778		all FN3 - could be ECM or receptor - see adjacent gene\n
GLEAN3_24777	SPU_024777		all FN3 - could be ECM or receptor  - see adjacent gene\n
GLEAN3_00736	SPU_000736	After reviewing the data and performing a BLAST search it appears that this is the best match for this particular GLEAN model based on orderly arrangement, no repeats, and sequence coverage. There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values greater than 5 and several greater than 10.	all FN3 - could be ECM or receptor\n
GLEAN3_00748	SPU_000748		all FN3 - could be ECM or receptor\n
GLEAN3_00797	SPU_000797		all FN3 - could be ECM or receptor\n
GLEAN3_00925	SPU_000925	From the BLAST results as well as the excel data, it is evident that this is the best results for this particular GLEAN model. When examining the excel data, there appears to be several gaps present within the sequence. These gaps however can be filled in with base pairing information from:\n                                                          Score    E\nSequences producing significant alignments:              (bits) Value\nv2.1_scaffold4976                                         634   e-179\nHowever, v2.1_scaffold70594 has the best overall sequence coverage compared to the rest of the scaffold results provided from BLAST. There was no Est suppot available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be separated into 2 distinct clusters that had overall strong scores (<10.)	all FN3 - could be ECM or receptor\n
GLEAN3_01610	SPU_001610		all FN3 - could be ECM or receptor\n
GLEAN3_01658	SPU_001658		all FN3 - could be ECM or receptor\n
GLEAN3_02802	SPU_002802		all FN3 - could be ECM or receptor\n
GLEAN3_02820	SPU_002820		all FN3 - could be ECM or receptor\n
GLEAN3_03086	SPU_003086		all FN3 - could be ECM or receptor\n
GLEAN3_03583	SPU_003583		all FN3 - could be ECM or receptor\n
GLEAN3_05060	SPU_005060		all FN3 - could be ECM or receptor\n
GLEAN3_05960	SPU_005960		all FN3 - could be ECM or receptor\n
GLEAN3_07364	SPU_007364		all FN3 - could be ECM or receptor\n
GLEAN3_09767	SPU_009767		all FN3 - could be ECM or receptor\n
GLEAN3_09943	SPU_009943		all FN3 - could be ECM or receptor\n
GLEAN3_10144	SPU_010144		all FN3 - could be ECM or receptor\n
GLEAN3_10788	SPU_010788		all FN3 - could be ECM or receptor\n
GLEAN3_11113	SPU_011113		all FN3 - could be ECM or receptor\n
GLEAN3_13800	SPU_013800		all FN3 - could be ECM or receptor\n
GLEAN3_14469	SPU_014469		all FN3 - could be ECM or receptor\n
GLEAN3_15918	SPU_015918		all FN3 - could be ECM or receptor\n
GLEAN3_15919	SPU_015919		all FN3 - could be ECM or receptor\n
GLEAN3_17207	SPU_017207		all FN3 - could be ECM or receptor\n
GLEAN3_17762	SPU_017762		all FN3 - could be ECM or receptor\n
GLEAN3_17937	SPU_017937		all FN3 - could be ECM or receptor\n
GLEAN3_17938	SPU_017938	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data several internal repeats were present that were also apparent from the BLAST results (on v2.1_scaffold54801). There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores were very widely dispersed, with most of the values being less than 5 with exception to a few outliers. 	all FN3 - could be ECM or receptor\n
GLEAN3_28716	SPU_028716		all FN3 - could be ECM or receptor\n
GLEAN3_10429	SPU_010429		VWC AND FN3 DOMAINS INTERMINGLED - NO TM PREDICTED BUT THERE ARE TM RECPTORS KNOWN WITH THESE TWO DOMAINS\n
GLEAN3_09576	SPU_009576		all Ig - could be ECM or receptor\n
GLEAN3_01222	SPU_001222		all FN3 - could be ECM or receptor\n
GLEAN3_05015	SPU_005015		all FN3 - could be ECM or receptor\n
GLEAN3_26321	SPU_026321		all FN3 - could be ECM or receptor\n
GLEAN3_00997	SPU_000997		This is a second C3 gene in the sea urchin.  The encoded protein has a conserved thioester site and a single cleavage site to generate alpha and beta chains.  There is a histidine in the C-terminal direction that functions in substrate binding choice. \n \nThe gene model is missing the 5' end - about the first 130 amino acids.  However, Genboree shows an overlap with NCBI:prediction XM-775838.1 (scaffold 1499) that may contain the missing part of the gene.\n
GLEAN3_05193	SPU_005193		The encoded protein has a thioester site and two cleavage sites.  The first to cleave the alpha and beta chains and the second to cleave the alpha and gamma chains.  This structure is typical of C4 proteins in mammals but also the C3 proteins in the cyclostomes.  \n
GLEAN3_28445	SPU_028445		likely missing two exons on the C-terminus.\n
GLEAN3_22988	SPU_022988	From the BLAST results and the excel data, it appears that the sequence is distributed onto 3 different scaffolds. 	The encoded protein has a thioester site and histidine in the N terminal direction that may function in target choice.  The sequence has a cleavage site between the alpha and beta chains, however, the beta chain is very short and the alpha chain is long.  \n \nGLEAN3-22988 overlaps with GLEAN3-19601.  See alignment with this annotation.  It is not clear why there is an overlap, but it may be an assembly problem. \n \nThe sequence overlaps with GLEAN3_19601\n
GLEAN3_21668	SPU_021668		Annotation entered by Bob Obar (robar@scientist.com). \nThis is one of 4 tandem alpha-tubulin Gene Models (GLEAN3_21667 - 21770).\n
GLEAN3_21669	SPU_021669		Annotation entered by Bob Obar (robar@scientist.com). \nThis is one of 4 tandem alpha-tubulin Gene Models (GLEAN3_21667 - 21770).\n
GLEAN3_16746	SPU_016746		Annotation entered by Bob Obar (robar@scientist.com).\n
GLEAN3_28221	SPU_028221		Annotation entered by Bob Obar (robar@scientist.com).\n
GLEAN3_24615	SPU_024615		Annotation entered by Bob Obar (robar@scientist.com).  This Gene Model contains a full-length alpha-tubulin with a duplication of 125 amino acids near the amino terminus of the predicted protein.\n
GLEAN3_12679	SPU_012679		Annotation entered by Bob Obar (robar@scientist.com).  This Gene Model contains a nearly full-length alpha-tubulin that is missing 15 amino acids at the amino terminus of the predicted protein.\n
GLEAN3_27848	SPU_027848		This is the N terminal part of the protein; the C terminal portion is encoded by GLEAN3_26498. These 2 gleans overlap (nucleotide level): bases 724-1066 (this  glean).  Probably the sequence after 1066 {TAGGATTATTGAGAAATCTTTAA} probably does not really belong to this gene\n
GLEAN3_27380	SPU_027380		3' of CDS missing (TM domain)\n
GLEAN3_05205	SPU_005205		similar to Phosphatidyl Serine Receptor (PSR), involved in phosphatidylserine-specific apoptotic cell clearance (e.g. macrophages engulfing apoptotic T cells) \n \n-model assembled/modified from GLEAN3_05204 and GLEAN3_05205\n
GLEAN3_05885	SPU_005885		Partial sequence. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537340-17614-146655645239.BLASTQ1\n
GLEAN3_16937	SPU_016937		The 5' end of the sequence annotated here is GLEAN3_16936.  GLEAN3_16936 sequences have been pasted in front of GLEAN3_16937 sequences.  A portion of GLEAN3_16936 at the boundaries of contig AAGJ01178647 and contig AAJ01178648 contains an identical repeat.  This is probably a genome assembly error and the duplicate sequence has been removed from the peptide sequence reported here (but not from the DNA sequence).   The deleted sequence in GLEAN3_16936 is: \nITTGLYNDEMVTSSTTRNCSTTDCESFTVDFDTLNSGTLYTLYAGVVQSSGREVVPLLAKAATIPESAVDLQFTSIGRNYVVLTWDNPAGMIDSYNISYYPVNDITKLMFEVVQAAAESNVLRVDDLNEGMNYSFTVVSLLEVEADLQEMGAPVEVFAVVGVLGSLNITAFDETTMNIEWEQVDVED.\n
GLEAN3_05602	SPU_005602		Partial sequence. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537204-5277-205811584566.BLASTQ4\n
GLEAN3_12073	SPU_012073		Partial sequence.\n
GLEAN3_06699	SPU_006699		Similarity to Chlamydomonas reinhardtii Inner_Dynein_Arm_1_Intermediate Chain IC140 (C_530081|166736|IA1-IC140|) and Homo sapiens axonemal dynein intermediate polypeptide 2.  Contains a WD-40 motifs.  The protein is also essentially identical to the Anthocidaris crassispina (gi|2494216|sp|Q16960|DYI3_ANTCR Dynein intermediate chain 3, ciliary).\n
GLEAN3_12809	SPU_012809		Similarity to Chlamydomonas reinhardtii Inner_Dynein_Arm_1_Intermediate Chain IC140 (C_530081|166736|IA1-IC140|) and Homo sapiens testis development protein NYD-SP29 (NP_660155).\n
GLEAN3_08502	SPU_008502		Similar to protein serine/threonine phosphatase 4 regulatory subunit 1.\n
GLEAN3_26101	SPU_026101		Similar to PP4R1\n
GLEAN3_15320	SPU_015320		This Gene Model is a close homolog of Chlamydomonas reinhardtii Inner Dynein Arm Light Chain p28 (IA-IC28, C_740003) and Homo sapiens axonemal dynein light chain (NP_003453).  It has a GenBank ID gi|1354084|gb|AAC47111.1| (axonemal dynein light chain p33).\n
GLEAN3_18432	SPU_018432		Similar to Dentin sialophosphoprotein precursor (DMP-3).  Partial sequence.\n
GLEAN3_17261	SPU_017261		Similar to Neurabin 1\n
GLEAN3_10093	SPU_010093	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first part of the sequence has an overlap that is apparent from both the BLAST result and the excel data. Other than this overlap within the first scaffold, if v2.1_scaffold32363 and v2.1_scaffold21524 were combined, the sequence would have an orderly continuous arrangement. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5.	this piece does not match well with any of the perlecan gene segments - so it is not from there.\n
GLEAN3_00937	SPU_000937		This looks like the N-terminus of perlecan - with other parts of the gene in different gene predictions. \n \nGLEAN3_12324/GLEAN3_28620 BOTH look like the middle of the gene and they are duplications - GLEAN3_12324 is longer   \n \nGLEAN3_26338 looks like the C-terminus\n
GLEAN3_00974	SPU_000974		This gene matches predicted gene products for coronin 1A   \n(gi|72123047|ref|XP_791382.1| PREDICTED: similar to coronin, actin binding protein, 1A [Strongylocentrotus purpuratus])\n
GLEAN3_02812	SPU_002812		EGF,EGF_Lam and FAS1 domains and a TM domain \nClosest match in human is stabilin but the urchin gene lacks the LINK domain and has a somewhat different pattern of EGF/FAS1 domains\n
GLEAN3_15419	SPU_015419		GLEAN3_15419 model is part of predicted Sp-Pask. Other, non-overlapping part is in GLEAN3_16604 on scaffold 70175.  FgeneshAB prediction S.P_Scaffold70175 may have additional exons, based on alignment with mammalian PAS-K.\n
GLEAN3_03676	SPU_003676		two Fas1 domains - member of a family of genes with similar structures and sequences \n \nNote that GLEAN3_03678 has same structure\n
GLEAN3_03678	SPU_003678		two FAS1 domains - member of a family of genes with similar structures and sequences \n \nVirtually identical to GLEAN3_20485 - latter has an additional  short segment in the middle(Exon??) \n \nAlso virtually identical to GLEAN3_03676 - but there are some minor sequence differences\n
GLEAN3_20485	SPU_020485		two FAS1 domains - probably a fragment \n \nVirtually identical to GLEAN3_03678 - latter is missing a short segment in the middle \n
GLEAN3_05198	SPU_005198		two FAS1 domains \n \nmember of a family of genes with similar structures and sequences\n
GLEAN3_06345	SPU_006345		two Fas1 domains - member of a family of genes with similar structures and sequences \n \nNote that GLEAN3_06346 has same structure and virtually same sequence\n
GLEAN3_06346	SPU_006346		two Fas1 domains - member of a family of genes with similar structures and sequences \n \nNote that GLEAN3_06345 has same structure and virtually same sequence\n
GLEAN3_15670	SPU_015670		two Fas1 domains - member of a family of genes with similar structures and sequences\n
GLEAN3_00041	SPU_000041	For this particular GLEAN model, an error message occurred indicating that the server encountered an internal error or misconfiguration and was unable to complete the request. When examining the excel data, it was evident that the sequence had numerous repeats and sequence overlaps present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with scores ranging from 5-15. 	enormous protein with three VWD/TIL, three VWC, lots of segments of low complexity and a CT at C-terminus \n \ngeneral domain composition looks mucin-like\n
GLEAN3_00070	SPU_000070		NIDO, AMOP, VWD,CCP - LOOKS LIKE A MUCIN\n
GLEAN3_01942	SPU_001942		VWCs, two VWDs and a couple of EGFs - novel architecture\n
GLEAN3_03277	SPU_003277		FOUR VWD -also TILand VWC -looks like a mucin\n
GLEAN3_03376	SPU_003376		AMOP, VWD,CCP - LOOKS LIKE A MUCIN - MISSING N-TERMINAL NIDO\n
GLEAN3_05406	SPU_005406		NIDO, AMOP, VWD AND EGF_CA TM - rather similar structure to mucin4d of chickens\n
GLEAN3_09378	SPU_009378		LPD_N, VWD only - common structure for vitellogenin\n
GLEAN3_09395	SPU_009395		NIDO/VWD - RATHER MUCIN-LIKE\n
GLEAN3_13189	SPU_013189		three VWD domains, also TIL and VWC - looks like a mucin\n
GLEAN3_13334	SPU_013334		NIDO, AMOP, VWD,CCP - LOOKS LIKE A MUCIN\n
GLEAN3_15633	SPU_015633		NIDO, AMOP, VWD,CCP - LOOKS LIKE A MUCIN\n
GLEAN3_16052	SPU_016052		LPD_N, VWD only - common structure for vitellogenin\n
GLEAN3_17361	SPU_017361	The BLAST results displayed a blank page	enormous protein with lots of segments of low complexity (including GLTT repeats) as well as a cluster of VWD/TIL/EGFs/CCPs/EGFs near C-terminus\n
GLEAN3_20247	SPU_020247		NIDO, VWD AND EGF_CA TM - very similar structure to mucin4d of chickens\n
GLEAN3_20744	SPU_020744		enormous protein - his-rich domain at N-terminus, 3-4 VWD or TIL/VWC domains,  a run of LDLa, a long segment of low complexity and LDLa/FA58C at C-terminus - some compositional similarity with SCO-spondin but not same\n
GLEAN3_21744	SPU_021744		FA58C and VWD with a couple of EGFs - novel architecture\n
GLEAN3_24352	SPU_024352	After reviewing data and performing a BLAST search it appears that this is the best fit for this GLEAN model. The sequence doesn't begin until 327 and there is one internal repeat (one duplicate) of 2182. The sequence does however have an orderly arrangement.  There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity score appeared to be weak with most of the values ranging from 2-8	NIDO, AMOP, VWD - LOOKS LIKE A MUCIN\n
GLEAN3_27118	SPU_027118		VWC/VWD - two repeats -  note that adjacent gene (27119) has multiple repeats of the same kind\n
GLEAN3_27119	SPU_027119		VWC/VWD - in multiple repeats - novel architecture - note that adjacent gene (27118) has similar composition\n
GLEAN3_28683	SPU_028683		LPD_N, VWD, VWA only - vitellogenin in Anopheles contains extra VWA also\n
GLEAN3_00538	SPU_000538		EGF x 5 - CCP - TM  \nNOVEL ARCHITECTURE\n
GLEAN3_02986	SPU_002986		CCP-CLECT-CCP-EGF-EGF-TM \nNOVEL ARCHITECTURE \n
GLEAN3_09610	SPU_009610		EGF-EGF-VWD - probably a fragment\n
GLEAN3_00782	SPU_000782		VWD only - probably a fragment\n
GLEAN3_16089	SPU_016089		EGF-VWD-EGF - probably a fragment\n
GLEAN3_18155	SPU_018155		VWD-EGF - probably a fragment\n
GLEAN3_20181	SPU_020181		VWF only - almost certainly a fragment\n
GLEAN3_16222	SPU_016222		VWF only - almost certainly a fragment\n
GLEAN3_28685	SPU_028685		large protein with a single VWD towards the C-terminus\n
GLEAN3_17171	SPU_017171		FBG - EGF x2 - CLECT  - novel architecture \nno particularly informative Blast hits\n
GLEAN3_23671	SPU_023671		novel architecture - several domains characteristic of adhesion proteins - FA58C, MAM, FBG, SR \nNo homologues but this is a known St.purp cDNA \nPancer,Z. Dynamic expression of multiple scavenger receptor cysteine-rich genes in coelomocytes of the purple sea urchin \nProc. Natl. Acad. Sci. U.S.A. 97 (24), 13156-13161 (2000)\n
GLEAN3_21993	SPU_021993		EGF-FBG - similar to C-terminus of GLEAN3_24020 which also has  a pfam:Nacht domain \n \nNovel architecture - FBG may imply role in innate immunity.\n
GLEAN3_06004	SPU_006004	From the BLAST results and the excel data, it is evident that the sequence is distributed onto three different scaffolds. After examining the excel data, it was apparent that the first scaffold didn't begin until the 33rd base pair, however, there were no other gaps present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with half of the values ranging >5 and the other half <5. 	has 7 TSP1 repeats, a TM domain and its long putative cytoplasmic domain contains a dual function kinase domain (could it be aPI kinase?) \n \nAnyway, it's a novel architecture - if the gene prediction is correct.\n
GLEAN3_06084	SPU_006084		large protein with mixture of multiple CCP, EGFCa and HYR domains and one TSP1 domain at the C-terminus \nNovel architecture - shared with GLEAN3_10445 and GLEAN3_19944\n
GLEAN3_10445	SPU_010445		large protein with mixture of multiple CCP, EGFCa and HYR domains and one TSP1 domain at the C-terminus \nNovel architecture - shared with GLEAN3_06084 and GLEAN3_19944\n
GLEAN3_19944	SPU_019944		large protein with mixture of multiple CCP, EGFCa and HYR domains and one TSP1 domain at the C-terminus \nNovel architecture - shared with GLEAN3_10445 and GLEAN3_06084\n
GLEAN3_19437	SPU_019437	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 3 different scaffolds.  The first part of the sequence is distributed onto >v2.1_scaffold81583, while the end of the sequence is distributed onto >v2.1_scaffold54361. If the 3 scaffolds were combined, there would be several sequence overlaps and repeats present. There was Est information available from GBrowse assembly V0.5 and the transcriptome information appeared to be somewhat strong with most of the values greater than 5. 	large protein with multiple TSP1, FA58C, gal-lectin and CLECT domains intermingled \n \nnovel architecture - similar to GLEAN3_00691 and GLEAN3_05426\n
GLEAN3_04017	SPU_004017		NOVEL ARCHITECTURE - WAP-IG-KU-KU-C345C\n
GLEAN3_06068	SPU_006068		novel architecture - intermingled SR and FU repeats followed by a series of Ig domains and a TM \nlooks like a fragment of the predicted gene UPI0000583F83 - similar to deleted in malignant brain tumors 1 isoform c precursor \n \ncompare GLEAN3_24528 which is a more complete version of this gene\n
GLEAN3_17202	SPU_017202		novel architecture - intermingled EGF/EGFCa/Igv/Igc2 domains followed by a set of VWC domains\n
GLEAN3_22250	SPU_022250		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Igc2 and TM - there are quite a few receptors of this type in humans \n \nBEST MATCH IS LRIG receptors - some homology with Gp-V of platelets\n
GLEAN3_00425	SPU_000425		looks like a pretty good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Igc2 - no TM - there are quite a few receptors of this type in humans \n \n
GLEAN3_05538	SPU_005538		#\nlooks like a pretty good model of an LRR/Ig membrane receptor - has a set of 7 LRR repeats (no NT) and CT domain followed by Igc2 and TM - there are quite a few receptors of this type in humans\n
GLEAN3_00186	SPU_000186		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Igc2 FN3 and TM - there are quite a few receptors of this type in humans\n
GLEAN3_12819	SPU_012819		looks like a pretty good model of an LRR/Ig membrane receptor - has a set of 5 LRR (no NT) and a CT domain followed by Igc2 and TM - there are quite a few receptors of this type in humans\n
GLEAN3_20790	SPU_020790		looks like a partial model of an LRR/Ig membrane receptor - has 4 LRR repeats followed by a CT domain and an Igc2 -  \nlacks LR_NT at N-terminus and TM at C-terminus - there are quite a few receptors of this type in humans\n
GLEAN3_02757	SPU_002757		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by 2 Igc2 and TM - there are quite a few receptors of this type in humans\n
GLEAN3_15612	SPU_015612		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Ig and TM - there are quite a few receptors of this type in humans\n
GLEAN3_17564	SPU_017564		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Ig but no TM - there are quite a few receptors of this type in humans\n
GLEAN3_18080	SPU_018080		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Ig but no TM - there are quite a few receptors of this type in humans\n
GLEAN3_01129	SPU_001129		looks like a pretty good model of an LRR/Ig membrane receptor - has a set of 12 LRR repeats (no NT) and a CT domain followed by Igc2 and TM - there are quite a few receptors of this type in humans\n
GLEAN3_18608	SPU_018608		looks like a partial model of an LRR/Ig membrane receptor - has aset of 4 LRR repeats (no NT) and a CT domain followed by Ig but no TM - there are quite a few receptors of this type in humans\n
GLEAN3_11759	SPU_011759		looks like a pretty good model of an LRR/Ig membrane receptor - has a set of LRR repeats (no NT) and a CT domain followed by Igc2 but no TM - there are quite a few receptors of this type in humans\n
GLEAN3_25731	SPU_025731		looks like a pretty good model of an LRR/Ig membrane receptor - has a set of 6 LRR repeats (no NT) and CT domains followed by Ig and TM - there are quite a few receptors of this type in humans\n
GLEAN3_11637	SPU_011637		looks like a pretty good model of an LRR/Ig membrane receptor - has a set of  6 LRR repeats (no NT) and a CT domain followed by Ig and TM - there are quite a few receptors of this type in humans\n
GLEAN3_04660	SPU_004660		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by six Ig domains and TM - there are quite a few receptors of this type in humans\n
GLEAN3_14240	SPU_014240		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Igc2 and TM - there are quite a few receptors of this type in humans\n
GLEAN3_21853	SPU_021853		multiple TSP1 repeats with one Igc2 domain in the middle  \n \npossibly an ADAM-TS\n
GLEAN3_23512	SPU_023512		four TSP1 repetas and an Ig domain - probably a fragment\n
GLEAN3_10954	SPU_010954		e val = 0.0 to NP_999823 from S. purpuratus, but Glean3_10954 is missing aa's at its start and has an insertion.  Gene model still to be modified to agree with cloning data. \ne val = 0.0 to NP_055785 from Homo sapiens. \ne val = e-125 to C_620048 from Chlamydomonas. \nAnnotated by RL Morris, A Musante, K Judkins, B Rossetti, A Rawson.\n
GLEAN3_19990	SPU_019990		The automated GLEAN prediction for this Gene Model contained a duplication of 76 amino acids near the amino terminus of the predicted protein. On the assumption that this apparent duplication is an assembly error rather than a true sequence duplication, it waqs edited out of the GLEAN sequence.\n
GLEAN3_17705	SPU_017705		A sequence containing the conserved motiv PSSALRE, characteristic of this kinase in other species, is missing in GLEAN3_17705.\n
GLEAN3_04406	SPU_004406		Similiar to Homo sapiens neuroglobin (NGB) mRNA, complete cds\n
GLEAN3_10004	SPU_010004		This is a gene with 19 perfect hyalin repeats in tandem, each present as a separate exon,each with a high pfam score as a hyalin repeat.  Expressed in the embryo at a modest level.\n
GLEAN3_17620	SPU_017620		This hyalin-like gene may be incomplete, is expressed in embryos at very low levels, if at all based on the tiling experiment.  It has an EGF - 11 hyalin repeats - EGF.  \n
GLEAN3_12490	SPU_012490		The GLEAN3 prediction doesn't include the C-terminus of the protein\n
GLEAN3_28266	SPU_028266		Gene accepted as is.  Not expressed in embryo but has 6 EGF repeats, 49 hyalin repeats \n
GLEAN3_28066	SPU_028066		Most likely artifactual (haplotype ?) duplication of GLEAN3_16079 (based on proximity of all other identified Type I Activin Like Receptor  to glean3_16079) \n
GLEAN3_16079	SPU_016079		GLEAN3_28066 seems a artifactual duplication of this gene \nIt seems also that the first exon of this prediction is an extra one that is not predicyed in the Angerer genescan.\n
GLEAN3_14949	SPU_014949		This hyalin-like gene is not expressed in the embryo, has 23 hyalin repeats, each with a low pfam score.\n
GLEAN3_19338	SPU_019338		Sequence discrepancies with cDNA data. \nGLEAN3 predicts a longer protein.\n
GLEAN3_01500	SPU_001500		Expressed in the embryo based on tiling experiment.  Highly conserved at N terminus with rat.  Exons predicted to be correct in glean model\n
GLEAN3_21497	SPU_021497		unclear duplication GLEAN3_00669\n
GLEAN3_08552	SPU_008552		Gene model corrected (2 exons modified) after careful comparison with vertebrate ortthologs and domain analysis. The 3' part of the gene is supported by EST evidence (CD323084 StrPu537.001446 from 20hr blastula stage library)\n
GLEAN3_22049	SPU_022049		GLEAN3 prediction corresponds to the NP_999778.1 cDNA sequence only for the first 2/3 of the sequence\n
GLEAN3_22153	SPU_022153		Highly expressed in embryos,  has high homology to rat casein kinase, gene looks to be complete as modeled\n
GLEAN3_04723	SPU_004723		It's possible exon 4 is larger than indicated here.\n
GLEAN3_13435	SPU_013435		The BRCA2 repeats are highly conserved and the gene is similar to the length of\n
GLEAN3_23599	SPU_023599		It is very similar to the P.l Dnmt1, with almost the same number of introns\n
GLEAN3_06612	SPU_006612		duplicated exons of Sp-FRAP, see glean3_06053 for gene model\n
GLEAN3_01692	SPU_001692		duplication, see glean3_06053 for  gene model\n
GLEAN3_09520	SPU_009520		Annotated using P.lividus(AM179826 and CAJ47350) and S.purpuratus cDNA and protein sequences. \nGlean 09520 contains the first exons.The C terminal exons have to be taken from Glean 03704 (Meredith Ashby). Protein sequences from the last exon of 09520 and the first exon of 03704 are not encoded in the cDNAs, and conversely a sequence present in both cDNAs is not present in the models. Alternative splicing or erroneous models. \nWarning :possible assembly problem. The fragment of the protein sequence missing is coded for by a short sequence from scaffold 2003. This sequence is incorporated in other Glean and NCBI models, within an intron in one case, as an exon read in a different frame, in the other. \nReconstructed protein sequence is given. \nIndicated highest blast hit is for non sea urchin sequences.     \n
GLEAN3_06406	SPU_006406		The protein encoded by this gene has a thioester site but no cleavage site to separate alpha and beta chains as in Sp-C3.  The protein sequence is too short to be either a complement protien or alpha 2 macroglobulin. \n \nGLEAN3-06406 overlaps with GLEAN3-26313.  An alignment showing this overlap is attached to this annotation.\n
GLEAN3_19422	SPU_019422		The protein encoded by this gene has a thioester site, but no beta chain and no histidine to regulate thioester attack on the target sequence.  The sequence is too short to be complement or alpha 2 macroglobulin.  The sequence does not overlap with any other GLEAN sequences.\n
GLEAN3_19612	SPU_019612		Similiar to gi|72041679|ref|XM_793147.1|PREDICTED: Strongylocentrotus purpuratus similar to ataxin 7-like  \n2 (LOC593678), partial mRNA \n
Sp-Tlr170	SPU_030136		Parital Toll-like receptor predicted by FgeneshAB and ++. The nuleotides have 89% identity to a typical Sp-Tlr (GLEAN3_23035). This gene model is located at the end of a contig, making the gene model incomplete. \n
GLEAN3_25786	SPU_025786		#\nhigh expression after UV-B irradiation of embryos\n
Sp-Tlr215	SPU_030137		Parital Toll-like receptor predicted by FgeneshAB and Genscan. The nuleotides have 98% identity to a typical Sp-Tlr (GLEAN3_13751). There are 5 LRRs and no more LRRs were found in the 5' upstream region. This gene model is located at the end of a short scaffold. \n
GLEAN3_13654	SPU_013654	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it is apparent that there are several internal repeats as well as sequence over laps present in both scaffolds. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5	The protein encoded by this gene has an N-terminal alpha 2 macroglobulin domain.  However, it does not have a thioester site.  There is an alpha/beta chain cleavage site and perhpas another for cleavage between alpha and gamma chains, but the site is in the wrong place.  The sequence is too short to be either a complement protein or alpha 2 macroblobulin.\n
GLEAN3_05856	SPU_005856		Exon 2 was missing in the original Glean3 model.  mRNA sequence was obtained from Bill Marzluff.\n
GLEAN3_27840	SPU_027840		The protein encoded by this gene matches to an EST with similarities to FKBP-12.\n
GLEAN3_16548	SPU_016548		Significant partial overlap with GLEAN3_16547\n
GLEAN3_18378	SPU_018378		   e val for NP_999777 = 0.0; KRP85 [Strongylocentrotus purpuratus].   \nUpdated sequence by replacing peptide seq with NP_999777 and nucleotide sequence with "NM_214612, 2213 bp, mRNA, linear, INV 12-AUG-2005" \n   e val = e-135 for C_1880008 (http://genome.jgi-psf.org/Chlre3/Chlre3.home.html), Cr-Fla10 which is same as NCBI P46869 \n   e val = e-140 for NP_004789, KIF3B [Homo sapiens]. \n   Annotation by RL Morris, B Rossetti, and A Rawson. \n
GLEAN3_26281	SPU_026281		#\nSegment of KRP95, annotated fully in GLEAN3_26280. \nAnnotation by RL Morris, R.A.Obar, AP Rawson, and B Rossetti.\n
GLEAN3_27012	SPU_027012		This glean model encodes the N-terminal part of the protein.  The C-terminus of the protein is contained in Glean3_14185.\n
GLEAN3_14185	SPU_014185		Encodes the C-terminus of Wee1; see annotation to Glean3_27012.\n
GLEAN3_11239	SPU_011239		e val to Q9P2H3 [Homo sapiens] = 0.0 \ne val to C_120075 (FAP167, IFT80, Intraflagellar Transport protein 80, http://genome.jgi-psf.org/Chlre3/Chlre3.home.html) = 3e-57 \nWD repeats. \nAnnotated by RL Morris.\n
GLEAN3_10564	SPU_010564		three and a bit complete LRRNT-LRRn-LRRCT repeats \n \nTogether with adjacent gene (GLEAN3_10564) comprises a complete Slit gene - could be one or two exons encoding LRR repeats missing at junction\n
GLEAN3_21581	SPU_021581		The start methionine of the ORF described here aligns with the query sequence (Mouse dystrophin) after the first 300 N-terminal amino acids.  GLEAN3 prediction 21580 lies immediately upstream on the same scaffold and encodes  a partial spectrin motif, it is possible that this should be included, however this entry constitutes a well aligned ORF against known homologs. \n
GLEAN3_21228	SPU_021228		This model was annotated based on a manual inspection of multiple protein sequence and domain structure comparisons. \n \nThis and a very similar adjacent model (GLEAN3_21229) predict proteins with a domain structure very similar to that of coagulation factors 5 and 8 (long N-terminus of little complexity and a C-terminal Pfam F5_F8_type_C domain). Their C-terminal F5_F8_type_C blasts best to coagulation factor 8. It should be noted, however, that there is a predicted EGF domain at the N-terminus of this model, which is absent from coagulation factors 5 and 8. \n \nIts adjacent model (GLEAN3_21229) is similar in sequence but far from identical, which suggests that these models might represent a true gene duplication event.\n
GLEAN3_21229	SPU_021229		This model was annotated based on a manual inspection of multiple protein sequence and domain structure comparisons. \n \nThis and a very similar adjacent model (GLEAN3_21228) predict proteins with a domain structure very similar to that of coagulation factors 5 and 8 (long N-terminus of little complexity and a C-terminal Pfam F5_F8_type_C domain). Their C-terminal F5_F8_type_C blasts best to coagulation factor 8. It should be noted, however, that there is a predicted Cadherin domain at the N-terminus of this model, which is absent from coagulation factors 5 and 8. \n \nIts adjacent model (GLEAN3_21228) is similar in sequence but far from identical, which suggests that these models might represent a true gene duplication event.\n
GLEAN3_12598	SPU_012598		e val to XP_787973 = e-122. \ne val to Q9P2H3 [Homo sapiens] = e-73 \nSome sililarity to FAP167, IFT80, Intraflagellar Transport protein 80, http://genome.jgi-psf.org/Chlre3/Chlre3.home.html). \nAnnotated by RL Morris.\n
GLEAN3_26298	SPU_026298		one complete LRR unit LRR-NT/17LRR/LRR-CT \n \nadjacent gene (GLEAN3_26299) has very similar structure - probably part of the same LRR protein\n
GLEAN3_26299	SPU_026299		one complete LRR unit LRR-NT/11LRR/LRR-CT \n \nadjacent gene (GLEAN3_26298) has very similar structure - probably part of the same LRR protein\n
GLEAN3_00831	SPU_000831		Similar to R-PTP-delta.  Partial sequence. May be a portion of a duplicate gene.  Another Sp-R-PTP-delta, GLEAN3_13607, is not on the same scaffold.  GLEAN3_13607 is probably a duplicate. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137535563-11783-53029041881.BLASTQ1\n
GLEAN3_00164	SPU_000164		One of 59 models with only one clectin motif and no others\n
GLEAN3_00294	SPU_000294		One of 59 models with only one clectin motif and no others\n
GLEAN3_00743	SPU_000743		One of 59 models with only one clectin motif and no others\n
GLEAN3_01182	SPU_001182		One of 59 models with only one clectin motif and no others\n
GLEAN3_01274	SPU_001274		One of 59 models with only one clectin motif and no others\n
GLEAN3_02144	SPU_002144		One of 59 models with only one clectin motif and no others\n
GLEAN3_02697	SPU_002697		One of 59 models with only one clectin motif and no others\n
GLEAN3_03227	SPU_003227		One of 59 models with only one clectin motif and no others\n
GLEAN3_03618	SPU_003618		One of 59 models with only one clectin motif and no others\n
GLEAN3_03774	SPU_003774		One of 59 models with only one clectin motif and no others\n
GLEAN3_04015	SPU_004015		One of 59 models with only one clectin motif and no others\n
GLEAN3_05111	SPU_005111		One of 59 models with only one clectin motif and no others\n
GLEAN3_05594	SPU_005594		One of 59 models with only one clectin motif and no others\n
GLEAN3_05989	SPU_005989		One of 59 models with only one clectin motif and no others\n
GLEAN3_05991	SPU_005991		One of 59 models with only one clectin motif and no others\n
GLEAN3_06508	SPU_006508		One of 59 models with only one clectin motif and no others\n
GLEAN3_07040	SPU_007040		One of 59 models with only one clectin motif and no others\n
GLEAN3_07576	SPU_007576		One of 59 models with only one clectin motif and no others\n
GLEAN3_07766	SPU_007766		One of 59 models with only one clectin motif and no others\n
GLEAN3_08393	SPU_008393		One of 59 models with only one clectin motif and no others\n
GLEAN3_09504	SPU_009504		One of 59 models with only one clectin motif and no others\n
GLEAN3_10100	SPU_010100		One of 59 models with only one clectin motif and no others\n
GLEAN3_10101	SPU_010101		One of 59 models with only one clectin motif and no others\n
GLEAN3_10212	SPU_010212		One of 59 models with only one clectin motif and no others\n
GLEAN3_10297	SPU_010297		One of 59 models with only one clectin motif and no others\n
GLEAN3_11163	SPU_011163		One of 59 models with only one clectin motif and no others\n
GLEAN3_12314	SPU_012314		One of 59 models with only one clectin motif and no others\n
GLEAN3_12942	SPU_012942		One of 59 models with only one clectin motif and no others\n
GLEAN3_13649	SPU_013649		One of 59 models with only one clectin motif and no others\n
GLEAN3_14081	SPU_014081		One of 59 models with only one clectin motif and no others\n
GLEAN3_14082	SPU_014082		One of 59 models with only one clectin motif and no others\n
GLEAN3_14184	SPU_014184		One of 59 models with only one clectin motif and no others\n
GLEAN3_14222	SPU_014222		One of 59 models with only one clectin motif and no others\n
GLEAN3_15079	SPU_015079		One of 59 models with only one clectin motif and no others\n
GLEAN3_15211	SPU_015211		One of 59 models with only one clectin motif and no others\n
GLEAN3_17295	SPU_017295		One of 59 models with only one clectin motif and no others\n
GLEAN3_17810	SPU_017810	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data, it was apparent that the two subject hits for this GLEAN model had numerous repeats present throughout both scaffolds. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	One of 59 models with only one clectin motif and no others\n
GLEAN3_17860	SPU_017860		One of 59 models with only one clectin motif and no others\n
GLEAN3_19213	SPU_019213		One of 59 models with only one clectin motif and no others\n
GLEAN3_19575	SPU_019575	After reviewing the data in the Excel file and performing a BLAST search it appears that the sequence is dispersed onto two different scaffolds. The first alignment score appears to be continuous but when the BLAST results data is examined there is no data for the first 156 bases on the first scaffold. The data for the first 156 bases appears to be on the second scaffold (>v2.1_scaffold34122). There was no Est information available on GBrowse assembly V0.5 and the transcrpitome intensity scores appear to be weak as well (>5). This is an un-annotated gene so no additional information was available in Baylor gene information (comments).	One of 59 models with only one clectin motif and no others\n
GLEAN3_20517	SPU_020517		One of 59 models with only one clectin motif and no others\n
GLEAN3_22152	SPU_022152		One of 59 models with only one clectin motif and no others\n
GLEAN3_22396	SPU_022396		One of 59 models with only one clectin motif and no others\n
GLEAN3_22861	SPU_022861		One of 59 models with only one clectin motif and no others\n
GLEAN3_23797	SPU_023797		One of 59 models with only one clectin motif and no others\n
GLEAN3_24127	SPU_024127		One of 59 models with only one clectin motif and no others\n
GLEAN3_24382	SPU_024382		One of 59 models with only one clectin motif and no others\n
GLEAN3_25181	SPU_025181		One of 59 models with only one clectin motif and no others\n
GLEAN3_25184	SPU_025184		One of 59 models with only one clectin motif and no others\n
GLEAN3_25248	SPU_025248		One of 59 models with only one clectin motif and no others\n
GLEAN3_25874	SPU_025874		One of 59 models with only one clectin motif and no others\n
GLEAN3_25875	SPU_025875		One of 59 models with only one clectin motif and no others\n
GLEAN3_25892	SPU_025892		One of 59 models with only one clectin motif and no others\n
GLEAN3_26524	SPU_026524		One of 59 models with only one clectin motif and no others\n
GLEAN3_27079	SPU_027079		One of 59 models with only one clectin motif and no others\n
GLEAN3_28229	SPU_028229		One of 59 models with only one clectin motif and no others\n
GLEAN3_28432	SPU_028432		One of 59 models with only one clectin motif and no others\n
GLEAN3_28538	SPU_028538		One of 59 models with only one clectin motif and no others\n
GLEAN3_28564	SPU_028564		One of 59 models with only one clectin motif and no others\n
Sp-Tfpi-like	SPU_030138		This model was created based on a Fgenesh++ prediction on scaffold98422 and a manual inspection and comparison of the predicted protein to similar genes in other groups. \n \nBased on our analysis thus far, this model likely corresponds to a partial prediction, as indicated by the similarity of the available sequence from this model to Tissue factor pathway inhibitor genes in vertebrates.  \n
GLEAN3_00786	SPU_000786		It could be a partial sequence!\n
GLEAN3_15878	SPU_015878		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure analyses. \n \nThe structure of this model is supported by the fact that other gene prediction protocols generated identical models. \n \nThe domain structure of this model differs slightly from that of vertebrate kallikrein B1 genes, in that no Pfam PAN domain is predicted in its N-terminus. Otherwise, the size and structure of this prediction is similar to that of kallikrein B1.\n
GLEAN3_06723	SPU_006723		See also GLEAN3_05592 and GLEAN3_24688. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537488-29699-97084085825.BLASTQ4\n
GLEAN3_08466	SPU_008466		Blasts to PTPRT, but doesn't clade with the PTPR K/M/T/U group in phylogenetic analysis using PTPc domains 1 or 2.  Renamed PTPRorph2. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137537867-27665-146362170620.BLASTQ4\n
GLEAN3_08878	SPU_008878		Partial sequence.  Similar to Survivin 2. \nSee putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137538166-23506-35039912798.BLASTQ4\n
GLEAN3_00276	SPU_000276		Transcriptome data indicates that Glean may have falsely predicted the following exons: 3.The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: RERERERERWRDGRGEREIATERERERERERERERERKRNMERERERERERERERIGNKSEYGIVRYVXXXXXXXXXXXXXXXXXGGGRDSIPFIENP\n
GLEAN3_14528	SPU_014528		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2.\n
GLEAN3_12586	SPU_012586		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2,5,6,7.\n
GLEAN3_13962	SPU_013962		Transcriptome data indicates that Glean may have falsely predicted the following exons: 2. \n \nC-terminal has likely been wrongly attached to glean3_08353.\n
GLEAN3_04028	SPU_004028		Matches_GLEAN3_04028. Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.\n
GLEAN3_02815	SPU_002815		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: KMEYILRERQLRLGRRLQCHHIHPQMSKATKSNQLQWHHIPMGTHNRLVLTAKLSSRRNGNFSVTIRGILFVTIILLLVIVVVTFSVGFVMVFLTDKWPRRYQSYPVIGDDWIKSGFRKRTESRINLPTYTYYGY,FVPSEHLLNARLSAVEKSVGKALISGVSSIHGQNQPLTDQAKPQDLNQEDNQTTAQPTQPTQQDEGDDSGITHQPLNVTTDSIEDGVHTEGTTTQVGQETAMPPHTSANVKGDQKQPTTMAPHTNGDSQPPSADGEVVIKAKREFLGDHPRYSFRDNNPFVGDRRGDVLGRLRDGLPHRQVASQVPVLPRHRRRLDQKWFQKKDGIEN\n
GLEAN3_22816	SPU_022816		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: HPSSSSPLYTPNFSSPSSSSLPSVYLFPKSFIQPLLYLQHRPLLLNNIVLLLLSFLTPSTTSSSSSSPSLISFFFSSSIFSKIVFPSSSSPPQPPPKLPPPPPPPLLPNPFHNHILQYFLPPPLPCLLCIPFPQSFIQPLLYLHLKLRLFHSSSSPLFSYLL,FCSSSSSSSSCSSSFSSSTTSPPPPLLPNPLHNFLLSLFLLDILLLLLLYILQIFLPPPPPPCLQYISFLRVSSNRSCTCNIAPSS,EFHPTAPVPATSPPPPKQHRPPPPLLPNPLHNFVLLFIPLLDILLLLLLHILQNCLPLLLLSSPTPSKTSSSSSSSSPSQPLPQPYSPIFSSPSSPLPSVYPLSSEFHPTAPVPALKAPPFPLLLLSPFLLSP\n
GLEAN3_23386	SPU_023386		The tiling data indicates that in the vicinity of the gene are significant ORFs which might represent additional exons. ORFs: AKMLAFITSHRTKARGKKEYDIVFPALFSWLYRVCFLAKRFIYAKQPQAMKPGGIGCLNEEGRKVLYKFLSFSFSPSTPILHAQINSRH,MFFVSPTNSTPNFSLATIRPRVVSAIALAAGLHPSNSIQHKSVKPSLGECRFEAEASSLTSFDLCLFALRFPLSLHSDIA,IRAYITVYNINIYVRHLEYDSSTNTLCVYVLSPSLYYSPSPSYLPLPISLSPSPFPKYFYVAYNALVISYRTLVDIFLGFCLNVFCFAN,LIHHPDMPAEIFTTPTSNTDTLLVQPRHFVYTIISIRPHAYKLMNPPPPSTSLFLSPSPSSTPFYYFFEVFPLAFRALVR,ERPNICPLASPDLSVLTKEAPVRINRTLLFLSLSPDPGPDNIQGISKSHSLFSLYFLSFFFQSPIPPPPPSHHPPQISSLP,SFTSIKYICLKDCSTISAHYPKHAHYQADTLLWFSLHSRMRRRWAIYCTRKIITRIPNDRAATLPDIHLFSLPASLPPTRYSPVIVH,IYLFKRLFYNICSLSQARTLSGRYPTVVLTPFTHEEKMGNLLHEENHNENSKRQGSNLARHTPVLLTCLIASYQVFTCNRPLRRDNSLPLSTVTDALFDAYAEVY\n
GLEAN3_07882	SPU_007882		Matches c-type lectin domain (cd00037).\n
GLEAN3_22405	SPU_022405		Ig-EGF-FN3 - a domain sequence unique to Tie1/2 in chordates \nLacks TM and kinase domains - suspect this is missing 3' end of gene.\n
GLEAN3_10943	SPU_010943		The GLEAN model directly corresponds to the previously cloned sea urchin (purpuratus) fascin 1 gene.\n
GLEAN3_23200	SPU_023200		variant b, 2 DSRM domains, the other gene has just one. \n
GLEAN3_10068	SPU_010068		one DSRM domain\n
GLEAN3_17592	SPU_017592		partial CDS containing an exon encoding part of the protease domain.  Nucleic acid sequence is very close to that of GLEAN3_11551.\n
GLEAN3_18708	SPU_018708		partial cds on a short scaffold.  Nucleic acid sequence is identical to that of GLEAN3_28742.\n
GLEAN3_19655	SPU_019655		partial CDS; predicted exons supported by weak signals on the transcriptome.\n
GLEAN3_06214	SPU_006214		PREDICTED: Strongylocentrotus purpuratus similar to CG11793-PA (LOC579361), mRNA Length=893 \n
GLEAN3_07151	SPU_007151		PREDICTED: Strongylocentrotus purpuratus similar to CG1548-PA (LOC575021), mRNA \n
GLEAN3_15385	SPU_015385		#\nPREDICTED: Strongylocentrotus purpuratus similar to lipocalin 7 (LOC576823), mRNA \n
GLEAN3_21839	SPU_021839		GLEAN3_21839 encodes a partial 3'-terminal sequence of the Cdk2 mRNA \nThe sequence is entirely contained in GLEAN3_07655, Refer to it for further annotation\n
GLEAN3_01005	SPU_001005		4 SRCR domains. Hits DMBT1 (probably superficial).  Probably incomplete model.  Possibly part of GLEAN3_01004.\n
GLEAN3_01004	SPU_001004		TM-SRCR(8).  Probably partial model.  Possiby part of gene that includes GLEAN3_01005.  \n
GLEAN3_00984	SPU_000984		SRCR(5).  Probably incomplete.\n
GLEAN3_00672	SPU_000672		Two gene models seem to be fused together. The first 7 exons appear to belong to the NLR gene, although the first exon is also questionable. \nDomains: DEATH-NACHT-LRRs \n
GLEAN3_13206	SPU_013206		#\nThe Genscan model contains additional sequences at the 5' end and seems more accurate. The gene features and sequences were annotated according to this model. \nDomains: DEATH-NACHT-PYD-LRRs \n
GLEAN3_24528	SPU_024528		novel architecture - intermingled SR and FU repeats followed by a series of Ig domains and a TM \nlooks like a more complete version of the predicted gene UPI0000583F83 - similar to deleted in malignant brain tumors 1 isoform c precursor \n \ncompare GLEAN3_06068 - a duplication of the C-terminal half of this gene\n
GLEAN3_06952	SPU_006952		has all the extracellular features of a toll receptor - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor \n \nNote - adjacent gene (GLEAN3_06951) is very similar\n
GLEAN3_06951	SPU_006951		has all the extracellular features of a toll receptor - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor \n \nNote - adjacent gene (GLEAN3_06952) is very similar\n
GLEAN3_21178	SPU_021178		has all the extracellular features of a toll receptor - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_01106	SPU_001106		has all the extracellular features of a toll receptor - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_21751	SPU_021751		has all the extracellular features of a toll receptor - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_14039	SPU_014039		has the extracellular features of a toll receptor ( a bit spaced out) - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_14845	SPU_014845		has the extracellular features of a toll receptor ( a bit spaced out) - no TIR in cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_07065	SPU_007065		has the extracellular features of a toll receptor BUT no predicted TM and no TIR in putative cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_22463	SPU_022463		has the extracellular features of a toll receptor ( a bit spaced out) BUT no predicted TIR in putative cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_18237	SPU_018237		has the extracellular features of a toll receptor ( maybe fewere LRRs - could be missing N-terminus) BUT no predicted TIR in putative cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_01436	SPU_001436		has the extracellular features of a toll receptor ( maybe fewer LRR repeats)) BUT no predicted TIR in putative cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_18238	SPU_018238		has the extracellular features of a toll receptor ( maybe fewer LRR repeats)) BUT no predicted TIR in putative cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_20527	SPU_020527		has the extracellular features of a toll receptor BUT no predicted TM or TIR cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_19916	SPU_019916		has the extracellular features of a toll receptor ( maybe fewer LRR repeats)) BUT no predicted TM or TIR cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_26114	SPU_026114		has the extracellular features of a toll receptor ( maybe a bit spaced out) BUT no predicted TM or TIR cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_08477	SPU_008477		has the extracellular features of a toll receptor ( maybe a few more than usual LRRs) BUT no predicted TM or TIR cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_07476	SPU_007476		looks rather like  an LRR/Ig membrane receptor - a set of LRR with NT a CT domain followed by Ig-like and TM - lacks an LRR-NT domain - could be incomplete \n \nBEST MATCH IS LRIG receptors \n
GLEAN3_24251	SPU_024251		looks like a good model of an LRR/Ig membrane receptor - has complete set of LRR with NT and CT domains followed by Ig-like and TM - there are quite a few receptors of this type in humans \n \nBEST MATCH IS LRIG receptors - some homology with Gp-V of platelets\n
GLEAN3_01749	SPU_001749		has the extracellular features of a toll receptor (plus an LRR-NT domain at N-terminus) BUT no predicted TM or TIR cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_26296	SPU_026296		has the extracellular features of a toll receptor ( maybe a few more than usual LRRs and has an N-terminal LRR-NT domain) BUT no predicted TIR cyto domain - could be incomplete gene or simply an LRR receptor\n
GLEAN3_00740	SPU_000740		Signal Peptide-SRCR(4)-TM.\n
GLEAN3_00654	SPU_000654		Signal Peptide-SRCR(2).  Possibly partial\n
GLEAN3_02925	SPU_002925		has partial LRR unit - LRR4/LRRCT and TM - could be fragment of a toll receptor or of another type of LRR receptor\n
GLEAN3_00646	SPU_000646		Sig Pep - SRCR(5).  Possibly partial.\n
GLEAN3_07463	SPU_007463		one complete LRR unit LRR-NT/21LRR/LRR-CT\n
GLEAN3_01172	SPU_001172		SRCR(3)-TM.  Probably partial. (DMBT1)\n
GLEAN3_01177	SPU_001177		SRCR(2).  Probably Partial.(DMBT1)\n
GLEAN3_01229	SPU_001229		SRCR(5). Probaly partial. (DMBT1)\n
GLEAN3_01266	SPU_001266		SRCR(5)-TM. Probably partial.  (DMBT1)\n
GLEAN3_01601	SPU_001601		F5_F8_type_C(1)-SRCR(3). Probably partial.\n
GLEAN3_08669	SPU_008669		Putative conserved TAFII28 domain detected in BlastP search. \n \nBest Genbank hit was a predicted protein similar to TFIID subunit 11 in S. purpuratus (XP_789830; 477 bits, 3e-133. \n \nBest Genbank empirical support for TFIID subunit 11 is TAF11 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 28kDa [Homo sapiens], accession AAV38212, score 170 bits, 1e-40. \n \nAll exons are supported by tiling path array data. \nExonerate and Splign both support expression of this protein. \nHowever, query of Poustka's database does not retrieve significant hits. \n \nModified glean model by accepting Davidson's 3' UTR. No other changes were made.\n
GLEAN3_08435	SPU_008435		fragment of an Igc2/FN3 protein - 3 Ig and 1 FN3 - no TM\n
GLEAN3_01727	SPU_001727		SRCR(9). Probably partial. (DMBT1)\n
GLEAN3_01763	SPU_001763		SigPep-SRCR(2). Probably partial.(Hensin/DMBT1)\n
GLEAN3_07715	SPU_007715		EGF-Ca x3 - Igc2 x2 - FN3 - probably a fragment - this domain organisation is not particularly informative as to identity \n \nlooks a little like a fragment of Tie 1/2 but the exact domains are not quite right\n
GLEAN3_07629	SPU_007629		Ig-EGF-FN3 - probably a fragment - this domain organisation is not particularly informative as to identity\n
GLEAN3_01863	SPU_001863		SRCR(2). Probably partial. (DMBT1)\n
GLEAN3_14732	SPU_014732		Blasted protein sequence of human gene NM_005645 against Baylor to obtain Glean3_14732. \n \nBlasted glean gene against NCBI. Putative conserved domains detected: TFIID-18kDa (Pfam). Best Genbank hit: XP_796890. Best empirical data support (provisional acceptance at NCBI): TAF13 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 18kDa [Accession: NP_001016066, cDNA, Xenopus tropicalis; bits 145, 4e-34]. and an unknown protein sequence [Accession: AAH74456, cDNA, Xenopus laevis; score, bits 145, 5e-34]. \n \nExonerate and Splign data exist that support all four exons of the gene. Poustka's database lacks support for any exon. Tiling path array data support exons 2-4, but are inconclusive for the first exon. \n \nI omitted Davidson's 3' UTR because it was HUGE: longer than all the exons of the CDS combined. If the 3'UTR modification were accepted, the resultant gene would contain both Glean3_14732 (this gene) and Glean3_14733 (the next gene on the scaffold in the 3' direction). \n \n \n \n
GLEAN3_02028	SPU_002028		SRCR(2). Probably partial. (DMBT1)\n
GLEAN3_02350	SPU_002350	From the results of the BLAST search as well as the excel data, it appears that the sequence is distributed onto 2 different scaffolds.  Throughout the sequence, there appears to be several small gaps as well as sequence overlaps. There is Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be widely distributed with half of the values being weak and the other half being strong.	SRCR(13). Probably partial. (DMBT1)\n
GLEAN3_24857	SPU_024857		six Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_08622	SPU_008622		eight Ig and Igv repeats - probably a fragment of some adhesion protein or receptor \n \nC-terminal DEATH domain could be an artefact - no known proteins with this structure\n
GLEAN3_03127	SPU_003127		SRCR(4)-TM. Probably partial. (DMBT1)\n
GLEAN3_13889	SPU_013889		seven Ig-like repeats and a heme peroxidase domain - homolog of peroxidasin, although missing LRR repeats at N-terminus\n
GLEAN3_03384	SPU_003384		SRCR(4). Probably incomplete. \n
GLEAN3_12354	SPU_012354		nine Igc2 and Ig repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_03526	SPU_003526		SRCR(6). Probably partial. (hensin/dmbt1)\n
GLEAN3_20291	SPU_020291		seven Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_06024	SPU_006024		fifteen Ig Igc2 and Igv repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_03778	SPU_003778		SigPep-SRCR(6). Probably incomplete. (DMBT1)\n
GLEAN3_03930	SPU_003930		SRCR(10). Probably partial. (DMBT1)\n
GLEAN3_12352	SPU_012352		nine Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_03963	SPU_003963		SRCR(6)-TM. Probably partial. (DMBT1). Maybe continuous with GLEAN3_03964, GLEAN3_03965, GLEAN3_03966\n
GLEAN3_22647	SPU_022647		eight Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_27956	SPU_027956	After reviewing the data and performing a BLAST search, it appears that this is the best results for this particular GLEAN model. When examining the excel data, it appears that there are several internal repeats present throughout the scaffold. There is also a small gap within the sequence that ranges from 2647-2860. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values ranging below 10. 	six Ig and Ig-like repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_25432	SPU_025432		fourteen Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_03964	SPU_003964		SRCR(3)-TM. Probably partial. (DMBT1). Maybe continuous with GLEAN3_03963, GLEAN3_03965, GLEAN3_03966\n
GLEAN3_10123	SPU_010123		eight Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_03965	SPU_003965		SRCR(2). Probably partial. (DMBT1). Maybe continuous with GLEAN3_03963, GLEAN3_03964, GLEAN3_03966\n
GLEAN3_15581	SPU_015581	For this particular GLEAN model there was an error message that was reiceived when a BLAST search was done. The message indicated that the server encountered an internal error or misconfiguration and was unable to complete the request. This may have occurred due to numerous internal repeats present within the sequence. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak (besides 2 outliers at about 30). 	eight Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_03966	SPU_003966		SRCR(3)-TM. Probably partial. (DMBT1). Maybe continuous with GLEAN3_03963, GLEAN3_03964, GLEAN3_03965\n
GLEAN3_22021	SPU_022021		six Igc2 repeats - probably a fragment of some adhesion protein or receptor \n \nlong low-complexity sequence preceding Ig domains is suspicious \n
GLEAN3_00532	SPU_000532		eight Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_12350	SPU_012350		ten Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_06022	SPU_006022		eleven Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_12355	SPU_012355		six Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_00453	SPU_000453		ten Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_13978	SPU_013978		six Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_21326	SPU_021326		six Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_21747	SPU_021747		fourteen Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_00533	SPU_000533		eight Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_21904	SPU_021904	From the BLAST results as well as the excel data, it is evident that this is the best fit for this particular GLEAN model. When reviewing he excel data, it was apparent that there were several internal repeats as well as gaps present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall, with most of the values being >5 (excluding one outlier present at approximately 47.)	seven Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor\n
GLEAN3_08653	SPU_008653		seven Ig and Igc2 repeats - probably a fragment of some adhesion protein or receptor \n \nlong low-complexity sequence surrounding Ig domains is suspicious \n
GLEAN3_27927	SPU_027927		Exon 2 of the Glean model does not Blast to NLR proteins. May be erroneous. There are unresolved (NNN) sequences within this model as well. \nDomains: NACHT, DEATH, LRR \n
GLEAN3_03942	SPU_003942		#\nHomo sapiens mRNA for polyglutamine binding protein variant 14(PQBP1 gene).\n
GLEAN3_27335	SPU_027335		PREDICTED: Strongylocentrotus purpuratus similar to Galanin receptor type 2 (GAL2-R) (GALR2) (LOC580125), mRNA \nLength=1029 \n
GLEAN3_11732	SPU_011732		#\nStrongylocentrotus purpuratus similar to gamma-aminobutyric acid  A receptor, epsilon (LOC585340), mRNA\n
GLEAN3_08212	SPU_008212		PREDICTED: Strongylocentrotus purpuratus similar to Somatostatin receptor type 2 (SS2R) (SRIF-1) (LOC581556), mRNA\n
GLEAN3_16241	SPU_016241		PREDICTED: Strongylocentrotus purpuratus similar to Glycogenin-1 (LOC589298), partial mRNA \n
GLEAN3_27885	SPU_027885		PREDICTED: Strongylocentrotus purpuratus similar to ceruloplasmin (LOC586705), mRNA \n
GLEAN3_20534	SPU_020534		#\nPREDICTED: Strongylocentrotus purpuratus similar to Surfeit locus protein 1 (LOC592318), mRNA \n
GLEAN3_19373	SPU_019373		PREDICTED: Strongylocentrotus purpuratus similar to surfeit 5 isoform b (LOC583077), mRNA. \n
GLEAN3_18029	SPU_018029		#\nTSPN and TSP1 domain combination. This domain combination is usually found in subgroup A thrombospondins. One fly protein has just these 2 domains. \nThe TSPN domain is most closely related to the one found in human collagen11a1. \nThe gene prediction probably includes some repetative sequence elements\n
GLEAN3_24181	SPU_024181	After reviewing the data and performing a BLAST search, it appears that there is no good GLEAN model that fits GLEAN3_24181 sufficiently. When reviewing the excel data it appears that the sequence is distributed onto 3 different scaffolds. However, within these scaffolds there are several gaps and internal repeats present. In particular within subject gb|DS015741| there is a large gap that spans from 1113 to 1489. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with most of the values being 5 and below. 	Likely C-terminal truncation due to end of contig\n
GLEAN3_25559	SPU_025559		Model contains only a TSPN domain but blast suggests that this domain is related to the Col15/18 family. The gene model is probably partial.\n
GLEAN3_04595	SPU_004595		Partial glean model for Nek8, duplicate of a longer glean model GLEAN3_05411\n
GLEAN3_28861	SPU_028861		Two overlapping glean predictions match DAP5 on scaffold 22799 (glean3_23932) and scaffold 105341 (glean3_28861).  \nNew gene model proposed: \nExon 1 Scaffold105341|5215|5219|+ \nExon 2 Scaffold105341|6813|6935|+ \nExon 3 Scaffold105341|8718|8820|+  \nExon 4 Scaffold22799|7457|7567|+ \nExon 5 Scaffold22799|7779|7805|+ \nExon 6 Scaffold22799|8433|8499|+ \nExon 7 Scaffold22799|8977|9069|+ \nExon 8 Scaffold22799|9731|9789|+ \nExon 9 Scaffold22799|11308|11490|+ \nExon 10 Scaffold22799|11835|11961|+ \nExon 11 Scaffold22799|13490|13579|+ \nExon 12 Scaffold22799|15302|15734|+ \nExon 13 Scaffold22799|16419|16569|+ \nExon 14 Scaffold22799|18222|18533|+ \nExon 15 Scaffold22799|19645|19860|+ \nExon 16 Scaffold22799|20736|21041|+ \nExon 17 Scaffold22799|21914|22071|+ \nExon 18 Scaffold22799|23074|23282|+ \nExon 19 Scaffold22799|23856|23977|+ \nExon 20 Scaffold22799|24686|24757|+ \nExon 4 to 8 are present in both saffolds.  Homologous sequences to exon 5 are found twice nearby (eg. on scaffold22799, 7657-7683 and 8050-8076), multiple isoforms? \nExon 15 is also duplicated in GLEAN3_04171.\n
GLEAN3_11191	SPU_011191		The encoded aa sequence is entirey contained in GLEAN3_03528.  \nIn GLEAN3_11191 the sequence encoded by exons 3 and 4 of GLEAN3_03528 are missing. \n
GLEAN3_23932	SPU_023932		Two overlapping glean predictions match DAP5 on scaffold 22799 (glean3_23932) and scaffold 105341 (glean3_28861).  \nsee glean3_28861 for gene model\n
GLEAN3_15935	SPU_015935		There is only one gene representing SNRPA and U2B" in Urchin.\n
GLEAN3_05411	SPU_005411		Partial gene model. The Cter part of the protein is on glean 04595 (duplicated exons identical). The Nterminal part of the protein is missing (not found on this genome assembly), the kinase domain is therefore incomplete. beginning of Glean3_05411 is located at the border of the scaffold and do not reflect the real Nter.\n
GLEAN3_28068	SPU_028068		PREDICTED: Strongylocentrotus purpuratus similar to large conductance calcium-activated potassium channel subfamily M alpha member 1 isoform b (LOC578468), mRNA \n
GLEAN3_10354	SPU_010354		#\nGLEAN3_10354 lies on minus strand of Scaffold90906. Protein is probably missing N-ter (comparative analysis of transmembrane helices in the prot family).  \nAnalysis of genomic region 5' of glean prediction (used Genescan and GeneMark to look for additional exons): not conclusive. \nScaffold90906 seq is incomplete 5' to glean_10354, additional exons may lie there. Alternatively, N-ter may possibly lie on GLEAN3_06608 (Scaffold109591). \n \n
GLEAN3_22934	SPU_022934		This model was annotated based on multiple protein sequence alignments and a manual analysis of predicted domain structures. \n \nThe annotation of this model is supported by reciprocal blasting and the observation that the domain structure of this model is similar to that of plasminogen genes. It should be noted, however, that given the position of this model in the current assembly (in a scaffold containing various sequence gaps), this model may be only partial and could be significantly improved as updated versions of the assembly become available.\n
GLEAN3_00633	SPU_000633		This model was annotated based on multiple protein sequence alignments and a manual analysis of predicted domain structures. \n \nThe annotation of this model is supported by reciprocal blasting and the observation that the domain structure of this model is similar to that of plasminogen genes. It should be noted, however, that the protein predicted by this model resembles a partial plasminogen. The position of this model in the current assembly is inconclusive with regards to the possibility that this model may be incomplete or that it may represent a novel protein related in structure to plasminogen.\n
GLEAN3_04292	SPU_004292		This gene is split between two scaffolds.  This model encodes the N-terminal portion of the protein.  The C-terminal portion, with some overlap, is found in Glean3_25615.\n
GLEAN3_25615	SPU_025615		This model encodes the C-terminal portion of the Rbl-1 protein.  The N-terminal portion (with some overlap) is found in Glean3_04292.\n
GLEAN3_04011	SPU_004011		SRCR(3). Probably partial. (DMBT1)\n
GLEAN3_14731	SPU_014731		This model may be duplicated in GLEAN3_14730. Please refer to GLEAN3_14730 for details and comments.\n
GLEAN3_04086	SPU_004086		SRCR(4). Probably partial. (DMBT1)\n
GLEAN3_04100	SPU_004100		SigPep-SRCR(2)-TM. Possibly part of gene that includes GLEAN3_101(and 102?). (Brain Ser prot/hensin/DMBT1)\n
GLEAN3_06242	SPU_006242		Gene correctly predicted\n
GLEAN3_04101	SPU_004101		SigPep-SRCR(7)-TM. Possibly part of gene that includes GLEAN3_100(and 102?). (DMBT1)\n
GLEAN3_14730	SPU_014730		This model was annotated based on multiple protein sequence alignments and a manual analysis of predicted domain structures. \n \nThe annotation of this model is supported by reciprocal blasting and the observation that the domain structure of this model is similar to that of plasminogen genes. It should be noted, however, that given the position of this model in the current assembly (close to the end of a scaffold), this model may be only partial and could be significantly improved as updated versions of the assembly become available. \n \nIt should also be noted that this model may be duplicated in an adjacent model (GLEAN3_14731). They're 98% identical at the protein level and they are located on separate contigs, which might indicate a very recent true gen duplication event or an assembly problem (haplotypes?).\n
GLEAN3_04160	SPU_004160		SRCR(8). possible partial. (DMBT1)\n
GLEAN3_02088	SPU_002088		GLEAN3_13821 appears to be identical.\n
GLEAN3_04642	SPU_004642		SRCR(2)-Sushi(1?).  Possibly partial. (DMBT1)\n
GLEAN3_05000	SPU_005000		SigPep-SRCR(5).  Possibly partial.  (DMBT1)\n
GLEAN3_05154	SPU_005154		SRCR(7). Possibly partial.\n
GLEAN3_05420	SPU_005420		SRCR(5)-Sushi(1)-TM.  Possibly partial.\n
GLEAN3_05464	SPU_005464		SRCR(6)-TM. Possibly partial.\n
GLEAN3_05414	SPU_005414		This model was annotated based on multiple protein sequence alignments and a manual analysis of predicted domain structures. \n \nThe annotation of this model is supported by reciprocal blasting and the observation that the domain structure of this model is similar to that of plasminogen genes. It should be noted, however, that given the position of this model in the current assembly (close to an end of a small scaffold), this model may be only partial and could be significantly improved as updated versions of the assembly become available.\n
GLEAN3_06608	SPU_006608		Partial sequence \nGLEAN3_06608 (Scaffold109591) probably contains the N-ter of Rh50, C-ter to be found on GLEAN3_10354 (minus strand of Scaffold90906): Or else these are two different genes. \n \nProbably only the first two exons code for Rh50 protein \nThird exon (pos.18813-18900) probably not real:deleted \n \n
GLEAN3_05556	SPU_005556		#\nSRCR(9). Possibly partial.\n
GLEAN3_27802	SPU_027802		PREDICTED: Strongylocentrotus purpuratus similar to Ubiquitin ligase protein RNF8 (RING finger protein 8) (LOC583203), mRNA \n
GLEAN3_16506	SPU_016506		Model inaccurately predicts N-terminus (>100 amino acids have been omitted!) \n \n \nGLEAN3_21385 appears to be identical.\n
GLEAN3_05860	SPU_005860		SRCR(3)-TM-PTPc. Unique domain structure.  Possibly partial.\n
GLEAN3_09188	SPU_009188		Pfam PF00909\n
GLEAN3_18141	SPU_018141		Pfam PF00909\n
GLEAN3_00856	SPU_000856		PREDICTED: Strongylocentrotus purpuratus similar to potassium voltage gated channel, Shab-related subfamily, member 2 (LOC593326), mRNA Length=3318 \n
GLEAN3_13823	SPU_013823		#\nOne EST (CD295368) appears to include the N-terminus of the protein and suggests the true start methionine is slightly upstream of the predicted start methionine in the GLEAN model. Based on this EST the model has been modified to include these additional amino acids at the N-terminus (MFCFRAILVLSACVVYGQKKEKTNVFTIKPVSSIYLPHYVAGKKSWGINKDAAVKTAYD...) Note that this added sequence contains a predicted signal sequence (a feature of other MSP130 proteins). \n \nGLEAN3_06387 appears to be identical over much of its sequence but contains additional proline/glutamine-rich repeats.\n
GLEAN3_01726	SPU_001726	When comparing the excel data with the BLAST results, it appears that the two sets of data do not coincide with each other. From the excel data it appears that the sequence begins on subject gb|DS008761| (until 1673 is reached) and continues on subject gb|DS011195| until the end of the sequence. The BLAST results display an entirely different sequence of base pairs that are not in the excel file. The BLAST results also do not have the amount of sequence coverage when compared to the data in the excel file. The sequence in the excel file ends at 2586 bases pairs while the BLAST results sequence ends at only 1016 base pairs. 	Partial sequence \nchimera: MelB N-ter + Amt3 C-ter \n \nMelB N-ter is given by: \n>GLEAN3_01726|Scaffold620|154587|154719| DNA_SRC: Scaffold620 START: 154587 STOP: 154719 STRAND: -  \n>GLEAN3_01726|Scaffold620|155996|156169| DNA_SRC: Scaffold620 START: 155996 STOP: 156169 STRAND: -  \n>GLEAN3_01726|Scaffold620|157562|157827| DNA_SRC: Scaffold620 START: 157562 STOP: 157827 STRAND: -  \n>GLEAN3_01726|Scaffold620|161305|162403| DNA_SRC: Scaffold620 START: 161305 STOP: 162403 STRAND: -\n
GLEAN3_06386	SPU_006386		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 30.71% over 368 BLAST alignment positions. 759 of 1189 Muscle alignment positions masked (63.800 %; 430 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_11265	SPU_011265		Partial sequence: N-ter only \n \nCDS at the end of scaffold \n \nPfam PF00909\n
GLEAN3_26576	SPU_026576		The sequence coded by this GLEAN is entirely contained in GLEAN3_15285; refer to this one for further annotation \nIn GLEAN3_26576, exon 1 and part of exon 2 are missing.\n
GLEAN3_14492	SPU_014492		Very similar at both the amino acid and nucleotide levels to 3 other GLEAN models: 21242, 15326, and 12567. It seems likely that at least some of these are haplotypes or improperly assembled genes. \n
GLEAN3_05008	SPU_005008		Partial CDS based on alignment with best blast hit sequence.  Transcriptome data strongly suggests that the following exons belong to this gene.  The first of the following predicted exons blasts to the same metalloprotease. \n>Supertig106616_1|Scaffold106616|20708|20789| DNA_SRC: Scaffold106616 START: 20708 STOP: 20789 STRAND: +  \nGGCTATTGAATTTACGTGGAGAAGATATTCCATACAACCCTCTCTTTATATCGTTTGCTATCGTCGGGAC \nCCAGTACATCAA \n>Supertig106616_1|Scaffold106616|21416|21513| DNA_SRC: Scaffold106616 START: 21416 STOP: 21513 STRAND: +  \nCCTATACCTCTATGATGCTAGCCGCCGGTTGGACACGGAGAGATATTCGGACTTGAGAGAACATTTGGAG \nCTGGACTCGTCTAGGTGCTCAAGCTACA \n>Supertig106616_1|Scaffold106616|23148|23305| DNA_SRC: Scaffold106616 START: 23148 STOP: 23305 STRAND: +  \nCTTCGTTCCCGACTACTTGCTTGAGAGCAAGGGATTTATCAGCCTTCACCGATGATCTCAGTACCGTTTC \nGTTTAGTAAGAAGGTCTGGTTTAGTAACTCATCTAACTACTTCGTCTACAAAACTATCACTGAGAGGCCT \nGATGTCAACAATGATAGG \n>Supertig106616_1|Scaffold106616|24152|24238| DNA_SRC: Scaffold106616 START: 24152 STOP: 24238 STRAND: +  \nAGTAAATATATCATGGAGCCCTCTCCCGTCCTGCTCATGAAAGCTGTCAAAAATGACGTCGAAGTGGAGG \nCGATGAACCAGGCATTC \n>Supertig106616_1|Scaffold106616|25692|25762| DNA_SRC: Scaffold106616 START: 25692 STOP: 25762 STRAND: +  \nGATCCAAAAGAAGGAGACGATCGATCGTTGACGGAATGGCTGGTTGCTCAGAAGACTGAAACATTCAGAG \nA \n>Supertig106616_1|Scaffold106616|26458|26535| DNA_SRC: Scaffold106616 START: 26458 STOP: 26535 STRAND: +  \nATCACATAGCAGTTATCAATACCCGAGTTACGAGACGATAGCCGCCGTAGGATACCACAGTGCCGACTAT \nTATTACCA \n>Supertig106616_1|Scaffold106616|27073|27144| DNA_SRC: Scaffold106616 START: 27073 STOP: 27144 STRAND: +  \nCCCTATAGAGGATGACCGGTTTGCCATACCTACTGGTAAGATGTTCCTCTATGACATGGGAGGACAGTAT \nAG \n>Supertig106616_1|Scaffold106616|27693|27813| DNA_SRC: Scaffold106616 START: 27693 STOP: 27813 STRAND: +  \nAGAAGGGACGACTACCCTCGCCCGAACCTTCTTCTTTGCCAAGGAATGGTATGAGGAGAATGAGAACCGT \nTATGAGTTTGATCGCACTTATGATCCTGCAAGGCCAACTGAATTTCAGCAG \n>Supertig106616_1|Scaffold106616|27868|27960| DNA_SRC: Scaffold106616 START: 27868 STOP: 27960 STRAND: +  \nGCTCTGACGCTCATGGCGCTTTACATATTTTTACCGTCACTGGGCACTGGTCATAGCCGAGCTGCCAAAG \nTAACTAGCGCTCAAGCATATCAG \n>Supertig106616_1|Scaffold106616|28493|28607| DNA_SRC: Scaffold106616 START: 28493 STOP: 28607 STRAND: +  \nTTTTCAGTGGACGAGGAGGGGGTGGCTAGTTGGGTTGGTCAACTTGACCGCTTTGGACGTCGTCGGTCGT \nTTATCGAAGTGATATATCGCGAGGATAAGGGAACCAATGTAGGGG \n\t\t \n \n \n \n
GLEAN3_07203	SPU_007203		Analysis indicates typical cysteines positioned for post-translational processing, protein folding and disulphide bonding in the mature peptide.An additional cysteine residue in a B domain is similar to the sequence of Ciona INS-L3. The B domain is very long in Sp-IGF1. Two dibasic sites in the sequence corresponding to a short C-peptide makes it vertebrate insulin- and relaxin-like. All true IGFs in vertebrates have lost dibasic sites in C-peptide.They are cleaved at the far end of the long C terminus, in an E domain. Two aromatic residues (YY) in the end of the putative B domain are crucial for biological activity of IGFs. \nUsually the D domain is short in length. Dibasic residues (RASR) at the beginning of the long E domain mark the C-terminal of D domain in IGFs. This makes the D domain of Sp-IGF 1 very long in contrast with other IGF sequences. \nAnnotated with the help of Robert Olinski (Robert.Olinski@neuro.uu.se) and Mohammed Idris (Idris@szn.it)  \n
GLEAN3_04690	SPU_004690		#\nNek11 is split on two overlapping glean models with 100%identity (exon 3 and 4). glean3_04690 and glean3_14659 \n
GLEAN3_14659	SPU_014659		Nek11 is split on two overlapping glean models with 100%identity. glean3_04690 and glean3_14659 \nsee glean3_04690 for gene model\n
GLEAN3_15736	SPU_015736		Probably has an extra exon predicted towards beginning.\n
GLEAN3_00811	SPU_000811		FIRST 170 AA corresponds to RAB3.  The rest-contains lots of repeated sequences - probably artifact\n
GLEAN3_15145	SPU_015145		Missing an exon at the beginning?\n
GLEAN3_00253	SPU_000253		contains rab domain \nNo signal on tiling array and no EST.  May be pseudogene or expressed in adult only.\n
GLEAN3_23334	SPU_023334		GLEAN3_16582 and GLEAN3_17442 have near exact match to c-terminal portions of the corrected gene model.  Exon 2 is present in the scaffold at 2 different positions.  \n \nQuery: 34   GISMQGMDPSHVALTVLMLQNDLFDQFRCDRNVTLGLNHAT 74 \n            GISMQGMDPSHVALTVLMLQNDLFDQFRCDRNVTLGLNHAT \nSbjct: 5190 GISMQGMDPSHVALTVLMLQNDLFDQFRCDRNVTLGLNHAT 5068 \n \nand \n \nQuery: 34   GISMQGMDPSHVALTVLMLQNDLFDQFRCDRNVTLGLNHAT 74 \n            GISMQGMDPSHVALTVLMLQNDLFDQFRCDRNVTLGLNHAT \nSbjct: 8917 GISMQGMDPSHVALTVLMLQNDLFDQFRCDRNVTLGLNHAT 8795 \n \nExon 6 is out of the scaffold on separate small scaffold as listed in gene features \n \n
GLEAN3_00503	SPU_000503		contians rab domain\n
GLEAN3_01288	SPU_001288		missing n-term\n
GLEAN3_13132	SPU_013132		GLEAN3_03272 is a partially similar prediction.\n
GLEAN3_03272	SPU_003272		GLEAN3_13132 is partially overlapping similar prediction.\n
GLEAN3_01818	SPU_001818		appears to be partial\n
GLEAN3_22309	SPU_022309		Part of sequence of this gene model is also found on another scaffold in GLEAN3_01872 (sequence almost identical for part; 3' end missing).  Is this a duplication or part of the genome that has been included in the assembly twice?\n
GLEAN3_06055	SPU_006055		SigPep-SRCR(4)-TM.\n
GLEAN3_06254	SPU_006254		SRCR(4)-TM. Possibly incomplete.\n
GLEAN3_06264	SPU_006264		SRCR(4). Probably incomplete.\n
GLEAN3_06531	SPU_006531		SRCR(4). Probably incomplete.\n
GLEAN3_02507	SPU_002507		rab and DNAj domain\n
GLEAN3_06538	SPU_006538		SRCR(2). Probably incomplete. Check in reference to GLEAN3_06539.\n
GLEAN3_06539	SPU_006539		SigPep-SRCR(12). Probably incomplete. Check GLEAN3_06538.\n
GLEAN3_21493	SPU_021493		AA 193-340 in prediction does not match human homologue; in fact matches nothing (except for another predicted S. purp sequence [XP_791406])when blasted by itself.  AA 497-682 of prediction matches a P. liv EST in the MaxPlank database.\n
GLEAN3_06659	SPU_006659		SRCR(6)-LRR(3). Possiby incomplete. Unique domain orgaization.\n
GLEAN3_06731	SPU_006731		SigPep-SRCR(6)-TM.\n
GLEAN3_07110	SPU_007110		SRCR(7). Probably incomplete.\n
GLEAN3_21121	SPU_021121		Overlap with Glean3_19174. This model appears to encode the C-terminal part of SpCul-3.\n
GLEAN3_19174	SPU_019174	After reviewing the data and performing a BLAST search, it appears that the data is distributed onto 2 different scaffolds. If the two scaffolds were combined the sequence would have a continuous, orderly arrangement without any gaps or repeats present. There was no Est. information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	Overlap with Glean3_21121.  This model appears to encode the N-terminus of SpCul-3.\n
GLEAN3_07349	SPU_007349		WSC(2)-SRCR-WSC-SRCR-WSC-TM. Possibly incomplete. Unique domain composition.\n
GLEAN3_01926	SPU_001926		Encodes sequences in middle of protein, corresponding to amino acids 447-659 of human cullin 4A.\n
GLEAN3_28294	SPU_028294		This protein contains the following domains: \nDEATH,NACHT and LRRs\n
GLEAN3_09731	SPU_009731		Appears to encode C-terminal portion of the protein, corresponding to amino acids downstream of residue number 704 of human cullin 4.\n
GLEAN3_18555	SPU_018555		Appears to encode the N-terminal portion of the protein, corresponding to amino acids 27-346 of human cullin 4.  Glean3_18556 also appears to have portions of this gene, assembled with parts of various other genes including PEX1.\n
GLEAN3_07370	SPU_007370		SigPep-SRCR(2)-HYR. Possibly incomplete.\n
GLEAN3_00832	SPU_000832		Previously cloned.  GLEAN prediction contains GAPS that should be rectified.  \n
GLEAN3_07372	SPU_007372		SRCR(2)-HYR(2)-IgC2-GPS-SRCR(2). Probably partial. See GLEAN3_07370.\n
GLEAN3_07618	SPU_007618		SigPep-SRCR(2)-TM.\n
GLEAN3_07660	SPU_007660		SRCR(3)-TM. Possibly partial.\n
GLEAN3_07718	SPU_007718		SRCR(6)-TM. Possibly partial.\n
GLEAN3_01439	SPU_001439		GLEAN3_17618 was another high scoring blast hit for POLR2C in the glean database.\n
GLEAN3_07781	SPU_007781		SRCR(10)-TM. Possibly partial. See GLEAN3_07782.\n
GLEAN3_11776	SPU_011776		This gene contains 2 NACHT domains which is very unusual. Also, it is located at the end of a Scaffold and could be incomplete. \nDomains: DEATH,NACHT,LRR,NACHT,LRRs\n
GLEAN3_13170	SPU_013170		The protein encoded by this model appears to be a duplication of the 5' and 3' halves.  The sequence shows two thioester sites, which is unheard of, and no cleavage sites for alpha and beta chains.  \n
GLEAN3_13169	SPU_013169		GLEAN3_13169 appears to be a duplication of the 5' end of GLEAN3_13170.\n
GLEAN3_07782	SPU_007782		SRCR(5). Probably partial.  See GLEAN3_07781.\n
GLEAN3_07893	SPU_007893		SRCR(6). Probably partial. See GLEAN3_07894, 07895, 07896, 07897, 07899, 07900.\n
GLEAN3_07840	SPU_007840		cub(4)-SRCR(2). Possibly partial.\n
GLEAN3_07894	SPU_007894		SRCR(4). Probably partial. See GLEAN3_07893, 07895, 07896, 07897, 07899, 07900.\n
GLEAN3_07895	SPU_007895		SRCR(2). Probably partial. See GLEAN3_07893, 07894, 07896, 07897, 07899, 07900.\n
GLEAN3_07896	SPU_007896		SRCR(7)-TM. Probably partial. See GLEAN3_07893, 07894, 07895, 07897, 07899, 07900.\n
GLEAN3_07897	SPU_007897		SRCR(2)-TM. Probably partial. See GLEAN3_07893, 07894, 07895, 07896, 07899, 07900.\n
GLEAN3_07899	SPU_007899		SigPep-SRCR(5)-TM. Probably partial. See GLEAN3_07893, 07894, 07895, 07896, 07897, 07900.\n
GLEAN3_07900	SPU_007900		SRCR(4)-TM. Probably partial. See GLEAN3_07893, 07894, 07895, 07896, 07897, 07899.\n
GLEAN3_12678	SPU_012678		This GLEAN is missing the N-terminal amino acid sequence of an alpha-tubulin, and is adjacent to another alpha-tubulin Gene Model, GLEAN3_12679.\n
GLEAN3_10277	SPU_010277		Only exons 3-22 are present on this GLEAN model on scaffold464. There is a gap of ~100bp where exons 15 (assuming that this part represnts only one exon) could be located. Exons 1-2 are present on scaffold35149, which has no GLEAN prediction.\n
GLEAN3_11107	SPU_011107		matches to the 3' end of GLEAN3_11106\n
GLEAN3_06756	SPU_006756		This GLEAN contains an insertion at its amino terminus that is inconsistent with assignment as a conventional alpha-tubulin.\n
GLEAN3_07984	SPU_007984		This GLEAN contains an insertion at its amino terminus that is inconsistent with assignment as a conventional alpha-tubulin.  There is a "T" missing, probably from sequencing error.\n
GLEAN3_27579	SPU_027579		This GLEAN has a good full-length match with Chlamydomonas reinhardtii RIB43A (E = 2.00E-42).\n
GLEAN3_25241	SPU_025241		One of 2. GLEAN3_22153 is a non-identical duplicate\n
GLEAN3_08432	SPU_008432		SRCR(4). Probably incomplete. \n
GLEAN3_28690	SPU_028690		This gene model has one thrombospondin domain and does not show high scoring matches to other sequences on GenGank.  \n
GLEAN3_08504	SPU_008504		SRCR(16). Probably incomplete.\n
GLEAN3_08514	SPU_008514		SRCR(2)-TM. Possibly incomplete.\n
GLEAN3_08598	SPU_008598		SRCR(2). Probably incomplete. \n
GLEAN3_11197	SPU_011197		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure analyses. \n \nThis annotation is supported by reciprocal blasting to Inhibitor of NFkappaB genes from various taxa and an identical domain structure. Its structure also strongly correlates with genome-wide tiling array hybridization data.\n
GLEAN3_08836	SPU_008836		SigPep-SRCR(2). Possibly incomplete.\n
GLEAN3_08885	SPU_008885		SRCR(3). probably incomplete.\n
GLEAN3_09145	SPU_009145	After reviewing the data it appears that there is no sufficient GLEAN model that fits model_GLEAN3_09145. The sequence appears to be distributed onto 2 different scaffolds. The BLAST results also displayed a large repeat within 1-165 region of the sequence that coincided with the discarded repeated portion of the excel data. There was Est information from GBrowse assembly V0.5 and the transcriptome intensity scores are widely distributed with values ranging from about 2-95.  This is an un-annotated gene so no additional comments were available from Baylor gene information. 	SRCR(2). Probably incomplete.\n
GLEAN3_05782	SPU_005782		contains rab, ras and trunk domains\n
GLEAN3_06362	SPU_006362		PARTIAL--MISSING N-TERMINUS \nNo signal in tiling array or EST.  May be pseudogene or adult only expression.\n
GLEAN3_24635	SPU_024635		 partial, missing C-terminus\n
GLEAN3_27868	SPU_027868		 missing central stretch\n
GLEAN3_28499	SPU_028499		 extra N- and C-terminus\n
GLEAN3_04006	SPU_004006		One of 2. GLEAN_21174 is a shorter internal version of this GLEAN. 21174 is mostly identical (protein level), but does not match on either predicted end. \n
GLEAN3_08777	SPU_008777		This GLEAN shares ~50% sequence identity over nearly its entire length with NP_653306 (Homo sapiens tektin-1).  Other than a discrepancy of a 39-amino acid insertion at the amino terminus of GLEAN3_19591, the coding regions of GLEAN3_08777 and GLEAN3_19591 are identical.\n
GLEAN3_19591	SPU_019591		This GLEAN shares ~50% sequence identity over nearly its entire length with NP_653306 (Homo sapiens tektin-1).  But the first 39 predicted amino acids after the initiator M (DAGATLLSRSYAPTIPVYPTQTTVGTKTDQALSQDLAKM) look like they don't belong.  Other than this discrepancy, the coding regions of GLEAN3_08777 and GLEAN3_19591 are identical.\n
GLEAN3_23618	SPU_023618		This GLEAN shares >50% sequence identity over nearly its entire length with NP_114104 (Homo sapiens tektin-3).\n
GLEAN3_06388	SPU_006388		IDENTICAL TO GLEAN3_06389 except at 3'end\n
GLEAN3_06453	SPU_006453		This GLEAN shares ~50% sequence identity over nearly its entire length with NP_444515.1 (Homo sapiens tektin-1).  The coding regions of GLEAN3_13841 and GLEAN3_06453 are identical.\n
GLEAN3_06392	SPU_006392		contains ADP-ribosyl-GH domain and rab domain\n
GLEAN3_20728	SPU_020728		This GLEAN shares ~52% sequence identity over ~400 amino acids with NP_055281 (Homo sapiens tektin-2).\n
GLEAN3_00049	SPU_000049		 missing N-terminus\n
GLEAN3_07237	SPU_007237		 fragment\n
GLEAN3_21063	SPU_021063		 half-molecule\n
GLEAN3_05461	SPU_005461		 fragment\n
GLEAN3_07285	SPU_007285	After reviewing the data, it appears that the sequence is on two different scaffolds. Scaffold >v2.1_scaffold73334 had the lowest e-value and highest bit score results, however, the continuation of the sequence on scaffold >v2.1_scaffold22829 was about 16 down on the BLAST results list (due to higher e-value and lower bit score). The transcriptome intensity scores were low (about a 5) and the there was no EST information available. This was an un-annotated gene and no additional gene information (comments) was available on Baylor.  	contains endo/exonuclease domains and phosphatase domains and ras/rho domain\n
GLEAN3_04038	SPU_004038		This is a short fragment that appears to encode a protein identical to the Sp-betaL Integrin. It is an incomplete sequence at the end of a short scaffold.  \n
GLEAN3_00348	SPU_000348		 fragment, should join with GLEAN3_00348, still incomplete gene\n
GLEAN3_00349	SPU_000349		 fragment, should join with GLEAN3_00348, still incomplete gene\n
GLEAN3_21469	SPU_021469		 extra N-terminus half\n
GLEAN3_24384	SPU_024384		 extra C-terminus half\n
GLEAN3_01041	SPU_001041		 contains 2 repeats matching a similar stretch\n
GLEAN3_03580	SPU_003580		 extra N- and C-terminus stretches\n
GLEAN3_04561	SPU_004561		 extra N-terminus region\n
GLEAN3_04242	SPU_004242		 unrelated stretch on N-terminus, partial match to the gene\n
GLEAN3_07951	SPU_007951		 fragment\n
GLEAN3_14894	SPU_014894		 partial, missing C-terminus half, some extra stretches in middle\n
GLEAN3_16354	SPU_016354		 unrelated N-terminus half, only the C-terminus half matches the gene\n
GLEAN3_14094	SPU_014094		 fragment\n
GLEAN3_11806	SPU_011806		 fragment\n
GLEAN3_11846	SPU_011846		 fragment\n
GLEAN3_11894	SPU_011894		 fragment\n
GLEAN3_11897	SPU_011897		 fragment\n
GLEAN3_12522	SPU_012522		 fragment\n
GLEAN3_13365	SPU_013365		 fragment\n
GLEAN3_13505	SPU_013505		 fragment\n
GLEAN3_13566	SPU_013566		 partial, missing N-terminus\n
GLEAN3_14209	SPU_014209		 fragment\n
GLEAN3_14873	SPU_014873		 fragment\n
GLEAN3_14903	SPU_014903		 fragment\n
GLEAN3_15530	SPU_015530		 fragment\n
GLEAN3_16239	SPU_016239		 fragment\n
GLEAN3_16280	SPU_016280		 fragment\n
GLEAN3_16701	SPU_016701		 fragment\n
GLEAN3_17893	SPU_017893		 extra N-terminus\n
GLEAN3_18043	SPU_018043		 fragment\n
GLEAN3_18288	SPU_018288		 fragment\n
GLEAN3_19705	SPU_019705		 fragment\n
GLEAN3_19912	SPU_019912		 fragment\n
GLEAN3_08410	SPU_008410		IDENTICAL TO 15580\n
GLEAN3_14036	SPU_014036		This gene is on three scaffolds (544, 81593 and 56300). On scaffold544 (exon 1) there is one GLEAN model (GLEAN3_25514) which covers exon1-3 for this gene. On scaffold81593 GLEAN_14036 is predicted (exon 2-15). For scaffold 56300, gene prediction is one GLEAN3_17882(exon 8-24) for this gene. Exon 2 and 3 are overlapped between two scaffolds (544 and 81593) and exon 8-15 are overlapped between scaffolds 81593 and 56300. Please refer to GLEAN3_25514 and GLEAN_17882 for gene features for far N-terminal and C-terminl portion respectively.\n
GLEAN3_17882	SPU_017882		This gene is on three scaffolds (544, 81593 and 56300). On scaffold544 there is GLEAN model (GLEAN3_25514) which covers exon1-3 for this gene.. On scaffold81593 GLEAN_14036 is predicted (exon 2-15). For scaffold 56300, gene prediction is GLEAN3_17882(exon 8-24) for this gene. Exon 2 and 3 are overlapped between two scaffolds (544 and 81593) nad exon 8-15 are overlapped between scaffolds 81593 and 56300. Please refer to GLEAN_14036 and GLEAN3_25514 for gene features for N-terminl portion.\n
GLEAN3_09123	SPU_009123		poor conservation \n
GLEAN3_25502	SPU_025502		Ig7/FN5/TM \n \nbest hit is Ds-CAM but the domain organization is not quite right for either Ds-CAM (9-4-1-2) or DCC (4-6) \nDoes not have Neogenin_C cytoplasmic match \n \nprobably neither but some related gene\n
GLEAN3_04328	SPU_004328		Igc2-4/FN3-5/TM \n \nHigh Blast hit (not #1) with DCC \nC-terminus (putative cyto domain) does not have Neogenin_C \nand is not homologous with DCC in Blast \n \nDomain organisation not consisitent with Ds-CAM or DCC (4-6)\n
GLEAN3_09220	SPU_009220	From the BLAST results and the excel data, it is evident that this sequence is distributed onto 2 different scaffolds for this particular GLEAN model. When reviewing the excel data, it is appears that both scaffolds have an orderly arrangement, however, there are several internal repeats and sequence overlaps present in both scaffolds. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with all of the values being greater than 5. 	SigPep-SRCR-WSC-CUB. Possiby incomplete.\n
GLEAN3_09354	SPU_009354		SRCR(4). Possibly partial.\n
GLEAN3_09496	SPU_009496		SigPep-SRCR(6). Possibly incomplete.\n
GLEAN3_09562	SPU_009562		SRCR(2). Probably partial.\n
GLEAN3_09676	SPU_009676		SRCR(3). Probably incomplete. See GLEAN3_09677.\n
GLEAN3_05039	SPU_005039		has GPS and a single TM - probably missing C-terminus. \nfive LDLa and one EGF in exodomain - there are no reported LDLa-LNB7TM GPCRs but St purp has several possible members \n \nNovel domain structure \n
GLEAN3_09677	SPU_009677		SRCR(3). Probably incomplete. See GLEAN3_09676.\n
GLEAN3_09753	SPU_009753		SRCR(3). Probably partial.\n
GLEAN3_05758	SPU_005758		has GPS and four TM - probably missing C-terminus. \nthree LDLa in exodomain - there are no reported LDLa-LNB7TM GPCRs but St purp has several \n \nNovel domain structure \n
GLEAN3_09869	SPU_009869		contains Rab, Arf-GAP, and Ank domains\n
GLEAN3_19158	SPU_019158	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. The beginning of the sequence is covered by   v2.1_scaffold60900 while the remaining of the portion is covered by v2.1_scaffold45932. If the two scaffolds were combined, the sequence would have an orderly continuous, arrangement. There was Est support available from GBrowse V0.5 and the transcriptome intensity scores appeared to be somewhat weak with all of the values being less than 10. 	NO GPS but  7TM-1  \nfive LDLa in exodomain - there are no reported LDLa-LNB7TM GPCRs but St purp has several of this type \n \nNo GPS - looks most like glycoprotein hormone receptors \n
GLEAN3_09988	SPU_009988		#\nIg(2)-SRCR(6). Unique structure. Probably incomplete. See GLEAN3_09989\n
GLEAN3_26060	SPU_026060		has GPS but no TM - probably missing C-terminus. \ntwo LDLa and two EGF in exodomain - there are no reported LDLa-LNB7TM GPCRs but St purp has several \n \nNovel domain structure \n
GLEAN3_09910	SPU_009910		PARTIAL, MISSING N-TERMINUS\n
GLEAN3_09989	SPU_009989		SRCR(6). Probably incomplete. See GLEAN3_09988.\n
GLEAN3_10001	SPU_010001		SRCR(2). Probably incomplete.\n
GLEAN3_10062	SPU_010062		SRCR(6). Possibly incomplete.\n
GLEAN3_10226	SPU_010226		SRCR(4). Possibly incomplete. See GLAEN3_10227.\n
GLEAN3_00002	SPU_000002		CUB-LDLa x2 7TM \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_10227	SPU_010227		F5_F8_type_C-SRCR(9). Probably incomplete. See GLEAN3_10226.\n
GLEAN3_23577	SPU_023577		FA58C-CUB-CLECT-LDLa x4-LRR x3 - 7TM \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \nNo known FA58C or LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_10232	SPU_010232		SigPep-SRCR(4). Possibly incomplete. \n
GLEAN3_27136	SPU_027136		LDLa-LRRNT-LRRtypx3 - 7TM_1 \n \nNo GPS - looks most like glycoprotein hormone receptors \n
GLEAN3_10240	SPU_010240		SRCR(2). Possibly incomplete. See GLEAN3_10241.\n
GLEAN3_10241	SPU_010241		SRCR(2). Possibly incomplete. See GLEAN3_10240\n
GLEAN3_14777	SPU_014777		LDLa x5-EGFx2-Igc2-GPS - 7tm_2 \nLooks like a member of the LNB-7TM family of adhesion domain GPCRs \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_11062	SPU_011062		CUB - LDLa - 7tm_1 \nNo GPS and 7tm_1 - could be a meber of the LNB-7TM subfamily - more likely a member of glycoprotein hormane receptor family. \nNo known LDLa members of LNB7TM GPCR family \n
GLEAN3_10330	SPU_010330		#\nSigPep-SRCR(2)-HYR\n
GLEAN3_20284	SPU_020284		identical to parts of GLEAN3_27935\n
GLEAN3_10409	SPU_010409		SRCR(5)\n
GLEAN3_10501	SPU_010501		SigPep-SRCR(4). Possibly incomplete.\n
GLEAN3_22714	SPU_022714		LDLa x9-EGFCa x4-Ig 7TM_2 \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \n \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_10523	SPU_010523		SRCR(2). Possibly incomplete.\n
GLEAN3_05239	SPU_005239		LDLa x10-LRRtyp x5 7TM \n \n \nNo GPS - looks most like glycoprotein hormone receptors \n \n
GLEAN3_10832	SPU_010832		SRCR(3)-TM. Possibly incomplete.\n
GLEAN3_10909	SPU_010909		SRCR(3)-TM. Possibly incomplete.\n
GLEAN3_04837	SPU_004837		LDLa x2-LRRtyp x5 7TM \n \nNo GPS - looks most like glycoprotein hormone receptors \n \nNo known LDLa members of LNB7TM GPCR family \n
GLEAN3_10953	SPU_010953		#\nSRCR(5). Possibly incomplete.\n
GLEAN3_09242	SPU_009242		CUBx5-FA58C-CUB-LDLa x3 5TM \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \n \nNo known FA58C or LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_10991	SPU_010991		SRCR(6)-TM. Possibly incomplete. See GLEAN3_10992. 10993, 10994.\n
GLEAN3_05132	SPU_005132		CUB-CLECT-LDLa-EGF-LDLa x3 7TM \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \n \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_10992	SPU_010992		SigPep-SRCR(2)-TM. Possibly incomplete. See GLEAN3_10991. 10993, 10994.\n
GLEAN3_10993	SPU_010993		SigPep-SRCR(5). Possibly incomplete. See GLEAN3_10991. 10992, 10994.\n
GLEAN3_12382	SPU_012382		CLECT-LDLa x6-LRRNT-LRRtyp x4 7TM-1 \n \nNo GPS and 7tm-1 rather than 7tm_2 but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \n \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_10994	SPU_010994		SRCR(2)-TM. Possibly incomplete. See GLEAN3_10991, 10992, 10993.\n
GLEAN3_07191	SPU_007191		fragment; see Glean3_07191 for larger Sp-Trh\n
GLEAN3_15872	SPU_015872		CUB-LDLa x2-LRRtyp x2 7TM_1 \n \nNo GPS but otherwise looks a bit like a member of the LNB-7TM family of adhesion domain GPCRs or, perhaps more likely, a glycoproteinhormone receptor \n \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_11101	SPU_011101		SigPep-SRCR(2). Possibly incomplete.\n
GLEAN3_11146	SPU_011146		SRCR(3). Possibly incomplete.\n
GLEAN3_15161	SPU_015161		LDLa-LRRtyp x5 7TM \n \nNo GPS - looks most like glycoprotein hormone receptors \n
GLEAN3_28049	SPU_028049		CUB-CLECT-LDLa x5/6-LRRtyp x 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_08926	SPU_008926		LDLa-Ig-GPS-7TM \nLooks like a member of the LNB-7TM family of adhesion domain GPCRs \n \nNo known LDLa members of LNB7TM GPCR family \nNovel architecture\n
GLEAN3_11222	SPU_011222	For this particular GLEAN model, when a BLAST search was done an error message came up indicating that there was a misconfiguration. When examining the excel data, it appears that there are similar sequences between the different subject queries and if the data would have been able to have been mapped onto V2.1 it seems like there may be some difficultly distinguishing between the scaffolds.  There also appears to be overlaps within the sequences resulting in no clear orderly arrangement. 	SigPep-SRCR(13)-TM.\n
GLEAN3_11752	SPU_011752		SRCR(2). Probably incomplete.\n
GLEAN3_13248	SPU_013248		LRRtyp x3 - 7TM \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_11977	SPU_011977		SigPep-SRCR(4). Possibly incomplete\n
GLEAN3_12039	SPU_012039		SigPep-SRCR(7). possibly incomplete.\n
GLEAN3_27096	SPU_027096		SR x4 7TM \n \nNo GPS but otherwise looks a bit like a member of the LNB-7TM family of adhesion domain GPCRs\n
GLEAN3_12159	SPU_012159		SRCR(3)-TM. Probably incomplete.\n
GLEAN3_19239	SPU_019239		SR x4 - LRRtyp x7 - 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_12230	SPU_012230		SigPep-SRCR(4)-EGF\n
GLEAN3_22189	SPU_022189		CUB-LRRtyp 4 - 7TM_1 \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_12410	SPU_012410		SigPep-SRCR(6). Possibly incomplete.\n
GLEAN3_03206	SPU_003206	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. However, there are several gaps within each scaffold and there are sequence overlaps between the scaffolds as well. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	LRRtyp x3 - 7TM \n \nNo GPS - looks most like glycoprotein hormone receptors \n
GLEAN3_12888	SPU_012888		#\nSRCR(3)-TM. Possibly incomplete.\n
GLEAN3_07937	SPU_007937		LDLa x2 - 7TM \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs \n
GLEAN3_13650	SPU_013650		SigPep-SRCR(3). possibly incomplete.\n
GLEAN3_13831	SPU_013831		SRCR(3). probably incomplete.\n
GLEAN3_16033	SPU_016033		LRRtyp x2 - 7TM-1 \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_19238	SPU_019238		SR x3 - LRRtyp x7 - 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_13958	SPU_013958	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. The first scaffold contains a large gap that spans from about 370-1050.this missing information is distributed onto the second scaffold. There are also several internal repeats present within both scaffolds. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 6.5.	SRCR(2)-TM. Possibly incomplete.\n
GLEAN3_14079	SPU_014079		SRCR(4)-TM. probably incomplete. See GLEAN3_14080.\n
GLEAN3_14080	SPU_014080		SigPep-SRCR(2). Probably incomplete. See GLEAN3_14079.\n
GLEAN3_14095	SPU_014095		SigPep-SRCR(4). Possibly incomplete.\n
GLEAN3_14769	SPU_014769		LRRtyp x10 - 7TM_1 \n \nNo GPS - looks most like glycoprotein hormone receptors \n \nNOTE GLEAN3_14765 contains a very similar gene fused with a Cathepsin gene \n
GLEAN3_14602	SPU_014602		SigPep-SRCR(6). Possibly incomplete.\n
GLEAN3_14829	SPU_014829		SRCR(7). Probably incomplete.\n
GLEAN3_12610	SPU_012610		LRRtyp x3 7TM \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_18295	SPU_018295		LRRtyp x2 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_08259	SPU_008259		LRRtyp x3 7TM  ShKT domain at N-terminus may be an artefact \n \nNo GPS but otherwise looks like a member of the LNB-7TM family of adhesion domain GPCRs or like glycoprotein hormane receptors\n
GLEAN3_18294	SPU_018294		LRRtyp x2 - 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
Sp-HIF-1a	SPU_030140		This is the last 18 exons of Glean3_01262. The rest of Glean3_01262 has been annotated as a separate gene, Sp-Birc6\n
GLEAN3_18887	SPU_018887		This is part of a sea urchin specific group of ADAM-TS metalloproteinase genes.  There are two worm ADAMTS genes with some sequence similarity (wormbase# F08C6.1a.1 and C02B4.1).   \n \nThe homeobox at the N-terminal is clearly a prediction/ assembly error.\n
GLEAN3_10561	SPU_010561		LRRtyp x3 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_21131	SPU_021131		 This is part of a sea urchin specific group of ADAM-TS genes.  There are two worm ADAMTS genes with some sequence similarity (wormbase# F08C6.1a.1 and C02B4.1).  This sequence also appears to be a haplotype but is missing a portion.\n
GLEAN3_24052	SPU_024052		LRRtyp x2 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_24290	SPU_024290		LRRtyp x2 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_15464	SPU_015464		LRRtyp x6 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_12494	SPU_012494		LRRNT/LRRtyp/LRRCT-GPS- 7TM-2 \n \nlooks like a member of the LNB-7TM subfamily of GPCRs\n
GLEAN3_03202	SPU_003202		only domain it contains is a part of the reprolysin domain\n
GLEAN3_07355	SPU_007355		only domain it contains is the reprolysin domain\n
GLEAN3_04726	SPU_004726		SR x4 - LRRtyp x3 - 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_06020	SPU_006020		may be part of the sea urchin only family of genes, but it contains only the metalloprotease and reprollysin domains which are the first parts of an ADAM-TS gene.   There are two worm ADAMTS genes with some sequence similarity (wormbase# F08C6.1a.1 and C02B4.1)\n
GLEAN3_18297	SPU_018297		SR x3 - LRRtyp x4 - 7TM_1 \n \n \nNo GPS - looks most like glycoprotein hormone receptors\n
GLEAN3_23004	SPU_023004		contains only the reprolysin domain\n
GLEAN3_15133	SPU_015133		LRRtyp x3 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_09503	SPU_009503		the sequence is similar to both ADAM-TS6 and ADAM-TS10, but it looks like it is a particial sequence containing only a TSP1 and ADAMs spacer.  \n
GLEAN3_11913	SPU_011913		roots the clade that contains vertibrate ADAM-TS2 and ADAM-TS3\n
GLEAN3_20686	SPU_020686		LRRtyp x3 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_02876	SPU_002876		LRRtyp x5 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_06540	SPU_006540		LRRtyp x2 - 7TM-1 \n \nNo GPS - looks most like glycoprotein/thyrotropin hormone receptors\n
GLEAN3_23903	SPU_023903		This gene is more common in arthropods than in vertebrates.  \n \nit has some domains that are characteristic of ADAM-TS proteins including TSP-1 and N-term ADAM spacer domain.  However, it has TY and KU domains which are novel for an ADAM-TS gene found in vertebrates, but normal for Papilin found in arthropods.  \n
GLEAN3_02031	SPU_002031		This gene is more common in arthropods than in vertebrates.  \n \nit has some domains that are characteristic of ADAM-TS proteins including TSP-1 and N-term ADAM spacer domain.  However, it has TY and KU domains which are novel for an ADAM-TS gene found in vertebrates, but normal for Papilin found in arthropods.   \n
GLEAN3_20547	SPU_020547		#\nsimilar to ADAM15, and to the ADAM15-like alleles, but distince enough to likely be a seperate gene. \n \nDisintegrin/ACR/TM - domain structure characteristic of an ADAM. \nNote that the gene is near to another gene with a very similar  \nstructure (GLEAN3_20545) \n
GLEAN3_22131	SPU_022131		PREDICTED: Strongylocentrotus purpuratus similar to 106 kDa O-GlcNAc transferase-interacting protein (LOC589805),\n
GLEAN3_23613	SPU_023613		PREDICTED: Strongylocentrotus purpuratus similar to Ubiquitin-conjugating enzyme E2-17 kDa (Ubiquitin-protein ligase)(Ubiquitin carrier protein) (Effete protein) (LOC586593) \n
GLEAN3_12475	SPU_012475		PREDICTED: Strongylocentrotus purpuratus similar to Separin \n(Separase) (Caspase-like protein ESPL1) (Extra spindle poles-like 1 protein) (LOC576865), mRNA. \n
GLEAN3_13841	SPU_013841		#\nThis GLEAN shares ~50% sequence identity over nearly its entire length with NP_444515.1 (Homo sapiens tektin-1).  The coding regions of GLEAN3_13841 and GLEAN3_06453 are identical.\n
GLEAN3_03152	SPU_003152		This F-box protein is most similar to Fbw7 (drosophila archipeligo), the F-box protein that targets cyclin E for destruction.  However, it is significantly less similar to human Fbw7 than is Glean3_19951, which is likely the Fbw7 ortholog, and I have thus named it "Fbw7-like".\n
GLEAN3_00883	SPU_000883		Likely to be incomplete.   Has 1 EGF, 1HYR, and 1 complete C lectin domain\n
GLEAN3_06781	SPU_006781		See also Glean3_07933.\n
GLEAN3_01854	SPU_001854		multiple EGFCa - GPS -7tm_2 \n \ngood match to overall pattern of LNB-7TM-GPCRs\n
GLEAN3_07778	SPU_007778		See also Glean3_13579, which encodes a nearly identical protein.\n
GLEAN3_07916	SPU_007916		CUB plus multiple EGFCa - GPS NO TMs predicted \n \nApart from lack of -7tm_2 this is a good match to overall pattern of LNB-7TM-GPCRs \nProbably missing C-terminus\n
GLEAN3_21402	SPU_021402		CUB x2 -multiple EGFCa - HormR-GPS x2 - NOTMs predicted \n \nApart from lack of 7tm_2 a good match to overall pattern of LNB-7TM-GPCRs \n \nC-terminus may be off\n
GLEAN3_07933	SPU_007933		Note that the protein sequence of this model is identical to that of Glean3_06781.\n
GLEAN3_07915	SPU_007915		multiple EGFCa - GPS -7tm_2 \n \ngood match to overall pattern of LNB-7TM-GPCRs \n \nNote adjacent gene is similar\n
GLEAN3_18381	SPU_018381		PREDICTED: Strongylocentrotus purpuratus similar to Huntingtin (Huntingtons disease protein) (HD protein) (LOC590871), partial mRNA. \n
GLEAN3_22093	SPU_022093		LDLa x2 - EGF x2- Ig - GPS - NO TMs predicted \n \nApart from lack of 7tm_2 domain this matches to overall pattern of LNB-7TM-GPCRs \nMay be missing C-terminus\n
GLEAN3_26092	SPU_026092		multiple EGF_Ca - Ig - GPS - 7tm_2 \nGood match to overall pattern of LNB-7TM-GPCRs \n
GLEAN3_19231	SPU_019231		#\nCUB - multiple EGF_Ca - HormR - GPS - 7tm_2 \nGood match to overall pattern of LNB-7TM-GPCRs \n
GLEAN3_08005	SPU_008005		2 EGF/EGFCas -GPS-7TM_2 \nmatches to overall pattern of LNB-7TM-GPCRs \n
GLEAN3_03304	SPU_003304		#\nEGFCa -GPS-7TM_2 \nmatches to overall pattern of LNB-7TM-GPCRs \n
GLEAN3_21254	SPU_021254	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When examining the first portion of the sequence it is evident that this scaffold has an orderly arrangement without any gaps or internal repeats present. The rest of the sequence is continued on the second scaffold which does contain several internal repeats present but no gaps. There was no Est. support available from GBrowse assembly V0.5. The transcriptome intensity scores were difficult to determine due to the scale of the graph.	2 EGFCa-GPS-7TM_2 \nmatches to overall pattern of LNB-7TM-GPCRs \n
GLEAN3_07742	SPU_007742		#\nmultiple EGFCa - Ig -GPS-  three TM segments \nApart from absence of complete 7tm_2 domain matches to overall pattern of LNB-7TM-GPCRs \nMay be missing C-terminus\n
GLEAN3_13572	SPU_013572		multiple EGFCa - Ig -GPS-  single TM segment \nApart from absence of complete 7tm_2 domain matches to overall pattern of LNB-7TM-GPCRs \nMay be missing C-terminus\n
GLEAN3_13579	SPU_013579		The amino acid sequence of this gene model is nearly identical to that of Glean3_07778, although they differ at their C-termini.\n
GLEAN3_23715	SPU_023715		The sequence coded by this GLEAN is entirely contained in GLEAN3_05762; refer to this one for further annotation\n
GLEAN3_12362	SPU_012362		EGF - Ig -GPS- 7tm_2 \nmatches to overall pattern of LNB-7TM-GPCRs \n
GLEAN3_22320	SPU_022320		LDLa - EGF - Ig -GPS- single TM \nApart from lack of full 7tm_1 or 7tm_2 domain \nmatches to overall pattern of LNB-7TM-GPCRs or to hormone receptors\n
GLEAN3_14667	SPU_014667		CUB - SO - NO GPS- 7tm_2 \nBecause of complete 7tm_2 domain matches to overall pattern of LNB-7TM-GPCRs but no GPS - could also be a member of the glycoprotein hormone receptor subfamily \n
GLEAN3_24566	SPU_024566		#\nCUB - NO GPS - 7tm_2 \nBecause of complete 7tm_2 domain matches to overall pattern of LNB-7TM-GPCRs but no GPS - could also be a member of the glycoprotein hormone receptor subfamily \n
GLEAN3_22019	SPU_022019		CUB - SO - one pfam:collagen repeat - NO GPS- 7tm_2 \nBecause of complete 7tm_2 domain matches to overall pattern of LNB-7TM-GPCRs but no GPS - could also be a member of the glycoprotein hormone receptor subfamily \n
GLEAN3_09826	SPU_009826		#\nNOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_23643	SPU_023643		NOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_13269	SPU_013269		NOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_28112	SPU_028112		NOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_04496	SPU_004496		IDENTICAL sequence to GLEAN3_04096   \nNOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_14844	SPU_014844		SigPep-SRCR(3). Possibly partial.\n
GLEAN3_04085	SPU_004085		Gene fragment.\n
GLEAN3_14859	SPU_014859		SRCR(4). Possibly partial. See GLEAN3_14860.\n
GLEAN3_15163	SPU_015163		CCP- 4XHYR - GPS-7TM_2 \n \nmatches general pattern for LNB-7TM-GPCR receptors\n
GLEAN3_25755	SPU_025755		Scaffoldi4711 has both ends of what appears to be a single beta integrin subunit - BetaD (GLEAN3_12985). Amino acids 1-420 appear to be a novel integrin beta subunit.  The scaffold has jointed two unrelated genes and I have corrected the model to remove the 3 exons that are not part of this subunit.  The model is incomplete.  \n
GLEAN3_14860	SPU_014860		SRCR(2). Possibly partial. See GLEAN3_14859.\n
GLEAN3_16723	SPU_016723		3 CCP - GPS-7TM_2 \n \nmatches general pattern for LNB-7TM-GPCR receptors\n
GLEAN3_14992	SPU_014992		SRCR(8)-TM. Possibly incomplete. See GLEAN3_14993, 14994.\n
GLEAN3_20525	SPU_020525		FA58C- HYR - GPS-7TM_2 \n \nmatches general pattern for LNB-7TM-GPCR receptors \n \nSIMILAR DOMAIN COMPOSITION TO GLEAN3_13084 \nNOVEL ARCHITECTURE\n
GLEAN3_14993	SPU_014993	After reviewing the Excel data and performing a BLAST search it appears that the gene sequence is on two different scaffolds. There appears to be several short internal repeats within the second scaffold however, the repeats are in fact continuous with the rest sequence. Besides the repeats, it appears that the overall sequence between the two scaffolds is orderly and has good coverage. The BLAST search indicated that both scaffolds had low e-values and high bit scores as well. There was also no Est information available on the GBrowse assembly V0.5 and the transcriptome intensity scores appear to be weak (most are >5) as well. This is an un-annotated gene so no additional information was available from Baylor under gene information (comments)	RVT_1(probable prediction error?)-SRCR(4)-TM. Possibly incomplete. See GLEAN3_14992, 14994. \n
GLEAN3_05260	SPU_005260		Gene fragment\n
GLEAN3_17226	SPU_017226		2x HYR -EGF - GPS-7TM_2 \n \nmatches general pattern for LNB-7TM-GPCR receptors\n
GLEAN3_04759	SPU_004759		NOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_14994	SPU_014994		SRCR(9). Possibly incomplete. See GLEAN3_14992, 14993.\n
GLEAN3_15123	SPU_015123		SRCR(3). Possibly incomplete. \n
GLEAN3_15325	SPU_015325		SRCR(2). Probably incomplete.\n
GLEAN3_20144	SPU_020144		NOTE only last 170-ish aa of this GLEAN match to Rheb\n
GLEAN3_15387	SPU_015387		SigPep-SRCR(4). Possibly incomplete.\n
GLEAN3_15539	SPU_015539		SigPep-SRCR(2)-WSC. \n
GLEAN3_15548	SPU_015548		SRCR(4)-TM. Possibly incomplete.\n
GLEAN3_15937	SPU_015937		SRCR(2)-TM. Possibly incomplete. See GLEAN3_15938.\n
GLEAN3_15938	SPU_015938		SigPep-SRCR(3). Possibly incomplete. See GLEAN3_15937.\n
GLEAN3_15989	SPU_015989		SRCR(2). Possibly incomplete.\n
GLEAN3_07557	SPU_007557		Missing 5' segment, likely to be on a different scaffold\n
GLEAN3_15991	SPU_015991		SigPep-SRCR(2)-TM. See GLEAN3_15989.\n
GLEAN3_16195	SPU_016195		SRCR(3). Possibly incomplete.\n
GLEAN3_16373	SPU_016373		SRCR(2)-TM. Possibly incomplete. See GLEAN3_16374.\n
GLEAN3_16374	SPU_016374		SRCR(9). Possibly incomplete. See GLEAN3_16373.\n
GLEAN3_08979	SPU_008979		This gene model predicts a protein with two separate domains that are normally on different genes: a p53 DNA binding domain at its N-terminus, and the SPOC domain (from the spen transcriptional regulator) at its C-terminus.  There is a large gap in the sequence of the scaffold that lies between these domains however, which leads me to suspect that these are actually two separate genes that have been artificially fused in the computational predictions.  If this is the case then the C-terminal gene with the SPOC domain would be a homologue of Drosophila transcriptional regulator split ends (spen), and should be named "Sp-Spen".\n
GLEAN3_16531	SPU_016531		SRCR(2). Possibly incomplete.\n
GLEAN3_16880	SPU_016880		SRCR(4). Possibly incomplete.\n
GLEAN3_28905	SPU_028905		PREDICTED: similar to Prostaglandin E2 receptor, EP4 subtype  \n(Prostanoid EP4 receptor) (PGE receptor, EP4 subtype)\n
GLEAN3_15849	SPU_015849		poor homology (BLAST score = 5e-05)\n
GLEAN3_10562	SPU_010562		NOT Embryonically Expressed, maybe be psuedo-gene or adult-only gene\n
GLEAN3_03857	SPU_003857		PREDICTED: Strongylocentrotus purpuratus similar to thyroid hormone receptor interactor 12 (LOC578329), mRNA. \n
GLEAN3_25176	SPU_025176	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it was apparent that both scaffolds had an orderly and continuous arrangement without any internal repeats or gaps present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5	Significant overlap with Glean3_11072, which encodes the C-terminal portion of ATM.  The last 7 exons of this model are from a completely different gene (an oxidoreductase), and were artifactually fused to this gene as a result of incomplete sequence on the scaffold between them.\n
GLEAN3_14345	SPU_014345		IDENTICAL to 01459 except at very 3' end\n
GLEAN3_01459	SPU_001459		Identical to 14345 except at very 3' end.   \nNOTE no tiling data or EST.  May be pseudogene or expressed in adult only.\n
GLEAN3_26783	SPU_026783		This model the majority of the ATR protein, but is missing C-terminus, which is encoded in Glean3_11017.  The N-terminus (~200 amino acids) predicted by this model is quite different from vertebrate ATR.\n
GLEAN3_11017	SPU_011017		This model appears to encode the C-terminus of the ATR protein.  There is significant overlap with Glean3_26783, which encodes the N-terminal portion (minus the N-terminus).\n
GLEAN3_09522	SPU_009522		GLEAN3_23882 and GLEAN3_03142 are PTEN hits\n
GLEAN3_24659	SPU_024659		Partial sequence.\n
GLEAN3_22223	SPU_022223		Contains C-lectin and PAN- Apple domains. Aligns to carboxy end of human versican-like protein.\n
GLEAN3_26772	SPU_026772		Partial sequence.\n
GLEAN3_14959	SPU_014959		GLEAN3_26257 is a high score hit with less coverage of the mouse sequence used for Blast Query.\n
GLEAN3_19405	SPU_019405		Base pairs 528-885 of this glean sequence are PNKP-like.\n
GLEAN3_10324	SPU_010324		GLEAN3_21010 is a significant hit limited to N-terminal coverage of the human xrcc1 sequence as Query.\n
GLEAN3_20872	SPU_020872		Partial sequence.\n
GLEAN3_03158	SPU_003158		Similar to Slingshot 1 and 2.\n
GLEAN3_00287	SPU_000287		N-term of protein similar to human vPARP. C-term has no similarity\n
GLEAN3_00991	SPU_000991		N-term of gene\n
GLEAN3_00992	SPU_000992		C-term of gene (see GLEAN3_00991)\n
GLEAN3_02082	SPU_002082		similar to C elegans unnamed gene\n
GLEAN3_22638	SPU_022638		Possible haplotype pair of GLEAN3_13171\n
GLEAN3_26339	SPU_026339		N-terminal portion of this gene might be GLEAN3_18354\n
GLEAN3_17320	SPU_017320		Possibly the N-term of GLEAN3_17321\n
GLEAN3_17321	SPU_017321		Possibly the C-term of GLEAN3_17320\n
GLEAN3_22501	SPU_022501		Partial sequence.\n
GLEAN3_19984	SPU_019984		partial sequence. Also see GLEAN3_00776.\n
GLEAN3_04377	SPU_004377		This GLEAN represents the sea urchin Outer Arm Dynein Light Chain 4, as defined by the Anthocidaris crassispina cDNA (gi|2754612|dbj|BAA24152.1| outer arm dynein light chain 4 [Anthocidaris crassispina]) and the RefSeq gi|72093505|ref|XP_794465.1| PREDICTED: similar to dynein, axonemal, light chain 4 [Strongylocentrotus purpuratus]. \n
GLEAN3_18404	SPU_018404		looks identical/very similar to Glean3_16657\n
GLEAN3_11682	SPU_011682		The predicted amino-terminal 50 amino acids of GLEAN3_11682 do not agree with the comparable regions of other Dynein Light Chain-2 sequences and are almost certainly incorrect.  GLEAN3_11682 is essentially identical to GLEAN3_11683, and identical for an extended length with GLEAN3_11681 and GLEAN3_11684. \nBecause there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_11682: Sp-Dynein Light Chain-2-4a \nGLEAN3_11681: Sp-Dynein Light Chain-2-4b \nGLEAN3_11683: Sp-Dynein Light Chain-2-4c \nGLEAN3_11684: Sp-Dynein Light Chain-2-4d.\n
GLEAN3_24497	SPU_024497		GLEAN3_24497 is essentially identical to GLEAN3_24498, GLEAN3_24499, and GLEAN3_24500.  Because there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_24497: Sp-Dynein Light Chain-2-5a \nGLEAN3_24498: Sp-Dynein Light Chain-2-5b \nGLEAN3_24499: Sp-Dynein Light Chain-2-5c \nGLEAN3_24500: Sp-Dynein Light Chain-2-5d.\n
GLEAN3_24498	SPU_024498		GLEAN3_24498 is essentially identical to GLEAN3_24497, GLEAN3_24499, and GLEAN3_24500.  Because there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_24497: Sp-Dynein Light Chain-2-5a \nGLEAN3_24498: Sp-Dynein Light Chain-2-5b \nGLEAN3_24499: Sp-Dynein Light Chain-2-5c \nGLEAN3_24500: Sp-Dynein Light Chain-2-5d. \n \n
GLEAN3_24499	SPU_024499		#\nGLEAN3_24499 is essentially identical to GLEAN3_24497, GLEAN3_24498, and GLEAN3_24500.  Because there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_24497: Sp-Dynein Light Chain-2-5a \nGLEAN3_24498: Sp-Dynein Light Chain-2-5b \nGLEAN3_24499: Sp-Dynein Light Chain-2-5c \nGLEAN3_24500: Sp-Dynein Light Chain-2-5d.\n
GLEAN3_24500	SPU_024500		GLEAN3_24500 is essentially identical to GLEAN3_24497, GLEAN3_24498, and GLEAN3_24499.  Because there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_24497: Sp-Dynein Light Chain-2-5a \nGLEAN3_24498: Sp-Dynein Light Chain-2-5b \nGLEAN3_24499: Sp-Dynein Light Chain-2-5c \nGLEAN3_24500: Sp-Dynein Light Chain-2-5d.\n
GLEAN3_11684	SPU_011684		Most of the predicted amino acid sequence of GLEAN3_11684 does not agree with the comparable regions of other Dynein Light Chain-2 sequences and is almost certainly incorrect.  GLEAN3_11684 is very similar to the neighboring gene models GLEAN3_11681, GLEAN3_11682, and GLEAN3_11683. \nBecause there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_11682: Sp-Dynein Light Chain-2-4a \nGLEAN3_11681: Sp-Dynein Light Chain-2-4b \nGLEAN3_11683: Sp-Dynein Light Chain-2-4c \nGLEAN3_11684: Sp-Dynein Light Chain-2-4d.\n
GLEAN3_11681	SPU_011681		The predicted amino-terminal 150 amino acids of GLEAN3_11681 do not agree with the comparable regions of other Dynein Light Chain-2 sequences and may be incorrect.  However, GLEAN3_11681 is identical with RefSeq XP_795373.1. GLEAN3_116821 is essentially identical for an extended length with GLEAN3_11682, GLEAN3_11683, and GLEAN3_11684. \nBecause there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_11682: Sp-Dynein Light Chain-2-4a \nGLEAN3_11681: Sp-Dynein Light Chain-2-4b \nGLEAN3_11683: Sp-Dynein Light Chain-2-4c \nGLEAN3_11684: Sp-Dynein Light Chain-2-4d.\n
GLEAN3_11683	SPU_011683		The predicted amino-terminal 50 amino acids of GLEAN3_11683 do not agree with the comparable regions of other Dynein Light Chain-2 sequences and are almost certainly incorrect.  GLEAN3_11683 is essentially identical to GLEAN3_11682, and identical for an extended length with GLEAN3_11681 and GLEAN3_11684. \nBecause there is a good chance that they do not represent four distinct gene products, they were named as follows: \nGLEAN3_11682: Sp-Dynein Light Chain-2-4a \nGLEAN3_11681: Sp-Dynein Light Chain-2-4b \nGLEAN3_11683: Sp-Dynein Light Chain-2-4c \nGLEAN3_11684: Sp-Dynein Light Chain-2-4d.\n
GLEAN3_08800	SPU_008800		The predicted amino acid sequence of GLEAN3_08800 is identical to those of GLEAN3_08799 and GLEAN3_08800. \nBecause there is a good chance that they do not represent three distinct gene products, they were named as follows: \nGLEAN3_08799: Sp-Dynein Light Chain-2-3a \nGLEAN3_08800: Sp-Dynein Light Chain-2-3b \nGLEAN3_08801: Sp-Dynein Light Chain-2-3c\n
GLEAN3_11875	SPU_011875		Partial duplicate prediction for GLEAN3_11837\n
GLEAN3_17987	SPU_017987		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_11970	SPU_011970		IDENTICAL TO 12085\n
GLEAN3_11134	SPU_011134		Duplicate partial prediction for GLEAN3_01605\n
GLEAN3_10689	SPU_010689		Incomplete prediction? GLEAN3_09456 is almost complete duplicate prediction for GLEAN3_10689.\n
GLEAN3_28471	SPU_028471		No signal in tiling array or EST.  May be pseudogene or adult only expression.\n
GLEAN3_21593	SPU_021593		This model appears to encode the middle part of the ATM protein.  The N- and C-terminal parts are encoded by Glean3_05652 and Glean3_11072, respectively.  There is sequence overlap between all three models.\n
GLEAN3_11072	SPU_011072		This model encodes the C-terminus of the ATM protein.  The other parts are encoded by Glean3_05652 (N-terminus) and Glean3_21593 (middle).  There is sequence overlap between all three models.  This model also has significant sequence overlap with Glean3_25176.\n
GLEAN3_05652	SPU_005652		This model appears to encode the N-terminal portion of ATM.  The rest of the protein falls on Glean3_11072 (C-terminus) and Glean3_21593 (middle).  There is sequence overlap between all three models.\n
GLEAN3_27613	SPU_027613		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 17, subfamily A, polypeptide 1 [Danio rerio]" (NP_997971.1) is 25.41% over 488 BLAST alignment positions. 383 of 817 Muscle alignment positions masked (46.800 %; 434 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_27796	SPU_027796		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 3, subfamily a, polypeptide 13 [Rattus norvegicus]" (NP_671739.1) is 40.64% over 342 BLAST alignment positions. 178 of 677 Muscle alignment positions masked (26.200 %; 499 positions used for tree generation) with a Muscle scorefile cutoff of 25. GLEAN3_27795 may be N terminus. Not very CYP3 like.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_28152	SPU_028152		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 4, subfamily v [Gallus gallus]" (NP_001001879.1) is 49.20% over 502 BLAST alignment positions. 531 of 956 Muscle alignment positions masked (55.500 %; 425 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_28699	SPU_028699		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 38.24% over 421 BLAST alignment positions. 271 of 732 Muscle alignment positions masked (37.000 %; 461 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_28922	SPU_028922		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " Cyp2f2 protein [Xenopus tropicalis]" (NP_001010999.1) is 31.80% over 217 BLAST alignment positions. 138 of 589 Muscle alignment positions masked (23.400 %; 451 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing C-terminus\n
GLEAN3_28934	SPU_028934		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2, B [Danio rerio]" (NP_956914.1) is 38.64% over 471 BLAST alignment positions. 238 of 714 Muscle alignment positions masked (33.300 %; 476 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_06272	SPU_006272		incomplete sequence\n
GLEAN3_16605	SPU_016605		Identical to 11157 (11157 is missing aa 1-36)\n
GLEAN3_11157	SPU_011157		IDENTICAL TO 16605 (EXCEPT 11157 is missing aa 1-36 of 16605)\n
GLEAN3_01094	SPU_001094		PARTIAL, MISSING N-TERMINUS\n
GLEAN3_12205	SPU_012205		Partial gene, missing the N-terminal half.\n
GLEAN3_08978	SPU_008978		This Glean3 model probably encodes the N-terminus of the p53 homologue predicted by the 5' end of Glean3_08979, and should be fused therewith.  The third exon is probably not real, as there is no evidence for expression in the tiling data.  At the same time, Glean3_08979 probably artifactually fuses two different genes, and needs to be broken up (see annotation to that gene model).\n
GLEAN3_15880	SPU_015880		IDENTICAL TO 08410\n
GLEAN3_13863	SPU_013863		IDENTICAL TO 18282\n
GLEAN3_04692	SPU_004692		Partial gene.  Missing N-terminus.\n
GLEAN3_12404	SPU_012404		Prediction is incomplete.\n
GLEAN3_13183	SPU_013183		Genescan predicted as sea squirt Halocynthia roretzi troponin 1\n
GLEAN3_24320	SPU_024320		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 46.11% over 193 BLAST alignment positions. 445 of 886 Muscle alignment positions masked (50.200 %; 441 positions used for tree generation) with a Muscle scorefile cutoff of 25. partial   COMMENTS from arm@stowers-institute.org:  missing stretch in middle\n
GLEAN3_25299	SPU_025299		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393108 [Danio rerio]" (NP_956433.1) is 41.15% over 384 BLAST alignment positions. 225 of 696 Muscle alignment positions masked (32.300 %; 471 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_25595	SPU_025595		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 51 [Strongylocentrotus purpuratus]" (NP_001001906.1) is 99.66% over 297 BLAST alignment positions. 436 of 902 Muscle alignment positions masked (48.300 %; 466 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N&C-terminus\n
GLEAN3_25829	SPU_025829		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 46, subfamily a, polypeptide 1 [Mus musculus]" (NP_034140.1) is 40.65% over 492 BLAST alignment positions. 239 of 633 Muscle alignment positions masked (37.700 %; 394 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_25830	SPU_025830		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC613109 [Xenopus tropicalis]" (NP_001027517.1) is 39.67% over 489 BLAST alignment positions. 241 of 639 Muscle alignment positions masked (37.700 %; 398 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing some N-terminus residues\n
GLEAN3_25863	SPU_025863		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393108 [Danio rerio]" (NP_956433.1) is 38.95% over 439 BLAST alignment positions. 235 of 699 Muscle alignment positions masked (33.600 %; 464 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_25956	SPU_025956		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 33.50% over 397 BLAST alignment positions. 670 of 1128 Muscle alignment positions masked (59.300 %; 458 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_25969	SPU_025969		Missing N terminus due to scaffold truncation. \nSingle exon! \nBLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 1, subfamily A, polypeptide 1 [Gallus gallus]" (NP_990477.1) is 36.00% over 300 BLAST alignment positions. 184 of 658 Muscle alignment positions masked (27.900 %; 474 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_26188	SPU_026188		Incomplete - runs off end of scaffold missing last 2 exons \nBLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " thromboxane A synthase 1 [Danio rerio]" (NP_991172.1) is 27.33% over 311 BLAST alignment positions. 89 of 573 Muscle alignment positions masked (15.500 %; 484 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_26360	SPU_026360		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily j, polypeptide 9 [Rattus norvegicus]" (NP_786942.1) is 40.98% over 388 BLAST alignment positions. 254 of 733 Muscle alignment positions masked (34.600 %; 479 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_26373	SPU_026373		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily u, polypeptide 1 [Rattus norvegicus]" (NP_001019950.1) is 35.43% over 446 BLAST alignment positions. 358 of 810 Muscle alignment positions masked (44.100 %; 452 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_26477	SPU_026477		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC572000 [Danio rerio]" (NP_001020728.1) is 33.95% over 433 BLAST alignment positions. 269 of 726 Muscle alignment positions masked (37.000 %; 457 positions used for tree generation) with a Muscle scorefile cutoff of 25.    \nExon 2 duplicated as exon3: misassembly problem \nCOMMENTS from arm@stowers-institute.org:  extra N-terminus\n
GLEAN3_27153	SPU_027153		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393204 [Danio rerio]" (NP_956529.1) is 30.21% over 480 BLAST alignment positions. 1574 of 1945 Muscle alignment positions masked (80.900 %; 371 positions used for tree generation) with a Muscle scorefile cutoff of 25. GLEAN3_27152 may be N terminus   COMMENTS from arm@stowers-institute.org:  missing central stretch and C-terminus\n
GLEAN3_20876	SPU_020876		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, subfamily XXVIA, polypeptide 1 [Danio rerio]" (NP_571221.2) is 35.74% over 484 BLAST alignment positions. 246 of 704 Muscle alignment positions masked (34.900 %; 458 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_21087	SPU_021087		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC334097 [Danio rerio]" (NP_997884.1) is 34.84% over 442 BLAST alignment positions. 256 of 679 Muscle alignment positions masked (37.700 %; 423 positions used for tree generation) with a Muscle scorefile cutoff of 25. Early diverging CYP1?\n
GLEAN3_21185	SPU_021185		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 17, subfamily A, polypeptide 1 [Danio rerio]" (NP_997971.1) is 32.89% over 377 BLAST alignment positions. 202 of 609 Muscle alignment positions masked (33.100 %; 407 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N- and C-terminus\n
GLEAN3_21251	SPU_021251		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, subfamily 3A, polypeptide 62 [Rattus norvegicus]" (NP_001019403.1) is 31.63% over 332 BLAST alignment positions. 83 of 580 Muscle alignment positions masked (14.300 %; 497 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing C-terminus and stretch in middle\n
GLEAN3_22109	SPU_022109		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 35.99% over 389 BLAST alignment positions. 229 of 671 Muscle alignment positions masked (34.100 %; 442 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_22110	SPU_022110		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 36.19% over 409 BLAST alignment positions. 211 of 667 Muscle alignment positions masked (31.600 %; 456 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_22432	SPU_022432		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393108 [Danio rerio]" (NP_956433.1) is 38.48% over 434 BLAST alignment positions. 268 of 728 Muscle alignment positions masked (36.800 %; 460 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_22590	SPU_022590		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 37.72% over 501 BLAST alignment positions. 299 of 770 Muscle alignment positions masked (38.800 %; 471 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_22593	SPU_022593		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 38.52% over 514 BLAST alignment positions. 392 of 833 Muscle alignment positions masked (47.000 %; 441 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_23067	SPU_023067		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily u, polypeptide 1 [Rattus norvegicus]" (NP_001019950.1) is 34.01% over 444 BLAST alignment positions. 334 of 773 Muscle alignment positions masked (43.200 %; 439 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_23068	SPU_023068		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily u, polypeptide 1 [Rattus norvegicus]" (NP_001019950.1) is 32.14% over 448 BLAST alignment positions. 242 of 682 Muscle alignment positions masked (35.400 %; 440 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_24078	SPU_024078		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393108 [Danio rerio]" (NP_956433.1) is 33.23% over 310 BLAST alignment positions. 251 of 718 Muscle alignment positions masked (34.900 %; 467 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_24113	SPU_024113		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 41.63% over 430 BLAST alignment positions. 248 of 726 Muscle alignment positions masked (34.100 %; 478 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_18515	SPU_018515		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 37.69% over 459 BLAST alignment positions. 212 of 662 Muscle alignment positions masked (32.000 %; 450 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_19082	SPU_019082		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 43.40% over 235 BLAST alignment positions. 479 of 889 Muscle alignment positions masked (53.800 %; 410 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_19464	SPU_019464		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450 A 37 [Gallus gallus]" (NP_001001751.1) is 39.19% over 296 BLAST alignment positions. 41 of 534 Muscle alignment positions masked (7.600 %; 493 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_19883	SPU_019883		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 1, subfamily B, polypeptide 1 [Danio rerio]" (NP_001013285.2) is 33.33% over 423 BLAST alignment positions. 264 of 693 Muscle alignment positions masked (38.000 %; 429 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_19898	SPU_019898		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2 [Homo sapiens]" (NP_000766.2) is 32.91% over 468 BLAST alignment positions. 170 of 636 Muscle alignment positions masked (26.700 %; 466 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_19899	SPU_019899		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2, B [Danio rerio]" (NP_956914.1) is 34.40% over 436 BLAST alignment positions. 223 of 693 Muscle alignment positions masked (32.100 %; 470 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_20229	SPU_020229		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 4, subfamily v [Gallus gallus]" (NP_001001879.1) is 52.84% over 388 BLAST alignment positions. 738 of 1149 Muscle alignment positions masked (64.200 %; 411 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_20233	SPU_020233		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily d, polypeptide 22 [Rattus norvegicus]" (NP_612524.1) is 38.37% over 258 BLAST alignment positions. 201 of 633 Muscle alignment positions masked (31.700 %; 432 positions used for tree generation) with a Muscle scorefile cutoff of 25. N terminus is GLEAN3_20232   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_20234	SPU_020234		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily j, polypeptide 11 [Mus musculus]" (NP_001004141.1) is 33.48% over 466 BLAST alignment positions. 80 of 538 Muscle alignment positions masked (14.800 %; 458 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_20752	SPU_020752		Fragment of CYP2. Possible missasembly with GLEAN3_20751 \n \nBLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 46.67% over 225 BLAST alignment positions. 457 of 896 Muscle alignment positions masked (51.000 %; 439 positions used for tree generation) with a Muscle scorefile cutoff of 25. \n
GLEAN3_20753	SPU_020753		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 37.15% over 463 BLAST alignment positions. 214 of 667 Muscle alignment positions masked (32.000 %; 453 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_20756	SPU_020756		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 39.10% over 445 BLAST alignment positions. 193 of 642 Muscle alignment positions masked (30.000 %; 449 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_14843	SPU_014843		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 1, subfamily A, polypeptide 1 [Gallus gallus]" (NP_990477.1) is 34.70% over 464 BLAST alignment positions. 173 of 659 Muscle alignment positions masked (26.200 %; 486 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_15096	SPU_015096		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily j, polypeptide 11 [Mus musculus]" (NP_001004141.1) is 29.45% over 292 BLAST alignment positions. 51 of 521 Muscle alignment positions masked (9.700 %; 470 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N- and C-terminus\n
GLEAN3_15120	SPU_015120	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. There was however mRNA information available from SpBase. There were only 2 subject hits for this query. When reviewing the excel data, it appears that there is a gap within one of the subjects (gb|DS010423|) and if the two scaffolds were to be combined, there would be an overlap between the 2 sequences (at 262-432). There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5.\nAdditional gene information from Baylor annotation (comments):\nPartial sequence.     BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC406641 [Danio rerio]" (NP_998497.1) is 39.04% over 146 BLAST alignment positions. 3167 of 3510 Muscle alignment positions masked (90.200 %; 343 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus	Partial sequence.     BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC406641 [Danio rerio]" (NP_998497.1) is 39.04% over 146 BLAST alignment positions. 3167 of 3510 Muscle alignment positions masked (90.200 %; 343 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_15256	SPU_015256		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 36.93% over 501 BLAST alignment positions. 179 of 644 Muscle alignment positions masked (27.700 %; 465 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_15442	SPU_015442		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 28.47% over 439 BLAST alignment positions. 351 of 798 Muscle alignment positions masked (43.900 %; 447 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_16251	SPU_016251		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 39.87% over 449 BLAST alignment positions. 406 of 858 Muscle alignment positions masked (47.300 %; 452 positions used for tree generation) with a Muscle scorefile cutoff of 25. GLEAN3_16152 may be C terminus   COMMENTS from arm@stowers-institute.org:  missing N-terminus residues\n
GLEAN3_16442	SPU_016442		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily u, polypeptide 1 [Rattus norvegicus]" (NP_001019950.1) is 36.18% over 398 BLAST alignment positions. 353 of 803 Muscle alignment positions masked (43.900 %; 450 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing C-terminus\n
GLEAN3_16816	SPU_016816		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily u, polypeptide 1 [Rattus norvegicus]" (NP_001019950.1) is 38.73% over 377 BLAST alignment positions. 194 of 646 Muscle alignment positions masked (30.000 %; 452 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_17582	SPU_017582		Missing middle exon - part of I helix - due to missing contig data on scaffold. \nBLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 1, subfamily B, polypeptide 1 [Danio rerio]" (NP_001013285.2) is 27.59% over 464 BLAST alignment positions. 1535 of 1933 Muscle alignment positions masked (79.400 %; 398 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing C-terminus\n
GLEAN3_17986	SPU_017986		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 36.96% over 514 BLAST alignment positions. 328 of 785 Muscle alignment positions masked (41.700 %; 457 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_18242	SPU_018242		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393433 [Danio rerio]" (NP_956755.1) is 39.11% over 427 BLAST alignment positions. 308 of 696 Muscle alignment positions masked (44.200 %; 388 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_18372	SPU_018372		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 37.80% over 508 BLAST alignment positions. 398 of 845 Muscle alignment positions masked (47.100 %; 447 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_18468	SPU_018468		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC393108 [Danio rerio]" (NP_956433.1) is 35.41% over 466 BLAST alignment positions. 35 of 514 Muscle alignment positions masked (6.800 %; 479 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_00279	SPU_000279		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 17, subfamily A, polypeptide 1 [Danio rerio]" (NP_997971.1) is 44.06% over 202 BLAST alignment positions. 1620 of 2057 Muscle alignment positions masked (78.700 %; 437 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_14896	SPU_014896		No signal in tiling array or EST.  May be pseudogene or adult only expression.\n
GLEAN3_00645	SPU_000645		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 27, subfamily B, polypeptide 1 [Xenopus tropicalis]" (NP_001006907.1) is 39.75% over 483 BLAST alignment positions. 494 of 938 Muscle alignment positions masked (52.600 %; 444 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_00746	SPU_000746		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 35.25% over 278 BLAST alignment positions. 372 of 798 Muscle alignment positions masked (46.600 %; 426 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N- and C-terminus\n
GLEAN3_01622	SPU_001622		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, subfamily IID (debrisoquine, sparteine, etc., -metabolizing), polypeptide 6 [Bos taurus]" (NP_776954.1) is 24.95% over 477 BLAST alignment positions. 65 of 554 Muscle alignment positions masked (11.700 %; 489 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_01792	SPU_001792		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 39.95% over 398 BLAST alignment positions. 262 of 726 Muscle alignment positions masked (36.000 %; 464 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing some N-terminus residues\n
GLEAN3_02371	SPU_002371		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily j, polypeptide 6 [Mus musculus]" (NP_034138.2) is 30.14% over 554 BLAST alignment positions. 416 of 879 Muscle alignment positions masked (47.300 %; 463 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_02380	SPU_002380		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 4, subfamily v [Gallus gallus]" (NP_001001879.1) is 53.59% over 362 BLAST alignment positions. 753 of 1162 Muscle alignment positions masked (64.800 %; 409 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_02590	SPU_002590		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC613109 [Xenopus tropicalis]" (NP_001027517.1) is 43.27% over 342 BLAST alignment positions. 576 of 968 Muscle alignment positions masked (59.500 %; 392 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_02656	SPU_002656		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 38.43% over 458 BLAST alignment positions. 202 of 653 Muscle alignment positions masked (30.900 %; 451 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_02658	SPU_002658		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " CYP2A13 protein [Xenopus tropicalis]" (NP_001010998.1) is 34.52% over 478 BLAST alignment positions. 117 of 569 Muscle alignment positions masked (20.500 %; 452 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_02660	SPU_002660		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC613109 [Xenopus tropicalis]" (NP_001027517.1) is 42.62% over 298 BLAST alignment positions. 533 of 935 Muscle alignment positions masked (57.000 %; 402 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_02831	SPU_002831		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily b, polypeptide 19 [Mus musculus]" (NP_031840.1) is 35.96% over 178 BLAST alignment positions. 84 of 548 Muscle alignment positions masked (15.300 %; 464 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial\n
GLEAN3_02832	SPU_002832		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 4, subfamily v [Gallus gallus]" (NP_001001879.1) is 54.14% over 423 BLAST alignment positions. 512 of 927 Muscle alignment positions masked (55.200 %; 415 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_02884	SPU_002884		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 26, subfamily b, polypeptide 1 [Danio rerio]" (NP_997831.1) is 37.76% over 241 BLAST alignment positions. 267 of 709 Muscle alignment positions masked (37.600 %; 442 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_02898	SPU_002898		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily d, polypeptide 9 [Rattus norvegicus]" (NP_695225.1) is 25.47% over 475 BLAST alignment positions. 287 of 759 Muscle alignment positions masked (37.800 %; 472 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_03038	SPU_003038		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 1, subfamily A, polypeptide 1 [Gallus gallus]" (NP_990477.1) is 36.04% over 455 BLAST alignment positions. 949 of 1435 Muscle alignment positions masked (66.100 %; 486 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_03231	SPU_003231		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450 1A1 [Sus scrofa]" (NP_999577.1) is 33.88% over 487 BLAST alignment positions. 163 of 595 Muscle alignment positions masked (27.300 %; 432 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_03232	SPU_003232		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 1, subfamily a, polypeptide 1 [Rattus norvegicus]" (NP_036672.2) is 34.65% over 456 BLAST alignment positions. 183 of 617 Muscle alignment positions masked (29.600 %; 434 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_03606	SPU_003606		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2, B [Danio rerio]" (NP_956914.1) is 38.82% over 492 BLAST alignment positions. 281 of 755 Muscle alignment positions masked (37.200 %; 474 positions used for tree generation) with a Muscle scorefile cutoff of 25.   Original GLEAN3 predictiona interrupted by non-LTR retrotransposon (GLEAN3_03604) and AP1 (GLEAN3_03603) possibly also from phage event. \nCOMMENTS from arm@stowers-institute.org:  extra N-terminus in original GLEAN model\n
GLEAN3_03607	SPU_003607		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily j, polypeptide 9 [Rattus norvegicus]" (NP_786942.1) is 40.05% over 382 BLAST alignment positions. 269 of 736 Muscle alignment positions masked (36.500 %; 467 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_03687	SPU_003687		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC613109 [Xenopus tropicalis]" (NP_001027517.1) is 33.94% over 330 BLAST alignment positions. 317 of 697 Muscle alignment positions masked (45.400 %; 380 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial\n
GLEAN3_05160	SPU_005160		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC572000 [Danio rerio]" (NP_001020728.1) is 33.69% over 466 BLAST alignment positions. 200 of 651 Muscle alignment positions masked (30.700 %; 451 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_05439	SPU_005439		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC613109 [Xenopus tropicalis]" (NP_001027517.1) is 42.06% over 504 BLAST alignment positions. 244 of 646 Muscle alignment positions masked (37.700 %; 402 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_05655	SPU_005655		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2, B [Danio rerio]" (NP_956914.1) is 31.26% over 531 BLAST alignment positions. 192 of 658 Muscle alignment positions masked (29.100 %; 466 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_05668	SPU_005668		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 34.34% over 530 BLAST alignment positions. 230 of 693 Muscle alignment positions masked (33.100 %; 463 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_05931	SPU_005931		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450 4F6 [Rattus norvegicus]" (NP_695230.1) is 36.55% over 435 BLAST alignment positions. 681 of 1195 Muscle alignment positions masked (56.900 %; 514 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  extra C-terminus\n
GLEAN3_06574	SPU_006574		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 39.67% over 484 BLAST alignment positions. 308 of 772 Muscle alignment positions masked (39.800 %; 464 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_07306	SPU_007306		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 37.73% over 440 BLAST alignment positions. 182 of 638 Muscle alignment positions masked (28.500 %; 456 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_07335	SPU_007335		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450 CYP4F18 [Mus musculus]" (NP_077764.1) is 45.73% over 199 BLAST alignment positions. 658 of 1166 Muscle alignment positions masked (56.400 %; 508 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_07409	SPU_007409		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 17, subfamily A, polypeptide 1 [Danio rerio]" (NP_997971.1) is 35.33% over 450 BLAST alignment positions. 127 of 571 Muscle alignment positions masked (22.200 %; 444 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_07558	SPU_007558		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450 4F6 [Rattus norvegicus]" (NP_695230.1) is 30.44% over 450 BLAST alignment positions. 579 of 1091 Muscle alignment positions masked (53.000 %; 512 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing C-terminus\n
GLEAN3_08152	SPU_008152		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 40.68% over 440 BLAST alignment positions. 273 of 739 Muscle alignment positions masked (36.900 %; 466 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_08632	SPU_008632		Most CYP2-like urchin sequence. BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 41.78% over 450 BLAST alignment positions. 403 of 859 Muscle alignment positions masked (46.900 %; 456 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_09118	SPU_009118		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450 2AD3 [Danio rerio]" (NP_001020725.1) is 45.61% over 171 BLAST alignment positions. 379 of 822 Muscle alignment positions masked (46.100 %; 443 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_09512	SPU_009512		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 42.69% over 431 BLAST alignment positions. 210 of 690 Muscle alignment positions masked (30.400 %; 480 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_09692	SPU_009692		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 40.48% over 294 BLAST alignment positions. 221 of 692 Muscle alignment positions masked (31.900 %; 471 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing C-terminus\n
GLEAN3_09825	SPU_009825		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 38.85% over 296 BLAST alignment positions. 302 of 737 Muscle alignment positions masked (40.900 %; 435 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_10143	SPU_010143		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 38.30% over 376 BLAST alignment positions. 264 of 730 Muscle alignment positions masked (36.100 %; 466 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  missing N-terminus\n
GLEAN3_10246	SPU_010246		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily U, polypeptide 1 [Homo sapiens]" (NP_898898.1) is 38.19% over 199 BLAST alignment positions. 187 of 653 Muscle alignment positions masked (28.600 %; 466 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus\n
GLEAN3_10563	SPU_010563		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2 [Homo sapiens]" (NP_000766.2) is 36.23% over 494 BLAST alignment positions. 238 of 714 Muscle alignment positions masked (33.300 %; 476 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_10576	SPU_010576		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2, B [Danio rerio]" (NP_956914.1) is 40.34% over 471 BLAST alignment positions. 76 of 553 Muscle alignment positions masked (13.700 %; 477 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_12080	SPU_012080		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 4, subfamily v, polypeptide 2 [Homo sapiens]" (NP_997235.2) is 50.75% over 469 BLAST alignment positions. 541 of 957 Muscle alignment positions masked (56.500 %; 416 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_12081	SPU_012081		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC510156 [Bos taurus]" (NP_001029545.1) is 51.39% over 469 BLAST alignment positions. 520 of 940 Muscle alignment positions masked (55.300 %; 420 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_12148	SPU_012148		Assembly error: see GLEAN3_25595 Percent ID to " cytochrome P450, family 51 [Strongylocentrotus purpuratus]" (NP_001001906.1) is 98.59% over 142 BLAST alignment positions.\n
GLEAN3_12500	SPU_012500		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 2, B [Danio rerio]" (NP_956914.1) is 38.26% over 311 BLAST alignment positions. 128 of 591 Muscle alignment positions masked (21.600 %; 463 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing N-terminus, missing stretch in middle\n
GLEAN3_12926	SPU_012926		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " hypothetical protein LOC553543 [Danio rerio]" (NP_001018358.1) is 33.33% over 312 BLAST alignment positions. 116 of 584 Muscle alignment positions masked (19.800 %; 468 positions used for tree generation) with a Muscle scorefile cutoff of 25.   COMMENTS from arm@stowers-institute.org:  partial, missing C-terminus\n
GLEAN3_13039	SPU_013039		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " MGC107863 protein [Xenopus tropicalis]" (NP_001015719.1) is 41.07% over 431 BLAST alignment positions. 233 of 700 Muscle alignment positions masked (33.200 %; 467 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_13199	SPU_013199		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " cytochrome P450, family 2, subfamily J, polypeptide 4 [Rattus norvegicus]" (NP_075414.2) is 27.34% over 523 BLAST alignment positions. 307 of 771 Muscle alignment positions masked (39.800 %; 464 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_14092	SPU_014092		BLAST identities confined to RefSeq only. Name assigned by phylogenetic methods (ME tree with ML branch lengths or heuristic ML). Percent ID to " thromboxane A synthase 1 [Danio rerio]" (NP_991172.1) is 36.29% over 463 BLAST alignment positions. 80 of 574 Muscle alignment positions masked (13.900 %; 494 positions used for tree generation) with a Muscle scorefile cutoff of 25.\n
GLEAN3_08801	SPU_008801		The predicted amino acid sequence of GLEAN3_08801 is identical to those of GLEAN3_08799 and GLEAN3_08800. \nBecause there is a good chance that they do not represent three distinct gene products, they were named as follows: \nGLEAN3_08799: Sp-Dynein Light Chain-2-3a \nGLEAN3_08800: Sp-Dynein Light Chain-2-3b \nGLEAN3_08801: Sp-Dynein Light Chain-2-3c\n
GLEAN3_27937	SPU_027937		The predicted amino terminal 43 amino acids of GLEAN3_27937 are inconsistent with the comparable regions of oher Dynein Light Chain Type 2 sequences, and are almost certainly incorrect; otherwise, this GLEAN matches the neighboring GLEAN3_27938 very well. \nBecause these may not represent independent gene products, They were named as follows: \nGLEAN3_27937: Sp-Dynein Light Chain-2-6a \nGLEAN3_27938: Sp-Dynein Light Chain-2-6b\n
GLEAN3_27938	SPU_027938		The predicted amino terminal 50 amino acids of GLEAN3_27938 are inconsistent with the comparable regions of oher Dynein Light Chain Type 2 sequences, and are almost certainly incorrect; otherwise, this GLEAN matches the neighboring GLEAN3_27937 very well. \nBecause these may not represent independent gene products, They were named as follows: \nGLEAN3_27937: Sp-Dynein Light Chain-2-6a \nGLEAN3_27938: Sp-Dynein Light Chain-2-6b\n
GLEAN3_26507	SPU_026507		five EGF - GPS - one TM segment \nApart from absence of a complete tm_2 domain looks like a member of the LNB-7TM-GPCR subfamily\n
GLEAN3_18579	SPU_018579		LDLa - 4-5 EGF - Igc2 - GPS x2 - NO TM predicted \n \nApart form lack of 7tm_2 domain looks like meber of LNB-TM7-GPCR subfamily \nC-terminal end probably misassembled\n
GLEAN3_13084	SPU_013084		SR - FA58C X3 - GPS - 7tm_2 \nmatches well with LNB-7TM-GPCR subfamily \n \nNOVEL ARCHITECTURE \nSIMILAR DOMAIN COMPOSITION TO GLEAN3_20525\n
GLEAN3_01671	SPU_001671		SR x4 - EGF - GPS - three TM segments \nApart from lack of a complet 7tm_2 domain, looks like a member of LNB-7TM-GPCR subfamily \n \nNovel architecture \n
GLEAN3_01056	SPU_001056		Gal-lectin/GPS/7tm_2 \n \na bit like latrophilin but lacks OLF domain\n
GLEAN3_03929	SPU_003929		Gal-lectin/HormR/GPS/7tm_2 \n \na bit like latrophilin but lacks OLF domain\n
GLEAN3_20012	SPU_020012		LamNT/EGF_Lam-8/FN3 x3 \n \nlikely N-terminus of usherin2A - mammalian forms have N-terminal LamGL domain in addition \n \nC-terminus encoded in GLEAN3_17733 \n \nPreviously vertebrate-restricted\n
GLEAN3_17733	SPU_017733		Large ECM or surface membrane protein with LamG and lots of FN3 repeats - homologous with the Usherin2A protein of vertebrates. \nMissing N-terminal part - LamNT AND EGF-Lam domains - probably encoded in GLEAN3_20012 \n \nPreviously vertebrate-restricted\n
GLEAN3_04543	SPU_004543		Using HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits. \nThis predicted protein had 2 serpin domains. This glean model could be a fusion of two genes. The Fgenesh predictions show two separate genes. There are very few serpins with 2 serpin domains. Some were found in ciona, rat and chicken.\n
GLEAN3_05904	SPU_005904		SR/SO/7TM_2 \n \nNovel architecture\n
GLEAN3_14551	SPU_014551		SO - 7tm_2 \n \nNovel architecture\n
GLEAN3_13142	SPU_013142		four SO repeats\n
GLEAN3_16249	SPU_016249		two SO repeats\n
GLEAN3_18658	SPU_018658		three SO repeats\n
GLEAN3_27632	SPU_027632		two SO repeats\n
GLEAN3_02711	SPU_002711		The gene prediction could be incomplete. It is at the end of a sort scaffold. \nUsing HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
GLEAN3_07225	SPU_007225		homolog of Tubulointerstitial nephritis antigen -  \nSO-cysteine-type endopeptidase - peptidase domain scores as inactive as in human homolog \n \nWidely distributed phylogenetically\n
GLEAN3_13734	SPU_013734		FA58C/CLECT/EGF-Ca/CUB x3\n
GLEAN3_15863	SPU_015863		SEMA/PSI/IPT x2 \n \nlooks like a plexin - could be a c-met-related RTK\n
GLEAN3_26514	SPU_026514		SEMA/PSI/IPT x2 \n \nlooks like a plexin - could be a c-met-related RTK\n
GLEAN3_19919	SPU_019919		SEMA/PSI \ncould be a semaphorin or plexin or a c-met relative\n
GLEAN3_26431	SPU_026431		SEMA/PSI \ncould be a semaphorin or plexin or a c-met relative\n
GLEAN3_24673	SPU_024673		SEMA/PSI \ncould be a semaphorin or plexin or a c-met relative\n
GLEAN3_18779	SPU_018779		SEMA/PSI \ncould be a semaphorin or plexin or a c-met relative\n
GLEAN3_06483	SPU_006483		SEMA/PSI \ncould be a semaphorin or plexin or a c-met relative\n
GLEAN3_12034	SPU_012034		PSI/IPT x3 \n \nlooks like a fragment of plexin - maybe c-met family\n
GLEAN3_22072	SPU_022072		same as GLEAN3_12389 \n \nadjacent to cleavage histone H2a  GLEAN3_22075\n
GLEAN3_12389	SPU_012389		THis is a duplicate of GLEAN3_22072 and likely should be deleted\n
GLEAN3_25514	SPU_025514		This gene is on three scaffolds (544, 81593 and 56300). On scaffold544 (exon 1) there is one GLEAN model (GLEAN3_25514) which covers exon1-3 for this gene. On scaffold81593 GLEAN_14036 is predicted (exon 2-15)for this gene. For scaffold 56300, gene prediction is GLEAN3_17882(exon 8-24) for this gene. Exon 2 and 3 are overlapped between two scaffolds (544 and 81593) and exon 8-15 are overlapped between scaffolds 81593 and 56300. Please refer to GLEAN_14036 and GLEAN_17882 for gene features for C-terminl portion.\n
GLEAN3_22075	SPU_022075		Adjacent to Glean3_22072 cleavage stage histone H4\n
GLEAN3_24567	SPU_024567		simialr to oocyte specific human and mouse H1oo;  there are no other known cleavage stage orthologues of the histones in mammals.\n
GLEAN3_26501	SPU_026501		This is a fragment of the H2aZ/x gene or a pseudogene; should be deleted\n
GLEAN3_24890	SPU_024890		histone H2aZ of mammals  \n
GLEAN3_14432	SPU_014432		There are two macro histone H2a isoforms in the urchin genome.  The other one is GLEAN3_15435\n
GLEAN3_15435	SPU_015435		there is a second isoform or macro H2a in the urchin genome  GLEAN3_14432\n
GLEAN3_11356	SPU_011356		This is an internal duplicate of GLEAN3_04671; this sequence contains a stretch of possibly spurious amino acids, absent from 04671.  \nSee protein: \nMFNHLSYMRPRDVQQREFEVLLKLNHKNIIHLDQIEEDQISKQPVIVMELCTGGSLYTYLDTPENLYGLKEKEFLQVLNDVSAGMKHLRDKGIVHRDIKPGNIMMVKGEDGVIVYKLADFGAARELEDDEAFKSLYGTEEYL \n \n{LADFGAARELEAEAFKSLYGTEKYL} spurious? \n \nHPDLYERAVLGKRNRTEFTAQTDLWSLGVTFFHTATGSLPFRAHGGRNNREVMHNITTTKKSSMISGVQEVPNGPIIWSEDLPKTCHFSPDMRHQVKTLLSHTMECNIQKMWTFDIFFDFVQRLTSMRPVDVFVAPTCECHIIYVEPQRKVVDFQDKMAHLTGIQPDTQSLLWDKEVFDLKKCLKCEDLPQTSPDNPLILVRMGAEVFPSVKVPSLR\n
GLEAN3_03438	SPU_003438		this is likely to be part of the same gene as 03534\n
GLEAN3_12899	SPU_012899		Overlaps with GLEAN3_22035: the N terminal part of this glean is identical to 22035, then the C terminus diverges and is longer than the C term of 22035.\n
GLEAN3_22035	SPU_022035		Overlaps with GLEAN3_12899. The N term of this protein is longer. 12899 is an internal identical match over its N terminal portion, but the proteins diverge in their C termini.\n
GLEAN3_05514	SPU_005514		overlapping part of GLEAN3_27573\n
GLEAN3_08172	SPU_008172		contains RRM (RNA Recognition motif)\n
GLEAN3_14468	SPU_014468		LY4-LDLA5\n
GLEAN3_06165	SPU_006165		LDLA/EGF2/LY5/EGF/MULTIPLE LDLA/EGF2/LY2/EGF2\n
GLEAN3_05642	SPU_005642		LY/LDLA/LY4/EGF4/CCP - likely a fragment \n \nnovel architecture - LRPs in other species do not have CCP\n
GLEAN3_20978	SPU_020978		LDLA2/EGF2/LY3/EGF/LY6/EGF/MULTIPLE LDLA/EGF2/LY5/EGF/LY4/EGF. \nLY5/EGF/LY5/EGF/MULTIPLE LDLA/EGF/LY5/EGF/MULTIPLE LDLA/EGF.. \nLY3/EGF6-TM\n
GLEAN3_24306	SPU_024306		very similar to GLEAN3_08172, contains 2 RRM motifs \n
GLEAN3_07072	SPU_007072		multiple LDLA/EGF2/LY3/EGF-TM\n
GLEAN3_22132	SPU_022132		LDLA/EGF2/LY5/EGF/multiple LDLA/EGF2/LY3/EGF2-TM\n
GLEAN3_09930	SPU_009930		LDLA3/EGF2/LY4 - TM\n
GLEAN3_25837	SPU_025837		EGF/LY/EGF/LY2/EGF2/LY/LDLA/LY4/EGF2/CCP - TM \n \nnovel architecture - LRPs in other species do not have CCP\n
GLEAN3_27625	SPU_027625		LY/LDLA/LY4/EGF/CCP - likely a fragment \n \nnovel architecture - LRPs in other species do not have CCP\n
GLEAN3_21270	SPU_021270		highly similar to Glean3_17749\n
GLEAN3_17749	SPU_017749		Highly similar to Glean3_21270\n
GLEAN3_25763	SPU_025763		DNA mismatch repair: The Escherichia coli MutHLS system has been highly conserved throughout evolution. The eukaryotic pathway results in a specialization of MutS homologs that have evolved to play crucial roles in both DNA mismatch repair and meiotic recombination. In Saccharomyces cerevisiae, MSH4 (MutS homolog 4) is a meiosis-specific protein that is not involved in mismatch correction. This protein is required for reciprocal recombination and proper segregation of homologous chromosomes at meiosis I. Paquis-Flucklinger et al identified the human MSH4 homolog gene. The predicted amino acid sequence shows 28.7% identity with the S. cerevisiae MSH4 protein \n
GLEAN3_12547	SPU_012547		contains SAM (sterile alpha motif) repeats; Glean3_26761 is a partial sequence of this one (different scaffolds)\n
GLEAN3_27921	SPU_027921		The S. cerevisiae DMC1 gene is essential for meiotic recombination. Its encoded protein is structurally and evolutionally related to the products of the yeast RAD51 and E. coli RecA genes. The ovary is one of the high-expression sites for this gene.\n
GLEAN3_26761	SPU_026761		partial sequence of Glean3_12547 (different scaffolds)\n
GLEAN3_17127	SPU_017127		SRCR(5). Possibly incomplete.\n
GLEAN3_17194	SPU_017194		SRCR(2). Possibly incomplete.\n
GLEAN3_17453	SPU_017453		SigPep-SRCR(2)-TM.  \n
GLEAN3_17933	SPU_017933		SRCR(4). Possibly incomplete.\n
GLEAN3_18252	SPU_018252		SRCR(8)-TM. Possibly incomplete.\n
GLEAN3_18429	SPU_018429		SRCR(13). Probably incomplete. See GLEAN3_18430.\n
GLEAN3_18430	SPU_018430		SigPep-SRCR(5). Probably incomplete. See GLEAN3_18429.\n
GLEAN3_18508	SPU_018508		SigPep-SRCR(2)-TM.\n
GLEAN3_18737	SPU_018737		SigPep-SRCR(3)-TM. \n
GLEAN3_18939	SPU_018939		SRCR(5)-TM. Possibly incomplete.\n
GLEAN3_18985	SPU_018985		SRCR(3). Possibly incomplete.\n
GLEAN3_20917	SPU_020917		allele: GLEAN3_21818\n
GLEAN3_21818	SPU_021818		Short, 8kb contig containing 6 exons coding for N-terminal 254 aa; allele of full length GLEAN3_20917 (17 exons).\n
GLEAN3_04177	SPU_004177		3 exons in 8 kb contig coding for C-terminal 75 aa.\n
GLEAN3_15969	SPU_015969		GLEAN3_15969 may contain exons from two different genes. The N-terminal cds (182 aa) is homologous to cytochrome P450; the C-terminal cds is homologous to Exoc4. Alleles: GLEAN3_01083, GLEAN3_01084, GLEAN3_22381.\n
GLEAN3_01084	SPU_001084		GLEAN3_01084 contains 15 exons coding for C-terminal 2/3 of Sp-Exoc4; GLEAN3_01083 contains 8 exons coding for the N-terminus. Alleles: GLEAN3_22381, GLEAN3_15969.\n
GLEAN3_01083	SPU_001083		GLEAN3_01083 contains 8 exons coding for N-terminal 1/3 of Sp-Exoc4; GLEAN3_01084 contains 15 exons coding for the C-terminus. Alleles: GLEAN3_22381, GLEAN3_15969. \n
GLEAN3_22381	SPU_022381		#\nGLEAN3_22381 occupies the entire length of a short (13 kb) contig; contains 8 exons coding for the N-terminus of Sp-Exoc4. Alleles: GLEAN3_01083, GLEAN3_01084, GLEAN3_15969.\n
GLEAN3_19241	SPU_019241		SigPep-SRCR(4). Possibly incomplete.  Near 7TM SRCR [GLEAN3_19239].\n
GLEAN3_13045	SPU_013045	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When comparing the results of the BLAST search to that of the excel data, it is evident that if these 2 scaffolds were combined the sequence would have an orderly and continuous arrangement. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5	allele: GLEAN3_20544\n
GLEAN3_20544	SPU_020544		GLEAN3_20544 appears to be truncated, containing 3 exons located near the edge of the contig, coding for the C-terminal 155 aa of Sp-Exoc6. GLEAN3_13045 is a full-length allele, 19 exons.\n
GLEAN3_19262	SPU_019262		SRCR(2). Probably incomplete. See GLEAN3_19263.\n
GLEAN3_19263	SPU_019263		SRCR(3). Probably incomplete. See GLEAN3_19262.\n
GLEAN3_01461	SPU_001461		Similar to PTPR phi, short or long insert varient...a member of the PTPR BHJOQ superfamily? \n
GLEAN3_19291	SPU_019291		SRCR(2). Probably incomplete.\n
GLEAN3_19370	SPU_019370		SRCR(5)-Sushi-TM. Possibly incomplete. Like gi|8547243|gb|AAF76316.1|AF228824_1 scavenger receptor cysteine-rich protein variant 1 [Strongylocentrotus purpuratus] and >gi|8547245|gb|AAF76317.1|AF228825_1 scavenger receptor cysteine-rich protein variant 2 [Strongylocentrotus purpuratus] from coelomocytes (as published by Z. Pancer).\n
GLEAN3_19374	SPU_019374		SRCR(2)-HYR-SRCR(2). possibly incomplete. See GLEAN3_19370.\n
GLEAN3_19479	SPU_019479		SRCR(7). Possibly incomplete.\n
GLEAN3_19826	SPU_019826		SRCR(5)-TM. Possibly incomplete.\n
GLEAN3_20081	SPU_020081		SRCR(5). Possibly incomplete.\n
GLEAN3_20161	SPU_020161		SRCR(3). Possibly incomplete.\n
GLEAN3_18071	SPU_018071		Part of PTPRB??\n
GLEAN3_20273	SPU_020273		GF(2)-SRCR(2). Possibly incomplete.\n
GLEAN3_20650	SPU_020650		TIL-SRCR(4). Possibly incomplete. Similar to >gi|8547241|gb|AAF76315.1|AF228823_1 scavenger receptor cysteine-rich protein [Strongylocentrotus purpuratus] published by Z. Pancer.\n
GLEAN3_20822	SPU_020822		SigPep-SRCR(4)-TM.\n
GLEAN3_20868	SPU_020868		SigPep-SRCR(2). Possibly incomplete.\n
GLEAN3_21124	SPU_021124		SigPep-SRCR(6)-TM.\n
GLEAN3_21348	SPU_021348		SigPep-SRCR(3). Possibly incomplete.\n
GLEAN3_21457	SPU_021457		SRCR(2). Probably incomplete.\n
GLEAN3_21509	SPU_021509		SigPep-SRCR(6). Possibly incomplete.\n
GLEAN3_21691	SPU_021691		SigPep-SRCR(2). Probably incomplete. See GLEAN3_21692.\n
GLEAN3_04193	SPU_004193		This Glean sequence corresponds to 2 different genes: the complete sequence for Hhat and the 3' portion of PTPRscav, a novel PTPR, containing the PTP domain.\n
GLEAN3_21692	SPU_021692		SRCR(4)-TM. Probably incomplete. See GLEAN3_21691.\n
GLEAN3_21782	SPU_021782		SRCR(3). Possibly incomplete.\n
GLEAN3_21789	SPU_021789		SRCR(7)-TM. Possibly incomplete.\n
GLEAN3_08670	SPU_008670		Partial sequence.  Contains one protein tyrosine phosphatase domain.\n
GLEAN3_21890	SPU_021890		SRCR(3). Probably incomplete.\n
GLEAN3_21987	SPU_021987		SRCR(5)-TM. Probably incomplete. See GLEAN3_21988.\n
GLEAN3_21988	SPU_021988		SigPep-SRCR(5). Probably incomplete. See GLEAN3_21987.\n
GLEAN3_22000	SPU_022000		Somat-SRCR-Somat-SRCR-CUB.\n
GLEAN3_28592	SPU_028592		Homologous to PTPN 14/21.  Also similar to the novel Drosophila protein PTPpez.\n
GLEAN3_22085	SPU_022085		SRCR(5)-TM. Possibly incomplete.\n
GLEAN3_22145	SPU_022145		SigPep-SRCR(6). Probably incomplete. See GLEAN3_22146, 22147, 22148, 22149, 22150, 22151.\n
GLEAN3_22146	SPU_022146		SRCR(5). Probably incomplete. See GLEAN3_22145, 22147, 22148, 22149, 22150, 22151.\n
GLEAN3_22147	SPU_022147		SRCR(3). Probably incomplete. See GLEAN3_22145, 22146, 22148, 22149, 22150, 22151.\n
GLEAN3_22148	SPU_022148		SRCR(4). Probably incomplete. See GLEAN3_22145, 22146, 22147, 22149, 22150, 22151.\n
GLEAN3_22149	SPU_022149		SRCR(8). Probably incomplete. See GLEAN3_22145, 22146, 22147, 22148, 22150, 22151.\n
GLEAN3_22150	SPU_022150		SRCR(5). Probably incomplete. See GLEAN3_22145, 22146, 22147, 22148, 22149, 22151.\n
GLEAN3_22151	SPU_022151	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. However, there was mRNA sequence information available from Spbase. When reviewing the excel data, there appears to be numerous repeats and sequence overlaps within the different subjects for this GLEAN model resulting in an un-orderly arrangement.	SRCR(2). Probably incomplete. See GLEAN3_22145, 22146, 22147, 22148, 22149, 22150.\n
GLEAN3_03149	SPU_003149		Partial sequence.\n
GLEAN3_03479	SPU_003479		Similar to CDKN3\n
GLEAN3_02928	SPU_002928		Partial sequence. Homologous to human TPIP/TPTE.\n
GLEAN3_03142	SPU_003142		Partial sequence. PTENb...See also GLEAN3_23882 (identical gene) and GLEAN3_09522 (PTENa).\n
GLEAN3_22991	SPU_022991		Identical to 11961 (11961 is missing aa 73-109) except for 5'end \n
GLEAN3_11961	SPU_011961	After reviewing the data from the excel file and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The data from the excel file doesn't completely coincide with the data from the BLAST search. The excel data has a sequence overlap between the 2 scaffolds that aren't apparent from the BLAST search. There was Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be some what weak with most of the values being less than 5.	Identical to 22991 except for 5'end (note 11961 is missing aa 73-109 of 22991)\n
GLEAN3_28117	SPU_028117		AA 28-89 IDENTICAL TO GLEAN3_20429, but 5' and 3' end differ\n
GLEAN3_11467	SPU_011467		Identical to 06006 (except 06006 is missing aa 1-69)\n
GLEAN3_06006	SPU_006006		IDENTICAL TO 11467 (except missing aa 1-69 of 11467)\n
GLEAN3_06389	SPU_006389		same as GLEAN3_06388 (IDENTICAL except at 3' END)\n
GLEAN3_03228	SPU_003228		almost IDENTICAL TO 17155\n
GLEAN3_17155	SPU_017155		almost DENTICAL TO 03228\n
GLEAN3_18282	SPU_018282		identical to 13863\n
GLEAN3_24480	SPU_024480	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it was apparent that there were no large gaps present. However, there were several internal present within both scaffolds. Overall, both scaffolds had an orderly arrangement and if the two were combined, the sequence would have good coverage. There was Est support available from GBRowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the majority of the values being greater than 5. 	PREDICTED: similar to castor homolog 1, zinc finger \n
GLEAN3_18645	SPU_018645		PREDICTED: Strongylocentrotus purpuratus similar to RNA binding motif protein 13 (LOC586606), mRNA\n
GLEAN3_07493	SPU_007493		PREDICTED: Strongylocentrotus purpuratus similar to Beta-amyloid-like protein precursor (LOC585391), mRNA \n
GLEAN3_17861	SPU_017861		involved in meiotic recombination found originally in yeast,  \n \nDomain search \ngnl|CDD|16546 cd00223, Topo6_Spo, DNA topoisomerase VI subunit A. Homologous to type II topoisomerase, meiotic recombination factor, Spo11; generates double stranded breaks that initiate homologous recombination during meiosis. Subunit A forms homodimers which contain a deep groove that spans both protomers; the dimer architecture suggests that DNA is bound in the groove across the A subunit interface, and that the two monomers separate during DNA transport.. \n
GLEAN3_15850	SPU_015850		#\nPREDICTED: Strongylocentrotus purpuratus similar to CUG triplet repeat,RNA-binding protein 2 (LOC576912), partial mRNA.\n
GLEAN3_22717	SPU_022717		non-identical duplicate of GLEAN3_27144, _02836. A portion of the overlap is identical, but these do appear to be distinct genes.\n
GLEAN3_06197	SPU_006197		This is the N terminal part of the protein. The middle portions are in GLEAN3_27144 and _02836\n
GLEAN3_06784	SPU_006784		Non-identical to other PI3K p110 GLEANs: 22717, 06197, 27144, 02836. This gene may be full length.\n
GLEAN3_17593	SPU_017593		PREDICTED: Strongylocentrotus purpuratus similar to baculoviral IAP repeat-containing 2 (LOC583399)\n
GLEAN3_04809	SPU_004809		partial sequence\n
GLEAN3_28521	SPU_028521		PREDICTED: Strongylocentrotus purpuratus similar to Elongation factor G 2, mitochondrial precursor (mEF-G 2) (Elongation factor G2) (LOC584381)\n
GLEAN3_10910	SPU_010910		partial sequence; non-identical to other P13K p110 proteins. This sequence seems to be internal to 27144 and 02836. This sequence may be the C terminus that corresponds to the N terminus in GLEAN3_05073\n
GLEAN3_13194	SPU_013194		contains SINA domains \n \ngnl|CDD|26029 pfam03145, Sina, Seven in absentia protein family. The seven in absentia (sina) gene was first identified in Drosophila. The Drosophila Sina protein is essential for the determination of the R7 pathway in photoreceptor cell development: the loss of functional Sina results in the transformation of the R7 precursor cell to a non- neuronal cell type. The Sina protein contains an N-terminal RING finger domain pfam00097. Through this domain, Sina binds E2 ubiquitin-conjugating enzymes (UbcD1) Sina also interacts with Tramtrack (TTK88) via PHYL. Tramtrack is a transcriptional repressor that blocks photoreceptor determination, while PHYL down-regulates the activity of TTK88. In turn, the activity of PHYL requires the activation of the Sevenless receptor tyrosine kinase, a process essential for R7 determination. It is thought that thus Sina targets TTK88 for degradation, therefore promoting the R7 pathway. Murine and human homologues of Sina have also been identified. The human homologue Siah-1 also binds E2 enzymes (UbcH5) and through a series of physical interactions, targets beta-catenin for ubiquitin degradation. Siah-1 expression is enhanced by p53, itself promoted by DNA damage. Thus this pathway links DNA damage to beta-catenin degradation. Sina proteins, therefore, physically interact with a variety of proteins. The N-terminal RING finger domain that binds ubiquitin conjugating enzymes is described in pfam00097, and does not form part of the alignment for this family. The remainder C-terminal part is involved in interactions with other proteins, and is included in this alignment. In addition to the Drosophila protein and mammalian homologues, whose similarity was noted previously, this family also includes putative homologues from Caenorhabditis elegans, Arabidopsis thaliana..\n
GLEAN3_05073	SPU_005073		This appears to be the N terminus; the C terminus may be on GLEAN3_10910\n
GLEAN3_01697	SPU_001697		This gene prediction is only N-terminus of GAT and should be combined with GLEAN3_01698.  \n
GLEAN3_10777	SPU_010777		Fas Associated death domain.  \n
GLEAN3_00102	SPU_000102		GLEAN3_04413 is a perfect duplicate of this glean (protein level)\n
GLEAN3_04413	SPU_004413		This ia a perfect duplicate of GLEAN3_00102 (protein level)\n
GLEAN3_21153	SPU_021153		partial protein; GLEAN3_19900 is a nearly perfect internal duplicate\n
GLEAN3_07693	SPU_007693		Groups with caspase 9 subfamily by neighbor joining of multiple sequence alignmnent.  Appears to lack an N-terminus.\n
GLEAN3_08540	SPU_008540		Groups with caspase 8 subfamily in neighbor-joining of multiple sequence alignment.\n
GLEAN3_09497	SPU_009497		Very high similarity to GLEAN3_11339, GLEAN3_22941, GLEAN3_26645, GLEAN3_21561, GLEAN3_26743, \n
GLEAN3_11339	SPU_011339		Very high similarity to GLEAN3_09497, GLEAN3_22941, GLEAN3_26645, GLEAN3_21561, and GLEAN3_26743.\n
GLEAN3_11471	SPU_011471		Nearly identical to C-terminus of GLEAN3_17523, GLEAN3_09653, and also with significant similarity to GLEAN3_19822, GLEAN3_09497, GLEAN3_11339.  The tiling expression data indicate that some exons are missing from this model.\n
GLEAN3_11061	SPU_011061	From the BLAST results, there is also differing base pair information compared to the excel data. There was no Est. information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be some what strong with most of the values being greater than 10.	This is the ligand binding domain of gcnf1, and belongs with glean3_00749 (see that model for data).\n
GLEAN3_18137	SPU_018137		Highly similar to GLEAN3_18315, may be a duplication.  See annotation to that gene.  It is not clear where the N-terminus of this model is.\n
GLEAN3_18315	SPU_018315		Most similar to GLEAN3_18137; the latter may be a duplication of the C-terminus of this gene, possibly an artifact.  This model contains two capsase domains that are not identical, and may represent a tandem duplication.  N-terminus is ambiguous (no Methionine).\n
GLEAN3_18506	SPU_018506		Groups with the caspase 9 subfamily by neighbor joining of multiple sequence alignment.\n
GLEAN3_20623	SPU_020623		Groups with caspase 8 subfamily by neighbor joining of multiple sequence alignment.\n
GLEAN3_22177	SPU_022177		Similar to GLEAN3_08540, GLEAN3_19839, GLEAN3_16039.\n
GLEAN3_14473	SPU_014473		partial sequence. \nGLEAN3_26766 is a non-identical duplicate \n
GLEAN3_19900	SPU_019900		GLEAN3_21153 is a nearly identical duplicate that is longer on both ends.\n
GLEAN3_23504	SPU_023504		GLEAN3_08853 is a non-identical duplicate \n
GLEAN3_10284	SPU_010284		#\npartial sequence; this GLEAN is a nearly identical internal duplicate of GLEAN3_00151\n
GLEAN3_08853	SPU_008853		GLEAN3_23504 is a non-identical duplicate\n
GLEAN3_25578	SPU_025578		glean3_16326 is a partial sequence of this one\n
GLEAN3_16326	SPU_016326		glean3_25578 is a partial sequence of this one\n
GLEAN3_15545	SPU_015545		unclear exactly which RTK this is.\n
GLEAN3_26766	SPU_026766		GLEAN3_14473 is a non-identical duplicate\n
GLEAN3_19045	SPU_019045		this is a partial sequence of Glean3_04954\n
GLEAN3_26272	SPU_026272		not clear which RTK this is\n
GLEAN3_18556	SPU_018556		PREDICTED: Strongylocentrotus purpuratus similar to peroxisome biogenesis factor 1 (LOC592476)\n
GLEAN3_00400	SPU_000400		PREDICTED: Strongylocentrotus purpuratus similar to folliculin isoform 1 (LOC587400)\n
GLEAN3_07667	SPU_007667		This GLEAN may code for either the RBM16 or SFRS15 protein. \n
GLEAN3_06990	SPU_006990		GLEAN3_01101 matches the first half of RBM19. GLEAN3_06990 matches the latter half.\n
GLEAN3_03581	SPU_003581		There appear to be 2 gene sequences in this Glean site.  The first one corresponds to a partial sequence for a receptor protein tyrosine phosphatase (type Mu?) and the second one is similar to Alpha-2,6-sialyltransferase.\n
GLEAN3_01558	SPU_001558		DNA polymerase lambda mediates a back-up base excision repair activity by similarity. \n
GLEAN3_03548	SPU_003548		The zf-C4 domain is located in glean3_03547.\n
GLEAN3_08365	SPU_008365	From the BLAST results as well as the excel data, it appears that the sequence is distributed onto two different scaffolds. When examining the excel data, it was apparent that there were several base pair duplicates and sequence overlaps within both scaffolds. However there was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5.	The mouse Ogg1 gene is involved in the repair of 8-hydroxyguanine in DNA damage. \n
GLEAN3_18631	SPU_018631		#\nUsing HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
GLEAN3_18630	SPU_018630		Using HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
GLEAN3_18196	SPU_018196		Using HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
GLEAN3_20278	SPU_020278		This gene model could be incomplete. It is located at the end of a scaffold. \nUsing HsSERPINE2 in BLASTP search of the GLEAN3 models yielded 15 genes. Each was analyzed by SMART and the top 12 hits were found to have a SERPIN domain and were annonated Sp-serpin-like 3 through 12 (2 were already annotated by A.Cameron). BLASTP searches of the GLEAN3 models with other vertebrate Serpins yielded the same top 12 hits.\n
GLEAN3_06446	SPU_006446		GLEAN3_23112 encodes first part of the gene and GLEAN3_06446 the latter half.\n
GLEAN3_02644	SPU_002644		Alternative splicing; DNA damage; DNA repair; Lyase; Nuclear protein by similarity.\n
GLEAN3_22678	SPU_022678		There may be a misassembly - the zinc finger part of this protein is missing.\n
GLEAN3_09715	SPU_009715		DNA damage; DNA repair; DNA replication; DNA synthesis; DNA-binding; DNA-directed DNA polymerase; Magnesium; Metal-binding; Mutator protein; Nuclear protein by similarity.\n
GLEAN3_08719	SPU_008719	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. When examining the 3 subject hits for this particular GLEAN model, it appears that if all three were to be combined, the entire sequence would have an orderly arrangement. However, the overall sequence does not have very good coverage. The end of the third scaffold is truncated at 2628 resulting in a gap that spans until the end of the sequence (2699.) There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with the majority of values being greater than 5.	Alternative splicing; DNA damage; DNA repair; DNA replication; DNA synthesis; DNA-binding; DNA-directed DNA polymerase; Magnesium; Metal-binding; Mutator protein; Nuclear protein by similarity.\n
GLEAN3_12887	SPU_012887		Anti-oncogene; ATP-binding; Cell cycle; Disease mutation; DNA damage; DNA repair; DNA-binding; Nuclear protein; Nucleotide-binding; by similarity.\n
Sp-CD59/Sca2-like1	SPU_030142		This model was created based on alignments and reciprocal BLASTing of a FgeneshAB/++ prediction with CD59 and Sca2 (two closely related genes in sequence and domain structure). Given the simple structure/sequence of these genes, and since this annotation was based purely on bioinformatic evidence, we have decided to name this gene CD59/Sca2-like.\n
Sp-CD59/Sca2-like2	SPU_030143		This model was created based on alignments and reciprocal BLASTing of a FgeneshAB/++ prediction with CD59 and Sca2 (two closely related genes in sequence and domain structure). Given the simple structure/sequence of these genes, and since this annotation was based purely on bioinformatic evidence, we have decided to name this gene CD59/Sca2-like.\n
GLEAN3_19819	SPU_019819		A segment of this model is duplicated on GLEAN3_25718. \n \nNTH1 is a DNA glycosylase that excise 5-formyluracil, \n5-hydroxymethyluracil and Thymine glycol in human cells\n
GLEAN3_06294	SPU_006294		Mitochondrial DNA polymerase, DNA polymerase gamma by similarity. \n
GLEAN3_27975	SPU_027975		Role in DNA mismatch repair by similarity.\n
GLEAN3_17492	SPU_017492		Transcriptome data indicates that Glean may have falsely predicted the following exons: 1.\n
GLEAN3_25239	SPU_025239		also see glean model 18861.\n
GLEAN3_03547	SPU_003547		Ligand binding domain is in glean3_03548\n
GLEAN3_04540	SPU_004540	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it was apparent that there are several internal repeats and gaps within the 2 scaffolds resulting in a disorderly arrangement about the 2 scaffolds. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several values being greater than 10. 	May have a couple of extra exons predicted.\n
GLEAN3_22735	SPU_022735		GLEAN3_02789 has the first part of the PPIE gene and GLEAN3_22735 has the latter half. There appears to be a small overlap between the two predictions.\n
GLEAN3_02789	SPU_002789		GLEAN3_02789 has the first part of the PPIE gene and GLEAN3_22735 has the latter half. There appears to be a small overlap between the two predictions.\n
GLEAN3_00875	SPU_000875		e-val for NP_878906 = e-136. \nThis peptide is identical in length (476aas) and sequence to Glean3_10081 on scaffold 6159. \nAnnotated by RA Obar, RL Morris and AM Musante.\n
GLEAN3_02954	SPU_002954		e-val for NP_060111= 8e-123, [Homo sapiens].  \nKinesin-2 family member. \nAnnotated by RA Obar, RL Morris, SC Cummings, EA Kovacs, and EJ Jin.\n
GLEAN3_03717	SPU_003717		E value for NP_659464 = 5e-72 KIF6 [Homo sapiens] \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, and B Rossetti. \n \n
GLEAN3_04939	SPU_004939		#\ne val for NP_085118 = 1e-69. \nProbably incomplete based on its size (390 AA) \nSame length but different sequence as GLEAN3_12452. \nAnnotation by RA Obar, RL Morris, BA Jeffrey,  J Bhatia, AM Musante.\n
GLEAN3_05509	SPU_005509		e val for NP_008985= 6e-42; kinesin family member 3A [Homo sapiens]. \nAnnotation by RA Obar, RL Morris, LE Shorey, SA Tower, and EJ Jin.\n
GLEAN3_05741	SPU_005741		e val = e-148 for NP_006836, KIF2C [Homo sapiens]. \nAnnotated by RA Obar, RL Morris, R Yen,  JM Fess, and AP Rawson.\n
GLEAN3_06602	SPU_006602		The e value was 0 for CAC20443/Q9H193. ?KINESIN-13A2, ACCESSION Q9H193, swissprot locus Q9H193_HUMAN?   \nMotor domain is KIF1-like. \nKinesin-3 family. \nAnnotated by RA Obar, BD Dyer, RL Morris, and B Rossetti. \n \n \n-BDD \n \n  \n \n
GLEAN3_07505	SPU_007505		 e = 0 for NP_659464. \nAnnotated by RA Obar, BD Dyer, RL Morris, and B Rossetti. \n \n
GLEAN3_10570	SPU_010570		e-val for NP_005724= 5e-93, [Homo sapiens]. \nAnnotated by RA Obar, RL Morris, SC Cummings, EA Kovacs, and B Rossetti.\n
GLEAN3_11353	SPU_011353		e val = 3e-61 for NP_071396, KIF13A [Homo sapiens]. \nAnnotated by RA Obar, RL Morris, R Yen, JM Fess, and B Rossetti.\n
GLEAN3_12452	SPU_012452		e val = 1e-49 for NP_085118. \nSame length but different sequence as GLEAN3_04939. \nAnnotated by RA Obar, BD Dyer, RL Morris, AM Musante.\n
GLEAN3_15247	SPU_015247		May play a role in nucleotide excision repair (NER) and RNA polymerase II (POL II) transcription by interacting with ERCC2/XPD and ERCC3/XPB helicases, both subunits of NER-transcription factor TFIIH (by similarity).\n
GLEAN3_14451	SPU_014451		e val for NP_002254 = 3e-34 \nExact match of XP_796037: PREDICTED: similar to kinesin family member C1 [Strongylocentrotus purpuratus]. Explained by Scaffold3656. Contains 8 exons. \nAnnotation by RA Obar, RL Morris, BA Jeffrey, J Bhatia, B Rossetti, EJ Jin, KM Judkins\n
GLEAN3_15354	SPU_015354		e val = 6e-28 for NP_004789, KIF3B [Homo sapiens]. \nContains partial kinesin motor domain, N terminus missing,  when compared to human CENP-E motor domain.  Likely to be a fragment. \nAnnotated by RA Obar, RL Morris, R Yen, JM Fess, and EJ Jin.\n
GLEAN3_16067	SPU_016067		e val for CAI12999 = 3e-84. \ne val for NP_006836 = e-52; kinesin family member 2C [Homo sapiens]. \nAnnotation by RA Obar, RL Morris, SA Tower, and B Rossetti.\n
GLEAN3_16655	SPU_016655	After reviewing the data and doing a BLAST search it appears that there isn't a GLEAN model that fits sufficiently. Scaffold >v2.1_scaffold60391 was the best fit according to the BLAST search; however, the sequence coverage is only 609/2268. The entire sequence is distributed onto 2 different scaffolds. There was no Est information available on GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with values ranging from about 5-30.\nAdditional information from Baylor gene information (comments): \ne val for AAI10990=1e-111.\ne val for NP_112494=5e-81; kinesin family member 18A [Homo sapiens].\nAnnotation by RA Obar, RL Morris, LE Shorey, SA Tower, KM Judkins	e val for AAI10990=1e-111. \ne val for NP_112494=5e-81; kinesin family member 18A [Homo sapiens].   \nAnnotation by RA Obar, RL Morris, LE Shorey, SA Tower, KM Judkins\n
GLEAN3_17809	SPU_017809		accession number (Swiss-Prot): Q02224 \nLikely a fragment based on its short length. \ne val for Q02224 is 3e-25 \nAnnotation by RA Obar, RL Morris, BA Jeffrey, KM Judkins\n
GLEAN3_17289	SPU_017289		e val for AAH73878 = 2e-45. \ne val for NP_002254 = 4e-47. \nAnnotated by RA Obar and RL Morris, KM Judkins\n
GLEAN3_21586	SPU_021586		Hydrolysis of the deoxyribose N-glycosidic bond to excise 3-methyladenine, and 7-methylguanine from the damaged DNA polymer formed by alkylation lesions (by similarity).\n
GLEAN3_28109	SPU_028109		GLEAN3_18262 sequence is a close but not identical match of the model \nalso \nGLEAN3_14629 is a duplicate of the N-terminal region of the model \nalso \nGLEAN3_18263 contains a sequence match to mid region section of the model\n
GLEAN3_18388	SPU_018388		e val = e-109 for NP_904325, KIF1B [Homo sapiens]  \nsee also GLEAN3_18764, also likely ortholog of KIF1B isoform alpha. \nAnnotated by RA Obar, RL Morris, R Yen, and IJ Strachan\n
GLEAN3_18533	SPU_018533		e val = 8e-147 against AAH35896.  \ne val = e val = e-150 against Q8IUN3_HUMAN Q8IUN3 (UniProtKB/TrEMBL Accession Number) \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, and EJ Jin\n
GLEAN3_18764	SPU_018764		e val for NP_904325 is 8e-109. \nAnnotation by RA Obar, RL Morris, BA Jeffrey, and IJ Strachan\n
GLEAN3_19165	SPU_019165		e val 0 against AAG18582/NP_999659, KRP110 [Strongylocentrotus purpuratus]. \ne val for NP_612565=e-161; kinesin family member 23 Isoform 1 [Homo sapiens].   \nAnnotation by RA Obar, RL Morris, LE Shorey, SA Tower, and EJ Jin.\n
GLEAN3_19775	SPU_019775		e value = 5e-37 for NP_612433; kinesin family member 12 [Homo sapiens]. \nAnnotation by R. A. Obar and R.L. Morris\n
GLEAN3_20767	SPU_020767		e val for NP_056069=e-172; kinesin family member 13B [Homo sapiens]. \nAnnotation by RA Obar, RL Morris, SA Tower\n
GLEAN3_24588	SPU_024588	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it was evident that the sequence did not begin until the 9th base pair and continued until 391 was reached. However, there is an internal overlap within this scaffold (v2.1_scaffold824) between 245-391 and 313-388. The rest of the sequence was continued on v2.1_scaffold68414. Besides the repeat within the first scaffold, if the 2 scaffolds were to be combined, there would be an orderly continues arrangement for this GLEAN model. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be	e val = 1e-54 for NP_112494. \nlikely a fragment, based on its short length. \nAnnotated by RA Obar and RL Morris, KM Judkins\n
GLEAN3_23560	SPU_023560		Q4VXC4 is UniProtKB accession number. \ne val for BAE02544 = 8e-106, and for Q4VXC4 = e-104. \nsee also: GLEAN3_18388, GLEAN3_18764, GLEAN3_20634, which are all KIF1B-like. \nAnnotation by: RA Obar, RL Morris, BA Jeffrey, and IJ Strachan\n
GLEAN3_22982	SPU_022982		e val = 8e-29 for NP_004789, KIF3B [Homo sapiens]. \nAnnotated by RA Obar, RL Morris, AM Musante, and EJ Jin\n
GLEAN3_21317	SPU_021317		e val for NP_999656 is 0.0 \nAnnotation by RA Obar, RL Morris, BA Jeffrey, and AP Rawson\n
GLEAN3_22160	SPU_022160		e val for NP_112494 = 3e-120 \nAnnotated by RA Obar and RL Morris, KM Judkins\n
GLEAN3_26503	SPU_026503		See GLEAN3_26503. \nThe KRP170 gene spans GLEAN3_20414 (Scaffold1612/Scaffoldi17703) and GLEAN3_26503 (Scaffold56862/Scaffoldi4507).  The mRNA was published as Chui,K.K., Rogers,G.C., Kashina,A.M., Wedaman,K.P., Sharp,D.J., Nguyen,D.T., Wilt,F. and Scholey,J.M.  "Roles of two homotetrameric kinesins in sea urchin embryonic cell division."   J. Biol. Chem. 275 (48), 38005-38011 (2000).  The GenBank entry for this gene is gi|10697491|gb|AF292395.2|. \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, EJ Jin.\n
GLEAN3_27784	SPU_027784		e val = 2e-128 for CAI95220.  e val = e-130 for NP_055889, KIF1B_beta [Homo sapiens]. \nAnnotation by R.A.Obar R.L. Morris, and AP Rawson 013106\n
GLEAN3_27552	SPU_027552		   e val = e-103 for NP_060046, KIF27 [Homo sapiens]. \n   Annotation by R.A.Obar and R.L. Morris, BJ Rossetti and AP Rawson 013106\n
GLEAN3_21946	SPU_021946		Responsible for preventing misincorporation of 8-oxo-dGTP into DNA thus preventing A:T to C:G transversions (by similarity).\n
GLEAN3_12960	SPU_012960		GLEAN3_12960 is the N-terminal match to MSH6 mouse. \nGLEAN3_28261 is the C-terminal match to MSH6 mouse. \nGLEAN3_15322 is the full match to the Query used.\n
GLEAN3_23112	SPU_023112		GLEAN3_23113 encodes first part of the gene and GLEAN3_06446 the latter half.\n
GLEAN3_09456	SPU_009456		GLEAN3_09456 is almost complete duplicate prediction for GLEAN3_10689 and contains the first half of the gene. GLEAN3_15881 contains the latter half.\n
GLEAN3_15881	SPU_015881		GLEAN3_09456 is almost complete duplicate prediction for GLEAN3_10689 and contains the first half of the gene. GLEAN3_15881 contains the latter half.\n
GLEAN3_03050	SPU_003050		The N-terminal sequence of the mouse RAD52 used for Query of the GLEAN3 models is covered by combining \n \nGLEAN3_03050  \nwith \nGLEAN3_06081  \n \n
GLEAN3_08906	SPU_008906		regions of GLEAN3_08906 are present in GLEAN3_26677 and GLEAN3_28391 also.\n
GLEAN3_26263	SPU_026263		A region of GLEAN3_26263 is present in GLEAN3_23612 also.\n
GLEAN3_10556	SPU_010556		Regions of GLEAN3_10556 are present in GLEAN3_00741 (covers 299-621) and GLEAN3_04597 (covers 1-203) also.\n
GLEAN3_02282	SPU_002282		GLEAN3_02282 covers the c-terminal region of Rev3. \nGLEAN3_02281 covers the n-terminal region of Rev3.\n
GLEAN3_02281	SPU_002281		GLEAN3_02281 covers the n-terminal region of Rev3. \nGLEAN3_02282 covers the c-terminal region of Rev3.\n
GLEAN3_26626	SPU_026626		This prediction includes only N-terminus and should be combined with GLEAN3_26627.  \n
GLEAN3_17523	SPU_017523		Highly similar to GLEAN3_09653, GLEAN3_11471, GLEAN3_13801.  Presence of a sushi domain and patchiness of scaffold sequence suggests that this model might artifactually fuse two different genes.\n
GLEAN3_21133	SPU_021133		Amino acid sequence highly similar but not identical to that of Glean3_01683; may be haplotype based on high similarity of intron sequences.  Model incomplete, lacking many exons both 5' and 3'.  (Or, may be a pseudogene.)\n
GLEAN3_10755	SPU_010755		Partial sequence.\n
GLEAN3_11070	SPU_011070		Partial sequence.  This contains the serine/threonine phosphatase, family 2C, catalytic (Pp2Cc) domain .  See also GLEAN3_11294.\n
GLEAN3_11705	SPU_011705		Identical to Glean3_17161\n
GLEAN3_17161	SPU_017161		See also GLEAN3_11705.\n
GLEAN3_15869	SPU_015869		beginning of the gene is missing (probably in the gap between contings in the scaffold). The end of the gene is probably on another scaffold as well\n
GLEAN3_13821	SPU_013821		GLEAN3_02088 appears to be identical.\n
GLEAN3_05107	SPU_005107		utrs were addad based on est and tiling data\n
GLEAN3_05433	SPU_005433		GLEAN3_05434 is a partial duplicate prediction for GLEAN3_05433. Looking at the Genboree data, it may represent an error in the assembly process. \n
GLEAN3_17434	SPU_017434		Partial sequence containing PTP catalytic domain.\n
GLEAN3_05434	SPU_005434		GLEAN3_05434 is a partial duplicate prediction for GLEAN3_05433. Looking at the Genboree data, it may represent an error in the assembly process.\n
GLEAN3_17234	SPU_017234		Partial sequence. Contains most of the PTP catalytic domain.\n
GLEAN3_14642	SPU_014642		Partial sequence containing the PTP catalytic domain.\n
GLEAN3_00838	SPU_000838		Partial duplication in GLEAN3_14788\n
GLEAN3_06579	SPU_006579		In light of evidence from hybridization array, the Fgeneh gene prediction model goes a better job than the Glean3 model, though the 3' UTR predicted by Glean3 appears accurate and has been included here.\n
GLEAN3_08840	SPU_008840		Shows similar genbank hits to Glean3_09651, though the two sequences show little similarity to each other.  Not highly confident in this annotation.\n
GLEAN3_09651	SPU_009651		Unclear what the exact homolog in non-urchins is.  Clusters tightly with Glean3_08840 (annotated as SpTFIIA alpha).  \n
GLEAN3_05661	SPU_005661		Little expression evidence for exons one, two and six.  But this is likely the correct gene and for at least the other exons shows good evidence for expression.  \n
GLEAN3_02637	SPU_002637		Putative conserved RPB5 domain detected in BlastP search. \nAll exons are supported by tiling path array data. \nHowever, query of Poustka's database does not retrieve significant hits.\n
GLEAN3_14575	SPU_014575		This GLEAN is the C terminal part of DKG-beta; the N terminal part is in GLEAN3_14574. The sequence has been modified below to what is probably correct: the first few amino acids were deleted, while a spurious, non-homologous internal sequence was sectioned off and indicated as not real. \nDNA: \nGATTCAAGGAAAACACTCAAAGATGTTTTAGAAGAATTCCATGGAAATGGAGTATTATCCAAATACAACCCCGAGCAGCCCATCAACTATGAAGGCTTCAAGCTCTTCATGGAGACTTACCTTGACGTTGACATGCCAGAGGATCTCTGTCGACACCTCTTCCTCTCCTTCGTCAAGAAGACACCTTCC \n \n{GTGTCCAGGTCTACAAGTAGAGAGAAGGAGAGGGGCGCCATCTATGATGTCGGCAGTGCCGTCGCTACCGTGACCACCACCGCCGCGTGTGCCGCCATCACAGGGAGCAGCTCGCCGATTGAAATCGCTGGGCAGACGAGCGGCGGCAACAACAACACAGAAGGAAATGAGCTCACAAACGGTCATGGGCAAGAGTCAGGGGTTGTCACAAGAGGGAGCCCCACCCAGTCACGGAGTTCATCCAAGAAGTCCAATGCCTCTAACGGAACCCCTTCAAGTCATTCCATTCGCACCGATGCACTCAGGGTCGAACAAACCACTAATGATCATGGAGATATCAAGAAAGGGAAATCCATGGAGAACAATAGGCGGAAAAAGACAGCACTCTTCACCGCCCTGCGCAAG} NOT REAL! \n \nACTAAAAAGAATACCAAAGACACGGACTCACTAGGGGCATTGGCCGCCTCCCGGGGGTCACTCGCACATCCAGATGTCACCAACCAGGTCGTCTACATGAAAGACATCGTCTGCTACTTGTCGTTGCTAGAGGGCGGCAAACCAGAGGATAAATTAGAATTCATGTTCAGGCTTTATGACACAGATGACAATGGTATCCTTGACAGCAGTGAGCTAGATTGTATAGTAAACCAGATGATGCATGTAGCTGAGTATCTAGGCTGGGATGTCACTGAATTAAGACCAATCTTACAAGATATGATGATAGAGATTGACTTTGATTCAGATGGGACCGTATCGTTAGAAGAATGGATACGTGGCGGCATGACAACCATTCCTCTCTTAGTTCTCCTTGGCCTTGAATCAAATGTGAAAGATGATGGCAGTCATGTATGGAGGTTAAAACACTACAACAAACCAGCCTATTGTAACCTCTGCCTCAATCTCCTTGTAGGGTTTGGAAAACAAGGTCTATCATGTACATTCTGCAAATACACGGTTCATGAACGCTGTGTACAACGTGCCCCAGCCTGTTGCATTAGTACATATGTCAAGTCAAAACGGACAACGAATATCATGAACCATCACTGGGTAGAAGGTAACAGCCCGGGTAAATGTGACCGGTGTAAGAAGTCGATTAAGAGCTACAACGGTATTACTGGCCTTCACTGCAGATGGTGCAAGATAACGCTCCATAACAAATGTGCCTCCCACGTCAAACCAGAGTGTAACATGGGAGAATTCCGAGACCACATTCTACCTCCTACGGCTATCTGCCCAGCAGTTCTGGTAAGTTTAGTCACATTTTAA \n \nprotein: \nDSRKTLKDVLEEFHGNGVLSKYNPEQPINYEGFKLFMETYLDVDMPEDLCRHLFLSFVKKTPS \n \n{VSRSTSREKERGAIYDVGSAVATVTTTAACAAITGSSSPIEIAGQTSGGNNNTEGNELTNGHGQESGVVTRGSPTQSRSSSKKSNASNGTPSSHSIRTDALRVEQTTNDHGDIKKGKSMENNRRKKTALFTALRK} NOT REAL! \n \nTKKNTKDTDSLGALAASRGSLAHPDVTNQVVYMKDIVCYLSLLEGGKPEDKLEFMFRLYDTDDNGILDSSELDCIVNQMMHVAEYLGWDVTELRPILQDMMIEIDFDSDGTVSLEEWIRGGMTTIPLLVLLGLESNVKDDGSHVWRLKHYNKPAYCNLCLNLLVGFGKQGLSCTFCKYTVHERCVQRAPACCISTYVKSKRTTNIMNHHWVEGNSPGKCDRCKKSIKSYNGITGLHCRWCKITLHNKCASHVKPECNMGEFRDHILPPTAICPAVLVSLVTF\n
GLEAN3_23664	SPU_023664		internal exon appears to be incorrect; indicated in sequences below: \nATGCCGAAGACGTACAAGCTGGAGTACTTTAACCTTCGTGGTCGCGCGGAATTGTCCCGTCTGCTTATGGCACAAGCTGATATGAAGTATGAAGATGTTCGGCTCTCATTCGCCGATTGGGGGACAGCAAAGGGAAATCAAGATAAGTACCCCTTGGGATTTCTGCCAGTGCTGGAGGAAGATGGAAAAGTCATTTCACAGAGTATGACCATCGCTCGTCATCTGGCGAGAGAGTTTGGAATGGCGGGACAGAATGAAGAAGAAATGGTAATGATTGATATGATTTGTGAGACGTGTAACGAATTGCTGAGCAAGATGATTGAGATAGCCCTGATGCAAGGCGAAGCC \n \n{AAGCCAAACGCGGTGAAAGAGTTTACGGAAGTCAAATCTCTACTCCCTATGAAAAATATTACAACATGGCTGGAGATGAACGGCAAGGGAAATGGATACTTCGTGGGCGAA} not real \n \nAAGATGTCGGTGGCAGACCTTTTCGTCTTCAGCATCATGGAACACCTCTCTGGAAAATACCCAAATATCCTCACCAAGCAACCCCTTCTCCAAGCCTTCTATGAGAGAATGATGAAGGAACCCAAGTTAGCTGCCTGGATCGTGAAGCGTCCAAATCAAGATATAGATATCTAA \n \nProtein: \nMPKTYKLEYFNLRGRAELSRLLMAQADMKYEDVRLSFADWGTAKGNQDKYPLGFLPVLEEDGKVISQSMTIARHLAREFGMAGQNEEEMVMIDMICETCNELLSKMIEIALMQGEA \n \n{KPNAVKEFTEVKSLLPMKNITTWLEMNGKGNGYFVGE} not real  \n \nKMSVADLFVFSIMEHLSGKYPNILTKQPLLQAFYERMMKEPKLAAWIVKRPNQDIDI\n
GLEAN3_05626	SPU_005626		Only the 5' part of this GLEAN is homologous to this kinase; this is noted in the sequences below. The C terminal protein sequence does not independently produce a significant BLAST hit.  \nATGAACATTGATGAAAAGCTTACAGCAAAGCAACGTGAGGAAAGCAAAACTAAGATTCGGCATAAAGCAGATGGCCGTGTGATGGTGGCAAAGATCGGCAAGCAGAAGATATGGAGAGCCAACCAGCACAAAATGATCCAAGAACTTGAGCTTCTCAATAAACTACAACACCCTAACGTTGTCAGATATATGGGAGCTTGTGTAAAAGATGGCCATATCCATCCTGTACTCGAGTATGTATCTGGTGGATGCTTGACGGACATCTTGGCTGATGAGAGCTTGGCGTTGTCATGGAGACAGAAGGGTGACTTAGCAACAGACATCGCTCGTGGAATGACCTACCTCCACTCACAGAATGTGTGTCATCGAGATCTCACGTCAGCGAATTGCCTCGTTCGTCAAAAGCCGAATAATGTCCTCGAGGCCATACTCACCGACTTCGGCCTCGCTCGTGTGCTCGGCTGCATGCCCGACCCTCCTCCAAACTCTCCCAGAACGCCCGAGTCTCCAGAACCGGACATAATCGACGCACCGAACGGTGGTCCGATGTTGCCTCGGATACCGTCGGCCTGCATGGACGTGCCTCGGAAGATGTCGGTTGTCGGCACCGCGTTCTGGATGGCTCCCGAAGTTTTACGAGGAGAGGAATACACTCGCCAAGTGGATGTTTTCTCGTTTGGTATCGTGGTATGCGAGATTGTAGCAAGAATAACGGCCAATCCAGACGACCTCCCGAGGACTGGGAAGTTCGGTCTCGACCTGCAGCTTTTCAAAGAGAAATGTCCAGGGATACCTGAACCCTTCCTACAGATCGCTGAAGACTGTTGTTCCATGGATCCCAGGGATCGGCCGGTTTTCGCCGAGCTCGTCCGCCGTTTCGAGATCATCCGCGGTACGTTGGACACAGAAACGAGCGACACAACGTGTTACGATGTCAACCTTACGGACATCATCAGGACAAACGATTCGGACGATGATGACGATGATTGTAGCTTTGGGTTTCAGTTTCAAACGGATCTGGATGACAAACGCAGAAGTGGACGAACAAGATTGCGAGAAGTCCTTTGTGGATGTTCTAAAGGA \n \n{TGGATGACAAACGCAGAAGTGGACGAACAAGATTGCGAGAAGTCCTTTGTGGATGTTCTAAAGGAGTACGCTTATGCTGCAAGACAGTATTTCAGGGTTGTTACGGGTTTCATCGTCTGCTTGTTGCCGATGCTTTCACTGTGGATGTATTGTGATTTGAACAATGTCCTCCTGTGGACCTCATTAATATATGCAAGCATAGGGTTGGTCTTAGAGAACTCGACAAGATGTGTAGAAGTGCTTCATTCTTCTCAATCAATCTTCAGGACTTTGTTCAATCTAATATCCAGATTATTCTCCAGACTGTGGAATGTTGTATCGCTTTTGTTGCAAAGAGTGCATTGTACGAGATCCGCAGGCTCGGACCTGAACGAAAACGTGACGTACCCAAATCAAAATGGCGGTCCGACCAAATCGTTGCAACATGGTGATACCCCCAGCGAGGTCCTGAGGAATACATCAACCCGCGTTGCGGACTTGGTCCCAATCTTGAAGAATCGTCAGAAAGGTCCAGGACCCGGCGCCGAGGATCCAGAGTCGCAGGGTGCAAGAAAGAAGACGCTGCTTGCAAACGAATTGGAAGAAACAGAACTGGATAAGTTACATAATCCAAACATTAACAGGGTATTGCTGAATGGCCGCATGTCGCAAAAGCGAGTCAGCTTTTCTTTACAAAACAGTATTGACCGAGATGAAGGTGACCCAAATCCATGA} non-homologous \n \nProtein: \nMNIDEKLTAKQREESKTKIRHKADGRVMVAKIGKQKIWRANQHKMIQELELLNKLQHPNVVRYMGACVKDGHIHPVLEYVSGGCLTDILADESLALSWRQKGDLATDIARGMTYLHSQNVCHRDLTSANCLVRQKPNNVLEAILTDFGLARVLGCMPDPPPNSPRTPESPEPDIIDAPNGGPMLPRIPSACMDVPRKMSVVGTAFWMAPEVLRGEEYTRQVDVFSFGIVVCEIVARITANPDDLPRTGKFGLDLQLFKEKCPGIPEPFLQIAEDCCSMDPRDRPVFAELVRRFEIIRGTLDTETSDTTCYDVNLTDIIRTNDSDDDDDDCSFGFQFQTDLDDKRRSGRTRLREVLCGCSKG \n \n{WMTNAEVDEQDCEKSFVDVLKEYAYAARQYFRVVTGFIVCLLPMLSLWMYCDLNNVLLWTSLIYASIGLVLENSTRCVEVLHSSQSIFRTLFNLISRLFSRLWNVVSLLLQRVHCTRSAGSDLNENVTYPNQNGGPTKSLQHGDTPSEVLRNTSTRVADLVPILKNRQKGPGPGAEDPESQGARKKTLLANELEETELDKLHNPNINRVLLNGRMSQKRVSFSLQNSIDRDEGDPNP} non-homologous  \n
GLEAN3_21174	SPU_021174		#\nOne of 2, duplicate of GLEAN3_04006. This glean is much shorter and is identical (protein level) to 04006, but not on the ends. This is indicated in the sequences below \n \n{ATGGCGGAAGACCTGCTGCATCCAGGCGCCATCGTCAAAGATCGATGGAAAGTTACCAAGAAAATTGGTGGTGGAGGCTTCGGTGAGATCTACGAAGCCCTTGACCAAGTCATTGATGAGTGCGTAGCCATCAAGCTAGAATCTGCTCTCCAACCTAAGCAGGTGCTCAAGATGGAAGTTGCCGTCCTCAAGAAACTTCAGGGG} incorrect? \n \nCGGGATCATATCTGCAAGTTCATAGGCTGCGGTCGCAACGATCAGTTCAACTACGTTGTGATGACCGTCCAGGGCCAGAACCTTGCAGAGCTCCGCCGTGCACAGCCCCGTGGCACGTTCTCCGTCAGCACCATGCTCAGGCTTGGAGTACAGATTCTTGAATCGATAGAAAGCATCCACGAAGTGGGCTTTCTACACAGAGACATCAAACCTAGCAACTTTGCCATTGGGAAAGCTGCTGCTAACACAAGAAAGGTGTACATGTTAGACTTTGGTCTGGCAAGGCAGTATACCAATTCTCAGGGTCAAGTTAGAACGCCAAGGCCAGTTGCTGGGTTTCGTGGAACTGTTCGCTATGCTTCTGTCAATGCTCATAGAAATAGAGAGATGGGTCGTCATGATGATTTGTGGTCGTTGTTCTACATGTTAGTAGAGTTTGTCATTGGTCAACTTCCATGGAGAAAAATCAAAGACAAGGAACAAGTAGGTTTGTTGAAGGAGAAGTACGATCATCGTTTACTACTGAAACACATGCCCATGGAGTTCAAGCAGATATTAGAACAATTCCAGTCCTTAGAATATGCAGACAAACCAGATTACAAGTGCATCCATTCTTTATTAGAGCGATGTATGAACAGGAAGAATATCAAGGAGAATGATGCCTATGATTGGGAAAGACCACCAGTAGATGGAACTCATAACTTACTTCCTTCTTCAACTAGTCCTGCTCGA \n \n{AGGAAGACTATTTTTTCTCCTAAACATCTCTTTTGGACGATTTGTAGGATTTAA} incorrect? \n \n{MAEDLLHPGAIVKDRWKVTKKIGGGGFGEIYEALDQVIDECVAIKLESALQPKQVLKMEVAVLKKLQG} incorrect? \n \nRDHICKFIGCGRNDQFNYVVMTVQGQNLAELRRAQPRGTFSVSTMLRLGVQILESIESIHEVGFLHRDIKPSNFAIGKAAANTRKVYMLDFGLARQYTNSQGQVRTPRPVAGFRGTVRYASVNAHRNREMGRHDDLWSLFYMLVEFVIGQLPWRKIKDKEQVGLLKEKYDHRLLLKHMPMEFKQILEQFQSLEYADKPDYKCIHSLLERCMNRKNIKENDAYDWERPPVDGTHNLLPSSTSPAR \n \n{RKTIFSPKHLFWTICRI} incorrect?\n
GLEAN3_10118	SPU_010118		This sequence does not include the first exon, which is instead present on GLEAN3_11286. The correct sequence is below: \nDNA: \nGCAGAAGATACCCGGCAGAAAGGTTTGAGGGTAGCCATTAAGAAGCTGTCCAGACCGTTCCAAACAGTCATACACGCCAAGAGGACCTACAGAGAACTACGCCTTCTCAAACATATGAGACATGAAAATGTAATCAGCCTGCTTGACTGTTTCACCCCTGACCGAGTCAACTTCTCAGATGTTTACATGGTGACCCATCTCATGGGAGCCGACCTTAACAACATCATCAAGTGTCAGAAACTCTCTGATGACCATGTCCAGTTCCTCATCTATCAGGTTCTCAGGGGCCTCAAGTACATCCATTCTGCTGGTGTGATTCATCGAGATCTCAAGCCCAGTAACATAGCTGTCAATGAAGACTGTGAACTCAGGATCCTGGACTTTGGATTAGCACGTAGCACAGACGATGAGATGACAGGATATGTAGCTACCAGATGGTATAGGGCACCTGAAATCATGCTCAATTGGATGCACTACACTGAGAAAGTTGACATCTGGTCCGTAGGCTGTATCATGGCAGAGCTCCTCACACAGAAAACCCTCTTCCCAGGGTGTGATCACATAGACCAACTGAATAAGATCATTGCTATCACAGGGAAACCAGACGAGACCTTCTTACAGAAGATCGCAAGTGAGAGTGCAAAGACATACCTGATGAGCATGGCTGCCTACCCTAAGAGGGACTTCAGCACTATCTTCCTAGGGGCCAGTCGCAAGGCTGTCGATCTTCTGGAGAAGATGCTACAATTGGATGAGGACAGAAGGCTGAGTGCTGAGCAGGCTCTTCAGCACCCCTATCTGTCTAAGTACCATGATCCAGATGATGAACCAATTGCAGCCATGTTTGATGATAGTCAGGAGAACAGCGACATCGTAATAGATGAATGGAGACAACGCGTTTTGAAAGAAGTAACAGAATTTGTTGCAGACCCAGCTCCGATGGATTGA \nProtein: \nAEDTRQKGLRVAIKKLSRPFQTVIHAKRTYRELRLLKHMRHENVISLLDCFTPDRVNFSDVYMVTHLMGADLNNIIKCQKLSDDHVQFLIYQVLRGLKYIHSAGVIHRDLKPSNIAVNEDCELRILDFGLARSTDDEMTGYVATRWYRAPEIMLNWMHYTEKVDIWSVGCIMAELLTQKTLFPGCDHIDQLNKIIAITGKPDETFLQKIASESAKTYLMSMAAYPKRDFSTIFLGASRKAVDLLEKMLQLDEDRRLSAEQALQHPYLSKYHDPDDEPIAAMFDDSQENSDIVIDEWRQRVLKEVTEFVADPAPMD\n
GLEAN3_24598	SPU_024598		Internal exon may be incorrect. See sequences below. \n \nDNA: \nATGTTTACTACTCACCAGAAATCAGGCCACGCAGGAGGCGGGGGCGTGAAGCGAATCGAGATCTTCTTGACGATGGTGGAACCGACGATACCAGACAGGAGATTCTTGAAGGTTGTCGTGACCAACAACGCTAAGGTCCAGGATCTGATCGGTCTCATCTGCTGGCACTATGTCAACAAGGGTCTTCAACCAGAACTCAATAAATCTTCAGATGGTGAATCTAGGCTTGCGGTGGACTCGACACAAATCACCCTCCGACAGATCTTAACCAAAGCTCTCAGAAAAAGAAAGGGTATCATTCCAACTGCAGGACCTCATTATCTACTGGAGAAGAAGTCTGCACCAGGGGTTCCACTAGACCTGGACCTGAAACTCTGCGAAACAGAATCCATGGACTTCATTATGGTCAGAGAACATAG \n \n{TAGAAGAGACTACCTGAAGAGCGACAGGACGCGACCCTACTCCGGCGATAGAGAGCCCCCTCTAGTGTTGAGTACTCAGTACCGATCGTTCCGAGTCAGCATGCTGCACAAGCTCAGACCTGCCACTGAGATCCAGCTAGGT} incorrect? \n \nATCTCAGGGGATAAGATTGAAATCGACCCGGTAGCCCAACCAAGGAACACGCCGGCCAAGTTCTGGGGCAAGCAGAAAGCCGTCTCCATCGAGTCCGACAGGCTTGCCTTCTGTAACATCACAGATGATAAACCATCAGGGAAGTCCACATTCCGACTGACATTCAAGAGTCCCAACCATGAGTTCAAGCACTACGACTTTGAGACCGGAACCAACCTCACCAAACAGATCGTCAACCGTATCAACCACATTCTCGAGCTTCGAGCAAGCTCCGTACGAAACGACTACACGCTGTGGAGAGAGAGGAGACATAGTAGAAAGGCCCACGATAAATAG \n \nProtein: \nMFTTHQKSGHAGGGGVKRIEIFLTMVEPTIPDRRFLKVVVTNNAKVQDLIGLICWHYVNKGLQPELNKSSDGESRLAVDSTQITLRQILTKALRKRKGIIPTAGPHYLLEKKSAPGVPLDLDLKLCETESMDFIMVREHS \n \n{RRDYLKSDRTRPYSGDREPPLVLSTQYRSFRVSMLHKLRPATEIQLG} incorrect? \n \nISGDKIEIDPVAQPRNTPAKFWGKQKAVSIESDRLAFCNITDDKPSGKSTFRLTFKSPNHEFKHYDFETGTNLTKQIVNRINHILELRASSVRNDYTLWRERRHSRKAHDK\n
GLEAN3_04964	SPU_004964		internal exon that is likely to be erroneous has been deleted in sequences below: \n \nDNA: \n \nATGTTATGCCTTGGGATCGTGTCCAAGCAGGTTATCCGTGATGCCATCCTTCTCAATGACTTCACCAAGAACTTCGACAGCTCACAAACCCGCGAGATAGTGGAGTGCATGTTCCCTATCGACTATAAGAAGGGCCAAATAGTCATCAATGAGGGCGACTCAGGAGCACACTTCTACGTCGGAGCAACGGGTACCCTTGAGGTGAGCCAAGGTGATCGCGTCCTGGCCACTATGGGACCGGGAAAGGTCTTCGGGGAACTGGCCATCCTCTATAACTGCACCAGAACAGCCACCATCACTGCCGTCACTGACGCGCAAGTATGGGCGATCGATAGAAAAGTGTTCCAGCTGATCATGATGAAAACTGGGATGCAGCGCCATGAAGAGTATTTCAACTTCCTTAAGAGTGTGCCTTTGCTCAAAGATTTGTCTTCCGATAACCTCTTCAAGTTGGCGAACAGTTTGGAAGTAGACTTCTTCCATGAAGGTGAATACATTATAGTGGAGGGCTCCAGGGGAGATACCTTCTACATCATTAGTAAGGGGGAGGTCCGGATAACCCAATCCGTCCAAGGACAGAGAGAACCCCAGGAGGTTCGAAGCCTCCAGAAAGGAGACTTCTTCGGTGAGAAAGCGCTCCTTGGTGAGGACGTACGAACAGCGAATGTCTTGGCCAGCAAAGGGGGATGCGAGTGCTTGGCCGTTGATAGACAGTCTTTCAACGAACTGATCGGCAACATGCAGGCACTCCAGGACAAGAATTATGGAGACAAAGAAAGGGGAGCAACCAGGTCGAGCTCGGAGATGGATAATACAGAGATTGCACGAATCAAGCCGATACAAGATGAGCTAGCTGCTATACATCTCAACGATCTGGATATCATCGCTACATTGGGTGTTGGAGGGTTCGGTCGGGTCGAACTGGTTCAACTGGCAGGCGATAAGCGGACATTCGCCCTCAAGTGTTTGAAGAAACATCACATCGTAGAAACTCGGCAACAGGAACATATCTTTTCTGAGAAGAAGATCATGATGGAATCTAGCTCCCCCTTTATTGTCAAATTGTTCAAGACATTCCGTGATCAGAAGTATATCTACATGCTTATGGAAGTCTGCTTAGGAGGAGAGCTCTGGACTATCCTCAGGGACAAGGGTCATTTTGATGACCGGACAGCAAGGTTTTCCACCGCATGCGTAGTTGAAGCTTTCCACTATTTGCACAGTCGCGGCATCGTCTACCGCGATCTCAAGCCTGAGAATCTGCTCCTTGACAACAAAGGCTACGTCAAATTGGTCGACTTTGGTTTCGCGAAGAAGATCGGTTTTGGTCGTAAGACCTGGACCTTTTGTGGCACTCCCGAGTACGTGGCACCGGAAATCATCCTCAACAAAGGTCATGACCTGTCATGTGACTACTGGTCCCTGGGAATCCTCATCTTTGAGCTTTTGACCGGAAATCCGCCATTCACTGCCAATGATCCCATGAAGACATACAACGTTATTCTAAAGGGTATCGACATGGTCGAGTTTCCACGGAAGATTCCTCGTAGTGCTGGTAACCTTATCAAGCGACTCTGTCGGGACAATCCAGGCGAGAGAATCGGCTACCAGAAGAATGGCATTAGTGATATCAAGAAGCACAAATGGTTCCAAGGTTTTGACTGGGAAGGTCTCAGGAAGCAAGAAATTGCCGCCCCTCTTCCTCCAAAGGTGAAAGGCTCAAGCGACTGCAGCAACTTCGACAGCTACCCGAAAGATGTCGATATCCCGGCCGATGAAACGTCGGGATGGGACGAACACTTTTAA \n \nProtein: \nMLCLGIVSKQVIRDAILLNDFTKNFDSSQTREIVECMFPIDYKKGQIVINEGDSGAHFYVGATGTLEVSQGDRVLATMGPGKVFGELAILYNCTRTATITAVTDAQVWAIDRKVFQLIMMKTGMQRHEEYFNFLKSVPLLKDLSSDNLFKLANSLEVDFFHEGEYIIVEGSRGDTFYIISKGEVRITQSVQGQREPQEVRSLQKGDFFGEKALLGEDVRTANVLASKGGCECLAVDRQSFNELIGNMQALQDKNYGDKERGATRSSSEMDNTEIARIKPIQDELAAIHLNDLDIIATLGVGGFGRVELVQLAGDKRTFALKCLKKHHIVETRQQEHIFSEKKIMMESSSPFIVKLFKTFRDQKYIYMLMEVCLGGELWTILRDKGHFDDRTARFSTACVVEAFHYLHSRGIVYRDLKPENLLLDNKGYVKLVDFGFAKKIGFGRKTWTFCGTPEYVAPEIILNKGHDLSCDYWSLGILIFELLTGNPPFTANDPMKTYNVILKGIDMVEFPRKIPRSAGNLIKRLCRDNPGERIGYQKNGISDIKKHKWFQGFDWEGLRKQEIAAPLPPKVKGSSDCSNFDSYPKDVDIPADETSGWDEHF\n
GLEAN3_14574	SPU_014574		This is the N terminal part of DGK-beta; GLEAN3_14575 encodes the C terminal part. The sequence has been modified below, since the 3' part of the original sequence is actually non-coding, as has been indicated below. \nDNA: \nATGCCTGCTTGTAAGGGAACATGGCCTTTCTCTCAGTCCTATGCCTCCCTCGGCCAAACGGACCAAAAGAGACAGAACTCCAAGAAAGAGCGACCGAGCTGGAAGTACCGTCTCTTTCGCAACTCCAAACGAGGCCGGCAGAAAAAGGAAAAGGATAATAGGATTTCGAAACCCCTTTTTGTGCCTTGTACCTCGGCCCTTGTGATGGCGCTGCAGGGCCAAGCTCAGGATGTAGTGGATATCGTCATG \n \n{CCTTGTCCTTTCAAGACCAATGCTGGCACCTTAATGTTGTTGAAGCGTGTCACATCTAATCCCCCGAAGCCGTATGTCCCGCTTCTGCATTTTCCTTCACTGGAATTCGCAGTTTACACGGGACCCCTGACCTTCTTGGTCTTGACACCGATCAATTTAAGCATCACTCCGGGGTAA} Not Real! \n \nProtein: \nMPACKGTWPFSQSYASLGQTDQKRQNSKKERPSWKYRLFRNSKRGRQKKEKDNRISKPLFVPCTSALVMALQGQAQDVVDIVM \n \n \n{PCPFKTNAGTLMLLKRVTSNPPKPYVPLLHFPSLEFAVYTGPLTFLVLTPINLSITPG} Not Real!\n
GLEAN3_23009	SPU_023009		this protein lacks a start codon and has several small regions that are possibly spurious, and one missing short internal sequence. These are indicated in the protein sequence below: \nVPQPYFNLKKRISEEVEVRQKADPPILPIMTKTELADDIVKLIPDSGLRTPIELQEAILFLHEIGSIVHFTDHLNGLNDLYFIDPVWLACTLQRVTALPIGSLKGGKVHVETLRELSKKSSIEEDKFEQYLQLLARFEIVVPISHHWYLVPARLPRDNPGVMLSPHNTDDAPFHYLRRIYKMPYLPPGFWIRLVSRLIADLQMRDKKKKISSAGNERNLGSSKR \n{LTDDEMFPFQR} spurious? \nKSSISEAISFHQTESIYWREGIFFRHNTGQILVRSMVFPSTDKSPGVDILISCQEGHFSAMGCVVDQIEGLIKDWYP \n{GLCTSIHESIQPKVQRLVPCPICVIHGPDDFEPVT} spurious? \nEDLPHCYTVEELAQTYVRGETHITACANSKEKPITISVLIPDMFMKDLSIRHFKQEDFTLHMVSGQSLGQGGFGEVFRAKFRGETVAAKTMLPSRLLKNRMFSSASEGYASCASTSSSTSNRTGESTSTENDSLEAAMLMESFHKLRNEVAIMAKLDHPYIVNLVGVSIRHLCFAMDYAPLGDLRSYLFAEHQSARPHFVKRNIVLEPVLSRMLTYKISLQVASAVGYLHRKDIIYCDLKTDNILLFSSDVNEDVNIKLIDYGISKKYDLMGAMGMAGTPGFCAPEILQGKTFDEKVDWFSYGMFLYHLMTGLVPYYDQHSRIEIELAVNEGRKPTFNFHEYTMPPKQVFPALGALMESCWQNKPGERPHGETTLQLLSEPSFLCLRRVVEVEEEEGVSLAFSQGSQDE \n{LQAFAKQPLAVNANGCKTASLFVI } this is missing from the GLEAN \nDKVVHLIVESGRGTSVRSFQVDEDGCYKSSLLNELQCPMIRTAIATPCGTKIVVGTGGDCVQLYHLPSSHSSHASLLVEARVAGQPTSLHYIQKPSGQEHSLLFVGQANGVLTVLSHETEDSGHHITDDLKLVTRMQLSKHNLPCSSIVAVSRKKNGDSMAEQRRRYEEVVYNGANGVWNRSAKSNGTREERRDETTARKMRGGRNSLEPRERTGGSRDEADGTEVWVGCGNKLRIILLDDITLEPDGIQVAAGMEGIIEGIVQSQGSVWCFTSSALYVYQYSTETRSCLAILDCRESILVPGSFLPLYQEKRQEL \n{VRSWEEKREKEQELASATA} spurious? \nERTVNIIRPRSVGQLSVFAYKLARRPQF\n
GLEAN3_06632	SPU_006632		The expected zinc fingers were not predicted in the GLEAN3 model.\n
GLEAN3_10672	SPU_010672		The N-terminal region (amino acids 1-162) of the GLEAN model show sequence match to heat shock protein.  The presence of pfam H2TH domain and C-terminal sequence similarity are to Neh-2 annotated GLEAN3_06632. \n
GLEAN3_01388	SPU_001388		alpha thalassaemia mental retardation X-linked protein\n
GLEAN3_02329	SPU_002329	After reviewing the information it appears that the sequence is dispersed onto 3 different scaffolds. The first scaffold is continuous for the first 128 bases and the second scaffold is unique in that it is continuous from 127-1129. If the three scaffolds were combined the sequence would appear to have an orderly arrangement without any repeats or gaps within the sequence. There was EST information available from GBrowse assembly V0.5 and the transcriptome information wasn't very strong ranging from 4-16. This is an un-annotated gene and no additional information was found on Baylor gene information (comments)	In addition to Ercc6 homology the GLEAN3 model has an addition N-terminal region with sequence match to similar to Galactosylceramide sulfotransferase (GalCer  \nsulfotransferase) (Cerebroside sulfotransferase) (3-phosphoadenylylsulfate:galactosylceramide  \n3-sulfotransferase) (3-phosphoadenosine-5phosphosulfate:GalCer  \nsulfotransferase)  \n
GLEAN3_20070	SPU_020070		This model was annotated based on a manual inspection of protein alignments and domain structures. The features of this glean model are supported by other predictions and genome-wide tiling array embryonic hybridization data. \n \n
GLEAN3_09913	SPU_009913	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The beginning of the sequence (v2.1_scaffold47466) contains an internal repeat at 2190-2302, however, that may be apart of the gene sequence since it's only one repeat of one base pair. The rest of the sequence is completed on v2.1_scaffold10419, but this scaffold contains a low bit score and a high e-value when compared to v2.1_scaffold47466 (the best GLEAN model fit). There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat weak with most of the values being less than 5. 	#\nThere is notable sequence match to GLEAN3_07783.\n
GLEAN3_10307	SPU_010307		binds to CREB-binding protein (CBP); related to Snf2 family of proteins (by similarity).\n
GLEAN3_12027	SPU_012027		Conserved domain DEAD/H box 1 identified as expected for smarcad homolog \n \ncd00046, DEXDc, DEAD-like helicases superfamily. \nscored 75.9  expectation 1e-14 \n
GLEAN3_02950	SPU_002950		partial sequence of glean3_12238, different scaffolds\n
GLEAN3_15432	SPU_015432		Protein involved in transcription-coupled repair nucleotide excision repair of UV-induced DNA lesions; homolog of human CSB protein; Rad26p [Saccharomyces cerevisiae] --(By Similarity). \n
GLEAN3_19459	SPU_019459		Human Chrom-1 sequence match confined to residues 750 to end of the GLEAN3 model.  The N-terminal region of the model may be more similar to other isoforms of the same family.\n
GLEAN3_19921	SPU_019921		Similar to Saccharomyces cerevisiae RAD26, Homo sapiens ERCC6 and chromodomain helicase proteins of the SNF2 family.\n
GLEAN3_24818	SPU_024818		Related to GLEAN3_28391 with higher coverage of the Query used.\n
GLEAN3_07862	SPU_007862		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThe structure of this model is supported by the genome-wide tiling array embryonic hybridization experiment and by very similar models generated by other gene prediction protocols. \n \nIts structure resembles that of a partial Cbl gene, with most of the N-terminal domains of Cbl genes missing from this model, and we have therefore named this gene "Sp-Cbl-related 1". Sp-Cbl [GLEAN3_07863] is located immediately upstream of this gene and in the opposite orientation. Given that both models map to a large region of uninterrupted sequence, that they are in opposite orientations and the strong correlation with the tiling array hybridization data, we believe it is unlikely these models represent an assembly error but that they may represent a true localized gene rearrangement event. Nonetheless, additional experimental data are needed to confirm these observations.\n
GLEAN3_28332	SPU_028332		SWI/SNF-related matrix-associated actin-dependent  \nregulator of chromatin subfamily A member 3. \n \nTNF-response element binding protein.\n
GLEAN3_28391	SPU_028391		Related to GLEAN3_24818 as a subset.\n
GLEAN3_02003	SPU_002003		Partial prediction. Missing the last 150 AA from human protein.\n
GLEAN3_07863	SPU_007863		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThe structure of this model is supported by the genome-wide tiling array embryonic hybridization experiment and by very similar models generated by other gene prediction protocols. \n \nIts structure resembles that of a partial Cbl gene, with the most N-terminal Pfam Cbl_N domain missing from this model. \n \nA closely related model [GLEAN3_07863] is located immediately upstream of this gene and in the opposite orientation. Given that both models map to a large region of uninterrupted sequence, that they are in opposite orientations and the strong correlation with the tiling array hybridization data, we believe it is unlikely these models represent an assembly error but that they may represent a true localized gene rearrangement event. Nonetheless, additional experimental data are needed to confirm these observations.\n
GLEAN3_11695	SPU_011695		Likely missing a terminal exon (or more).\n
GLEAN3_13568	SPU_013568		Close match to GLEAN3_23027.\n
GLEAN3_23027	SPU_023027		Close match to GLEAN3_13568.\n
GLEAN3_03313	SPU_003313		This gene model has been modified by adding Glean3_03312 to the 5' end.  This sequence blasts to patched most highly,  but is most similar to human Niemann-Pick C2 by phylogenetic analysis.\n
GLEAN3_03312	SPU_003312		This gene model has been modified by adding it to the 5' end of Glean3_03313.  This sequence blasts to patched most highly,  but is most similar to human Niemann-Pick C2 by phylogenetic analysis.\n
GLEAN3_03472	SPU_003472		The GLEAN3 model does not cover the c-terminal helicase region of the mouse Brip-1 Query.\n
GLEAN3_09499	SPU_009499		The GLEAN3 model does not cover the c-terminal region of the Trel-1 Query.\n
GLEAN3_12100	SPU_012100		The GLEAN3_12100 sequence coverage is limited to the N-terminal 250 amino acids of the Query sequence presumably due to the short sequence length of the GLEAN3_12100 model. \n \nThe GLEAN3_12100 sequence is contained within GLEAN3_23149.\n
GLEAN3_23149	SPU_023149		Glean3_23149 is "contained" within this GLEAN3 model.\n
GLEAN3_28874	SPU_028874		Very similar to GLEAN3_13756.\n
GLEAN3_02790	SPU_002790		Appears to be identical to GLEAN3_22735.\n
GLEAN3_22479	SPU_022479		Appears to be identical to GLEAN3_04237\n
GLEAN3_10625	SPU_010625		C-terminal half, probably missing an exon in the middle \nmissing N-terminal half\n
GLEAN3_05083	SPU_005083		ectopic transmembrane domain at the N-terminus\n
GLEAN3_10032	SPU_010032		shorter than expected, missing N-terminus?\n
GLEAN3_22285	SPU_022285		SRCR(3). Probably incomplete. See GLEAN3_22286, 22287, 22288, 22289.\n
GLEAN3_22286	SPU_022286		SRCR(10)-TM. Probably incomplete. See GLEAN3_22285, 22287, 22288, 22289.\n
GLEAN3_22287	SPU_022287		SRCR(4)-TM. Probably incomplete. See GLEAN3_22285, 22286, 22288, 22289.\n
GLEAN3_22288	SPU_022288		SRCR(3). Probably incomplete. See GLEAN3_22285, 22286, 22287, 22289.\n
GLEAN3_22289	SPU_022289		SRCR(4)-TM. Probably incomplete. See GLEAN3_22285, 22286, 22287, 22288.\n
GLEAN3_22339	SPU_022339		SigPep-SRCR(3)-TM.\n
GLEAN3_22423	SPU_022423		SRCR(4). Probably incomplete. See GLEAN3_22424.\n
GLEAN3_22424	SPU_022424		SRCR(8)-Sushi(2). Probably incomplete.  See GLEAN3_22423. Like >gi|8547249|gb|AAF76319.1|AF228827_1 scavenger receptor cysteine-rich protein [Strongylocentrotus purpuratus]. \n
GLEAN3_22528	SPU_022528		SRCR(9)-EGF-SRCR(5). Possibly incomplete.\n
GLEAN3_22567	SPU_022567		SRCR(5)-TM. Probably incomplete.  See GLEAN3_22568, 22569.\n
GLEAN3_22568	SPU_022568		SRCR(5). Probably incomplete.  See GLEAN3_22567, 22569.\n
GLEAN3_22569	SPU_022569		SRCR(3). Probably incomplete.  See GLEAN3_22567, 22568.\n
GLEAN3_22814	SPU_022814		SRCR(4). Probably incomplete.\n
GLEAN3_23641	SPU_023641		SigPep-SRCR(4)-TM.  \n
GLEAN3_23677	SPU_023677		SRCR(5)-TM. Possibly incomplete.\n
GLEAN3_23840	SPU_023840		SRCR(2). Probably incomplete.\n
GLEAN3_23153	SPU_023153		Very similar but not identical in sequence to GLEAN3_23152.\n
GLEAN3_25824	SPU_025824		Groups with caspase 9 subfamily in neighbor joining of multiple sequence alignment.  Model may be missing an exon, as the predicted protein contains a CARD domain, but no capsase (peptidase C14) domain.  Similar but not identical to N-terminus of GLEAN3_00882.\n
GLEAN3_23991	SPU_023991		SRCR(2). Probably incomplete.\n
GLEAN3_24084	SPU_024084		SRCR(4). Probably incomplete. \n
GLEAN3_24390	SPU_024390		SigPep-SRCR(2)-WSC-TM.\n
GLEAN3_24408	SPU_024408		SigPep-SRCR(7)-TM.\n
GLEAN3_24440	SPU_024440	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBarse. However, there was mRNA information available from SpBase. When reviewing the excel data it appears that there are numerous gaps and repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being <5	SigPep-SRCR(4)-TM.\n
GLEAN3_24487	SPU_024487		SRCR(9). Probably incomplete. Like gi|8547243|gb|AAF76316.1|AF228824_1 scavenger receptor cysteine-rich protein variant 1 [Strongylocentrotus purpuratus] and >gi|8547245|gb|AAF76317.1|AF228825_1 scavenger receptor cysteine-rich protein variant 2 [Strongylocentrotus purpuratus]\n
GLEAN3_25862	SPU_025862		SRCR(3). Probably incoplete. See GLEAN3 25865.\n
GLEAN3_25865	SPU_025865		SRCR(3). Probably incomplete. See GLEAN3_25862.\n
GLEAN3_25968	SPU_025968		SigPep-SRCR(3). Probably incomplete.\n
GLEAN3_25983	SPU_025983		SRCR(27). probably incomplete.\n
GLEAN3_26234	SPU_026234		SRCR(3). Probably incomplete.\n
GLEAN3_26241	SPU_026241		SRCR(2). Probaly incomplete.\n
GLEAN3_26408	SPU_026408		SigPep-SRCR(3). Probably incomplete.\n
GLEAN3_26709	SPU_026709		SigPep-SRCR(4). Probably incomplete.\n
GLEAN3_26848	SPU_026848		SRCR(8). Probably incomplete.  See GLEAN3_26849.\n
GLEAN3_26849	SPU_026849	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. There was however mRNA information from SpBase. When reviewing the excel data it appears that the sequence might be distributed onto only one scaffold, however; there were numerous internal repeats represent within the scaffold. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak overall, with most of the values being less than 5. 	SRCR(14)-TM. Probably incomplete. See GLEAN3_26848.\n
GLEAN3_27037	SPU_027037		SigPep-SRCR(2). Possibly incomplete.\n
GLEAN3_27287	SPU_027287		SRCR(4). Probably incomplete.  See GLEAN3_27288.\n
GLEAN3_27288	SPU_027288		SigPep-SRCR(17). Probably incomplete. See GLEAN3_27287. Like >gi|4165053|gb|AAD08654.1| scavenger receptor cysteine-rich protein type 12 precursor [Strongylocentrotus purpuratus].\n
GLEAN3_27379	SPU_027379		SRCR(9). Probably incomplete.\n
GLEAN3_27619	SPU_027619		SRCR(5). Probably incomplete.\n
GLEAN3_28233	SPU_028233		SigPep-SRCR(3). Probably incomplete.\n
GLEAN3_28382	SPU_028382		SRCR(3). Probably incomplete.\n
GLEAN3_28612	SPU_028612		SRCR(6). Probably incomplete.\n
GLEAN3_28669	SPU_028669		SigPep-SRCR(3). Possibly incomplete.\n
GLEAN3_28680	SPU_028680		EGF_CA(6)-EGF-SRCR(2)-EGF(2).\n
GLEAN3_28804	SPU_028804		SRCR(3). Probably incomplete.\n
GLEAN3_17888	SPU_017888		partial\n
GLEAN3_08981	SPU_008981		From Best Accession annotation - \n"OB-fold nucleic acid binding domain. This family contains OB-fold domains that bind to nucleic acids. The family includes the anti-codon binding domain of lysyl, aspartyl, and asparaginyl -tRNA synthetases (See pfam00152). Aminoacyl-tRNA synthetases catalyse the addition of an amino acid to the appropriate tRNA molecule EC:6.1.1.-. This family also includes part of RecG helicase involved in DNA repair. Replication factor A is a heterotrimeric complex, that contains a subunit in this family. This domain is also found at the C-terminus of bacterial DNA polymerase III alpha chain."\n
GLEAN3_02063	SPU_002063		GLEAN3_02063 is a partial duplicate prediction for GLEAN3_04403. \n
GLEAN3_18479	SPU_018479		PARTIAL\n
GLEAN3_10770	SPU_010770		e val for AAH01211 = 1e-75; Kinesin family member C3 [Homo sapiens]. \ne val for NP_005541 = 2e-77; KIFC3 [Homo sapiens]. \nGLEAN3_10770 overlaps entire concensus motor domain when compared to human CENP-E, and has long C-terminal domain. \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, and B Rossetti. \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n
GLEAN3_13729	SPU_013729		#\ne val = 7e-77 for AAH01211, Kinesin family member C3 [Homo sapiens]. \ne val = 1e-78 for NP_005541, KIFC3 [Homo sapiens]. \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, and B Rossetti.\n
GLEAN3_19636	SPU_019636		PARTIAL \n
GLEAN3_15809	SPU_015809		e val for NP_065867 is 8e-45. \nLikely to be a fragment based on its short length \nAnnotation by RA Obar, RL Morris, BA Jeffrey, and B Rossetti.\n
GLEAN3_15437	SPU_015437		e val = 6e-168 against NP_004511, and e-149 for NP_006836; KIF2C [Homo sapiens]. \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, and B Rossetti. \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n
GLEAN3_20634	SPU_020634		#\ne val for NP_004312, "axonal transport of synaptic vesicles" [Hs], is 3e-118.  \ne val for NP_904325, kinesin family member 1B isoform alpha [Hs], is e-116 \nSee also GLEAN3_18764. \nAnnotation by: RA Obar, RL Morris, BA Jeffrey, and IJ Strachan\n
GLEAN3_21656	SPU_021656		CAA40175 is KHC cloned from purp. \nQ66K46_HUMAN Q66K46 (UniProtKB/TrEMBL accession number) \ne val = 0.0 for Q66K46 \ne val = 0.0 for NP_004512 KIF5B [Homo sapiens]. \nThrough comparison with CENP-E (NP_001804.2) as defined by Pfam PF00225, N-terminus of motor domain is likely incomplete.   \nPeptide length=1,077 AA.  \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, SA Tower, LE Shorey, and AP Rawson.   \n
GLEAN3_22296	SPU_022296		e val for AAH28155=7e-115, and against NP_015556 is e-117. \nAnnotation by RA Obar, RL Morris, BA Jeffrey, AM Musante.\n
GLEAN3_12645	SPU_012645		Inspection of the tiling array suggests that glean may have missed the following exons: TSVTQISRPLSLSLPLSSVAIHTFPSGSFPLYSSPYSPSLPLLFRCLHLSLCLLSLTLFSPFSFSPFSNYSFPSLHLPPSQSDT,ARIFHGRFAILLLGKWSLRDSERPKIILLWRGGKTARIIVLLLCFLNPHESYMIYDNLNIDLHEHELQLLRSVGLSLSLSLSPQLQYTPSLLVRSPCIPLRILLLYLFCFAVSISHFVCSV\n
GLEAN3_26237	SPU_026237		e val for CAI43180 and for NM_024704 is 0.0.  \nAnnotation by: RA Obar, RL Morris, BA Jeffrey.\n
GLEAN3_09940	SPU_009940		e val for NP_05541 is 3e-40. \nEval  = 5e-40 against ?CAK04214.1|  novel kinesin motor domain containing protein [Danio rerio], Length=690? \nsee also GLEAN3_13729 and GLEAN3_10770 also KIFC3-like. \nAnnotations by RA Obar, RL Morris, BA Jeffrey, and B Rossetti.\n
GLEAN3_11200	SPU_011200		Strongylocentrotus purpuratus similar to F-box only protein 28 (LOC574995)\n
GLEAN3_04722	SPU_004722		Strongylocentrotus purpuratus similar to WD-repeat  \nprotein 26 (LOC579141), mRNA \n
GLEAN3_06169	SPU_006169		#\ne val = 2e-160 for XP_780214 "PREDICTED: similar to breast cancer metastasis-suppressor 1-like [Strongylocentrotus purpuratus]" \ne val = e-55 for XP_789383 "PREDICTED: similar to kinesin-like motor protein C20orf23 [Strongylocentrotus purpuratus]" \ne val = e-32 for NP_115728, breast cancer metastasis-suppressor 1-like [Homo sapiens] \nAnnotation by R.A.Obar and R.L. Morris 020106\n
GLEAN3_00882	SPU_000882		Very similar to GLEAN3_01683.\n
GLEAN3_01683	SPU_001683		Very similar to GLEAN3_00882 and GLEAN3_13850; may be a duplication or haplotype of the latter.\n
GLEAN3_09653	SPU_009653		Very similar to GLEAN3_17523, may be a duplication or haplotype.  Also significant similarity to C-terminus of GLEAN3_11471.\n
GLEAN3_13850	SPU_013850		Very similar to GLEAN3_01683; may be a duplication or haplotype.\n
GLEAN3_21561	SPU_021561		Very high similarity to C-terminal sequences of GLEAN3_09497, GLEAN3_11339, GLEAN3_26645, GLEAN3_22941, and GLEAN3_26743.  Also has significant sequence similarity to parts of GLEAN3_01472, GLEAN3_09653, and GLEAN3_17523\n
GLEAN3_11916	SPU_011916		This GLEAN MAY be similar to the human nuclear receptor coactivator 5 (NCOA5).\n
GLEAN3_26645	SPU_026645		Very high similarity to C-terminal sequences of GLEAN3_09497, GLEAN3_11339, GLEAN3_21561, GLEAN3_22941, and GLEAN3_26743.  Also has significant sequence similarity to parts of GLEAN3_01472, GLEAN3_09653, and GLEAN3_17523\n
GLEAN3_22941	SPU_022941		Very similar to GLEAN3_09497, GLEAN3_11339, GLEAN3_21561, GLEAN3_26645, and GLEAN3_26743.  Also has significant sequence similarity to parts of GLEAN3_01472, GLEAN3_09653, and GLEAN3_17523.  Missing N-terminus (no methionine).\n
GLEAN3_00205	SPU_000205		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_01505	SPU_001505		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_04604	SPU_004604		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_05393	SPU_005393		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_26743	SPU_026743	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first scaffold consists of an orderly arrangement and continuous arrangement until 565. The rest of the sequence is continued on v2.1_scaffold22981. However, this scaffold contains numerous gaps and repeats present that occur throughout the scaffold. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5. 	Very similar to C-terminal sequences of GLEAN3_09497, GLEAN3_11339, GLEAN3_21561, GLEAN3_22941, and GLEAN3_26645.  Also has significant sequence similarity to parts of GLEAN3_19865, GLEAN3_10466, and GLEAN3_02718....\n
GLEAN3_09555	SPU_009555		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_10114	SPU_010114		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_13859	SPU_013859		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_14831	SPU_014831		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_16886	SPU_016886		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_17772	SPU_017772		Contains MMR_HSR1 domain (GTPase of unknown function domain) \nNote: Identical to GLEAN3_16886 except missing 3'end\n
GLEAN3_20224	SPU_020224		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_20366	SPU_020366		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_21394	SPU_021394		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_24411	SPU_024411		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_25797	SPU_025797		Contains MMR_HSR1 domain (GTPase of unknown function domain)\n
GLEAN3_02178	SPU_002178		Model contains two adjacent astacin protease domains followed by an EGF domain.  This architecture is unique among members of this groups of metalloproteases.\n
GLEAN3_00238	SPU_000238		Portion of the early histone gene repeat\n
GLEAN3_00366	SPU_000366		Portion of the early histone gene repeat\n
GLEAN3_01565	SPU_001565		Portion of the early histone gene repeat\n
GLEAN3_01759	SPU_001759		Portion of the early histone gene repeat\n
GLEAN3_02067	SPU_002067		Portion of the early histone gene repeat\n
GLEAN3_02683	SPU_002683		Portion of the early histone gene repeat\n
GLEAN3_02822	SPU_002822		Portion of the early histone gene repeat\n
GLEAN3_02867	SPU_002867		Portion of the early histone gene repeat\n
GLEAN3_04191	SPU_004191		Portion of the early histone gene repeat\n
GLEAN3_04693	SPU_004693		Portion of the early histone gene repeat\n
GLEAN3_05923	SPU_005923		Portion of the early histone gene repeat\n
GLEAN3_06695	SPU_006695		Portion of the early histone gene repeat\n
GLEAN3_07047	SPU_007047		Portion of the early histone gene repeat\n
GLEAN3_08044	SPU_008044		Portion of the early histone gene repeat\n
GLEAN3_09492	SPU_009492		Portion of the early histone gene repeat\n
GLEAN3_12220	SPU_012220		Portion of the early histone gene repeat\n
GLEAN3_12387	SPU_012387		Portion of the early histone gene repeat\n
GLEAN3_12571	SPU_012571		Portion of the early histone gene repeat\n
GLEAN3_13691	SPU_013691		Portion of the early histone gene repeat\n
GLEAN3_14169	SPU_014169		Portion of the early histone gene repeat\n
GLEAN3_15602	SPU_015602		Portion of the early histone gene repeat\n
GLEAN3_17595	SPU_017595		Portion of the early histone gene repeat\n
GLEAN3_18058	SPU_018058		Portion of the early histone gene repeat\n
GLEAN3_18349	SPU_018349		Portion of the early histone gene repeat\n
GLEAN3_18424	SPU_018424		Portion of the early histone gene repeat\n
GLEAN3_19037	SPU_019037		Portion of the early histone gene repeat\n
GLEAN3_19246	SPU_019246		Portion of the early histone gene repeat\n
GLEAN3_20433	SPU_020433		Portion of the early histone gene repeat\n
GLEAN3_20553	SPU_020553		Portion of the early histone gene repeat\n
GLEAN3_23417	SPU_023417		Portion of the early histone gene repeat\n
GLEAN3_23579	SPU_023579		Portion of the early histone gene repeat\n
GLEAN3_23581	SPU_023581		Portion of the early histone gene repeat\n
GLEAN3_23748	SPU_023748		Portion of the early histone gene repeat\n
GLEAN3_24254	SPU_024254		Portion of the early histone gene repeat\n
GLEAN3_25387	SPU_025387		Portion of the early histone gene repeat\n
GLEAN3_26420	SPU_026420		Portion of the early histone gene repeat\n
GLEAN3_26642	SPU_026642		Portion of the early histone gene repeat\n
GLEAN3_26749	SPU_026749		Portion of the early histone gene repeat\n
GLEAN3_26754	SPU_026754		Portion of the early histone gene repeat\n
GLEAN3_27950	SPU_027950		Portion of the early histone gene repeat\n
GLEAN3_28440	SPU_028440		Portion of the early histone gene repeat\n
GLEAN3_28648	SPU_028648		Portion of the early histone gene repeat\n
GLEAN3_00445	SPU_000445		Portion of the early histone gene repeat\n
GLEAN3_00465	SPU_000465		Portion of the early histone gene repeat\n
GLEAN3_00466	SPU_000466		Portion of the early histone gene repeat\n
GLEAN3_00609	SPU_000609		Portion of the early histone gene repeat\n
GLEAN3_00833	SPU_000833		Portion of the early histone gene repeat\n
GLEAN3_00894	SPU_000894		Portion of the early histone gene repeat\n
GLEAN3_01635	SPU_001635		Portion of the early histone gene repeat\n
GLEAN3_01693	SPU_001693		Portion of the early histone gene repeat\n
GLEAN3_01908	SPU_001908		Portion of the early histone gene repeat\n
GLEAN3_02377	SPU_002377		Portion of the early histone gene repeat\n
GLEAN3_02503	SPU_002503	For this particular GLEAN model there was no Cds information available from Baylor annotations or SpBase. There was however mRNA information available from SpBase. When examing the excel data it was evident that there were numerous repeats and sequence overlaps throughout the data. An Error message was received when doing a transcriptome intensity search. However, there was some Est information available from GBrowse assembly V0.5	Portion of the early histone gene repeat\n
GLEAN3_02577	SPU_002577		Portion of the early histone gene repeat\n
GLEAN3_02621	SPU_002621		Portion of the early histone gene repeat\n
GLEAN3_02721	SPU_002721		Portion of the early histone gene repeat\n
GLEAN3_02805	SPU_002805		Portion of the early histone gene repeat\n
GLEAN3_03694	SPU_003694		Portion of the early histone gene repeat\n
GLEAN3_03790	SPU_003790		Portion of the early histone gene repeat\n
GLEAN3_04339	SPU_004339		Portion of the early histone gene repeat\n
GLEAN3_05077	SPU_005077		Portion of the early histone gene repeat\n
GLEAN3_05245	SPU_005245		Portion of the early histone gene repeat\n
GLEAN3_05535	SPU_005535		Portion of the early histone gene repeat\n
GLEAN3_05934	SPU_005934		Portion of the early histone gene repeat\n
GLEAN3_06734	SPU_006734		Portion of the early histone gene repeat\n
GLEAN3_07281	SPU_007281		Portion of the early histone gene repeat\n
GLEAN3_08232	SPU_008232		Portion of the early histone gene repeat\n
GLEAN3_08425	SPU_008425		Portion of the early histone gene repeat\n
GLEAN3_08451	SPU_008451		Portion of the early histone gene repeat\n
GLEAN3_08765	SPU_008765		Portion of the early histone gene repeat\n
GLEAN3_09051	SPU_009051		Portion of the early histone gene repeat\n
GLEAN3_10863	SPU_010863		Portion of the early histone gene repeat\n
GLEAN3_11373	SPU_011373		Portion of the early histone gene repeat\n
GLEAN3_12033	SPU_012033		Portion of the early histone gene repeat\n
GLEAN3_12737	SPU_012737		Portion of the early histone gene repeat\n
GLEAN3_14033	SPU_014033		Portion of the early histone gene repeat\n
GLEAN3_14153	SPU_014153		Portion of the early histone gene repeat\n
GLEAN3_14506	SPU_014506		Portion of the early histone gene repeat\n
GLEAN3_14870	SPU_014870		Portion of the early histone gene repeat\n
GLEAN3_14988	SPU_014988		Portion of the early histone gene repeat\n
GLEAN3_15532	SPU_015532		Portion of the early histone gene repeat\n
GLEAN3_15646	SPU_015646		Portion of the early histone gene repeat\n
GLEAN3_15686	SPU_015686		Portion of the early histone gene repeat\n
GLEAN3_16297	SPU_016297		Portion of the early histone gene repeat\n
GLEAN3_16809	SPU_016809		Portion of the early histone gene repeat\n
GLEAN3_16905	SPU_016905		Portion of the early histone gene repeat\n
GLEAN3_16951	SPU_016951		Portion of the early histone gene repeat\n
GLEAN3_18034	SPU_018034		Portion of the early histone gene repeat\n
GLEAN3_18161	SPU_018161		Portion of the early histone gene repeat\n
GLEAN3_18700	SPU_018700		Portion of the early histone gene repeat\n
GLEAN3_18896	SPU_018896		Portion of the early histone gene repeat\n
GLEAN3_19122	SPU_019122		Portion of the early histone gene repeat\n
GLEAN3_19321	SPU_019321		Portion of the early histone gene repeat\n
GLEAN3_19619	SPU_019619		Portion of the early histone gene repeat\n
GLEAN3_19867	SPU_019867		Portion of the early histone gene repeat\n
GLEAN3_20189	SPU_020189		Portion of the early histone gene repeat\n
GLEAN3_20285	SPU_020285		Portion of the early histone gene repeat\n
GLEAN3_20438	SPU_020438		Portion of the early histone gene repeat\n
GLEAN3_20439	SPU_020439		Portion of the early histone gene repeat\n
GLEAN3_20521	SPU_020521		Portion of the early histone gene repeat\n
GLEAN3_21756	SPU_021756		Portion of the early histone gene repeat\n
GLEAN3_22660	SPU_022660		Portion of the early histone gene repeat\n
GLEAN3_23533	SPU_023533		Portion of the early histone gene repeat\n
GLEAN3_23891	SPU_023891		Portion of the early histone gene repeat\n
GLEAN3_23940	SPU_023940		Portion of the early histone gene repeat\n
GLEAN3_24477	SPU_024477		Portion of the early histone gene repeat\n
GLEAN3_25050	SPU_025050		Portion of the early histone gene repeat\n
GLEAN3_25461	SPU_025461		Portion of the early histone gene repeat\n
GLEAN3_25562	SPU_025562		Portion of the early histone gene repeat\n
GLEAN3_27043	SPU_027043		Portion of the early histone gene repeat\n
GLEAN3_28654	SPU_028654		Portion of the early histone gene repeat\n
GLEAN3_28929	SPU_028929		Portion of the early histone gene repeat\n
GLEAN3_00093	SPU_000093	For this particular GLEAN model there was no orderly arrangement of the sequence and there were numerous subject hits. When a search was performed using the SpBase search engine there were no gene features or CDS for this model. However, the mRNA sequence was available from SpBase.	Portion of the early histone gene repeat\n
GLEAN3_00306	SPU_000306		Portion of the early histone gene repeat\n
GLEAN3_00467	SPU_000467		Portion of the early histone gene repeat\n
GLEAN3_00846	SPU_000846		Portion of the early histone gene repeat\n
GLEAN3_01111	SPU_001111		Portion of the early histone gene repeat\n
GLEAN3_01234	SPU_001234		Portion of the early histone gene repeat\n
GLEAN3_01375	SPU_001375		Portion of the early histone gene repeat\n
GLEAN3_01553	SPU_001553		Portion of the early histone gene repeat\n
GLEAN3_01637	SPU_001637		Portion of the early histone gene repeat\n
GLEAN3_01661	SPU_001661		Portion of the early histone gene repeat\n
GLEAN3_01961	SPU_001961		Portion of the early histone gene repeat\n
GLEAN3_02318	SPU_002318		Portion of the early histone gene repeat\n
GLEAN3_02354	SPU_002354		Portion of the early histone gene repeat\n
GLEAN3_02362	SPU_002362		Portion of the early histone gene repeat\n
GLEAN3_02502	SPU_002502		Portion of the early histone gene repeat\n
GLEAN3_03081	SPU_003081		Portion of the early histone gene repeat\n
GLEAN3_04281	SPU_004281		Portion of the early histone gene repeat\n
GLEAN3_04696	SPU_004696		Portion of the early histone gene repeat\n
GLEAN3_04847	SPU_004847		Portion of the early histone gene repeat\n
GLEAN3_05943	SPU_005943		Portion of the early histone gene repeat\n
GLEAN3_06062	SPU_006062		Portion of the early histone gene repeat\n
GLEAN3_06184	SPU_006184		Portion of the early histone gene repeat\n
GLEAN3_06480	SPU_006480		Portion of the early histone gene repeat\n
GLEAN3_06827	SPU_006827		Portion of the early histone gene repeat\n
GLEAN3_07002	SPU_007002		Portion of the early histone gene repeat\n
GLEAN3_07525	SPU_007525		Portion of the early histone gene repeat\n
GLEAN3_08092	SPU_008092		Portion of the early histone gene repeat\n
GLEAN3_08706	SPU_008706		Portion of the early histone gene repeat\n
GLEAN3_08729	SPU_008729		Portion of the early histone gene repeat\n
GLEAN3_08925	SPU_008925		Portion of the early histone gene repeat\n
GLEAN3_09102	SPU_009102		Portion of the early histone gene repeat\n
GLEAN3_09779	SPU_009779		Portion of the early histone gene repeat\n
GLEAN3_10037	SPU_010037		Portion of the early histone gene repeat\n
GLEAN3_10166	SPU_010166		Portion of the early histone gene repeat\n
GLEAN3_10251	SPU_010251		Portion of the early histone gene repeat\n
GLEAN3_10558	SPU_010558		Portion of the early histone gene repeat\n
GLEAN3_10732	SPU_010732		Portion of the early histone gene repeat\n
GLEAN3_11590	SPU_011590		Portion of the early histone gene repeat\n
GLEAN3_11627	SPU_011627		Portion of the early histone gene repeat\n
GLEAN3_11788	SPU_011788		Portion of the early histone gene repeat\n
GLEAN3_12030	SPU_012030		Portion of the early histone gene repeat\n
GLEAN3_12346	SPU_012346		Portion of the early histone gene repeat\n
GLEAN3_12718	SPU_012718		Portion of the early histone gene repeat\n
GLEAN3_12796	SPU_012796		Portion of the early histone gene repeat\n
GLEAN3_12836	SPU_012836		Portion of the early histone gene repeat\n
GLEAN3_12916	SPU_012916		Portion of the early histone gene repeat\n
GLEAN3_13518	SPU_013518		Portion of the early histone gene repeat\n
GLEAN3_13548	SPU_013548		Portion of the early histone gene repeat\n
GLEAN3_14356	SPU_014356		Portion of the early histone gene repeat\n
GLEAN3_14675	SPU_014675		Portion of the early histone gene repeat\n
GLEAN3_14825	SPU_014825		Portion of the early histone gene repeat\n
GLEAN3_15193	SPU_015193		Portion of the early histone gene repeat\n
GLEAN3_15236	SPU_015236		Portion of the early histone gene repeat\n
GLEAN3_16072	SPU_016072		Portion of the early histone gene repeat\n
GLEAN3_16281	SPU_016281		Portion of the early histone gene repeat\n
GLEAN3_16620	SPU_016620		Portion of the early histone gene repeat\n
GLEAN3_16985	SPU_016985		Portion of the early histone gene repeat\n
GLEAN3_17565	SPU_017565		Portion of the early histone gene repeat\n
GLEAN3_17576	SPU_017576		Portion of the early histone gene repeat\n
GLEAN3_18089	SPU_018089		Portion of the early histone gene repeat\n
GLEAN3_18398	SPU_018398		Portion of the early histone gene repeat\n
GLEAN3_18699	SPU_018699		Portion of the early histone gene repeat\n
GLEAN3_19093	SPU_019093		Portion of the early histone gene repeat\n
GLEAN3_19342	SPU_019342		Portion of the early histone gene repeat\n
GLEAN3_19660	SPU_019660		Portion of the early histone gene repeat\n
GLEAN3_19832	SPU_019832		Portion of the early histone gene repeat\n
GLEAN3_20352	SPU_020352		Portion of the early histone gene repeat\n
GLEAN3_20583	SPU_020583		Portion of the early histone gene repeat\n
GLEAN3_21066	SPU_021066	For this particular GLEAN model there was no Cds information from either Baylor annotations or SpBase. There was however mRNA information available from SpBase. When examining the excel data it appeared that the sequence may have been distributed onto two different scaffolds. The first portion of the sequence appeared to be on subject gb|DS008717| and the rest of the sequence was on gb|DS008632|. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with several values being greater than 5.	Portion of the early histone gene repeat\n
GLEAN3_21094	SPU_021094		Portion of the early histone gene repeat\n
GLEAN3_21349	SPU_021349		Portion of the early histone gene repeat\n
GLEAN3_21730	SPU_021730		Portion of the early histone gene repeat\n
GLEAN3_21848	SPU_021848		Portion of the early histone gene repeat\n
GLEAN3_22453	SPU_022453		Portion of the early histone gene repeat\n
GLEAN3_22688	SPU_022688		Portion of the early histone gene repeat\n
GLEAN3_22751	SPU_022751		Portion of the early histone gene repeat\n
GLEAN3_23418	SPU_023418		Portion of the early histone gene repeat\n
GLEAN3_23419	SPU_023419		Portion of the early histone gene repeat\n
GLEAN3_23580	SPU_023580		Portion of the early histone gene repeat\n
GLEAN3_23699	SPU_023699		Portion of the early histone gene repeat\n
GLEAN3_23857	SPU_023857		Portion of the early histone gene repeat\n
GLEAN3_24550	SPU_024550		Portion of the early histone gene repeat\n
GLEAN3_24562	SPU_024562		Portion of the early histone gene repeat\n
GLEAN3_25109	SPU_025109		Portion of the early histone gene repeat\n
GLEAN3_26158	SPU_026158		Portion of the early histone gene repeat\n
GLEAN3_26519	SPU_026519		Portion of the early histone gene repeat\n
GLEAN3_26812	SPU_026812		Portion of the early histone gene repeat\n
GLEAN3_26911	SPU_026911		Portion of the early histone gene repeat\n
GLEAN3_27184	SPU_027184		Portion of the early histone gene repeat\n
GLEAN3_27190	SPU_027190		Portion of the early histone gene repeat\n
GLEAN3_27569	SPU_027569		Portion of the early histone gene repeat\n
GLEAN3_28350	SPU_028350		Portion of the early histone gene repeat\n
GLEAN3_28515	SPU_028515		Portion of the early histone gene repeat\n
GLEAN3_28657	SPU_028657		Portion of the early histone gene repeat\n
GLEAN3_28815	SPU_028815		Portion of the early histone gene repeat\n
GLEAN3_00307	SPU_000307		Portion of the early histone gene repeat\n
GLEAN3_00464	SPU_000464		Portion of the early histone gene repeat\n
GLEAN3_00474	SPU_000474		Portion of the early histone gene repeat\n
GLEAN3_00892	SPU_000892		Portion of the early histone gene repeat\n
GLEAN3_01021	SPU_001021		Portion of the early histone gene repeat\n
GLEAN3_02486	SPU_002486		Portion of the early histone gene repeat\n
GLEAN3_02772	SPU_002772		Portion of the early histone gene repeat\n
GLEAN3_02828	SPU_002828		Portion of the early histone gene repeat\n
GLEAN3_02905	SPU_002905		Portion of the early histone gene repeat\n
GLEAN3_03454	SPU_003454		Portion of the early histone gene repeat\n
GLEAN3_03576	SPU_003576		Portion of the early histone gene repeat\n
GLEAN3_03611	SPU_003611		Portion of the early histone gene repeat\n
GLEAN3_04020	SPU_004020		Portion of the early histone gene repeat\n
GLEAN3_05019	SPU_005019		Portion of the early histone gene repeat\n
GLEAN3_05645	SPU_005645		Portion of the early histone gene repeat\n
GLEAN3_05897	SPU_005897		Portion of the early histone gene repeat\n
GLEAN3_06563	SPU_006563		Portion of the early histone gene repeat\n
GLEAN3_07637	SPU_007637		Portion of the early histone gene repeat\n
GLEAN3_07666	SPU_007666		Portion of the early histone gene repeat\n
GLEAN3_07835	SPU_007835		Portion of the early histone gene repeat\n
GLEAN3_08980	SPU_008980		Portion of the early histone gene repeat\n
GLEAN3_08993	SPU_008993		Portion of the early histone gene repeat\n
GLEAN3_09181	SPU_009181		Portion of the early histone gene repeat\n
GLEAN3_11743	SPU_011743		Portion of the early histone gene repeat\n
GLEAN3_12142	SPU_012142		Portion of the early histone gene repeat\n
GLEAN3_13389	SPU_013389		Portion of the early histone gene repeat\n
GLEAN3_13690	SPU_013690		Portion of the early histone gene repeat\n
GLEAN3_14085	SPU_014085		Portion of the early histone gene repeat\n
GLEAN3_14175	SPU_014175		Portion of the early histone gene repeat\n
GLEAN3_15261	SPU_015261		Portion of the early histone gene repeat\n
GLEAN3_15560	SPU_015560		Portion of the early histone gene repeat\n
GLEAN3_15871	SPU_015871		Portion of the early histone gene repeat\n
GLEAN3_16038	SPU_016038		Portion of the early histone gene repeat\n
GLEAN3_16047	SPU_016047		Portion of the early histone gene repeat\n
GLEAN3_16236	SPU_016236		Portion of the early histone gene repeat\n
GLEAN3_16420	SPU_016420		Portion of the early histone gene repeat\n
GLEAN3_17399	SPU_017399		Portion of the early histone gene repeat\n
GLEAN3_17732	SPU_017732		Portion of the early histone gene repeat\n
GLEAN3_18526	SPU_018526		Portion of the early histone gene repeat\n
GLEAN3_18560	SPU_018560		Portion of the early histone gene repeat\n
GLEAN3_18569	SPU_018569		Portion of the early histone gene repeat\n
GLEAN3_19977	SPU_019977		Portion of the early histone gene repeat\n
GLEAN3_20103	SPU_020103		Portion of the early histone gene repeat\n
GLEAN3_20800	SPU_020800		Portion of the early histone gene repeat\n
GLEAN3_20913	SPU_020913		Portion of the early histone gene repeat\n
GLEAN3_22170	SPU_022170		Portion of the early histone gene repeat\n
GLEAN3_22188	SPU_022188		Portion of the early histone gene repeat\n
GLEAN3_23994	SPU_023994		Portion of the early histone gene repeat\n
GLEAN3_25103	SPU_025103		Portion of the early histone gene repeat\n
GLEAN3_25148	SPU_025148		Portion of the early histone gene repeat\n
GLEAN3_25417	SPU_025417		Portion of the early histone gene repeat\n
GLEAN3_25463	SPU_025463		Portion of the early histone gene repeat\n
GLEAN3_25602	SPU_025602		Portion of the early histone gene repeat\n
GLEAN3_25804	SPU_025804		Portion of the early histone gene repeat\n
GLEAN3_26462	SPU_026462		Portion of the early histone gene repeat\n
GLEAN3_26670	SPU_026670		Portion of the early histone gene repeat\n
GLEAN3_27750	SPU_027750		Portion of the early histone gene repeat\n
GLEAN3_28062	SPU_028062		Portion of the early histone gene repeat\n
GLEAN3_28816	SPU_028816		Portion of the early histone gene repeat\n
GLEAN3_00685	SPU_000685		Portion of the early histone gene repeat\n
GLEAN3_00768	SPU_000768		Portion of the early histone gene repeat\n
GLEAN3_01509	SPU_001509		Portion of the early histone gene repeat\n
GLEAN3_02089	SPU_002089		Portion of the early histone gene repeat\n
GLEAN3_02504	SPU_002504	For this particular GLEAN model there was no CDS information available in both the SpBase search engine and in Baylor annotations. There was also no gene information provided in either search engine as well.\nAdditional information found: Portion of the early histone gene repeat.	Portion of the early histone gene repeat\n
GLEAN3_02699	SPU_002699		Portion of the early histone gene repeat\n
GLEAN3_02794	SPU_002794		Portion of the early histone gene repeat\n
GLEAN3_04941	SPU_004941		Portion of the early histone gene repeat\n
GLEAN3_04970	SPU_004970		Portion of the early histone gene repeat\n
GLEAN3_05173	SPU_005173		Portion of the early histone gene repeat\n
GLEAN3_05210	SPU_005210		Portion of the early histone gene repeat\n
GLEAN3_05424	SPU_005424		Portion of the early histone gene repeat\n
GLEAN3_05591	SPU_005591		Portion of the early histone gene repeat\n
GLEAN3_05704	SPU_005704		Portion of the early histone gene repeat\n
GLEAN3_05933	SPU_005933		Portion of the early histone gene repeat\n
GLEAN3_06061	SPU_006061		Portion of the early histone gene repeat\n
GLEAN3_06205	SPU_006205		Portion of the early histone gene repeat\n
GLEAN3_07032	SPU_007032		Portion of the early histone gene repeat\n
GLEAN3_07314	SPU_007314		Portion of the early histone gene repeat\n
GLEAN3_07489	SPU_007489		Portion of the early histone gene repeat\n
GLEAN3_07705	SPU_007705		Portion of the early histone gene repeat\n
GLEAN3_07762	SPU_007762		Portion of the early histone gene repeat\n
GLEAN3_07953	SPU_007953		Portion of the early histone gene repeat\n
GLEAN3_08055	SPU_008055		Portion of the early histone gene repeat\n
GLEAN3_08489	SPU_008489		Portion of the early histone gene repeat\n
GLEAN3_09064	SPU_009064		Portion of the early histone gene repeat\n
GLEAN3_09329	SPU_009329		Portion of the early histone gene repeat\n
GLEAN3_09565	SPU_009565		Portion of the early histone gene repeat\n
GLEAN3_09569	SPU_009569		Portion of the early histone gene repeat\n
GLEAN3_11273	SPU_011273		Portion of the early histone gene repeat\n
GLEAN3_11680	SPU_011680		Portion of the early histone gene repeat\n
GLEAN3_11907	SPU_011907		Portion of the early histone gene repeat\n
GLEAN3_12107	SPU_012107		Portion of the early histone gene repeat\n
GLEAN3_12144	SPU_012144		Portion of the early histone gene repeat\n
GLEAN3_12231	SPU_012231		Portion of the early histone gene repeat\n
GLEAN3_12331	SPU_012331		Portion of the early histone gene repeat\n
GLEAN3_12802	SPU_012802		Portion of the early histone gene repeat\n
GLEAN3_13496	SPU_013496		Portion of the early histone gene repeat\n
GLEAN3_14872	SPU_014872		Portion of the early histone gene repeat\n
GLEAN3_14878	SPU_014878		Portion of the early histone gene repeat\n
GLEAN3_14893	SPU_014893		Portion of the early histone gene repeat\n
GLEAN3_14933	SPU_014933		Portion of the early histone gene repeat\n
GLEAN3_15147	SPU_015147		Portion of the early histone gene repeat\n
GLEAN3_15166	SPU_015166		Portion of the early histone gene repeat\n
GLEAN3_15201	SPU_015201		Portion of the early histone gene repeat\n
GLEAN3_15645	SPU_015645		Portion of the early histone gene repeat\n
GLEAN3_15663	SPU_015663		Portion of the early histone gene repeat\n
GLEAN3_15694	SPU_015694		Portion of the early histone gene repeat\n
GLEAN3_15911	SPU_015911		Portion of the early histone gene repeat\n
GLEAN3_15956	SPU_015956		Portion of the early histone gene repeat\n
GLEAN3_16469	SPU_016469		Portion of the early histone gene repeat\n
GLEAN3_17435	SPU_017435		Portion of the early histone gene repeat\n
GLEAN3_17482	SPU_017482		Portion of the early histone gene repeat\n
GLEAN3_17997	SPU_017997		Portion of the early histone gene repeat\n
GLEAN3_18670	SPU_018670		Portion of the early histone gene repeat\n
GLEAN3_19218	SPU_019218		Portion of the early histone gene repeat\n
GLEAN3_19474	SPU_019474		Portion of the early histone gene repeat\n
GLEAN3_19847	SPU_019847		Portion of the early histone gene repeat\n
GLEAN3_20135	SPU_020135		Portion of the early histone gene repeat\n
GLEAN3_20454	SPU_020454		Portion of the early histone gene repeat\n
GLEAN3_20503	SPU_020503		Portion of the early histone gene repeat\n
GLEAN3_20862	SPU_020862		Portion of the early histone gene repeat\n
GLEAN3_21741	SPU_021741		Portion of the early histone gene repeat\n
GLEAN3_22016	SPU_022016		Portion of the early histone gene repeat\n
GLEAN3_22963	SPU_022963		Portion of the early histone gene repeat\n
GLEAN3_23075	SPU_023075		Portion of the early histone gene repeat\n
GLEAN3_23698	SPU_023698		Portion of the early histone gene repeat\n
GLEAN3_24131	SPU_024131		Portion of the early histone gene repeat\n
GLEAN3_25530	SPU_025530		Portion of the early histone gene repeat\n
GLEAN3_25880	SPU_025880		Portion of the early histone gene repeat\n
GLEAN3_26174	SPU_026174		Portion of the early histone gene repeat\n
GLEAN3_26434	SPU_026434		Portion of the early histone gene repeat\n
GLEAN3_26542	SPU_026542		Portion of the early histone gene repeat\n
GLEAN3_27036	SPU_027036		Portion of the early histone gene repeat\n
GLEAN3_27210	SPU_027210		Portion of the early histone gene repeat\n
GLEAN3_27269	SPU_027269		Portion of the early histone gene repeat\n
GLEAN3_27412	SPU_027412		Portion of the early histone gene repeat\n
GLEAN3_27474	SPU_027474		Portion of the early histone gene repeat\n
GLEAN3_27801	SPU_027801		Portion of the early histone gene repeat\n
GLEAN3_28399	SPU_028399		Portion of the early histone gene repeat\n
GLEAN3_28656	SPU_028656		Portion of the early histone gene repeat\n
GLEAN3_28658	SPU_028658		Portion of the early histone gene repeat\n
GLEAN3_28932	SPU_028932		Portion of the early histone gene repeat\n
GLEAN3_01465	SPU_001465		Model contains exons encoding cub repeats that are nearly identical to glean3_08802.  it probably is a partial CDS of an allele in 08802 or another closely related gene.\n
GLEAN3_11658	SPU_011658		The predicted ORF has a N-terminal sequence longuer than  homologous cyclin H in other species. The first Met of these Cyclin H is conserved in Sp raising the possibility that the      initiation codon predicted in the features is perhaps not the   true one.\n
GLEAN3_00328	SPU_000328		Three GLEAN: GLEAN3_00328,14989 and 0011295 encode the cyclin L protein. They differ in the N-terminal end.\n
GLEAN3_20986	SPU_020986		Unknown protein containing a Cyclin domain\n
GLEAN3_11295	SPU_011295		Potential N-terminal sequence of Sp-Cyclin L found in GLEAN3_14989 \nThree GLEAN: GLEAN3_00328,14989 and 0011295 encode the cyclin L protein. They differ in the N-terminal end.\n
GLEAN3_11190	SPU_011190		This gene was annotated and modified based on bioinformatic evidence (analysis of multiple protein sequence alignments and domain structures). \n \nThe original version of GLEAN3_11190 showed a domain composition/structure very similar but not identical to that of vertebrate and Drosophila Stam genes. Inspection of other predictions revealed that an otherwise almost identical Genscan model incorporates an additional exon (supported by noticeable signal from the genome-wide tiling array hybridization data). When translated, this Genscan model showed an improved alignment to Stam genes and a domain structure now identical to that of vertebrate and fuit fly Stams. We have therefore decided to modify GLEAN3_11190 accordingly.\n
GLEAN3_28525	SPU_028525		This gene may represent a partial duplication of GLEAN3_11190. It is located at the end of a relatively small scaffold, and their sequence identity is 99% at the aminoacid level and >94% at the nucleotide level (including intronic and flanking sequences from the contigs where both models map), which suggests they may reflect an assembly error.\n
GLEAN3_08053	SPU_008053		This Glean sequence correspond to an exact duplication of the N-terminal region of Sp-Faim (GLEAN3_03262).\n
GLEAN3_19983	SPU_019983		>GLEAN3_19983|Scaffold499|161923|162036|  \n>GLEAN3_19983|Scaffold499|162415|162741|  \ncontain sequences conserved in DAN proteins.\n
GLEAN3_03281	SPU_003281		Partial sequence. \n
GLEAN3_05441	SPU_005441		Partial sequence longer than its duplicate GLEAN3_03281 but still partial. N.B.: the two duplicates are not identical\n
GLEAN3_01228	SPU_001228		Unknown CYP. Fragmentary. Last 3 or 4 exons of a P450 with insufficient homology to known proteins to identify. First exon may not be good.\n
GLEAN3_02899	SPU_002899		Partial CYP2-like gene. Near GLEAN3_02898, Sp-Cyp2-like8, suggesting possible tandem duplication as is know for other CYP2s in many species.\n
GLEAN3_01773	SPU_001773		Only the N-terminal region of the Query sequence is covered by the GLEAN3 model. \n \nThe first 42 residues of GLEAN3_01773 are unique whereas the remainder of the sequence is identical to and contained within GLEAN3_01777. \n
GLEAN3_18723	SPU_018723		Possible exta exon in the GLEAN3 model, length extended relative to the Query sequence used. \n \nGLEAN3_18723 is near exact match to GLEAN3_28113 with the exception of an extended N-terminal region.\n
GLEAN3_18795	SPU_018795		Posible missing exon middle region, length reduced relative to the Query sequence used.\n
GLEAN3_21197	SPU_021197		Possible missing exon C-terminal region.  The GLEAN3 model only covers the N-terminal region of the Query sequence used. \nGLEAN3_21197 is contained within GLEAN3_21198.\n
GLEAN3_21198	SPU_021198		GLEAN3_21198 contains GLEAN3_21197.\n
GLEAN3_28113	SPU_028113		Posible exon duplication.  The GLEAN3 model may have a duplication of the C-terminal region revealed by the alignment with the Query sequence used. \nGLEAN3_28113 is a near exact alignment to GLEAN3_18723 and is contained within it.\n
GLEAN3_01777	SPU_001777		Only the N-terminal region of the Query sequence is covered by the GLEAN3 model. \n \nGLEAN3_01773 is contained within GLEAN3_01777.\n
GLEAN3_04494	SPU_004494		A clear sequence match to Msh5 but with low coverage of the Query sequence.\n
GLEAN3_11199	SPU_011199		GLEAN3_11199 contains GLEAN3_21406.\n
GLEAN3_21406	SPU_021406		GLEAN3_21406 is a fragment of GLEAN3_11199.\n
GLEAN3_18944	SPU_018944		GLEAN3_18944 contains an extended c-terminus of low complexity sequence relative to the query sequence used.\n
GLEAN3_07033	SPU_007033		Fragment, missing C terminus due to incomplete scaffold\n
GLEAN3_28358	SPU_028358		Fragment, missing C terminus, possibly other exons due to incomplete scaffolds\n
GLEAN3_03760	SPU_003760		Allele: GLEAN3_03908\n
GLEAN3_03908	SPU_003908		Allele: GLEAN3_03760\n
GLEAN3_00064	SPU_000064		gi|68420855|ref|XP_700381.1|  PREDICTED: similar to Muscarinic acetylcholine receptor M3, partial  \n[Danio rerio]\n
GLEAN3_00078	SPU_000078		G-protein coupled receptor 88\n
GLEAN3_06016	SPU_006016		The first 60 aa of this glean number (KDIGRRLGLLEADLENIESDYPKQKERGYQMLLKWRQMTRNKDLVKTLVQGLQSVQRVDLADKYGPRFEALFPSEIESD) \npresents homology with the death domains of proteins from TNFR family \n \nHowever the rest of the sequence is more closely related to NOD/NALP proteins although the four last exons encode a sterol-desaturase domain which normally does not belong to this type of molecules.  \n \nAssembly problem must had occurred during the generation of this sequence\n
GLEAN3_00283	SPU_000283		NB: sequence identical to GLEAN3_07382\n
GLEAN3_07382	SPU_007382		NB: sequence identical to GLEAN3_00283\n
GLEAN3_23408	SPU_023408		end of Nek10 sequence. See GLEAN3_18375 from complete gene features\n
GLEAN3_18440	SPU_018440		GLEAN3_18441 predicts the first half of SND1 and GLEAN3_18440 has the rest of the gene.\n
GLEAN3_18441	SPU_018441		GLEAN3_18441 predicts the first half of SND1 and GLEAN3_18440 has the rest of the gene.\n
GLEAN3_26759	SPU_026759		GLEAN3_02501 is a partial duplicate prediction.\n
GLEAN3_02501	SPU_002501		Partial duplicate prediction for GLEAN3_26759\n
GLEAN3_02533	SPU_002533		This prediction is likely incorrect. There are at least two separate genes present in this GLEAN. Later half of the prediction matches the human MELK gene well. See the alignment.\n
GLEAN3_07194	SPU_007194		GLEAN3_07194, GLEAN3_24350 and GLEAN3_27411 are tudor domain containing proteins with weak homology to human tudor domain containing protein 1 (TDRD1). TDRD1 ortholog in urchin is represented by GLEAN3_17916. These may be novel tudor domain proteins or may be incorrect predictions.\n
GLEAN3_24350	SPU_024350		GLEAN3_07194, GLEAN3_24350 and GLEAN3_27411 are tudor domain containing proteins with weak homology to human tudor domain containing protein 1 (TDRD1). TDRD1 ortholog in urchin is represented by GLEAN3_17916. These may be novel tudor domain proteins or may be incorrect predictions.\n
GLEAN3_27411	SPU_027411		GLEAN3_07194, GLEAN3_24350 and GLEAN3_27411 are tudor domain containing proteins with weak homology to human tudor domain containing protein 1 (TDRD1). TDRD1 ortholog in urchin is represented by GLEAN3_17916. These may be novel tudor domain proteins or may be incorrect predictions.\n
GLEAN3_11603	SPU_011603		GLEAN3_11603 covers to 700 of 1087 residues in the Query.\n
GLEAN3_12136	SPU_012136		GLEAN3_12136 coverage limited to first 432 of 615 residues in the Query sequence used.\n
GLEAN3_19559	SPU_019559		GLEAN3_19559 contains Uba Domain N-terminal but by sequence match it is as named.\n
GLEAN3_22259	SPU_022259		GLEAN3_22259 missed 55 residues relative to Query.\n
GLEAN3_25405	SPU_025405		Possible missing upstream exon relative to Query.\n
GLEAN3_00486	SPU_000486		GLEAN3_00486 coverage of the Query is 267-477 of 608 aa protein sequence.  Additional regions of the Query are present on scaffold 26695.  Add exons to the features table.\n
GLEAN3_15411	SPU_015411		GLEAN3_25651 has the first part of the SKIV2L gene and GLEAN3_15411 has the latter half.\n
GLEAN3_25651	SPU_025651		GLEAN3_25651 has the first part of the SKIV2L gene and GLEAN3_15411 has the latter half.\n
GLEAN3_18262	SPU_018262		GLEAN3_18262 sequence is close but not identical to GLEAN3_28109 sequence.\n
GLEAN3_11552	SPU_011552		This prediction should be combined with Glean3_11551.  It contains exons encoding cub domains that are very likely to complete the C-terminal sequence of the astacin protease in 11551.\n
GLEAN3_07658	SPU_007658		From Pfam 19.0 \n \nAccession number: PF02301 \nHORMA domain \n \nThe HORMA (for Hop1p, Rev7p and MAD2) domain has been suggested to recognise chromatin states that result from DNA adducts, double stranded breaks or non-attachment to the spindle and acts as an adaptor that recruits other proteins. MAD2 is a spindle checkpoint protein which prevents progression of the cell cycle upon detection of a defect in mitotic spindle integrity.  \n
GLEAN3_16042	SPU_016042		Possible missing N-terminal coding exon relative to Query used.\n
GLEAN3_08287	SPU_008287		Blasts as myotubularin-related protein 2 isoform 2.\n
GLEAN3_09486	SPU_009486		e val = 2e-67 for NP_005724 \nAlmost exact match to XP_790534 : PREDICTED: similar to kinesin family member 20A, partial [Strongylocentrotus purpuratus]  \n1197 nt spread out over 8 or 9 exons. \nExon 8 may represent a false prediction of an exon or may include sequence errors.  All other exons were perfect matches to accession #  XP_790534.  With a 1 nucleotide shift, exon 8 is approximately 80% identical between the described GLEAN3_09486 exon 8 and XP_790534. \nSame sequence is found on Scaffoldi2484 from sp_20060316.asm. \nAnnotation by RA Obar, RL Morris, J Bhatia, BA Jeffrey, AM Musante, EJ Jin, BJ Rossetti and AP Rawson\n
GLEAN3_01874	SPU_001874		When blasted with mus, homo sapiens gene did not obtain same glean3 hit.\n
GLEAN3_22840	SPU_022840		e val for NP_524883=3e-100. \ne val for NP_612433=1e-63; kinesin family member 12 [Homo sapiens].   \nSimilarity to NP_612433 is based on overlap of C terminal half of Glean3_22840 with N terminus of 612433.  612433 contains only partial kinesin motor domain when compared with human CENP-E.  \nAnnotation by RA Obar, RL Morris, SA Tower, SC Cummings, EA Kovacs, and AP Rawson.  \n
GLEAN3_11918	SPU_011918		GLEAN3_25220 also has very good alignment.\n
GLEAN3_07768	SPU_007768		From SwissPro entry - \n"Interacts with BIRC4/XIAP. These two proteins are likely to coexist in a complex with TAK1, TRAF6, TAB1 and TAB2 (By similarity)."  \n
GLEAN3_21537	SPU_021537		#\nmyotubularin-related protein 9\n
GLEAN3_25276	SPU_025276		myotubularin related protein 12 \nNo myotubularin domain in this protein.\n
GLEAN3_14336	SPU_014336		From Swiss Pro  \n"May function as a ubiquitin-protein or polyubiquitin hydrolase. This deubiquitinating enzyme which functions at the endosome, is able to oppose the ubiquitin-dependent sorting of receptors to lysosomes (By similarity)."  \n
GLEAN3_26259	SPU_026259		May be PTPR10D. Partial sequence.  Contains PTP catalytic domain.\n
GLEAN3_22005	SPU_022005		From Swiss Pro \n"Probable component of the SCF (SKP1-CUL1-F-box protein) E3 ubiquitin ligase complex which mediates the ubiquitination and subsequent proteasomal degradation of target proteins involved in cell cycle progression, signal transduction and transcription. Through the RING-type zinc finger, seems to recruit the E2 ubiquitination enzyme to the complex and brings it into close proximity to the substrate. May play a role in protecting cells from apoptosis induced by redox agents. " \n
GLEAN3_26336	SPU_026336		#\nGLEAN3_26336 likely codes for part 1 of the DHX34 gene. \nGLEAN3_27857 likely codes for part 2 of the DHX34 gene. \nGLEAN3_11079 likely codes for part 3 of the DHX34 gene.\n
GLEAN3_27857	SPU_027857		GLEAN3_26336 likely codes for part 1 of the DHX34 gene. \nGLEAN3_27857 likely codes for part 2 of the DHX34 gene. \nGLEAN3_11079 likely codes for part 3 of the DHX34 gene.\n
GLEAN3_11079	SPU_011079		GLEAN3_26336 likely codes for part 1 of the DHX34 gene. \nGLEAN3_27857 likely codes for part 2 of the DHX34 gene. \nGLEAN3_11079 likely codes for part 3 of the DHX34 gene.\n
GLEAN3_10141	SPU_010141		Same as GLEAN3_00897.\n
GLEAN3_00897	SPU_000897		Same as GLEAN3_10141.\n
GLEAN3_12067	SPU_012067		Assemble fragments to obtain Query coverage \nGLEAN3_11485 N-terminal coverage \nGLEAN3_27393 also N-terminal from alternate region or query \nGLEAN3_12067 middle region of query covered \nGLEAN3_18381 C-terminus \n
GLEAN3_09949	SPU_009949		GLEAN3_09949 is a fragment of GLEAN3_00595.\n
GLEAN3_11022	SPU_011022		Comparison to best blast hit suggests that the gene model lacks both N- and C- terminal sequences.  Note: the best blast hit encodes a huge protein more than 2800 amino acids.\n
GLEAN3_01344	SPU_001344		Partial sequence.\n
GLEAN3_17796	SPU_017796	The BLAST results displayed a blank page. Excel data display incredibly long repeats	multiple ankyrin repeats in the encoded protein\n
GLEAN3_17839	SPU_017839		the encoded protein has several ankyrin repeats\n
GLEAN3_01143	SPU_001143		the encoded protein has several ankyrin repeats\n
GLEAN3_15973	SPU_015973		the encoded protein has several ankyrin repeats\n
GLEAN3_01707	SPU_001707		the encoded protein has several ankyrin repeats\n
GLEAN3_15269	SPU_015269		the encoded protein has several ankyrin repeats\n
GLEAN3_19767	SPU_019767		the encoded protein has several ankyrin repeats\n
GLEAN3_22497	SPU_022497		the encoded protein has several ankyrin repeats\n
GLEAN3_10321	SPU_010321		the encoded protein has several ankyrin repeats\n
GLEAN3_15601	SPU_015601		the encoded protein has several ankyrin repeats\n
GLEAN3_04926	SPU_004926		Partial sequence of DUSP4\n
GLEAN3_09401	SPU_009401		Sequence spans collagen and head domains.  Profile scan using ScanProsite identified C1q profile from residues 133 to 269 (score 28.379).\n
GLEAN3_21191	SPU_021191		May have an extra exon towards the end of the prediction.\n
GLEAN3_00282	SPU_000282		Possible missing c-terminual coding exon relative to query.\n
GLEAN3_07253	SPU_007253		predicted:similar to sterol regulatory element binding protein\n
GLEAN3_17267	SPU_017267		predicted: similar to  mucin19 in S.purp\n
GLEAN3_14795	SPU_014795		PREDICTED: similar to TRPC4-associated protein isoform b \n(transient receptor potential cation channel)\n
GLEAN3_23016	SPU_023016		PREDICTED: similar to Fras1 related extracellular matrix protein  \n1\n
GLEAN3_23856	SPU_023856		 \nThe AF-6 protein (also known as afadin and canoe) is a multidomain cell junction protein that contains two N-terminal Ras-associating (RA) domains in addition to FHA (forkhead-associated), DIL (class V myosin homology region), and PDZ domains and a proline-rich region\n
GLEAN3_09389	SPU_009389		Hypothetical protein similar to CG15216-PA gene model  \nNo other info known\n
GLEAN3_25634	SPU_025634		glean feature incomplete, missing exons predicted by genesh. \n
GLEAN3_09107	SPU_009107		it is homolog of Mus musculus RIKEN cDNA 2310067G05 gene, function is not known. If you know the gene function, you can rename it and take it.\n
GLEAN3_12221	SPU_012221		There are 6 to 8 copies of this protein in sea urchin genome based on blast search.\n
GLEAN3_22632	SPU_022632		The protein is shorter than its mouse homolog.\n
GLEAN3_05033	SPU_005033		There are about 7 more copies of Calm1 in sea urchin genome. GLEAN3_05032, GLEAN3_05033, GLEAN3_21511, GLEAN3_10425, GLEAN3_08000, GLEAN3_17814, GLEAN3_24195. \n
GLEAN3_18898	SPU_018898		#\nThis sequence was published in: Wedaman, K.P., Knight, A.E., Kendrick-Jones ,J. and Scholey, J.M. "Sequences of sea urchin kinesin light chain isoforms."  J. Mol. Biol. 231 (1), 155-158 (1993). \nThere are 4 known spliceoforms (mRNAs) encoded by this gene, "kinesin light chain isoform 1" through "isoform 4."  The one predicted in GLEAN3_18898 has been named  "kinesin light chain isoform 4" (KLC-4).\n
GLEAN3_23443	SPU_023443		VERY LARGE ADHESION PROTEIN - MAY BE A CONCATENATION \n \nLDLa x5 CCP2-EGFCa x3-NIDO-aAMOP-VWD-CA -EGF-many EGFCA-FOLN-TM \n \nNovel architecture\n
GLEAN3_21797	SPU_021797		GLEAN3_01260 is a partial duplicate prediction for GLEAN3_21797.\n
GLEAN3_13905	SPU_013905		GLEAN3_08049 and GLEAN3_13905 are duplicate predictions. \n
GLEAN3_08049	SPU_008049		GLEAN3_08049 and GLEAN3_ 13905 are duplicate predictions. \n
GLEAN3_13521	SPU_013521		GLEAN3_13521 is a partial duplicate prediction for GLEAN3_10382 (First 180 AA from both GLEAN's). Rest of 13521 does not appear to be similar to any protein in database.\n
GLEAN3_10338	SPU_010338		This prediction is most similar to DDX43, though it could be a different DDX protein. It certainly is like a DDX protein in any case.\n
GLEAN3_16062	SPU_016062		This GLEAN is almost certainly an incorrect version of the gene represented by GLEAN3_16061, which is a Tubulin binding cofactor A (TBCA) homolog.  These two (adjacent) GLEANs differ only in their predicted amino-termini, but the predicted amino-terminus of GLEAN3_16061 matches the rest of the proteins in the TBCA family well, while the predicted amino-terminus of GLEAN3_16062 does not.\n
GLEAN3_01260	SPU_001260		GLEAN3_01260 is a partial duplicate prediction for GLEAN3_21797.\n
GLEAN3_10463	SPU_010463		This gene is in three GLEAN's. GLEAN3_10463 has part 1 (to ~570 AA), GLEAN3_24100 has part 2 (from ~ 200-1246 AA) and GLEAN3_24101 has part 3 (~ 1028-1304 AA). AA numbers refer to human protein. In addition, GLEAN3_10463 prediction overlaps GLEAN3_24100 (~200-600 AA from 10463 ovelap with 1-396 AA from 24100). \n
GLEAN3_24100	SPU_024100		This gene is in three GLEAN's. GLEAN3_10463 has part 1 (to ~570 AA), GLEAN3_24100 has part 2 (from ~ 200-1246 AA) and GLEAN3_24101 has part 3 (~ 1028-1304 AA). AA numbers refer to human protein. In addition, GLEAN3_10463 prediction overlaps GLEAN3_24100 (~200-600 AA from 10463 ovelap with 1-396 AA from 24100). \n
GLEAN3_24101	SPU_024101		This gene is in three GLEAN's. GLEAN3_10463 has part 1 (to ~570 AA), GLEAN3_24100 has part 2 (from ~ 200-1246 AA) and GLEAN3_24101 has part 3 (~ 1028-1304 AA). AA numbers refer to human protein. In addition, GLEAN3_10463 prediction overlaps GLEAN3_24100 (~200-600 AA from 10463 ovelap with 1-396 AA from 24100). \n
GLEAN3_01832	SPU_001832		Even though this is only a partial prediction, it precisely matches human SF3B14 protein. The rest of the human SF3B14 protein is not represented by other GLEANS's. \nWGS clone SPWDP1E744370A may have the mussing 50 AA at end.\n
GLEAN3_28676	SPU_028676	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 3 different scaffolds. If the 3 scaffolds were combined, the sequence would have an overall orderly and continuous arrangement without any gaps or repeats present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5	1 missing Kelch repeat (only 5 of 6 expected relative to query) could indicate incomplete model near c-terminus.\n
GLEAN3_21088	SPU_021088		Likely missing initial ~200 AA as compared to the human protein. \n \nGLEAN3_04163 is a duplicate prediction for GLEAN3_21088.\n
GLEAN3_15453	SPU_015453		GLEAN3_15453 is a partial duplicate prediction for GLEAN3_05681 and MAY represent a better model for the latter half of this gene.\n
GLEAN3_05681	SPU_005681		GLEAN3_15453 is a partial duplicate prediction for GLEAN3_05681 and MAY represent a better model for the latter half of this gene.\n
GLEAN3_12779	SPU_012779		GLEAN3_09634 is a duplicate prediction. May be missing first exon.\n
GLEAN3_09634	SPU_009634		GLEAN3_09634 is a duplicate prediction for GLEAN3_12779.\n
GLEAN3_08095	SPU_008095		GLEAN3_05311 is a duplicate prediction for GLEAN3_08095.\n
GLEAN3_18805	SPU_018805		From Swiss Prot entry \n"FUNCTION: Substrate-recognition component of the SCF (SKP1-CUL1-F-box protein)-type E3 ubiquitin ligase complex (By similarity)." \n
GLEAN3_10835	SPU_010835		Possibly missing the first exon.\n
GLEAN3_05311	SPU_005311		GLEAN3_05311 is a duplicate prediction for GLEAN3_08095.\n
GLEAN3_20121	SPU_020121		GLEAN3_20121 has the first part of the gene. GLEAN3_14430 has the rest of the gene.\n
GLEAN3_00801	SPU_000801		GLEAN3_23248 has first part of the gene. GLEAN3_00801 has the rest.\n
GLEAN3_23248	SPU_023248		GLEAN3_23248 has first part of the gene. GLEAN3_00801 has the rest.\n
GLEAN3_24643	SPU_024643		GLEAN3_24643 is a partial duplicate prediction for GLEAN3_24644.\n
GLEAN3_24644	SPU_024644		GLEAN3_24643 is a partial duplicate prediction for GLEAN3_24644.\n
GLEAN3_24273	SPU_024273		Phylogenetic analysis shows that this glean model is in a clade with human Niemann-Pick C1 which is a patched related protein.\n
GLEAN3_28882	SPU_028882		Phylogenetic analysis shows that this glean is highly similar to Glean3_24273.  They both are in a clade with human Niemann Pick C1.\n
GLEAN3_27985	SPU_027985		GLEAN3_26660 has the same hit.\n
GLEAN3_24492	SPU_024492		GLEAN3_22249 has the same hit.\n
GLEAN3_17600	SPU_017600		May be missing an exon at the beginning and end.\n
GLEAN3_00170	SPU_000170		GLEAN3_00170 has the first part of the gene and GLEAN3_00171 has the rest.\n
GLEAN3_02448	SPU_002448		same as glean3_13119.\n
GLEAN3_23519	SPU_023519		GLEAN3_23520 is a partial duplicate prediction of GLEAN3_23519.\n
GLEAN3_23520	SPU_023520	From the BLAST results and the excel data, it is evident that the entire sequence is distributed onto 3 different scaffolds. The when examining the excel data it appears that if the three scaffolds were to be combined, the overall sequence would have an orderly and continuous arrangement. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 6. 	GLEAN3_23520 is a partial duplicate prediction of GLEAN3_23519.\n
GLEAN3_00171	SPU_000171		GLEAN3_00170 has the first part of the gene and GLEAN3_00171 has the rest.\n
GLEAN3_07870	SPU_007870	After reviewing the data and performing BLAST searches, it was determined that no orderly GLEAN model fit sufficiently. The best results were on >v2.1_scaffold11280, however, only 1026 base pairs matched out of a total of 6753 indicating that the sequence did not map very well. There was EST information available on GBrowse V0.5 and the transcriptome intensity scores appeared to be strong. \nAdditional information from Baylor page (comments regarding gene information):\nThis GLEAN MAY represent the RANBP2 ortholog in Urchin. RANBP2 in humans encodes a very large RAN-binding protein that immunolocalizes to the nuclear pore complex. 	This GLEAN MAY represent the RANBP2 ortholog in Urchin. RANBP2 in humans encodes a very large RAN-binding protein that immunolocalizes to the nuclear pore complex.\n
Sp-185/333-01	SPU_030144		A partial gene on the end of the scaffold that includes the leader, and the start of the open reading frame (Elements 1-2).  \n
Sp-185/333-02	SPU_030145		Scaffold65222 is entirely 185/333 sequence, but the scaffold starts in the intron, so there is no start codon.  The sequence is element pattern most likely C4, although the scaffold sequence ends just before the putative stop codon.\n
Sp-185/333-03	SPU_030146		Partial gene: Leader, Intron, elements 1-3.\n
GLEAN3_02645	SPU_002645		This GLEAN MAY code for SRPK1.\n
GLEAN3_27906	SPU_027906		Portion of derived peptide sequence matches c-lectin domain (smart00034-E value=3e-04). \n \nExpressed in PMC est libraries.  On same scaffold as PM27.\n
glean3_28945	SPU_030147		Partial gene--continues off the beginning of Scaffold1445.  Did not appear in the original glean3_XXXXX models.\n
GLEAN3_22984	SPU_022984		GLEAN3_22984 and GLEAN3_24392 are both likely candidates for SFRS8. They internally have a significant overlap as well.\n
GLEAN3_24392	SPU_024392		GLEAN3_22984 and GLEAN3_24392 are both likely candidates for SFRS8. They internally have a significant overlap as well.\n
GLEAN3_17377	SPU_017377		GLEAN3_17377 is a duplicate prediction for GLEAN3_22855.\n
GLEAN3_22855	SPU_022855		GLEAN3_17377 is a duplicate prediction for GLEAN3_22855.\n
GLEAN3_10081	SPU_010081		e val = e -136 for NP_878906. \nThis peptide is identical in length (476aas) and sequence to Glean3_00875 on scaffold 113994. \nAnnotated by RA Obar, BD Dyer, RL Morris.\n
GLEAN3_25021	SPU_025021		Possible duplicated gene, GLEAN3_27654\n
GLEAN3_27654	SPU_027654		Possible duplicated gene, GLEAN3_25021\n
GLEAN3_01555	SPU_001555		Possible assemble error, GLEAN3_08070 maybe belongs to 3' of this gene\n
GLEAN3_08070	SPU_008070		Possible assemble error, GLEAN3_01555 maybe belongs to 5' of this gene\n
GLEAN3_13727	SPU_013727		GLEAN3_13727 has first part of the LSM14A gene and GLEAN3_13728 has the latter half.\n
GLEAN3_13728	SPU_013728		GLEAN3_13727 has first part of the LSM14A gene and GLEAN3_13728 has the latter half.\n
GLEAN3_25049	SPU_025049		GLEAN3_16338 is a duplicate prediction for GLEAN3_25409\n
GLEAN3_16338	SPU_016338		GLEAN3_16338 is a duplicate prediction for GLEAN3_25409\n
GLEAN3_25266	SPU_025266		Missing N-terminus.  N-terminus is GLEAN3_28560.  \n
GLEAN3_28560	SPU_028560		missing C-terminus.  C-terminus is predicted in GLEAN3_25266.  Center doamin is overlapped.  \n
GLEAN3_20437	SPU_020437		GLEAN3_03537 has first part of USP52 and GLEAN3_20437 has the rest.\n
GLEAN3_03537	SPU_003537		GLEAN3_03537 has first part of USP52 and GLEAN3_20437 has the rest.\n
GLEAN3_08448	SPU_008448		This GLEAN3 prediction is likely to be incorrect. \n
GLEAN3_28580	SPU_028580		GLEAN3_28452 is a duplicate prediction for GLEAN3_28580.\n
GLEAN3_28452	SPU_028452		GLEAN3_28452 is a duplicate prediction for GLEAN3_28580.\n
GLEAN3_27408	SPU_027408		GLEAN3_27408 has the first part and GLEAN3_16447 has the rest of the EXOSC10 gene. In addition, GLEAN3_27408 and GLEAN3_16447 share a significant partially identical overlap.\n
GLEAN3_16447	SPU_016447		GLEAN3_27408 has the first part and GLEAN3_16447 has the rest of the EXOSC10 gene. In addition, GLEAN3_27408 and GLEAN3_16447 share a significant partially identical overlap.\n
GLEAN3_24074	SPU_024074	After reviewing the data, it appears that there is no sufficient match for this particular GLEAN model.  There appears to be no orderly arrangement of the sequence and poor coverage. The sequence appears to be distributed onto at least 3 different scaffolds. There was no Est information available and the transcriptome score intensities appear to be widely distributed (values from 4-36). This is an un-annotated gene so no additional information was available.	2 different genes.  One portion is similar to Solute carrier family 23, member 1. The other portion is similar to PTPRT (Receptor type protein tyrosine phosphatase T).  In phylogenetic analysis, the PTPRT portion does not clade with the PTPR K/M/T/U group.  It formed a unique clade with Glean3_27290...both were renamed PTPRW.\n
GLEAN3_06216	SPU_006216		GLEAN3_19209 is a significant partial duplicate prediction for GLEAN3_06216.\n
GLEAN3_19209	SPU_019209	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. If the 2 scaffolds were to be combined, the sequence would have an orderly and continuous arrangement without any repeats or gaps present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values in the 5-10 range	GLEAN3_19209 is a significant partial duplicate prediction for GLEAN3_06216.\n
GLEAN3_22282	SPU_022282		May be the PTP domain of PTPRA. Partial gene.  Best hit was PTPSp8.\n
GLEAN3_09557	SPU_009557		Partial sequence. PTPN3? PTPN4?\n
GLEAN3_02877	SPU_002877		GLEAN3_02877, 06741, 17776, 17115 and 10544 are partially identical duplicate predictions of varying degree.\n
GLEAN3_06741	SPU_006741		GLEAN3_02877, 06741, 17776, 17115 and 10544 are partially identical duplicate predictions of varying degree.\n
GLEAN3_17776	SPU_017776		GLEAN3_02877, 06741, 17776, 17115 and 10544 are partially identical duplicate predictions of varying degree.\n
GLEAN3_17115	SPU_017115		GLEAN3_02877, 06741, 17776, 17115 and 10544 are partially identical duplicate predictions of varying degree.\n
GLEAN3_10544	SPU_010544		GLEAN3_02877, 06741, 17776, 17115 and 10544 are partially identical duplicate predictions of varying degree.\n
GLEAN3_26886	SPU_026886		There may be an extra exon(s) in the prediction.\n
GLEAN3_17940	SPU_017940		Partial sequence.\n
GLEAN3_07363	SPU_007363		Partial sequence.\n
GLEAN3_14401	SPU_014401		The following exon prediction is probably incorrect as it codes for another peptide sequence found in a different class of proteins. \n>GLEAN3_14401|Scaffold442|42154|42721| DNA_SRC: Scaffold442 START: 42154 STOP: 42721 STRAND: +  \nATGGAAGAGCAAATCACCGCAAATCTTTTTAATTGCTCTATCATGAATGCTATGCCCTACGACGATGATA \nACTTTGAGTCGCCATCGACATCACCACCTACATACGCCGAGCTCACACCCGCTGTCAATCACACTTTCAA \nTCACGGCAACATCAATTTTGATCACAACACCAGTTACGACGACGGCAACATAAGATATGAACACGACAAC \nAGCAACCATAACTTTGACGAACAAGTACCCTTGAGCACCGCGCATCTTCTTGACATCTTATCGACGACGG \nATGTCGACATCAACAATATCGCAAATGACGGGGAGGAAGAGGGAAGCGACGAGGGGAGCGAACTCGCAGC \nGTATCTCTTTCAGAATTCGGAATGGATTACGAATAACGCGACTTTAGACGATTCTCAATATTCAACTGCA \nGTTAACGGTGACCCGCAACACTTTCAGAGTTGCTACACGAATAAGTCCATGGGCTATGGCAACACTTCGT \nTCAACAGCAGCTATCATGAGGCTCACACCTTGCCACAAGTACCTTATTTTGGACATACTGATACTCAACA \nTGCTCAAG \n
GLEAN3_10793	SPU_010793		missing N-terminus residues, partial\n
GLEAN3_02993	SPU_002993		GLEAN3_02993 has the first 1/3rd of the gene. GLEAN3_25005 has the last 1/3rd. Middle part appears to be missing.\n
GLEAN3_25005	SPU_025005		GLEAN3_02993 has the first 1/3rd of the gene. GLEAN3_25005 has the last 1/3rd. Middle part appears to be missing.\n
GLEAN3_23532	SPU_023532		Domains: DEATH, NACHT, LRRs \n
GLEAN3_25680	SPU_025680		#\nDomains: DEATH, NACHT, LRRs \n
GLEAN3_28681	SPU_028681		Domains: DEATH, NACHT, LRRs\n
GLEAN3_03200	SPU_003200		Domains: DEATH, NACHT, LRRs \n
GLEAN3_09017	SPU_009017		#\nDomains: DEATH, NACHT, LRRs\n
GLEAN3_14128	SPU_014128		Domains: Signal peptide, DEATH, NACHT, LRRs.\n
GLEAN3_04053	SPU_004053		Domains: DEATH, NACHT, LRRs\n
GLEAN3_02641	SPU_002641		Domains: DEATH, NACHT, LRRs\n
GLEAN3_26921	SPU_026921		Domains: DEATH, NACHT, LRRs. \n
GLEAN3_25914	SPU_025914	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. These two scaffolds had the highest bit score and the lowest e-values compared to the rest of the BLAST results. When reviewing the excel data and comparing it to the BLAST results, it is apparent that it these two scaffolds were to be combined the sequence would have an orderly continuous arrangement, without any gaps or internal repeats present. There was no Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10	Domains: DEATH, NACHT, LRRs.\n
GLEAN3_15340	SPU_015340		Domains: DEATH, NACHT, LRRs. \n
GLEAN3_17054	SPU_017054		#\nDomains: DEATH, NACHT, LRRs.\n
GLEAN3_17993	SPU_017993		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_23628	SPU_023628		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_09111	SPU_009111		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_22780	SPU_022780		Domains: DEATH, NACHT, LRRs. \n
GLEAN3_02868	SPU_002868		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_20380	SPU_020380		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_27858	SPU_027858		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_15972	SPU_015972		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_02436	SPU_002436		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_24975	SPU_024975		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_03715	SPU_003715		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_21243	SPU_021243		Domains: DEATH, NACHT, LRRs.\n
Sp-VEGF-3	SPU_030148		This model was created based on RACE sequence (5'end) and completed based on a Fgenesh++ model (S.P_Scaffold78.seq.N000007).\n
GLEAN3_21148	SPU_021148		Found by Tandem Mass spectrometry of S. purpuratus sperm membranes\n
GLEAN3_06122	SPU_006122		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm).\n
GLEAN3_20008	SPU_020008		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nNB: Identical models were generated by other gene prediction protocols.\n
GLEAN3_07020	SPU_007020		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm).\n
GLEAN3_00764	SPU_000764		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nNB: The exon structure and embryonic expression of this gene are supported by the genome-wide tiling array hybridization data.\n
GLEAN3_15127	SPU_015127		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nNB: This model lacks the SAM domains found in other members of the Sp-Sarm-related subfamily of genes (Sp-Sarm 1-4). Accordingly, it shows a weaker clustering with these models. The analysis of the scaffold in which this model is located does not reveal any obvious missing sequence, which would suggest a true loss of this domain.\n
GLEAN3_21841	SPU_021841		This gene was modified and annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nThis model was modified based on otherwise identical Fgenesh++/AB predictions that incorporate an additional C-term exon. This exon is supported by the tiling array data, and it codes for a (sub-optimal) SAM domain, which is present in other members of this subfamily of Sp-Sarm related genes. \n \nNB: The structure and embryonic expression of this gene is supported by the embryonic tiling array data.\n
GLEAN3_18859	SPU_018859		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models (NB: this model is one example, it is located at the end of the scaffold and there might be missing N-ter sequence). It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm).\n
GLEAN3_27640	SPU_027640		This gene was modified and annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nThis model was modified based on an otherwise identical BCM:Gene prediction that incorporates an additional N-term exon. This additional exon codes for sequence that includes a sub-optimal prediction for an Armadillo/b-catenin domain, which is present in other Sp-Sarm-related genes. The additional exon corresponds to an adjacent glean model (GLEAN3_27639) and thus the modified version of this model fuses both glean models.\n
GLEAN3_18168	SPU_018168		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm).\n
GLEAN3_03495	SPU_003495		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nNB: The exon structure and embryonic expression of this model are strongly supported by the tiling-array data.\n
GLEAN3_04557	SPU_004557		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nNB: The exon structure and embryonic expression of this model are strongly supported by the tiling array data.\n
GLEAN3_04107	SPU_004107		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm).\n
GLEAN3_08302	SPU_008302		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models (this model is one such example; there may be some missing N-ter sequence). It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nNB: The exon structure and embryonic expression of this model are strongly supported by tiling array data.\n
GLEAN3_07088	SPU_007088		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nThis and all other Sp-SARM-related genes cluster "tightly" with Sp-Sarm (GLEAN3_11042) and other Sarm genes in MSA trees built based on the sequence of their TIR domains. In addition, most of Sp-Sarm-related genes have similar domain compositions to that of Sarm genes (TIR, Armadillo/b-catenin and SAM domains), and those that do not are often located at the end of their respective scaffold and might be incomplete models. It should be noted, however, that the subfamily of Sarm-related genes shows a different domain organization than that of Sarms (including Sp-Sarm). \n \nThis model also includes a predicted transmembrane domain, a feature not seen in other members of the subfamily of Sp-Sarm-related models. \n \nNB: The exon structure and embryonic expression of this model are strongly supported by tiling array data. Moreover, some tiling array signal falls into introns of this prediction, and there could therefore exist more exonic sequence that was not called for by the prediction protocols.\n
GLEAN3_16014	SPU_016014		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species). \n \nNB: This model is located in a small scaffold. Therefore, there could be missing sequence towards both the N-ter and C-ter ends of the model.\n
GLEAN3_13299	SPU_013299		This seems a duplication of GLEAN3_07952. Please see GLEAN_07952 for details. \n \nNote that the adjacent GLEAN3_13298 is also most likely a duplication of GLEAN3_07951.\n
GLEAN3_07952	SPU_007952		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species). The TIR domain in this model showed a weak co-segregation with TIR-domains from MyD88 genes in a MSA tree. However its domain composition/structure does not resemble any of the MyD88 genes. \n \nNB: The exon structure and embryonic expression of this model are partly supported by tiling array data. Also note this model seems duplicated in GLEAN 3_13299 (99% seq identity at aminoacid level).\n
GLEAN3_12671	SPU_012671		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species). This model, however, co-clustered tightly with three other Sp-TIR-containing models (Sp-TIR-c5,6,7). Despite this strong co-seggregation, Sp-TIR-c4-7 do not show any characteristic domain structure/composition.\n
GLEAN3_14926	SPU_014926		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species).\n
GLEAN3_03608	SPU_003608		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species). This model, however, co-clustered tightly with three other Sp-TIR-containing models (Sp-TIR-c4,6,7). Despite this strong co-seggregation, Sp-TIR-c4-7 do not show any characteristic domain structure/composition. \n \nNB: The exon structure and embryonic expression of this model are strongly supported by tiling array data.\n
GLEAN3_13352	SPU_013352		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species). This model, however, co-clustered tightly with three other Sp-TIR-containing models (Sp-TIR-c4,5,7). Despite this strong co-seggregation, Sp-TIR-c4-7 do not show any characteristic domain structure/composition. \n \nNB: This model is located at the end of a scaffold and represents a partial model (there is no ATG at th N-ter of the sequence).\n
GLEAN3_20131	SPU_020131		This gene was annotated based on a manual inspection of multiple protein sequence alignments and domain structures. \n \nSp-TIR-containing models do not show any characteristic domain composition, nor do they cluster significantly with other TIR-containing genes (in S.purpuratus or any other species). This model, however, co-clustered tightly with three other Sp-TIR-containing models (Sp-TIR-c4-6). Despite this strong co-seggregation, Sp-TIR-c4-7 do not show any characteristic domain structure/composition. \n \nNB: Some exons of this model show very strong signal in the tiling array embryonic hybridization experiment, which supports the embryonic expression of this model.\n
GLEAN3_12667	SPU_012667		This model was annotated based on a manual inspection of sequence alignments and domain composition. \n \nWe named this model with the sufix "a" (and not a number) to avoid the misleading assumption that this model would represent any specific ortholog of vertebrate SAA genes (e.g. SAA1). \n \nThis model is very similar in sequence to the adjacent GLEAN3_12668 (Sp-Saa-b). However, there are enough differences between them both at the nt and aa level, and therefore they most likely represent two different genes. \n
GLEAN3_12668	SPU_012668		This model was annotated based on a manual inspection of sequence alignments and domain composition. \n \nWe named this model with the sufix "b" (and not a number) to avoid the misleading assumption that this model would represent any specific ortholog of vertebrate SAA genes (e.g. SAA2). \n \nThis model is very similar in sequence to the adjacent GLEAN3_12667 (Sp-Saa-a). However, there are enough differences between them both at the nt and aa level, and therefore they most likely represent two different genes. \n
GLEAN3_14752	SPU_014752		This model was annotated based on reciprocal blasting and similarity of domain structure/organization with vertebrate Nck. \n \nIts embryonic expression is partly supported by signal from the tiling array hybridization data.\n
GLEAN3_00045	SPU_000045		Shows significant homology to human Ficolin-1.  Sequence overlap occurs in the fibringen domain.\n
GLEAN3_20449	SPU_020449		 fragment\n
GLEAN3_20501	SPU_020501		 fragment\n
GLEAN3_20584	SPU_020584		 fragment\n
GLEAN3_20736	SPU_020736		 fragment\n
GLEAN3_20959	SPU_020959		 tiny fragment\n
GLEAN3_21072	SPU_021072		 fragment\n
GLEAN3_21329	SPU_021329		 tiny fragment\n
GLEAN3_21353	SPU_021353		 fragment\n
GLEAN3_21423	SPU_021423		 partial, missing C-terminus half\n
GLEAN3_21534	SPU_021534		 fragment\n
GLEAN3_21540	SPU_021540		 tiny fragment\n
GLEAN3_21636	SPU_021636		 fragment\n
GLEAN3_21657	SPU_021657		 missing N- and C-terminus\n
GLEAN3_21711	SPU_021711		 fragment\n
GLEAN3_21836	SPU_021836		 only N-terminal fragment\n
GLEAN3_21984	SPU_021984		 tiny fragment\n
GLEAN3_22014	SPU_022014		 fragment\n
GLEAN3_22413	SPU_022413		 fragment\n
GLEAN3_23181	SPU_023181		 fragment\n
GLEAN3_23246	SPU_023246		 fragment\n
GLEAN3_23276	SPU_023276		 fragment\n
GLEAN3_23521	SPU_023521		 tiny fragment\n
GLEAN3_23818	SPU_023818		 fragment, missing N-terminus half\n
GLEAN3_23997	SPU_023997		 fragment\n
GLEAN3_24225	SPU_024225		 fragment\n
GLEAN3_24330	SPU_024330		 fragment, missing N-terminus region and a stretch in middle\n
GLEAN3_24422	SPU_024422		 tiny fragment\n
GLEAN3_24573	SPU_024573		 fragment\n
GLEAN3_24701	SPU_024701		 tiny fragment\n
GLEAN3_24730	SPU_024730		 tiny fragment\n
GLEAN3_24872	SPU_024872		 small fragment\n
GLEAN3_24892	SPU_024892		 fragment\n
GLEAN3_24939	SPU_024939		 fragment\n
GLEAN3_25048	SPU_025048		 fragment\n
GLEAN3_25189	SPU_025189		 fragment\n
GLEAN3_25694	SPU_025694		 fragment\n
GLEAN3_25798	SPU_025798		 missing C-terminus residues\n
GLEAN3_25847	SPU_025847		 small fragment\n
GLEAN3_25877	SPU_025877		 small fragment\n
GLEAN3_26128	SPU_026128		 fragment\n
GLEAN3_26792	SPU_026792		 fragment\n
GLEAN3_27323	SPU_027323		 missing N-terminus residues\n
GLEAN3_27582	SPU_027582		 fragment\n
GLEAN3_28841	SPU_028841		 fragment\n
GLEAN3_11626	SPU_011626		 fragment\n
GLEAN3_19005	SPU_019005		 fragment\n
GLEAN3_03531	SPU_003531		 extra C-terminus residues\n
GLEAN3_04263	SPU_004263		 fragment\n
GLEAN3_09652	SPU_009652		 fragment\n
GLEAN3_09845	SPU_009845		 fragment\n
GLEAN3_15734	SPU_015734		 tiny fragment\n
GLEAN3_17456	SPU_017456		 fragment\n
GLEAN3_18249	SPU_018249		 fragment\n
GLEAN3_24068	SPU_024068		 fragment\n
GLEAN3_26707	SPU_026707		 fragment\n
GLEAN3_28520	SPU_028520		 fragment\n
GLEAN3_28870	SPU_028870		 fragment\n
GLEAN3_00214	SPU_000214		 missing some N-terminus residues\n
GLEAN3_01700	SPU_001700		 fragment\n
GLEAN3_02107	SPU_002107		 fragment\n
GLEAN3_02203	SPU_002203		 fragment\n
GLEAN3_02274	SPU_002274		 fragment\n
GLEAN3_02980	SPU_002980		 tiny fragment\n
GLEAN3_03017	SPU_003017		 fragment\n
GLEAN3_03265	SPU_003265		 fragment\n
GLEAN3_03408	SPU_003408		 missing N-terminus region\n
GLEAN3_03563	SPU_003563		 fragment\n
GLEAN3_03804	SPU_003804		 fragment\n
GLEAN3_04207	SPU_004207		 fragment\n
GLEAN3_04637	SPU_004637		 fragment\n
GLEAN3_05345	SPU_005345		 partial\n
GLEAN3_06178	SPU_006178		 fragment\n
GLEAN3_07650	SPU_007650		 fragment\n
GLEAN3_07810	SPU_007810		 fragment\n
GLEAN3_07990	SPU_007990		 small fragment\n
GLEAN3_08078	SPU_008078		 fragment\n
GLEAN3_08090	SPU_008090		 tiny fragment\n
GLEAN3_08096	SPU_008096		 fragment\n
GLEAN3_08341	SPU_008341		 partial\n
GLEAN3_08681	SPU_008681		 fragment\n
GLEAN3_09010	SPU_009010		 fragment\n
GLEAN3_09101	SPU_009101		 fragment\n
GLEAN3_09349	SPU_009349		 fragment\n
GLEAN3_09393	SPU_009393		 fragment\n
GLEAN3_09442	SPU_009442		 partial, missing N-terminus region\n
GLEAN3_09533	SPU_009533		 partial\n
GLEAN3_09631	SPU_009631		 fragment\n
GLEAN3_09694	SPU_009694		 tiny fragment\n
GLEAN3_04300	SPU_004300		This is a partial sequence.  Does not form a clade with human Ppm1g in phylogenetic analysis of the PP2C subfamily of PPM phosphatases.   Most similar to GLEAN3_14625.\n
GLEAN3_05327	SPU_005327		Partial sequence.  This is N-terminal to GLEAN3_04300. Identification is partially based on the EST sequence matching these two sequences.\n
GLEAN3_28868	SPU_028868		Partial sequence.\n
GLEAN3_06956	SPU_006956		Partial sequence containing PP2Ac domain.\n
GLEAN3_19872	SPU_019872		Most of the exons encoded; exon 10  is identical to GLEAN3_04123 exon 2 while the final exon (11) is also in GLEAN3_04123 (bases 2099-2275). \n
GLEAN3_17129	SPU_017129	From the BLAST results as well as the excel data, it is evident that this is the best fit for this particular GLEAN model. When reviewing the excel data in comparison with the BLAST results it was clear that this scaffold contained an orderly arrangement without any large gaps present. However, the scaffold did not begin until the 36th base pair but this was the only missing sequence information. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 5. 	#\nDomains: NACHT, LRRs. This gene model is at the end of a scaffold. Could be incomplete.\n
GLEAN3_00896	SPU_000896		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_04043	SPU_004043		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_17038	SPU_017038		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_17708	SPU_017708		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_22294	SPU_022294		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_01392	SPU_001392		NOTE: Sequence and alignment suggest this is the 5' end of a single eIF-5B gene, comprised of GLEANs 01392 (C-Term) and 01393 (N-term).\n
GLEAN3_01393	SPU_001393		NOTE: Sequence and alignment suggest this is the 5' end of a single eIF5B gene, comprised of GLEANs 01392 (C-Term) and 01393 (N-term). \n
GLEAN3_01394	SPU_001394		NOTE-PARTIALseq--identical to middle of eIF5B (glean 01392)\n
GLEAN3_00506	SPU_000506		PARTIAL.  When aligned to human SELB, this contains the start site, but ends before Glean22870 begins --(22870) is the "REST" of SELB\n
GLEAN3_22870	SPU_022870		PARTIAL.  When aligned to human SELB, this contains most of SELB, but without the 5'end.  Note--GLEAN 00506 appears to contain the start site and ends before this glean begins.  \n
GLEAN3_16991	SPU_016991		NOTE partial sequence which is a duplication of the middle of GLEAN 22870 (the bulk of SELB).  \n
GLEAN3_03646	SPU_003646		GLEAN3_09292 is identical to 3' end of this sequence\n
GLEAN3_09292	SPU_009292		partial--Duplication of 3' part of Glean 03646-eIF2alpha\n
GLEAN3_06351	SPU_006351		Sp1200 Bacterially activated coelomocyte, arrayed \ncDNA clone Sp1200 5' similar to amassin, mRNA sequence.\n
GLEAN3_23924	SPU_023924		This is the same as GLEAN3_06351, with the addition of an extra exon, and some additional sequences in one of the other exons. Considered to be an allele of GLEAN3_06351. mRNA obtained from coelomocytes exposed to bateria.\n
GLEAN3_23548	SPU_023548		Significant homology to human ficolin2 (FCN2).  Overlaps occur at fibrinogen C-terminal domain (Expasy annotation)\n
GLEAN3_04912	SPU_004912		GLEAN3_02231 is a duplicate prediction for GLEAN3_04912.\n
GLEAN3_21032	SPU_021032		GLEAN3_21032 is a partial duplicate prediction for GLEAN3_08786.\n
GLEAN3_08786	SPU_008786		GLEAN3_21032 is a partial duplicate prediction for GLEAN3_08786.\n
GLEAN3_10773	SPU_010773		First ~400 aa from the human protein have no homology in urchin GLEAN's.\n
GLEAN3_05511	SPU_005511		Identified from cDNA clone from gram negative bacterially activated coelomocyte of sea urchin.\n
GLEAN3_13960	SPU_013960		Identified from cDNA clone from gram negative bacterially activated coelomocyte of sea urchin.\n
GLEAN3_27883	SPU_027883		GLEAN3_13201 has the first part of the DHX36 gene. GLEAN3_27883 has the rest of the gene.\n
GLEAN3_13201	SPU_013201		GLEAN3_13201 has the first part of the DHX36 gene. GLEAN3_27883 has the rest of the gene.\n
GLEAN3_27440	SPU_027440		Partial duplicate prediction for GLEAN3_27883.\n
GLEAN3_13786	SPU_013786	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 3 different scaffolds. However, there are numerous gaps and repeats within the scaffolds and if the three scaffolds were combined the overall sequence coverage would be poor. There was some Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be 	There are extra exon(s) at the end of the GLEAN.\n
GLEAN3_08600	SPU_008600	After reviewing the data and performing a BLAST search, it appears that this is the best fit for this particular GLEAN model. The BLAST results indicated that this was the second best results based on bit score and e-value. However, this scaffold has more coverage furthermore; the differences between the bit scores and e-values were very close compared to that of the first results. There are several gaps present within this sequence, specifically from 913-987, 1083-1123 and 1416-1496. This is an un-annotated gene so no additional gene information was available from Baylor annotations (comments). There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being <10	GLEAN3_08600 is a duplicate partial prediction for GLEAN3_13786.\n
GLEAN3_14106	SPU_014106		GLEAN3_14106 shows a significant partial overlap with GLEAN3_26336 which codes for the first part of DHX34. First 170 aa from GLEAN3_14106 show no significant homology to human proteins.\n
GLEAN3_08946	SPU_008946		GLEAN3_14505 has the first part of DDX56 and GLEAN3_08946 has the rest.\n
GLEAN3_14505	SPU_014505		GLEAN3_14505 has the first part of DDX56 and GLEAN3_08946 has the rest.\n
GLEAN3_12487	SPU_012487		GLEAN3_12487 is a partial duplicate prediction for GLEAN3_08946.\n
GLEAN3_16488	SPU_016488		GLEAN3_16488 is a partial duplicate prediction for GLEAN3_16467.\n
GLEAN3_02662	SPU_002662		partial sequence containing TGF-beta domain, it has the identical c terminal end as Glean3_12786, but appears to be different towards it's N terminal end\n
GLEAN3_22079	SPU_022079		contains TGFbeta domain\n
GLEAN3_28397	SPU_028397		GLEAN3_16552 has one part and GLEAN3_28397 appears to have the last part of DDX42.\n
GLEAN3_16552	SPU_016552		GLEAN3_16552 has one part and GLEAN3_28397 appears to have the last part of DDX42.\n
GLEAN3_21726	SPU_021726		contains TGFbeta_propeptide domain\n
GLEAN3_22653	SPU_022653		partial sequence, contains TGFbeta_propeptide domain\n
GLEAN3_26612	SPU_026612		Similar to other catalitic subunits of telomerase\n
GLEAN3_09443	SPU_009443		e val for NP_065867= 9e-54; kinesin family member 17 [Homo sapiens].  \nIn the alignment with the best human hit, there is a gap of 13 aa in Glean3_09443 which corresponds to a gap of 374 aa in the NP_065867. \nAnnotation by RA Obar, RL Morris, LE Shorey, SA Tower, and B Rossetti.\n
GLEAN3_26884	SPU_026884		This GLEAN3 sequence is apparently missing some of the beginning sequences of the other Sp-amassins. Note the sequences in the alignments. The data against which it was compared came from the coelomocytes of a bacterially activated sea urchin.\n
GLEAN3_00409	SPU_000409		This is likely a duplication of GLEAN3_13950 (98+% ID ata aa level). Please see GLEAN3_13950 for details.\n
GLEAN3_05998	SPU_005998		GLEAN3_05998 and GLEAN3_19123 code for DDX52. They have a partial identical overlap that may have a haplotype as well.\n
GLEAN3_19123	SPU_019123		GLEAN3_05998 and GLEAN3_19123 code for DDX52. They have a partial identical overlap that may have a haplotype as well.\n
GLEAN3_00362	SPU_000362		Potential MASP\n
GLEAN3_03126	SPU_003126		This model was annotated on a manual inspection of sequence alignments and domain structure. \n \nThis model shows a very high degree of similarity to part of an adjacent model (GLEAN3_03127) corresponding to one of the Sp-SRCR genes. The region of similarity includes both the extra and intracellular juxtamembrane regions, the transmembrane domain and a long cytoplasmic tail of low complexity, but excludes the SRCR domains, which are replaced by 2xEGF + 2xIG domains. \n \nThe position and orientation of both models in a single uninterrupted contig suggests that they did not originate as an assembly problem, but that they may represent a true gene duplication/divergence event. Of note, a very similar situation can be seen with GLEAN3_22566/GLEAN3_22567.\n
GLEAN3_22566	SPU_022566		This model was annotated on a manual inspection of sequence alignments and domain structure. \n \nThis model shows a very high degree of similarity to part of three adjacent models (GLEAN3_22567-9) corresponding to Sp-SRCR genes. The region of similarity includes both the extra and intracellular juxtamembrane regions, the transmembrane domain and a long cytoplasmic tail of low complexity, but excludes the SRCR domains, which are replaced by EGF+IG domains. \n \nThe position and orientation of these models in long, uninterrupted contigs suggests that they did not originate as an assembly problem, but that they may represent true gene duplication/divergence events. Of note, a very similar situation was seen with GLEAN3_03126/GLEAN3_03127.\n
GLEAN3_09374	SPU_009374		This GLEAN3 model encodes a partial ORC6 sequence in which exon 3 is probably missing due to an inappropriate fusion of contigs. See GLEAN_05343 for complete ORC6.\n
GLEAN3_06096	SPU_006096		An incomplete MCM2 gene sequence is also find on GLEAN3_11491\n
GLEAN3_06848	SPU_006848	For this particular GLEAN model there was no Cds information available from either SpBase or Baylor annotations. However, there was in fact mRNA information available from SpBase. When reviewing the excel data, it appears that the sequence could be distributed onto 2 different scaffolds. Specifically onto subject gb|DS015099| and AAGJ02106196. Subject gb|DS015099 has two different base pair duplicates that occur within the sequence. Besides the internal repeats present, both scaffolds appear to have an orderly arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with all the values being greater than 10.	view GLEAN_12983 for complete CDS\n
GLEAN3_12983	SPU_012983		This Glean3 model encodes the entire MCM3 gene sequence; however the first exon probably not belongs to this protein, being artifactually fused to the MCM3 gene in the scaffold. \n
GLEAN3_03296	SPU_003296	For this particular GLEAN model there was no Cds information available from either Baylor annotations or SpBase. However, there was mRNA information available from SpBase. When reviewing the excel data, it appears that the sequence is distributed on to 2 different scaffolds (subjects gb|DS006852| and gb|DS007174|). It is unclear if sequence has good coverage without the BLAST results, but if these 2 scaffolds were combined the sequence would have an orderly and continuous arrangement without any gaps or repeats present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values ranging from 5-10.\nAdditional gene information from Baylor annotations (comments):\nThis is highly correlated with two highly conserved regions:\n1) cytidine_deaminase-like region (2e-14);\n2) SNF7 family (9e-22).\nBut an alternative splicing gives different regions with less highly correlated scores; thus, this one is consideredmore likely.	This is highly correlated with two highly conserved regions:  \n1) cytidine_deaminase-like region (2e-14) \n2) SNF7 family (9e-22) \nBut an alternative splicing gives different regions with less highly correlated scores; thus, this one is considered more likely\n
GLEAN3_28359	SPU_028359	For this particular GLEAN model there was no Cds information available from Baylor annotations or from SpBase. There was however, mRNA information available from SpBase. When examining the excel data, it was apparent that there was numerous repeats present resulting in an un-orderly sequence arrangement. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being <5	Highly correlated with highly conserved region identified as nucleoside_deaminase. The Accession number is from a zebra fish protein that had not been identified.\n
GLEAN3_07007	SPU_007007		An incomplete MCM8 sequence is also found in GLEAN3_07296\n
GLEAN3_07296	SPU_007296		Partial sequences. See anotation of GLEAN3_07007 for the MCM8 full-length coding sequence\n
GLEAN3_23032	SPU_023032		The scaffold assembly should be revised. \nAnother GLEAN encodes the CDC45 sequence: GLEAN3_24816, in that case also exons are missing or artefactually assembled. \n
GLEAN3_24816	SPU_024816		#\nThe scaffold assembly should be revised. \nAnother GLEAN encodes the CDC45 sequence: GLEAN3_23032, in that case also exons are missing or artefactually assembled. \n
GLEAN3_26280	SPU_026280	After reviewing the data and performing a BLAST search it appears that this is the best fit for this particular GLEAN model. When comparing the excel data with the BLAST results, the sequence data does not coincide. The excel data displayed a large sequence of repeats and the coverage was only until about 658, while the BLAST results indicated that the sequence ended at about 2120.  There was Est. support available from GBrowse assembly V0.5 and the transcriptome intensity scores were very low (below 5). 	This is KRP95 of kinesin-2 cloned from Sp by Cole et al, 1993. \ne val = 3e-37 for NP_004789, KIF3B [Homo sapiens] \nThis is amino terminal portion of protein in 11 exons.  Two other scaffolds required to complete the gene.  KRP95 continues on scaffold91496 (one exon) then scaffold58510 (eight exons). \nAnnotated by RA Obar, RL Morris, B Rossetti, AM Musante, and EJ Jin.\n
GLEAN3_21526	SPU_021526	From the results of the BLAST search in comparison with the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. Both scaffolds contain several gaps present, however from the excel data, it appears that where ever one scaffold is missing information the other scaffold will fill it in. There was some Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	Exon 2 is missing from Scaffold1158 due to a string of N's but is present on Scaffold181669.\n
GLEAN3_06237	SPU_006237		Segment of KRP95, annotated fully in GLEAN3_26280. \nAnnotation by RL Morris, R.A.Obar, and B Rossetti.\n
GLEAN3_09764	SPU_009764		Segment of KRP95, annotated fully in GLEAN3_26280. \nAnnotation by RL Morris, R.A.Obar, and B Rossetti.\n
GLEAN3_27513	SPU_027513		#\nDomains: DEATH, NACHT, LRRs.\n
GLEAN3_01054	SPU_001054		Domains: DEATH, NACHT, LRRs. \n
GLEAN3_16921	SPU_016921		Domains: DEATH, NACHT, LRRs. \nThis gene could be incomplete. The scaffold is quite small. \n
GLEAN3_08547	SPU_008547	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. The first portion of the sequence (v2.1_scaffold37344) has an orderly and continuous arrangement until base pair 2838 is reached. The second scaffold (v2.1_scaffold49974) is unique in that it continues that rest of the sequence until the end, covering the last 1390 base pairs continuously (from 2837-4227). There was no Est. support from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 6.5.	#\nDomains: DEATH, NACHT, LRRs.\n
GLEAN3_13504	SPU_013504		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_17341	SPU_017341		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_01548	SPU_001548		Domains: NACHT, LRRs. Genscan model has DEATH domain but no methionine. Likely incomplete.\n
GLEAN3_21447	SPU_021447		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_02372	SPU_002372	After reviewing the data and performing a BLAST search, it appears that there is no sufficient fit for this particular GLEAN model. There are numerous repeats and gaps within the sequence. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with half of the values being less than 10 and the other half greater than 10.	This gene model is likely a fusion of two models. The Fgenesh model has DEATH domain and NACHT.\n
GLEAN3_02423	SPU_002423		Domains: DEATH, NACHT, LRRs. This model could be incomplete, the scaffold is very short.\n
GLEAN3_26071	SPU_026071		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_21478	SPU_021478		Domains: CARD, DEATH, NACHT, LRRs. \nSome LRRs could be missing since the scaffold is short. \n
GLEAN3_26020	SPU_026020		Domains: NACHT, LRRs. Fgenesh model has DEATH domain.\n
GLEAN3_22394	SPU_022394		Domains: DEATH, NACHT, LRRs. Genscan model has additional exons. This gene model could be incomplete, this scaffold is small.\n
GLEAN3_08431	SPU_008431		Domains: DEATH, NACHT, LRRs, TM. \nThe Genscan prediction does not have the TM.\n
GLEAN3_24649	SPU_024649		Domains: NACHT, LRRs. \nGenscan model has different exon/intron structure and contains DEATH domain. Some LRRs could be missing since this model is at the end of a scaffold.\n
GLEAN3_15105	SPU_015105		Domains: DEATH, NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete.\n
GLEAN3_03934	SPU_003934		Domains: Signal peptide, DEATH, NACHT, LRRs. \nThe Fgenesh model has 2 additional exons: could have more LRRs.\n
GLEAN3_05462	SPU_005462		Domains: DEATH, NACHT, LRRs. \nThis gene model is on a small scaffold. It could be incomplete.\n
GLEAN3_02183	SPU_002183		Shows essentially identical sequences to the XP_786692 accession number for the ADEAMc (adenosine deaminase tRNA specific) gene for the Sea Urchin as predicted by bioinformatics data.\n
GLEAN3_04872	SPU_004872		Domains: Signal peptide, DEATH, NACHT, LRRs.\n
GLEAN3_02962	SPU_002962		Domains: DEATH, NACHT, LRRs. \n
GLEAN3_23550	SPU_023550		#\nDomains: DEATH, NACHT, LRRs. \n
GLEAN3_25138	SPU_025138		Domains: DEATH, NACHT. \nThe Fgenesh model has 4 additional exons and has LRRs.\n
GLEAN3_28387	SPU_028387		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_02231	SPU_002231		GLEAN3_02231 is a duplicate prediction for GLEAN3_04912.\n
GLEAN3_04163	SPU_004163		GLEAN3_04163 is a duplicate prediction for GLEAN3_21088.\n
GLEAN3_04597	SPU_004597		GLEAN3_04597 is a duplicate prediction for GLEAN3_10556.\n
GLEAN3_22130	SPU_022130		Domains: Signal peptide, DEATH, NACHT, LRRs. \n
GLEAN3_05004	SPU_005004		#\nGLEAN3_05004 is a partial duplicate prediction for GLEAN3_03407\n
GLEAN3_05099	SPU_005099		Partial sequence.\n
GLEAN3_12661	SPU_012661		looks like C-terminal region of a myosin 10 \n \nPH-MyTH4-B41 - missing N-terminal region containing MySc and IQ repeats\n
GLEAN3_18982	SPU_018982		looks like C-terminal region of a myosin 10 \n \nPHx2-MyTH4-B41 - missing N-terminal region containing MySc and IQ repeats\n
GLEAN3_23549	SPU_023549		looks like a partial of myosin 10 \nMySc-IQx3-PHx2-MyTH4 - missing B41 domain at C-terminus\n
GLEAN3_26077	SPU_026077		looks like a C-terminal fragment of myosin 7 or 15 \n \nSH3-MyTH4-B41 \n \nmissing N-terminal part with MySc-IQ-IQ-IQ-MyTH4-B41\n
GLEAN3_28273	SPU_028273		looks like a fragment of a myosin 10 \n \nPH-PH-MyTH4 \n \nmissing both N-terminus - MySc plus IQ repeats and C-terminus - B41  \n \n \n
GLEAN3_06732	SPU_006732		just a reelin domain - nothing else - could be part of a reelin\n
GLEAN3_28149	SPU_028149		multiple ANK domains, one SAM and a PARP domain \n \nlooks like Tankyrase - ADP ribosylase reportedly involved in modifying telomere-associated proteins and regulating GLUT4 traffic in the Golgi\n
GLEAN3_07551	SPU_007551		9 ankyrin repeats and a single SAM domain \n \nToo many repeats to be exactly SANS - unless that actually has more - but definitely a homolog\n
GLEAN3_22872	SPU_022872		four ankyrin domains and a single SAM - looks like a pretty good match to SANS - has one more ankyrin repeat \n \nSAM is the USH1G_HUMAN Usher syndrome type 1G protein (Scaffold protein containing ankyrin repeats and SAM domain)\n
GLEAN3_25058	SPU_025058		has 4 ankyrin repeats, two SAM domains and a PTB domain \n \nThis gene organization is found in vertebrates usually with 5 or 6, ankyrin repeats (also in bees). \n \nNames assigned to those genes include cajalin (rat) and odin (human) \nAlso known as EB-1 and ANKS1B\n
GLEAN3_14167	SPU_014167		large protein with N-terminal ANK/SH3-----SAM-SAM and then a long run of undistinguished sequence. \n \nThis structure exists in chordates - named caskin-1 (CASK-interacting protein) in humans. \n \nNote - Shank, a PSD-organizing protein involved in dendritic spine organization, is similar but has PDZ domain as well.\n
GLEAN3_00935	SPU_000935		This is the 3' end of the gene.  The 5' end is GLEAN3_13520\n
GLEAN3_13520	SPU_013520		FARP matches approximately the first 250aa. This is the 5' end of the gene.. the 3'end appears to be GLEAN3_00935\n
GLEAN3_07771	SPU_007771		Duplication of GLEAN3_08684.  Looks like a splice varient\n
GLEAN3_11527	SPU_011527		This is the 3' end--GLEAN3_25443 contains the first half of the gene.\n
GLEAN3_10861	SPU_010861		PARTIAL Sequence, almost identical to middle of GLEAN 14908 ECT2\n
GLEAN3_19189	SPU_019189		Best Blast is a Kalirin protein (a highly related family member), but based on sequence alignment and domain structure this is a TRIO not KALRN.  The TRIO Like 1 gene is in three parts.  5'end is GLEAN 22793.  The middle is 02796, while this GLEAN is the rest.\n
GLEAN3_22793	SPU_022793		This is the 5'end of this gene.  The middle is GLEAN 02796, while the end is GLEAN 19189.\n
GLEAN3_28457	SPU_028457		This is the end of ITSN2 gene.  The rest of the gene (5'end) is in GLEAN 03961.\n
GLEAN3_03961	SPU_003961		This is the 5'end of this glean.  The rest is located in GLean 28457.\n
GLEAN3_19587	SPU_019587		This gene is spread among 3 GLEANs.  This is the 5'end.  The middle is GLEAN 15117, and the 3' end is GLEAN 28316.\n
GLEAN3_15117	SPU_015117		This gene is spread among 3 GLEANs.  This is the middle.  The 5'end is GLEAN 19587, and the 3' end is GLEAN 28316.\n
GLEAN3_28316	SPU_028316		This gene is in three parts.  This is the 3'end.  The 5'end is GLEAN 19587, and the middle is GLEAN 15117.\n
GLEAN3_14333	SPU_014333		Appears to be a splice varient of GLEAN3_02673\n
GLEAN3_02796	SPU_002796		This is the middle of this gene.  The 5' end is GLEAN 22793, while most (and the 3'end) is 19189.\n
GLEAN3_13479	SPU_013479		This is the middle and 3'end of this gene.  The 5' end is GLEAN3_18498\n
GLEAN3_18498	SPU_018498		This is the 5'end of the gene.  The rest of the gene is found in GLEAN 013479\n
GLEAN3_01797	SPU_001797		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix C.2 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNote that GLEAN3_01794 (Sp-MACPF-C.1), a model adjacent to this gene, also contains the MACPF domain. A comparison of their protein sequences reveals high similarity but a fair number of differences as well. It is to be determined whether this fact reflects the erroneous assembly of different haplotypes (both genes are indeed located in an area of numerous contigs) or if reflects a true gene duplication event.\n
GLEAN3_02550	SPU_002550		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix B.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nThe modification to this model was made based on an otherwise identical Fgenesh++ prediction that includes a prediction for a signal peptide, a feature of all other Sp-MACPF genes, and an additional exon supported by the tiling array hybridization data. \n \nThe embryonic expression and modified structure of this model are strongly supported by the tiling array data. \n \nNB: An adjacent model (GLEAN3_02548/9) is very similar in sequence and exon structure. However, there are noticeable differences between both. It is difficult at this point to determine whether this represents an assembly error or a true duplication/reversion event.\n
GLEAN3_14984	SPU_014984		This model was annotated and modified based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nIn their original form, most predictions for this model forced two C-ter exon into the structure of this gene due to the presence of a large NNNNNNNN gap (150+ kb) that likely includes the last exon of this gene. We have modified this model by deleting the last two exons and noting the incompleteness of this model. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix A.3 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_15144	SPU_015144		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix B.2 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nThe embryonic expression and structure of this model are strongly supported by the tiling array data. Furthermore, there is a significant amount of signal coming from one of the introns. However, no other gene prediction protocol called for an exon in the region. \n \nNote that this model includes a prediction for a transmembrane domain towards the C-ter of the protein, which is uncharacteristic of the other Sp-MACPF genes. However, since there is no alternative models for this region, we cannot rule out that this is a true feature of this gene. It should be noted, however, that this model is located at the end of a scaffold, and the exon call may have been forced.\n
GLEAN3_22318	SPU_022318		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix D.2 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_26119	SPU_026119		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix D.3 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_27405	SPU_027405		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix D.4 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNote that this model is likely incomplete. Based on the position of this model close to an end of a scaffold, and the structure of other closely related models and Sp-MACPF genes in general, it is likely that there is missing N-ter sequence (including a signal peptide). \n \nThe embryonic expression and structure of this model are partly supported by the tiling array data.\n
GLEAN3_28756	SPU_028756		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix E.2 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_22230	SPU_022230		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix E.3 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_14229	SPU_014229		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix F.1 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nNote that most other Sp-MACPF genes include a signal peptide. However, none of the gene prediction protocols found one for this model. This is unlikely due to lack of scaffold sequence.\n
GLEAN3_02923	SPU_002923		Very similar to GLEAN3_02921; looks like local duplication.\n
GLEAN3_27596	SPU_027596		Very similar but not identical to GLEAN3_06866.\n
GLEAN3_13704	SPU_013704		duplicate of glean3_01739\n
GLEAN3_00386	SPU_000386		duplicate of glean3_12252\n
GLEAN3_08485	SPU_008485		This model was annotated based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix E.4 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees.\n
GLEAN3_02549	SPU_002549		This model was annotated and modified based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThe modification to this model was made based on an NCBI model that spans and perfectly matches this and an adjacent glean model (GLEAN3_02548). The modified nucleotide and protein sequences are provided for each of the fused glean models; but the gene features of only this model have been modified to reflect the fusion. \n \nThe predicted domain structure for the modified model includes only a signal peptide and the MACPF domain, a feature of all other Sp-MACPF genes. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix B.0 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nThe embryonic expression and modified structure of this model are strongly supported by the tiling array data. \n \nNB: An adjacent model (GLEAN3_02550) is very similar in sequence and exon structure. However, there are noticeable differences between both. It is difficult at this point to determine whether this represents an assembly error or a true duplication/reversion event.\n
GLEAN3_02548	SPU_002548		This model was annotated and modified based on a manual inspection of multiple protein sequence alignments and domain structure. \n \nThe modification to this model was made based on an NCBI model that spans and perfectly matches this and an adjacent glean model (GLEAN3_02549). The modified nucleotide and protein sequences are provided for each of the fused glean models; but the gene features of only GLEAN3_02549 have been modified to reflect the fusion. The gene features for this model have been accepted in their present form for simplicity. Please refer to GLEAN3_02549 for the modified exon structure. \n \nThe predicted domain structure for the modified model includes only a signal peptide and the MACPF domain, a feature of all other Sp-MACPF genes. \n \nThis model belongs to a family of models that contain a Membrane attack/perforin domain (MACPF) and show similarity in domain structure to apextrin. The sufix B.0 is arbitrary and just reflects the clustering of this model with other Sp-MACPF genes in phylogenic trees. \n \nThe embryonic expression and modified structure of this model are strongly supported by the tiling array data. \n \nNB: An adjacent model (GLEAN3_02550) is very similar in sequence and exon structure. However, there are noticeable differences between both. It is difficult at this point to determine whether this represents an assembly error or a true duplication/reversion event.\n
GLEAN3_21313	SPU_021313		The lim domains are in GLEAN3_04021, located in scaffold5\n
GLEAN3_22817	SPU_022817		duplicate of glean3_25302.  See that model for data.\n
GLEAN3_02246	SPU_002246		Tiling data suggests several exons are wrong.\n
GLEAN3_02766	SPU_002766		no embryonic expression based on tiling\n
GLEAN3_12277	SPU_012277		This is the PH domain that corresponds to the rest of the Sp-Tec protein (glean3_12278).\n
GLEAN3_09691	SPU_009691		PLC eta is distributed over four GLEAN3 predictions (GLEAN3_09688-09691).  Complete annotation is located on GLEAN3_09691 THIS ONE HERE!!!!\n
GLEAN3_09690	SPU_009690		Protein sequence continued on glean3_09691 glean3_09689 glean3_09688.  This annotation contains the PH domain of PLC eta. \n \nComplete annotation is located on GLEAN3_09691\n
GLEAN3_09689	SPU_009689		PLC eta is distributed over four GLEAN3 predictions (GLEAN3_09688-09691).  Complete annotation is located on GLEAN3_09691\n
GLEAN3_09688	SPU_009688		PLC eta is distributed over four GLEAN3 predictions (GLEAN3_09688-09691).  Complete annotation is located on GLEAN3_09691\n
GLEAN3_02630	SPU_002630		partial duplicate of glean3_21309.  Homeodomain and C-terminal are identical to Sp-Hox8. \n
GLEAN3_16561	SPU_016561		High similarity with human CDKL5. Partial N-terminal sequence\n
GLEAN3_22670	SPU_022670		High similarity with human CDKL5. Partial N-terminal sequence\n
GLEAN3_03424	SPU_003424		serine-threonine, not tyrosine\n
GLEAN3_03389	SPU_003389		containa an olfactomedin domain - no collagen repeats or other predicted domains.\n
GLEAN3_24073	SPU_024073		LY x2-EGF x5-SEA-EGF - no obvious TM domain \n \nSEA/EGF COMBINATION IS CHARACTERISTIC OF MUCINS\n
GLEAN3_25178	SPU_025178		LY-EGF x6-SEA-EGF - no obvious TM domain \n \nSEA/EGF COMBINATION IS CHARACTERISTIC OF MUCINS\n
GLEAN3_10472	SPU_010472		CCP x4 - SEA - EGF - apparent transmembrane domain \n \nSEA/EGF COMBINATION IS CHARACTERISTIC OF MUCINS\n
GLEAN3_03285	SPU_003285		SEA-EGF - apparent TM domain \n \nSEA/EGF COMBINATION IS CHARACTERISTIC OF MUCINS\n
GLEAN3_27741	SPU_027741		large protein with EGF/SEA/EGF flanked on both sides by long low complexity sequence blocks \n \nSEA/EGF COMBINATION IS CHARACTERISTIC OF MUCINS\n
GLEAN3_06884	SPU_006884		SEA/EGF - apparent TM domain \nSEA/EGF COMBINATION IS CHARACTERISTIC OF MUCINS\n
GLEAN3_05135	SPU_005135	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. The sequence doesn't begin until the 76th base pair on scaffold v2.1_scaffold75030. When reviewing the BLAST results, it is apparent that there are numerous internal repeats within both scaffolds and a gap within the second scaffold that ranges from 625-696. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5 (excluding one outlier at 41.)	single SEA domain - characteristic of mucins\n
GLEAN3_20148	SPU_020148		sequence identical to GLEAN3_08768\n
GLEAN3_01134	SPU_001134		Exact same sequence that GLEAN3_05105\n
GLEAN3_27666	SPU_027666		Likely orthologue of Aurora A;Partially included in GLEAN3_27833 \n
GLEAN3_06082	SPU_006082		C-terminal part of GLEAN3_00964\n
GLEAN3_27384	SPU_027384		small part of GLEAN3_00964\n
GLEAN3_14001	SPU_014001		part of GLEAN3_05312\n
GLEAN3_00852	SPU_000852		Domains: Signal peptide, DEATH, NACHT, LRRs.\n
GLEAN3_03186	SPU_003186		Domains: DEATH, NACHT, LRRs. \nFgenesh model has 6 additional small exons at the C-terminus, and contains additional LRRs.\n
GLEAN3_17245	SPU_017245		Domains: Signal peptide, DEATH, NACHT, LRRs. \nFgenesh model has 6 additional c-terminal exons, the first 4 of which contain additional LRRs.\n
GLEAN3_14503	SPU_014503		Domains: Signal peptide, DEATH, NACHT, LRRs. \n
GLEAN3_28650	SPU_028650		#\nProbable homologue of human ficolin 2.\n
GLEAN3_28805	SPU_028805		Domains: Signal peptide, DEATH, NACHT, LRRs.\n
GLEAN3_08597	SPU_008597		Domains: Signal peptide, NACHT, LRRs. \nThe Genscan model has a DEATH domain and no signal peptide, and may be more accurate.\n
GLEAN3_19700	SPU_019700		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_04096	SPU_004096		IDENTICAL TO GLEAN3_04496 \nNOT Embryonically expressed.  May be pseudo-gene or adult-only gene.\n
GLEAN3_01444	SPU_001444		Domains: Signal peptide, DEATH, NACHT, LRRs. \nThe Fgenesh model has 2 additional small exons, which may contain additional LRRs not detected by the SMART program.\n
GLEAN3_23120	SPU_023120		Domains: NACHT, LRRs. \nThe Genscan model is probably incomplete since it doesn't start with a Met, but it has an additional 5' exon and contains a DEATH domain.\n
GLEAN3_28483	SPU_028483		Domains: Signal peptide, DEATH, NACHT, LRRs. \nThe Fgenesh model has slightly different exon/intron structure at the 3' end and could contain additional LRRs not detected by the SMART program.\n
GLEAN3_00816	SPU_000816		Domains: Signal peptide, DEATH, NACHT, LRRs. \n
GLEAN3_19699	SPU_019699		Domains: Signal peptide, DEATH, NACHT, LRRs. \n
GLEAN3_04165	SPU_004165		Domains: Signal peptide, DEATH, NACHT, LRRs.\n
GLEAN3_15481	SPU_015481		Domains: DEATH, NACHT, LRRs. \nThe Fgenesh model has 7 additional small 3' exons that code for additional LRRs.\n
GLEAN3_06733	SPU_006733		Domains: NACHT, LRRs. \nThis gene model is at the end of a short scaffold and could be incomplete, missing the DEATH domain.\n
GLEAN3_08283	SPU_008283		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has 3 additional small 3' exons that code for additional LRRs.\n
GLEAN3_13465	SPU_013465		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_25204	SPU_025204		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_00457	SPU_000457		Domains: NACHT, LRRs. \nThe Genscan model has a different exon/intron structure and has DEATH domains (one at N-terminus and one c-terminal to the NACHT domain). \n
GLEAN3_05993	SPU_005993		Domains: DEATH, NACHT, LRRs. \nThis gene model is at the end of a scaffold and could be missing LRRs.\n
GLEAN3_06203	SPU_006203		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has 7 additional small exons at the 3' end and has additional LRRs. \n
GLEAN3_00015	SPU_000015		Domains: DEATH, NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing LRRs.\n
GLEAN3_19696	SPU_019696		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_12523	SPU_012523		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain.\n
GLEAN3_25948	SPU_025948		Domains: DEATH (2), NACHT, LRRs. \nThis gene model overlaps with Glean3_25952. The Genscan model has a different terminal exon and does not overlap the other gene model. \n
GLEAN3_07113	SPU_007113		Domains: NACHT, LRRs. \nGlean3_07116 has low e-value DEATH domain.\n
GLEAN3_22412	SPU_022412		Domains: Sulfatase, NACHT, LRRs. \nThe Fgenesh model also has a DEATH domain before the NACHT domain. This is a very unusual domain combination. \nAlso, there is a very large exon between the exons coding for the sulfatase domain and the NACHT and LRRs.\n
GLEAN3_21844	SPU_021844		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain.\n
GLEAN3_11441	SPU_011441		Domains: NACHT, LRRs. \nThe Fgenesh model combines Glean3_11441 and Glean3_11440. It has a DEATH domain.\n
GLEAN3_10053	SPU_010053		Domains: NACHT, LRRs. \nThe Genscan model combines Glean3_10053 and Glean3_10052, which contains the DEATH domain.\n
GLEAN3_11439	SPU_011439		Domains: NACHT, DEATH, LRRs. \n
GLEAN3_06229	SPU_006229		#\nDomains: Signal peptide, DEATH, NACHT, LRRs. \nThis gene model is on a small scaffold and could be incomplete, missing LRRs.\n
GLEAN3_10097	SPU_010097		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_28630	SPU_028630		Domains: DEATH, NACHT. \nGlean3_28631 contains 4 exons coding for LRRs. The accurate gene model should probably be a fusion of these 2 gene models, such as the Genscan.\n
GLEAN3_05301	SPU_005301		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_15052	SPU_015052		Domains: DEATH, NACHT, LRRs. \nThis gene model sits at the end of a scaffold and could be incomplete, missing LRRs.\n
GLEAN3_24605	SPU_024605		9 armadillo repeats - matches best with plakoglobin/gamma catenin but also, less well, with beta catenin and poorly with p120/delta catenin\n
GLEAN3_20240	SPU_020240		Domains: NACHT, LRRs. \nThis gene model is likely incomplete. There is an Fgenesh model just 5' of this glean that codes for 2 DEATH domains.\n
GLEAN3_14761	SPU_014761		Domains: DEATH, DEATH, NACHT, LRRs. \nThe Fgenesh model has 2 additional exons that code for LRRs. This gene model is at the end of a scaffold and could be incomplete, missing LRRs.\n
GLEAN3_08833	SPU_008833		Domains: Signal peptide, DEATH, NACHT, LRRs. \nThis gene model is on a small scaffold and could be incomplete, missing LRRs and/or a second DEATH domain.\n
GLEAN3_01016	SPU_001016		Domains: NACHT, LRRs.  \nThe Genscan models has 2 additional exons (Glean3_01015) that code for a DEATH domain. The gene features were modified to reflect this model. \nThe Fgenesh model has 2 additional 3' exons that code for additional LRRs.\n
GLEAN3_20414	SPU_020414		The KRP170 gene spans GLEAN3_20414 (Scaffold1612/Scaffoldi17703) and GLEAN3_26503 (Scaffold56862/Scaffoldi4507).  The mRNA was published as Chui,K.K., Rogers,G.C., Kashina,A.M., Wedaman,K.P., Sharp,D.J., Nguyen,D.T., Wilt,F. and Scholey,J.M.  "Roles of two homotetrameric kinesins in sea urchin embryonic cell division."   J. Biol. Chem. 275 (48), 38005-38011 (2000).  The GenBank entry for this gene is gi|10697491|gb|AF292395.2|. \nAnnotation by RA Obar, RL Morris, AL Silverio, BJ Chick, EJ Jin.\n
GLEAN3_16794	SPU_016794		Domains: NACHT, LRRs. \nThis gene model is at the end of scaffold and is likely incomplete, missing the DEATH domain.\n
GLEAN3_28595	SPU_028595		Domains: Signal peptide, DEATH, NACHT, LRRs. \n
GLEAN3_03640	SPU_003640		Domains: DEATH, NACHT, LRRs. \nThe Fgenesh model has additional exons that could code for more LRRs.\n
GLEAN3_03762	SPU_003762		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_03366	SPU_003366		Domains: DEATH, NACHT, LRRs. \nThe Fgenesh model just downstream has 5 exons that code for LRRs.\n
GLEAN3_27511	SPU_027511		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has 3 additional exons at the 3' end that code for additional LRRs.\n
GLEAN3_18384	SPU_018384		Domains: DEATH, NACHT, LRRs. \nThe Fgenesh model is slightly different and codes for additional LRRs. \n
GLEAN3_26189	SPU_026189		#\nDomains: NACHT, LRRs. \nThe Genscan model is slightly different and also encodes a DEATH domain. It is likely incomplete however, since this is a short scaffold.\n
GLEAN3_28060	SPU_028060		Domains: Signal peptide, DEATH, NACHT, LRRs. \nThis gene model could be incomplete since it is on a small scaffold.\n
GLEAN3_25179	SPU_025179		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has addtional small exons that code for additional LRRs.\n
GLEAN3_05026	SPU_005026	For this particular GLEAN model an error message was received when doing a BLAST search. When reviewing the excel data, it was evident that the sequence was very long with numerous repeats present.	Domains: DEATH, DED, NACHT,LRRs.\n
GLEAN3_09160	SPU_009160		Domains: DEATH. \nThe Genscan model combines Glean3_09060 and Glean3_09061. Glean3_09061 has NACHT and LRRs. This gene model has been modified to reflect this gene model.\n
GLEAN3_09161	SPU_009161	For this particular GLEAN model there were no CDS found in both the SpBase search engine as well as the Baylor annotations search engine. There were also no gene features available on either site as well. However, the mRNA sequence was available from SpBase.\nAdditional information found: This gene model is combined with Glean3_09160. Please refer to this model for details.	This gene model is combined with Glean3_09160. Please refer to this model for details.\n
GLEAN3_28631	SPU_028631		This gene model is combined with Glean3_28631. Please refer to this gene model for details.\n
GLEAN3_25166	SPU_025166		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_09659	SPU_009659		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_05609	SPU_005609		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_28433	SPU_028433		Domains: Signal peptide, DEATH, NACHT, LRRs.\n
GLEAN3_11855	SPU_011855		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_17505	SPU_017505		Domains: DEATH, NACHT, LRRs. \nThe Glean3_17506 has additional LRRs that are probably part of this gene model.\n
GLEAN3_17506	SPU_017506		This gene model contains LRRs only and probably belong to Glean3_17505. See this other gene model for details.\n
GLEAN3_17196	SPU_017196		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_03247	SPU_003247		Domains: DEATH, NACHT, LRRs. \nBoth the Genscan and the Fgenesh models have different exon/intron structures in the 3' end of the gene, which code for additional LRRs.\n
GLEAN3_08498	SPU_008498		Domains: NACHT, LRRs. \nThis gene model is on a small scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_27610	SPU_027610		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_03797	SPU_003797		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_22001	SPU_022001		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_28820	SPU_028820		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_16810	SPU_016810		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_26622	SPU_026622		Domains: NACHT, LRRs. \nThis gene model is on a small scaffold and could be incomplete, missing DEATH domain(s). \n
GLEAN3_05383	SPU_005383		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_00318	SPU_000318		Gene neighbour of the ParaHox cluster; in chordates.\n
GLEAN3_23183	SPU_023183		Domains: NACHT, LRRs. \nThe Genscan model has a different exon/intron sturcture and also codes for a DEATH domain N-terminal of the NACHT domain. \n
GLEAN3_16257	SPU_016257		#\nDomains: DEATH, NACHT, LRRs. \nThis gene model has a final intron of more than 150kb. This is very unusual. The Fgenesh model has a different final exon, which is much closer to the rest of the gene, and could be a more accurate model.\n
GLEAN3_24419	SPU_024419		EST data:  \n \nBG784137 SEAUMC004094 Sea urchin primary mesenchyme cell cDNA library Strongylocentrotus purpuratus cDNA clone PC_0020_A1_G10_MR 5', mRNA sequence \n \n>gi|57955070|gb|CX692889.1|CX692889 yde99f06.y1 Sea urchin EST Lib1 Strongylocentrotus purpuratus cDNA clone yde99f06 5' similar to TR:Q9VK69 Q9VK69 CG5525 PROTEIN. ;, mRNA sequence\n
GLEAN3_16926	SPU_016926		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_01210	SPU_001210		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_04343	SPU_004343		Domains: DEATH, NACHT, LRRs. \n
GLEAN3_14122	SPU_014122		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_25600	SPU_025600		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has additional small exons that code for more LRRs.\n
GLEAN3_00523	SPU_000523		Domains: DEATH, NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing LRRs.\n
GLEAN3_08707	SPU_008707		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has additional small exons that code for additional LRRs.\n
GLEAN3_07446	SPU_007446		Domains: DEATH, NACHT, LRRs. \nThis gene model is on a short scaffols and could be incomplete, missing LRRs.\n
GLEAN3_18173	SPU_018173		7 ARMADILLO REPEATS - similar to karyopherin (importin) alpha 6  - clusters with human importins\n
GLEAN3_13477	SPU_013477		8 ARMADILLO REPEATS - similar to karyopherin (importin) alpha 6  - clusters with human importins\n
GLEAN3_04456	SPU_004456		single armadillo repeat - clusters with importins \n
GLEAN3_24721	SPU_024721		This is likely to be split between this gene and glean 24930. Correct protein sequence is predicted to be: \nMAQAYVEDTLMVRHELKHAYGKVFLVRKVGNHNQGKLYAMKVLKKATIVQ \nKAKTAEHTMTERQVLEAVRSCPFLVTLHYAFQTDSKLNLILDYVNGGELF \nTHLYQREHFRESEVRIYIAEIIIALDCLHRILTSHPPMPNTFSKEVKDFI \nNKLLVKDPTKRLGCNGVKDIKSHSFFKGLNWDDVAAKRVSPPFRPHINGE \nLDTSNFAEEFTSLVPADSPADIPKTADARVFRVGYSFIAPSILYSDNAIT \nQDMLTQPSEHNRPSLASILSIHELKDSPFNKYYELDMKSAPIGDGSFSIC \nRRCTHRKTEKEYAVKIVSRRVACTQEITTLQLCQKHPNIVHLKEEFKDKL \nHTYIIMELCKGGELLGRIRKKKHFDELEASMIMRKLVSAVDYMHSRGIVH \nRDLKPENILFTDDSDDAELKIIDFGFARITNSNQPLKTPCFSLHFAAPEV \nLKRAYEQDGEYDASCDVWSLGVILYTMLSGRVPFQDPSISKSNSASDIMK \nRIKHGNFSFDGEEWNSVSTPAKDLIKGLLTVDPSRRLTTDDLLQNEWIQG \nQQLSTSTPLMTPDILNSCASIQKRVKATMRAFHTAQREGFLLTDVSNAPL \nAKRRKKKKDSSTETRSSSSESTHSQSSSSQESTTPTPTANPVLTIPVTTV \nSCAPRTTTATGAPSIPSVQPLPSLSKQTGARLDQYESLESLGFSPILPFS \nAGGSQELPPLLARQDSGYVGQMPSYAQVTPVPRTNVGSHGVTYAPILDPS \nMYPCGLQQPILDFSSSIPEYLSVQYASTEQPSIPMTVPRTLHQPHPHPLP \nLPHQHLSHLPTISEDPSTT \n
GLEAN3_00512	SPU_000512		#\nPredicts the carboxy-terminal end of a dual oxidase with an exon structure similar to that of Sp-Udx1 (GLEAN3_00513).  May link to GLEAN3_25507, which represents a paralogous amino-terminal domain to Sp-Udx1 (GLEAN3_00513).\n
GLEAN3_25518	SPU_025518		Best hit to Xenopus Timeless, needs to be corelated with GLEAN3_06230\n
GLEAN3_26036	SPU_026036		Domains: DEATH, DEATH, LRRs, NACHT, LRRs, ZU5. \nThe Fgenesh model has different exon/intron structure, is missing the last 2 exons, and therefore the ZU5 model and is likely more accurate, since this domain is not associated with this type of protein in any other organism.  \n \nThe presence of LRRs between the DEATH domains and the NACHT domain is also unusual. \n
GLEAN3_00863	SPU_000863		Domains: DEATH, NACHT, LRRs. \nThe Genscan model fuses Glean3_00863 with Glean3_00864 (which is comprised of 4 small exons) that could code for additional LRRs.\n
GLEAN3_20641	SPU_020641		Would prefer to use Arl13b (for clarity's sake), but ARL2L1 appears to the preferred naming scheme. \nAlso, note that tiling array shows signal on only 6/10 exons (missing signal at 5' exons).  No EST to confirm.  Likely that 5' end is wrong.\n
GLEAN3_23642	SPU_023642		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_27808	SPU_027808		Domains: DEATH,NACHT, LRRs.\n
GLEAN3_26400	SPU_026400		Domains: NACHT, LRRs. \nThis gene model is at the end of a scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_25167	SPU_025167		Domains: NACHT, LRRs. \nThe Fgenesh model has additional exons in the 5' region that code for a DEATH domain. However, this model is truncated at the 3' end and is missing a large portion of the LRRs. A combination of both models is likely the most accurate version of this gene. \n
GLEAN3_22070	SPU_022070		Domains: NACHT, LRRs. \nThis gene model is at the end of a scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_01884	SPU_001884		Domains: NACHT, LRRs. \nThe Genscan model has 2 additional exons that code for additional LRRs. \nThis gene model is on a short scaffols and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_15206	SPU_015206		Domains: DEATH, EGF, NACHT, LRRs. \nThe presence of the EGF domain in this model is unusual. \n \nThe Genscan model combines Glean3_15206 and Glean3_15207. Glean3_15207 has 4 exons that code for additional LRRs. The Genscan model is therefore likely to be more accurate.\n
GLEAN3_15207	SPU_015207		This gene model contains LRRs only but is likely a part of Glean3_25206 model. Please see this other gene model for details.\n
GLEAN3_11035	SPU_011035		Domains: DEATH, DEATH, NACHT, LRRs.\n
GLEAN3_22441	SPU_022441		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_06456	SPU_006456		Domains: NACHT, LRRs. \nThis gene model is on a small scaffold and could be incomplete, missing the DEATH domain(s) and LRRs.\n
GLEAN3_06587	SPU_006587		Domains: Signal peptide, NACHT, DEATH, LRRs. \nThis domain organization of this gene model is unusual, since the DEATH domain is usually N-terminal to the NACHT domain.\n
GLEAN3_15205	SPU_015205		Domains: DEATH, EGF(2), NACHT, LRRs. \nThe presence of the EGF domains in this gene model is unusual but it is also seen in the gene model just downstream on the same scaffold: Glean3_15206.\n
GLEAN3_01630	SPU_001630		Domains: NACHT, LRRs. \nThis gene model is located on a short scaffold and is likely incomplete, missing the DEATH domain(s).\n
GLEAN3_12713	SPU_012713		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_01549	SPU_001549		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and is likely incomplete, missing the DEATH domain(s).\n
GLEAN3_05732	SPU_005732		Domains: DEATH, NACHT, LRRs. \nThe Genscan model has three additional small exons that code for additional LRRs.\n
GLEAN3_27300	SPU_027300		Domains: DEATH, EGF, NACHT, LRRs. \nThe presence of the EGF domain in this gene model is unusual. \n \nThe Fgenesh model has 2 additional terminal exons that code for additional LRRs. \nThis gene is at the end of a scaffold and could be incomplete, missing LRRs. \n
GLEAN3_20569	SPU_020569		#\nDomains: DEATH, EGF, NACHT, LRRs. \nThe presence of the EGF domain in this gene model is unusual. \n
GLEAN3_02888	SPU_002888		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and is likely incomplete, missing DEATH domain(s) and LRRs.\n
GLEAN3_24709	SPU_024709		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_10052	SPU_010052		Domains: DEATH. \nThis gene model is likely part of a larger model that includes Glean3_10053. See this model for further details.\n
GLEAN3_00738	SPU_000738		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_03303	SPU_003303		Domains: NACHT, LRRs. \nThe Genscan model has 3 additional exons at the 5' end that code for a DEATH domain. This gene model is likely to be more accurate than the glean. \n \nThis gene model is on a short scaffold and could be incomplete, missing a DEATH domain and/or LRRs.\n
GLEAN3_10091	SPU_010091		Domains: DEATH, NACHT, LRRs. \nThe DEATH and NACHT domains overlap. The Genscan model has an additional small exon before the one coding for the NACHT domain and effectively "separates" the DEATH and NACHT domains. However, the last 2 exons of the Genscan model (not in the Glean model) code for a Zn finger domain and probably do not belong to this protein. The accurate gene model is likely part of the Genscan model and the Glean model.\n
GLEAN3_14719	SPU_014719		Domains: NACHT, LRRs. \nThis gene model is located on a short scaffold and is likely incomplete, missing DEATH domain(s) and/or LRRs.\n
GLEAN3_21630	SPU_021630		two ANATO domains and nothing else - a "chordate-specific" domain found in complement C3/4/5 and in fibulins.  This looks like a fragment - probably of a fibulin - could be haplotype pair for GLEAN3_26629, which encodes a fibulin. \n
GLEAN3_16532	SPU_016532		#\nDomains: LRRs. \nThe Fgenesh model has additional exons that code for Signal peptide, DEATH and NACHT domains. The gene features and sequences of this gene model were modified to reflect this.\n
GLEAN3_27053	SPU_027053		Duplication of the C-term of glean3_02874.\n
GLEAN3_15768	SPU_015768		Domains: NACHT, LRRs.  \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_26304	SPU_026304		Domains: NACHT, DEATH, LRRs. \nThe presence of a DEATH domain C-terminal to the NACHT domain is unexpected since this domain structure is not observed in the vertebrate NLRs, where PYD and CARD domains are N-terminal to the NACHT domain. \n \nThe Fgenesh model (S.P_Scaffold1348.seq.N000001:  2,207-2,237) located 5' of this Glean model codes for a DEATH domain that likely belongs to this gene.\n
GLEAN3_01608	SPU_001608		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s). \n
GLEAN3_28485	SPU_028485		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s). \n
GLEAN3_06019	SPU_006019		#\nDomains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s). \n
GLEAN3_05410	SPU_005410		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_19497	SPU_019497		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_05581	SPU_005581		Domains: DEATH, NACHT, LRRs, NACHT, LRRs. \nThis structure is very unusual, since most NLRs have a single NACHT domain. There is a large (~100kb) intron in this prediction. Therefore, this could be a fusion of 2 gene models.\n
GLEAN3_21370	SPU_021370		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_07620	SPU_007620		Domains: NACHT, LRRs. \nThis gene model is on a small scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_23759	SPU_023759		Domains: LRRs. \nThis gene model contains part of a NACHT domain as well. Glean3_23758 has the N-terminal part of the NACHT domain as well as a Signal peptide and DEATH domain. The accurate gene model is probably a fusion of the two, similar to the Genscan prediction. The Genscan model is missing a part of the NACHT domain.\n
GLEAN3_23758	SPU_023758		Domains: Signal peptide, DEATH. \nThis gene model probably belongs as a fusion with GLEAN3_23759. Please see this other model for details.\n
GLEAN3_10667	SPU_010667		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_27207	SPU_027207		Domains: NACHT, LRRs. \nThis gene is located on a short scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_22564	SPU_022564		Domains: NACHT, LRRs. \nThe Fgenes model has three additional exons at the 5'end which code for a DEATH domain. This model is probably a more accurate reflection of the actual gene.\n
GLEAN3_16060	SPU_016060		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s) and/or LRRs.\n
GLEAN3_13038	SPU_013038		Domains: NACHT, LRRs. \nThis gene model is at the end of a scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_27035	SPU_027035		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s) and LRRs.\n
GLEAN3_15033	SPU_015033		Domains: NACHT, LRRs. \nThis gene model is on a short scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_11097	SPU_011097		Domains: NACHT, LRRs. \nThis gene mode is located at the end of a scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_21930	SPU_021930		Domains: DEATH, NACHT, LRRs.\n
GLEAN3_06219	SPU_006219		Domains: DEATH, LRRs. \nBoth the Genscan and the Fgenesh models have a different exon/intron structure and code for a NACHT domain. The Genscan NACHT domain is better (higher e-value). However, this model may not be the most accuarate one, since it is does not start with a Methionine. \nThe gene features and sequences were changed to reflect the 5'end of the Glean3 model and the NACHT domain region of the Genscan model.  \n(The gene feature edit function was not available, so an additional "exon" was created to make the existing one bigger.)\n
GLEAN3_10039	SPU_010039	For this particular GLEAN model it appears that the sequence is distributed onto 2 different scaffolds. The first portion of the sequence appears to have several small gaps throughout the sequence that only span about 10-15 base pairs apart. The second scaffold has an orderly and continuous distribution without any gaps or repeats present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the values being less than 5.	Domains: NACHT, LRRs, CLECT, CLECT. \nThis gene model is likely a fusion of two models, as the Fgenesh model only has the first 4 exons and doesn't code for the CLECT domain, which are not normally found in this type of protein. Also, the intron separating the NLR domains from the CLECT domains is very large (~150kb) which is very unusual. \nThis gene model is at the beginning of a scaffold and could be incomplete, missing the DEATH domain(s).\n
GLEAN3_10116	SPU_010116		Could contain a repetitive non-coding sequence in the second predicted exon, without this sequence the predicted ORF is very similar in length and homology to human Dystroglycan\n
GLEAN3_03619	SPU_003619		Domains: DEATH, 6x(EGF), NACHT. \nThe presence of the EGF domain in this model is unusual. \n \nThe LRRs are not present in this glean model but are included in the corresponding Genscan prediction. The genscan prediction is missing part of the NACHT domain, however. A hybrid of both gene predictions is probably the accurate model.\n
GLEAN3_01423	SPU_001423		Domains: DEATH, NACHT, LRRs. \nThis gene model is located on a small scaffold and could be incomplete, missing LRRs. \nAlso, the DEATH and NACHT domains overlap in the SMART prediction. This overlap is less extensive in the Genscan model which has a larger 3rd exon. The gene features and sequences for this model reflect the Genscan model.\n
GLEAN3_18582	SPU_018582		This GLEAN matches a GenBank entry, "gi|72180194|ref|XP_794270.1| similar to dynein 2 light intermediate chain" exactly.\n
GLEAN3_22644	SPU_022644		No signal in tiling array or EST.  May be pseudogene or adult only expression.\n
GLEAN3_09807	SPU_009807		NOTE: Based on sequence, alignment and domain analysis--This look like the 5' end of a single OPA gene that is composed of GLEANS 09807 (5'end)  and 06815 (3'end), with no gaps between.\n
GLEAN3_09674	SPU_009674		NOTE: No signal in tiling array and no EST.  May be pseudogene or adult expressed.\n
GLEAN3_16744	SPU_016744		NOTE: No signal in tiling array and no EST.  May be pseudogene or adult expressed.\n
GLEAN3_10656	SPU_010656		high homology to chondroitin glucuronate 5-epimerase, dermatan sulfate epimerase, squamous cell carcinoma antigen recognized by T cells 2, SART2 \n \nNCAG1 may be an enzyme with dual epimerase and O-sulfotransferase activities involved in dermatan sulfate biosynthesis (Maccarana et al., in press, JBC 2006)\n
GLEAN3_12840	SPU_012840		Better hit against Human Traf1 sequence than GLEAN3_26479\n
GLEAN3_05419	SPU_005419	From the BLAST results it is evident that the sequence is distributed onto two different scaffolds. When examining the excel data, it is apparent that there are no internal repeats or gaps present within either of the two scaffolds and if they were to be combined the sequence would have an orderly continuous arrangement. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with all of the values being greater than 5 and several values that were greater than 10. 	Kinase domain + some 3' UTR partially cloned out of egg cDNA \nThe model is probably missing the 5' most sequence based on comparisons to A.miniata SFK3 \n \n Gleen for CDS and Peptide sequences Accepted, however further verification should be done for sequence that is not within the cloned mRNA sequence.\n
GLEAN3_02160	SPU_002160		likely an incomplete version of GLEAN10659. Refer to GLEAN3_10659 for the complete sequence.\n
GLEAN3_17675	SPU_017675		likely an incomplete prediction, the C-terminal region of the protein is not predicted.\n
GLEAN3_07092	SPU_007092		This is the S.purpuratus version of gi|1817526|dbj|BAA09934.1| intermediate chain 1 [Anthocidaris crassispina].\n
GLEAN3_19893	SPU_019893		Structure: CARD-DEATH. \nThe CARD domain is a poor hit in a SMART analysis, but does appear in the 2nd table.\n
GLEAN3_06410	SPU_006410		Structure: CARD-DEATH-DEATH. \nThe CARD domain is a poor hit in a SMART analysis but does appear in the 2nd table.\n
GLEAN3_19506	SPU_019506		This GLEAN represents the sea urchin Dynein Intermediate Chain 2, as defined by the Anthocidaris crassispina cDNA (gi|2494214|sp|Q16959|DYI2_ANTCR Dynein intermediate chain 2, ciliary).  GLEAN3_19506 originally represented only the first 477 amino acid residues, while GLEAN3_05973 represented the last 437 residues.  These were merged into GLEAN3_19506.\n
GLEAN3_06354	SPU_006354		The prototype of this gene product is from the sea urchin Anthocidaris crassispina (gi|2760163|dbj|BAA24185.1| outer arm dynein light chain 1 [Anthocidaris crassispina]).\n
GLEAN3_18854	SPU_018854		This GLEAN represents the sea urchin Dynein Light Chain 2, as defined by the Anthocidaris crassispina cDNA (gi|2760161|dbj|BAA24184.1| outer arm dynein light chain 2 [Anthocidaris crassispina]). \n
GLEAN3_08471	SPU_008471		This GLEAN represents the sea urchin Outer Arm Dynein Light Chain 2, as defined by the Anthocidaris crassispina cDNA (gi|3336986|dbj|BAA31751.1| outer arm dynein LC3 [Anthocidaris crassispina]), and is identical to the GenBank RefSeq entry "XP_783725.1 PREDICTED: similar to t-complex testis expressed 1."\n
GLEAN3_04009	SPU_004009		This GLEAN represents the sea urchin Dynein Light Chain Type 6, as defined by the Anthocidaris crassispina cDNA (gi|2811014|sp|O02414|DYL1_ANTCR Dynein light chain LC6, flagellar outer arm), which has an identical amino acid sequence.   The S. purpuratus RefSeq ID is XP_785110.1. \nThere are several other GLEANs that are identical or nearly identical to this one: \nGLEAN3_18607 (100% Amino Acid Identity) \nGLEAN3_08799 (93% Amino acid Identity)                                            \nGLEAN3_24498 (91% Amino acid Identity)                                            \nGLEAN3_24497 (88% Amino acid Identity)                                            \nGLEAN3_18567 (88% Amino acid Identity)                                            \nGLEAN3_25272 (88% Amino acid Identity)                                            \nGLEAN3_24499 (88% Amino acid Identity)                                            \nGLEAN3_08800 (86% Amino acid Identity)                                            \nGLEAN3_24500 (87% Amino acid Identity)                                            \nGLEAN3_08801 (76% Amino acid Identity)                                           \nGLEAN3_27937 (70% Amino acid Identity) \nGLEAN3_27938 (68% Amino acid Identity)\n
GLEAN3_18607	SPU_018607		This GLEAN represents the sea urchin Dynein Light Chain Type 6, as defined by the Anthocidaris crassispina cDNA (gi|2811014|sp|O02414|DYL1_ANTCR Dynein light chain LC6, flagellar outer arm), which has an identical amino acid sequence.   The S. purpuratus RefSeq ID is XP_785110.1. \nThis gene model is a member of a group of several GLEANs that are identical or nearly identical to GLEAN3_04009: \nGLEAN3_18607 (100% Amino Acid Identity) \nGLEAN3_08799 (93% Amino acid Identity)                                            \nGLEAN3_24498 (91% Amino acid Identity)                                            \nGLEAN3_24497 (88% Amino acid Identity)                                            \nGLEAN3_18567 (88% Amino acid Identity)                                            \nGLEAN3_25272 (88% Amino acid Identity)                                            \nGLEAN3_24499 (88% Amino acid Identity)                                            \nGLEAN3_08800 (86% Amino acid Identity)                                            \nGLEAN3_24500 (87% Amino acid Identity)                                            \nGLEAN3_08801 (76% Amino acid Identity)                                           \nGLEAN3_27937 (70% Amino acid Identity) \nGLEAN3_27938 (68% Amino acid Identity)\n
GLEAN3_15529	SPU_015529		homologous to DNA-PKcs, 5'end of gene is most likely missing, Glean3_15528 also contains parts of this gene,  \nGlean3_15484 contains 5'end / amino-terminus of gene\n
GLEAN3_08799	SPU_008799		This GLEAN represents the sea urchin Dynein Light Chain Type 6, as defined by the Anthocidaris crassispina cDNA (gi|2811014|sp|O02414|DYL1_ANTCR Dynein light chain LC6, flagellar outer arm), which has an identical amino acid sequence.   The S. purpuratus RefSeq ID is XP_785110.1. \nThis gene model is a member of a group of several GLEANs that are identical or nearly identical to GLEAN3_04009: \nGLEAN3_18607 (100% Amino Acid Identity) \nGLEAN3_08799 (93% Amino acid Identity)                                            \nGLEAN3_24498 (91% Amino acid Identity)                                            \nGLEAN3_24497 (88% Amino acid Identity)                                            \nGLEAN3_18567 (88% Amino acid Identity)                                            \nGLEAN3_25272 (88% Amino acid Identity)                                            \nGLEAN3_24499 (88% Amino acid Identity)                                            \nGLEAN3_08800 (86% Amino acid Identity)                                            \nGLEAN3_24500 (87% Amino acid Identity)                                            \nGLEAN3_08801 (76% Amino acid Identity)                                           \nGLEAN3_27937 (70% Amino acid Identity) \nGLEAN3_27938 (68% Amino acid Identity) \nThe predicted amino acid sequence of GLEAN3_08799 is identical to those of GLEAN3_08800 and GLEAN3_08801. \nBecause there is a good chance that they do not represent three distinct gene products, they were named as follows: \nGLEAN3_08799: Sp-Dynein Light Chain-1-3a \nGLEAN3_08800: Sp-Dynein Light Chain-1-3b \nGLEAN3_08801: Sp-Dynein Light Chain-1-3c\n
GLEAN3_03242	SPU_003242		This GLEAN represents an Outer Arm Dynein Light Chain-like polypeptide (RefSeq ID: gi|72012233|ref|XP_782355.1| PREDICTED: similar to outer arm dynein light chain like (XJ558) [Strongylocentrotus purpuratus]).\n
GLEAN3_04498	SPU_004498		This GLEAN represents an Outer Arm Dynein Light Chain-like polypeptide (RefSeq ID: gi|72070876|ref|XP_791620.1| PREDICTED: similar to outer arm dynein light chain like).\n
GLEAN3_26533	SPU_026533		This gene model represents a segment of the axonemal Dynein intermediate Chain 3.  Similarity to Chlamydomonas reinhardtii Inner_Dynein_Arm_1_Intermediate Chain IC140 (C_530081|166736|IA1-IC140|) and Homo sapiens axonemal dynein intermediate polypeptide 2.  Contains a WD-40 motifs.  The protein is also essentially identical to the Anthocidaris crassispina (gi|2494216|sp|Q16960|DYI3_ANTCR Dynein intermediate chain 3, ciliary).\n
GLEAN3_24517	SPU_024517	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. On the first portion of the sequence (v2.1_scaffold85334) the sequence is continuous until 480 and doesn't continue until 717. This region is filled in by v2.1_scaffold6747 . If the two scaffolds were combined, the sequence would have a continuous orderly arrangement. There was Est support available and from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	putative homolog of xrcc4, 3'end / C-terminus is missing in gene model\n
GLEAN3_14075	SPU_014075		Annotated by RL Morris and B Rossetti.\n
GLEAN3_06911	SPU_006911		This GLEAN corresponds to the first 292 amino acid residues (of a total of 686) of a well-characterized Microtubule-associated Protein known as the "77kDa-MAP."  GLEAN3_06911 overlaps with GLEAN3_05744 by 18 codons, followed by the remainder of the carboxyl-terminus of the protein (a total of 686 codons plus the stop codon) contained in GLEAN3_05744.  The annotation for the full-length gene product is associated with GLEAN3_06911.\n
GLEAN3_05744	SPU_005744		This GLEAN corresponds to the last 402 amino acid residues (of a total of 686) of a well-characterized sea urchin Microtubule-associated Protein known as the "77kDa-MAP."  GLEAN3_06911 overlaps with GLEAN3_05744 by 18 codons, followed by the remainder of the carboxyl-terminus of the protein (a total of 686 codons plus the stop codon) contained in GLEAN3_05744.  The annotation for the full-length gene product is associated with GLEAN3_06911.\n
GLEAN3_09444	SPU_009444		This GLEAN represents a homolog of the mammalian  Microtubule-associated proteins 1A/1B light chain 3B precursor (gi|72168709|ref|XP_783653.1| PREDICTED: similar to Microtubule-associated proteins 1A/1B light chain 3B precursor (MAP1A/MAP1B LC3) (MAP1A/1B light chain 3) [Strongylocentrotus purpuratus]).  There seem to be two MAP1A/1B_LC3-like proteins encoded in the S. purpuratus genome: GLEAN3_09444 (~72% sequence identity) and GLEAN3_08954 (~60% sequence identity).\n
GLEAN3_22897	SPU_022897		This GLEAN is a relative of TPX2, microtubule-associated protein, RefSeq: "gi|72167299|ref|XP_796944.1| PREDICTED: similar to TPX2, microtubule-associated protein homolog [Strongylocentrotus purpuratus]."\n
GLEAN3_08221	SPU_008221		The last 238 amino acid residues of this GLEAN are related to TPX2, microtubule-associated protein, RefSeq: "gi|72167299|ref|XP_796944.1| PREDICTED: similar to TPX2, microtubule-associated protein homolog [Strongylocentrotus purpuratus]."\n
GLEAN3_28717	SPU_028717		Glean3_28716 is the front part of GLEAN3_28717.  Reported as separate genes but should be the same gene.  I've attached the GLEAN3_28716 sequences in front of the GLEAN3_28717 sequences.  \n
GLEAN3_21172	SPU_021172		The two first predicted exons likely belong to a different gene called calumenin. \n
GLEAN3_10527	SPU_010527		sequence identical to GLEAN3_12840\n
GLEAN3_04366	SPU_004366		Glean has falsely predicted the following exon: 3\n
GLEAN3_07226	SPU_007226		eIF3e spans 2 glean prediction 07726 (N-ter) and 09248 (Cter), probably missing exons in between.\n
GLEAN3_09248	SPU_009248		eIF3e spans 2 glean prediction 07726 (N-ter) and 09248 (C-ter), probably missing exons in between.\n
GLEAN3_25856	SPU_025856		The prediction is likely incorrect regarding the first 160 aminoacids. \n
GLEAN3_00502	SPU_000502		homology with UPF3 from aa 1200 to 1588 AAG48511.1,  \naa 1-1200 similar to Retrovirus-related POL polyprotein (Endonuclease) AAH66867.1\n
GLEAN3_05736	SPU_005736		This gene was annotated with A. pectinifera mRNA and peptide.  The complete gene is annotated in Glean3_12078. \nAligns with ApIP3R AA sequence from 1670-1769.\n
GLEAN3_27674	SPU_027674		Annotation of this gene was done with alignments using A. pectinifera mRNA and pepetide.  The full annotation of this gene is on Glean3_12078.  Of note, there were two Glean3 predicted exons in sequential order which were exactly the same.  One was erased.  This glean aligns with ApIP3R AAs 2058-2673.\n
GLEAN3_21143	SPU_021143		NCBI is calling this a DSP4, but the best non-predicted BLAST hit is a DSP1\n
GLEAN3_02908	SPU_002908		59% identity with the corresponding region of human VARS2 (P26640) whereas it has only 46% identity with the human VARS2-like (NP_065175.3) \n \nSp-VARS isoformB has 47% identity with the Sp-VARSisoformA (glean 3_08058) \n \nThe construction was made from \n7 exons from glean3_28547 plus 1 exon from glean3_03860 to build the N-ter region \nand 15 exons from glean3_02908 to build the C-ter region\n
GLEAN3_28547	SPU_028547		fragments of Sp-VARS isoformB \ncomplete gene annotated in GLEAN3_02908 \n \n55% identity with corresponding region in human VARS2 (P26640)\n
GLEAN3_14014	SPU_014014		close similarity in the overlapping region with GLEAN3_01009\n
GLEAN3_01009	SPU_001009		close similarity in the overlapping region with GLEAN3_14014\n
GLEAN3_25860	SPU_025860		Overlaps with GLEAN_17572\n
GLEAN3_17572	SPU_017572		Overlaps with GLEAN3_25860.\n
GLEAN3_22195	SPU_022195		a very simila protein is predicted in Glean3_03052. \n
GLEAN3_03052	SPU_003052		 A protein very similar is predicted in GLEAN3_22195\n
GLEAN3_25517	SPU_025517		Cyclic ADP-ribose is an important calcium mobilizing metabolite produced by the ADP-ribosyl cyclase (cyclases) family of enzymes. Three evolutionarily conserved ADP-ribosyl cyclase superfamily members have been identified, one from the invertebrate Aplysia californica and two from mammalian tissues, CD38 and CD157. \n \nThis annotation most bears most homology to CD157.\n
GLEAN3_18717	SPU_018717		matches human amino acid sequence from position 52-396 with 82% identity, BLAST score 1e-167\n
GLEAN3_03985	SPU_003985		best vertebrate hit NM_204934\n
GLEAN3_02461	SPU_002461		Cyclic ADP-ribose is an important calcium mobilizing metabolite produced by the ADP-ribosyl cyclase (cyclases) family of enzymes. Three evolutionarily conserved ADP-ribosyl cyclase superfamily members have been identified, one from the invertebrate Aplysia californica and two from mammalian tissues, CD38 and CD157. \n \nincomplete sequence that has most homology to the ADP-ribosyl cyclase family member from Aplysia californica \n
GLEAN3_07538	SPU_007538		Three evolutionarily conserved ADP-ribosyl cyclase superfamily members have been identified, one from the invertebrate Aplysia californica and two from mammalian tissues, CD38 and CD157. \n \nThis annotation bears greatest homology to family member CD38\n
GLEAN3_10746	SPU_010746		This model was modified and annotated based on a manual inspection of multiple protein sequence alignments and its predicted domain architecture. \n \nThe 5' features of the corresponding Fgenesh model were chosen over those of the original glean model because they generate a model that better corresponds in domain structure with the genes to which this model Blasts back (B7-1/CD80). The 3' features of the glean/genscan and Fgenesh models are also different. Since they do not give rise to significant differences in domain structure or Blasting to genes from other phyla, we have decided to accept the former.\n
GLEAN3_19025	SPU_019025		Appears to be an additional exon of GLEAN3_19024\n
GLEAN3_19026	SPU_019026		Appears to be an additional exon of GLEAN3_19024.\n
GLEAN3_11076	SPU_011076		This model was annotated based on a manual inspection of multiple protein sequence alignments and its predicted domain architecture.\n
GLEAN3_20633	SPU_020633		vertebrate homolog DM. NM 057490\n
GLEAN3_20159	SPU_020159		GLEAN3_20159 appears to be C-terminal portion of Chlamydomonas IFT140: AAT95430. eval=1.00E-145. explains aas 183-775 of IFT140. \nGLEAN3_21918 appears to be N-terminal portion of IFT140. eval=2.00E-84. explains aas 677-1249 of IFT140. \nAnnotated by RL Morris. \n
GLEAN3_21918	SPU_021918		#\nGLEAN3_20159 appears to be C-terminal portion of Chlamydomonas IFT140: AAT95430. eval=1.00E-145. explains aas 183-775 of IFT140. \nGLEAN3_21918 appears to be N-terminal portion of IFT140. eval=2.00E-84. explains aas 677-1249 of IFT140. \nAnnotated by RL Morris.\n
GLEAN3_23605	SPU_023605		GLEAN3_23605 eval=0.0 against "FAP80, IFT122A, Intraflagellar Transport Protein 122A [Chlamydomonas reinhardtii]".  \nGLEAN3_23605 explains aas 83-795 of Chlamydomonas FAP80/IFT122A which is 1162 aas long. \nAnnotated by RL Morris.\n
GLEAN3_25443	SPU_025443		This is the 5- end of TIAM2.  The 3'end is GLEAN3_11527\n
GLEAN3_18291	SPU_018291	From the BLAST results and the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. Both scaffolds contain an orderly arrangement without any gaps or repeats present. There was Est support available from GBrowse assembly and the transcriptome intensity scores appeared to be weak with most of the values being less than 5. 	This glean contains a RhoGEF domain but also an F-Box domain????  Either a novel GEF or the F-box is not part of this gene.\n
GLEAN3_07086	SPU_007086		This is most of the gene.  However, the 5' end is GLEAN3_20692\n
GLEAN3_04041	SPU_004041		This is a duplication of 020022, which looks like a splice variant\n
GLEAN3_12009	SPU_012009		This is most of the gene for Dock180, the 5' end is GLEAN3_17939\n
GLEAN3_17939	SPU_017939		This is the 5' part of Sp-Dock180.  The rest of the gene is GLEAN3_12009\n
GLEAN3_27143	SPU_027143		This is the 5' end of SRGAP.  SRGAP is in three GLEANs, with the middle being 027715 and the bulk (3'end) being 022632\n
GLEAN3_27715	SPU_027715		This is the middle part of SRGAP.  This 5' end is 027143, and the 3' end is 022632\n
GLEAN3_26597	SPU_026597		This is the 5' end of FAM13A1.  The 3' end is GLEAN3_00258\n
GLEAN3_00258	SPU_000258		This is the 3' end of FAM13A1.  The 5' end is GLEAN3_26587.\n
GLEAN3_25352	SPU_025352		This is a duplication of most of GLEAN3_00258, which itself is the 3'end of FAM13A1.\n
GLEAN3_12388	SPU_012388		This gene was annotated based on a manual inspection of domain architecture and multiple protein sequence alignments. \n \nThis model blasts back and shows a similar domain structure (signal peptide/2xIG-v/TM) to that of diverse vertebrate IGSF genes, many of which have a well documented immune function.\n
GLEAN3_26664	SPU_026664		C-term part of the protein seems too long\n
GLEAN3_27781	SPU_027781		seems to be an artifactual duplication of GLEAN3_23813 \n... or it's the inverse :)\n
GLEAN3_13624	SPU_013624		This model was annotated based on a manual inspection of its predicted domain architecture and its Blasting to known genes. \n \nThis model codes for what seems a fairly novel protein, for it shows very weak blasting to known genes in genebank. Its sequence does not show any high confidence domain architecture (as predicted by SMART and Pfam); however, one particularly interesting predicted domain architecture for this model is sp+3xIG[low scores]+TM+ITAM[low score].\n
GLEAN3_28030	SPU_028030		N-Term of the protein, seems inaccurate. The good part (half of the peptidase S8 domain which is lacking)is most likely the GLEAN3_28031  \n \nthe most C-term part seems as well inaccurate\n
GLEAN3_28031	SPU_028031		Seems to be the N'term part (Seems to contain the lacking peptitase S8 domain half) of GLEAN_28030 wich is just near\n
scaffoldi3224	SPU_030153		sequence here is only the conserved homeobox domain.\n
GLEAN3_17228	SPU_017228		This model was annotated based on a manual inspection of its predicted domain architecture and its similarity to known genes. \n \nIts sequence and domain structure are similar to those of various vertebrate immune IGSF genes (e.g. TCR, CD276, CD4). Furthermore, if the sequence of the glean model is fused to exons called by other predictions, there is an even better Blasting to these genes. Note neither this or any of the other corresponding predictions include a transmembrane domain, indicating that this might be a partial model.  \n
GLEAN3_20457	SPU_020457		This model was annotated based on a manual inspection of its domain architecture and alignment to known sequences. \n \nThis model Blasts back to human CD276/B7-H3, and it has a partially similar predicted domain structure (sp+V-set+C2-set+TM). The alternative predictions are very similar to the glean model and they do not provide any additional information on this gene.\n
GLEAN3_24439	SPU_024439		This model was annotated based on a manual inspection of sequence alignments and predicted domain architectures. \n \nThis glean model codes for a protein that Blasts back and has a similar domain structure to that of vertebrate SIRPB2 and SIRPG, although it has a slightly longer cytoplasmic portion. This model is represented in two separate Fgenesh predictions, the first of which has a domain structure more similar to that of SIRPs. However, there is equivalent signal from the tiling array data for all the exons, which would suggest that they all correspond to the same gene; we have therefore accepted the glean model in its original form.\n
GLEAN3_24787	SPU_024787		This model codes for what seems a novel domain architecture: 2xIGv+2xCCP/Sushi. Eventhough there are gaps in the sequence between the N-ter IG domains (thus making it possible that this is a "forced" [artifactual] model), the 2nd IG domain and both Sushi domains are encoded in one uninterrupted contig. \n \nThe IG and Sushi domains Blast separately to sequences in Genbank, which supports further the idea that, if real, this gene would have a novel domain architecture. Of note, the IG domains blast to various vertebrate IGSF genes relevant for immunity. \n \n
GLEAN3_28300	SPU_028300		This model was modified to incorporate an extra 5' exon based on an otherwise identical Fgenesh prediction. The modified model incorporates a signal peptide into the predicted sequence, which better resembles the structure of the vertebrate B7 family genes to which this gene Blasts back. It codes for sp+V-set+C2-set[low]. The sequence is most likely incomplete (transmembrane domain missing), which is expected since this model locate to a scaffold end.\n
GLEAN3_21592	SPU_021592		GLEAN3_21592 eval=1E-30 against "CPC1, Central Pair Complex 1 [Chlaydomonas reinhardtii]" \nAnnotated by RL Morris, B Rossetti, and A Shorette.\n
GLEAN3_10897	SPU_010897		This is a duplicaiton of the SH3 domain of GLEAN 19856\n
GLEAN3_25252	SPU_025252		This is a duplicaiton of part of GLEAN3_00206, which itself appears to be a partial PIK3R1.\n
GLEAN3_25017	SPU_025017		has a C-terminal C1q domain and ~400 N-terminal extension - but without collagen repeats\n
GLEAN3_24653	SPU_024653		C-terminal C1q domain and Nterminal extension of about 200 amino acids but without collagen repeats.\n
GLEAN3_28732	SPU_028732		C-terminal C1q domain plus N-terminal extension of about 200 amino acids but without collagen repeats.\n
GLEAN3_24033	SPU_024033		C-terminal C1q domain plus N-terminal extension of about 200 amino acids but without collagen repeats.\n
GLEAN3_28510	SPU_028510		This model was modified based on a corresponding Genscan model whose domain structure better resembles the vertebrate B7 family of genes (to which this model blasts back). The added exons were indeed predicted by Glean3, but as part of the adjacent glean model (GLEAN3_28511).\n
GLEAN3_02608	SPU_002608		This model is located in a small scaffold, and is most likely incomplete (missing a transmembrane domain). \n \n
GLEAN3_16836	SPU_016836		#\nN-terminal EMI domain followed by large number of EGF and EGFLam repeats.  There are homologs in vertebrates but they are a bit shorter and have TM domain.\n
GLEAN3_12032	SPU_012032		LRRNT/EGF/FN3/TM - looks as if its missing N-terminus with rest of an LRR unit - such structures exist in vertebrates\n
GLEAN3_10102	SPU_010102		SERIES OF LRRtyp REPEATS FOLLOWED BY KR AND 4 FA58C \nA novel domain architecture.\n
GLEAN3_19694	SPU_019694		GPS/7TM2 - EXTRACELLULAR DOMAIN HAS NO DOMAINS PREDICTED \nINTRACELLULAR DOMAIN HAS SEVERAL DOMAINS DOMAINS (SR/WSC/CCP) THAT LOOK SUSPICIOUS\n
GLEAN3_28171	SPU_028171		previously cloned gene\n
GLEAN3_16565	SPU_016565		previously cloned genomic DNA\n
GLEAN3_17445	SPU_017445		This gene model has a CARD domain at the N-terminus and Blasts back to the human VISA/CARDIF/Ips-1/MAVS in the top 10 hips. It however lacks the TM at the C-terminus.\n
GLEAN3_13091	SPU_013091		centromere specific histone H3\n
GLEAN3_10425	SPU_010425		This GLEAN appears to be an ortholog of the Chlamydoimonas reinhardtii ODA-DC3 gene (C_240117|160952 ODA-DC3, Outer Dynein Arm Docking Complex 3, Mr 25,000\n
GLEAN3_14310	SPU_014310		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene model has a DEATH domain instead of a CARD domain, located at the C-terminus instead of the N-termimus.\n
GLEAN3_25885	SPU_025885		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene model has an N-terminal DEATH domain instead of a CARD domain.\n
GLEAN3_11866	SPU_011866		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene's CARD domain is located at the C-terminus instead of the N-terminus.\n
GLEAN3_14311	SPU_014311		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene model's CARD domain overlaps the DEXDc domain by ~25 amino acids.\n
GLEAN3_07126	SPU_007126		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene models codes for a N-terminal DEATH domain instead of a CARD domain.\n
GLEAN3_14119	SPU_014119		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene combines with part of the sequence from SPU_014118 to make a complete gene model (Helicase_C domain is missing from SPU_014119). \nThis gene model codes for a N-terminal DEATH domain instead of a CARD domain.\n
GLEAN3_10536	SPU_010536		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \n
GLEAN3_19134	SPU_019134	From the BLAST results as well as the excel data, it appears that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data it is evident that both scaffolds do not have any internal repeats or gaps present and if both scaffolds were to be combined, the entire sequence would have an orderly and continuous arrangement. There was no Est information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	 extra 400 amino acids on N-terminus\n
GLEAN3_15823	SPU_015823		 partial, tiny fragment\n
GLEAN3_03850	SPU_003850		 partial, missing N-terminus\n
GLEAN3_04180	SPU_004180		 partial, missing N-terminus, may join with Glean3_11497\n
GLEAN3_11497	SPU_011497		 partial, missing C-terminus, may join with Glean3_04180\n
GLEAN3_05315	SPU_005315		 partial, missing some N-terminus residues\n
GLEAN3_13104	SPU_013104		 fragment\n
GLEAN3_16142	SPU_016142		 partial, missing N- and C-terminus\n
GLEAN3_16259	SPU_016259		 partial, fragment\n
GLEAN3_10535	SPU_010535		 partial\n
GLEAN3_13319	SPU_013319		 partial\n
GLEAN3_26353	SPU_026353		 partial\n
GLEAN3_05476	SPU_005476		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene model is at the end of a scaffold and is likely incomplete, it is missing part of the DEXDc domains and the effector domain (DEATH or CARD) at the N-terminus. \n
GLEAN3_19617	SPU_019617		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene model is a the end of a scaffold and is likely incomplete, it is missing the Helicase_c domain.\n
GLEAN3_16718	SPU_016718		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene is at the end of a scaffold and is likely incomplete, it is missing the Helicase_c domain.\n
GLEAN3_20020	SPU_020020		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nHowever the domain structure is unusual for this type of protein: It has an ERCC4 domain at the C-terminus (after a long stretch of low complexity sequence, which contains a poor hit to a DEATH domain). Both the Genscan and Fgenesh models are identical to the Glean3 prediction.\n
GLEAN3_00006	SPU_000006		This gene model clusters with the RIG-I family of helicases in phylogenetic tree based on a multiple sequence alignment of the DEXDc domains.  \nThis gene model does not appear to encode an "effector" domain (N-terminal CARD domain in vertebrates) It is possible that it is incomplete, although the Genscan and Fgenesh predictions are identical to the Glean3 model and there is another ORF in opposite orientation just upstream.\n
GLEAN3_12192	SPU_012192		potential novel kinase family member. Blasts reasonably well to Fumerate Dehydratase\n
GLEAN3_26236	SPU_026236		This prediction does not include the N-terminal sequences of the protein which are present in Glean3_26235.  26235 and 26236 should be combined.\n
GLEAN3_07435	SPU_007435		The automated GLEAN prediction contained an exon near the 5'-end of this gene that threw the alignment of the sequence out of line with all other alpha tubulins, so it was deleted.\n
GLEAN3_21670	SPU_021670		This GLEAN has disappeared from the curated GLEAN set in GenBoree (043006).\n
GLEAN3_20749	SPU_020749		Annotated by BLAST only. \nBest human hit is NP_699160\n
GLEAN3_28158	SPU_028158		Annotation by BLAST only \npartial gene missing N terminus and exon in the middle\n
GLEAN3_13551	SPU_013551		Glean model had extra C terminal exon \nMay be missing small real C terminal\n
GLEAN3_12976	SPU_012976		possibly missing 1-2 exons in middle and 1 exon at N terminus \ntrees with Ciona intestinalis predicted NATs\n
GLEAN3_26720	SPU_026720		This model is likely incomplete (sits on a small scaffold) and is almost identical to GLEAN3_03486 annotated by Charlie Whittaker. For this reason we have followed their Whittaker's annotation to annotate this model.\n
GLEAN3_14392	SPU_014392		Partial sequence duplication or homolog to GLEAN3_01360.\n
GLEAN3_01360	SPU_001360	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto 2 different scaffolds. When reviewing the excel data, it is apparent that there are several internal repeats present specifically in scaffold v2.1_scaffold80884. Besides the internal repeats, both scaffolds contain an orderly arrangement without any gaps present. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10.	May represent an duplication or paralog since GLEAN3_14392 also matches a significant part of the published katanin B sequence.\n
GLEAN3_27562	SPU_027562		MORC protein sequence revealed putative nuclear localization signals, two predicted coiled-coil structural motifs and limited homology to GHL (GyraseB, Hsp90, MutL) ATPase. Epitope-tagged MORC protein expressed in COS7 cells localized to the nucleus\n
GLEAN3_23500	SPU_023500		May be a partial gene.  The best CDS match to meiosis defective 1 only spans the first 450 amino acid residue, and matches with the middle of the mei-1 protein.\n
GLEAN3_28506	SPU_028506		Only one predicted exon is conserved: \n>GLEAN3_28506|Scaffold82783|17550|17749| DNA_SRC: Scaffold82783 START: 17550 STOP: 17749 STRAND: +  \nAAACGAGCAACGCATCCATCGATTACAGCAAAGAGAATGAAACCGCCACTCTTACATTCCCCTCACCCCT \nTGCTGTCGGCAGCGGTGATCTGGCCCTGGAGTTCACAGGAGAGCTCAATGATAAGATGAAAGGGTTCTAT \nCGAAGCAAGTACACCACACCAGCTGGTGAAGAAAGATACTGTGCTGTTACTCAGTTTGAG \n
GLEAN3_21117	SPU_021117		#\nOnly two predicted exons are conserved: \n>GLEAN3_21117|Scaffold22466|7279|7378| DNA_SRC: Scaffold22466 START: 7279 STOP: 7378 STRAND: +  \nCTCAGGCAGAGGTAGACATGAAGGCCTGGTTTTGCCGTAGAGGGCGAAGAAGAAATCCACCACATCACAA \nCGTAACCTGGCATCGTGAGATGTACCTGAA \n>GLEAN3_21117|Scaffold22466|7944|8065| DNA_SRC: Scaffold22466 START: 7944 STOP: 8065 STRAND: +  \nTTCATGTGGGCCCAGAGCCTCTCTACTAGTTCCTCTGTGCTAAGCTTCGACTGTTCCTTGTACTTAAACG \nGTGGGTTGGCTATCAGCAGCCGTAGTAGCTTGTGCCTGGGTTGGAGATACAT \n
GLEAN3_04070	SPU_004070		fragment\n
GLEAN3_26235	SPU_026235		fragment\n
GLEAN3_12022	SPU_012022		The following exons are the only ones conserved with other CPA proteins. \n>GLEAN3_12020|Scaffold48388|53670|53789| DNA_SRC: Scaffold48388 START: 53670 STOP: 53789 STRAND: +  \nCTTGATTTCTGGAAGCGCCCGTCGAAGGTTGGACGGCCCGTCGACGTGATGGTCTCCCCCGCCCAGCAGT \nTGAGCTTCGTTAGCTCCGCGAGCCGCCCTGGACTCTTTATCGAGACTTGG \n>GLEAN3_12020|Scaffold48388|55561|55662| DNA_SRC: Scaffold48388 START: 55561 STOP: 55662 STRAND: +  \nATCGCGAAGTCACCATCTGCAACGAACGTGGCTTACATCCAGGGAGGCATCCACGCCCGCGAATGGGTCA \nGCCCAGCTACAGTCATCAACCTCATCAAAAAT \n>GLEAN3_12020|Scaffold48388|56376|56483| DNA_SRC: Scaffold48388 START: 56376 STOP: 56483 STRAND: +  \nTACATAGATAACTACGGCAGTGATGATACGGTGACGAGCATGTTGGATAACTTTGTGTGGATCATTGTAC \nCCGTCTACAACATCGATGGATACAAGTTCAGCCACACC \n>GLEAN3_12020|Scaffold48388|58257|58347| DNA_SRC: Scaffold48388 START: 58257 STOP: 58347 STRAND: +  \nGACGATCGTATGTGGCGCAAGAATCGCAACCCCAACGTAGGAGGCTGCGCTGGAGTCGATCTGAACCGCA \nACTATGACTTCGAGTGGGGAG \n>GLEAN3_12020|Scaffold48388|61004|61206| DNA_SRC: Scaffold48388 START: 61004 STOP: 61206 STRAND: +  \nGTGCCAGCAAACAGAGGTGTACCCAGGATTATCAGGGCACAGAGCCGCTGAGTGAACCCGAGAACAGCGG \nCTCCAAGGCTTTCCTGCAAGGCTTTGGTTCAAACCTCAAACTCTTCATTGATTTCCACGCCTATGGCCAG \nTACTGGCTCTACCCATGGGGTTACACCAGGAGAACCCTTGCACAACCAGATAGAGACGATCAG \n>GLEAN3_12020|Scaffold48388|62567|62757| DNA_SRC: Scaffold48388 START: 62567 STOP: 62757 STRAND: +  \nACCCCGCAACAGGTGCAAGCGAAGACTTTGGATACGGCTCCCTGGGTGTGAAGTACACCTACGTGGTGGA \nGCTGAGGGATGAGGGCACTTTCGGGTTCTCGCTCCCCGCCTACCAGATCCAGCCCACCGGTGAGGAGATC \nTTCGCCGGTATGAAGACACTCGGCAAGCAGCTCGTTGCCGAGTATGCTTAG \n\t\t\n
GLEAN3_27565	SPU_027565		coomparison to best blast hit suggests that the prediction may be missing N-terminal sequence.\n
GLEAN3_06011	SPU_006011		potential novel kinase family member\n
GLEAN3_10455	SPU_010455		potential novel kinase family member\n
GLEAN3_11808	SPU_011808		potential novel kinase family member\n
GLEAN3_12859	SPU_012859		potential novel kinase family member\n
GLEAN3_13806	SPU_013806		potential novel kinase family member\n
GLEAN3_16864	SPU_016864		potential novel kinase family member\n
GLEAN3_17078	SPU_017078		potential novel kinase family member\n
GLEAN3_18352	SPU_018352		potential novel kinase family member\n
GLEAN3_20735	SPU_020735		potential novel kinase family member\n
GLEAN3_22611	SPU_022611		potential novel kinase family member\n
GLEAN3_28859	SPU_028859		potential novel kinase family member\n
GLEAN3_28625	SPU_028625		probably TK family member \n
GLEAN3_02076	SPU_002076		#\nprobable TKL family member\n
GLEAN3_06622	SPU_006622		probable TKL family member \n
GLEAN3_08015	SPU_008015		probable TKL family member \n
GLEAN3_02493	SPU_002493		#\npotential novel kinase family member\n
GLEAN3_05457	SPU_005457		potential novel kinase family member\n
GLEAN3_22612	SPU_022612		potential novel kinase family member\n
GLEAN3_12527	SPU_012527		Best alignment in the central region, where coiled coil domain is likely encoded.\n
GLEAN3_21921	SPU_021921		partial sequence; based on NCBI predicted data set\n
GLEAN3_09093	SPU_009093	For this particular GLEAN model, there was no Cds information available from either Baylor annotations or SpBase. There was however mRNA information available from SpBase. When reviewing the excel data, it is apparent that there are several repeats within the scaffolds as well as gaps. There was no Est support from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with all of the values bring greater than 10. \nAdditional gene information from Baylor annotations (comments): very similar to 0014185.	very similar to 0014185\n
GLEAN3_08923	SPU_008923		distinct from 07733, 25829\n
GLEAN3_25421	SPU_025421		Eval 8e-12 against cd00159, RhoGAP, GTPase-activator protein for Rho-like GTPases...\t \n181 aas are 100% identical to NCBI prediction XP_798334 "PREDICTED: similar to Kinesin-like motor protein C6orf102, partial [Strongylocentrotus purpuratus]" \nAnnotated by RL Morris \n
GLEAN3_15483	SPU_015483		polymeric globin gene similar to shrimp polymeric globin\n
GLEAN3_08766	SPU_008766		GLEAN3_08766 is exactly identical to the C-terminal region of XP_789874.  Evalue =1e-88  for XP_789874 against against "NP_055690.1|  kinesin family member 14 [Homo sapiens]".   \nAnnotated by RL Morris\n
GLEAN3_01284	SPU_001284		Eval=2e-113 using GLEAN3_01284 against "NP_055889.2|  kinesin family member 1B isoform b [Homo sapiens]" \nAnnotated by RL Morris\n
GLEAN3_09400	SPU_009400		AAF04841 = Sp-kinesin-C cloned sequence.   \ne val against AAF04841 = 0. \nAnnotated by RL Morris.\n
GLEAN3_15484	SPU_015484		Homologous to DNA-PKcs, 5'end of gene is most likely missing. This model shows similarity to the N-ter sequence of DNA-PKcs. Glean3_15529 contains the 3'end / carboxy-terminus of the gene.\n
GLEAN3_11294	SPU_011294		Middle third of this gene is SpILKAP b.\n
GLEAN3_04669	SPU_004669		Exon 1 not confirmed by cDNA; \n5' end at edge of 16 kb contig;\t \nonly gene on contig \n
GLEAN3_20182	SPU_020182		#\nAllele of GLEAN3_0466, 3' end; \none exon in 13 kb contig\n
GLEAN3_13544	SPU_013544		Gene model may be missing N-terminal 70-80 aa based on alignment to mammalian homologs.\n
GLEAN3_27014	SPU_027014		Blasts to PTPRT, but forms part of a novel clade in phylogenetic analysis with PTPRFn1 and PTPRLec genes.  Partial sequence. See also GLEAN3_08466, GLEAN3_23162, and GLEAN3_24820.\n
GLEAN3_20604	SPU_020604		Blasts to PTPRA, but in phylogenetic analysis it forms part of a novel clade with PTPRLec1, PTPRLec2, PTPRLec4, PTPRLec5, PTPRLec6, PTPRFn1, and PTPRFn2.\n
GLEAN3_23115	SPU_023115		This protein has structure typical of a PTPR, but does not clade with known PTPRs.  It forms a unique clade and has been renamed PTPRiz.\n
GLEAN3_28575	SPU_028575		Blasts to Receptor-type tyrosine-protein phosphatase mu precursor, but does not clade with the PTPR K/M/T/U group in phylogenetic analysis.  Renamed PTPRY3.  Clades with GLEAN3_15923 and GLEAN3_20542 (PTPRY2).\n
GLEAN3_19920	SPU_019920		Partial sequence.\n
GLEAN3_00652	SPU_000652		See putative conserved domains at: \nhttp://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi?RID=1137535304-18825-160717183845.BLASTQ4\n
GLEAN3_22669	SPU_022669		Blasts to MTMR6, homologous to myotubularin related proteins 6, 7, and 8 in phylogenetic analysis.\n
GLEAN3_11860	SPU_011860		This blasts to PPEF1, but phylogenetic analysis showed that it was more similar to PPEF2.  Glean3_08844 is likely the identical protein.\n
GLEAN3_24691	SPU_024691		GLEAN3_05570 is a partial duplicate prediction.\n
GLEAN3_05570	SPU_005570		Partial duplcate prediction for GLEAN3_24691.\n
GLEAN3_14812	SPU_014812		Gene sequence modified as per HMM model\n
GLEAN3_25188	SPU_025188		sequence modified as per HMM model\n
HIPKa	SPU_030163		HMM prediction \n
GLEAN3_04962	SPU_004962	For this particular GLEAN model there was no Cds information available from SpBase or Baylor annotations. There was however mRNA information available from SpBase. When examining the excel data, it appears that the sequence may be distributed onto three different scaffolds. The first part of the sequence appears to be on subject gb|DS001520|, the second part on AAGJ02167917, and the remaining part on gb|DS012062|. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong (greater than 5) and distributed into two distinct clusters of values. 	protein sequence modified as per HMM model; this is consistent with tiling data\n
GLEAN3_15281	SPU_015281		protein sequence modified as per HMM model\n
GLEAN3_26828	SPU_026828		Protein sequence modifidied as per HMM prediction\n
GLEAN3_07333	SPU_007333		Protein sequence modified as per HMM model\n
Nek1c	SPU_030165		Prediction as per HMM model\n
GLEAN3_14818	SPU_014818		Protein sequence modified as per HMM prediction\n
GLEAN3_16754	SPU_016754		prediction is probably wrong- way too long. Predicted HMM model sequence is: \n \nLTQAFGRSTAVLSMIGYIIGLGDRHLDNVLVNFVTGEVVHIDYNVCFEKG \nKNLRVPERVPFRMTQNVQAALGITGVE \n
SFK1b	SPU_030166		#\nprotein sequence (incomplete) as per HMM prediction. \n
GLEAN3_24930	SPU_024930		see comments for glean 24721. This is 3' part of gene. Predicted protein sequence is \nMAQAYVEDTLMVRHELKHAYGKVFLVRKVGNHNQGKLYAMKVLKKATIVQ \nKAKTAEHTMTERQVLEAVRSCPFLVTLHYAFQTDSKLNLILDYVNGGELF \nTHLYQREHFRESEVRIYIAEIIIALDCLHRILTSHPPMPNTFSKEVKDFI \nNKLLVKDPTKRLGCNGVKDIKSHSFFKGLNWDDVAAKRVSPPFRPHINGE \nLDTSNFAEEFTSLVPADSPADIPKTADARVFRVGYSFIAPSILYSDNAIT \nQDMLTQPSEHNRPSLASILSIHELKDSPFNKYYELDMKSAPIGDGSFSIC \nRRCTHRKTEKEYAVKIVSRRVACTQEITTLQLCQKHPNIVHLKEEFKDKL \nHTYIIMELCKGGELLGRIRKKKHFDELEASMIMRKLVSAVDYMHSRGIVH \nRDLKPENILFTDDSDDAELKIIDFGFARITNSNQPLKTPCFSLHFAAPEV \nLKRAYEQDGEYDASCDVWSLGVILYTMLSGRVPFQDPSISKSNSASDIMK \nRIKHGNFSFDGEEWNSVSTPAKDLIKGLLTVDPSRRLTTDDLLQNEWIQG \nQQLSTSTPLMTPDILNSCASIQKRVKATMRAFHTAQREGFLLTDVSNAPL \nAKRRKKKKDSSTETRSSSSESTHSQSSSSQESTTPTPTANPVLTIPVTTV \nSCAPRTTTATGAPSIPSVQPLPSLSKQTGARLDQYESLESLGFSPILPFS \nAGGSQELPPLLARQDSGYVGQMPSYAQVTPVPRTNVGSHGVTYAPILDPS \nMYPCGLQQPILDFSSSIPEYLSVQYASTEQPSIPMTVPRTLHQPHPHPLP \nLPHQHLSHLPTISEDPSTT \n
GLEAN3_25882	SPU_025882		This is likely to be the N terminus of 14404 (DAPK). The fused protein sequence is pasted below, but is probably still missing a few exons based on alignment to human DAPK1 (NP_004929). \n \nMAMFRTESVEEFYQIGEDIGSGQFSEVKKVTEKSTGKDYAGKFIRKRRST \nASRRGVKREDIVREVSILEELSHDNIISLHDAFELQKEVVLILELVTGGE \nLFHYLAEEDHVNEEVAAQFVKKILEALKHMHDRNICHLDLKPENIMLLNR \nNTQNIMLIDFGLSRRIKPGEDIRDIMGTAEFVAPEIINFEPLSLNTDMWP \nVFISSRLPNIQLIQLSRSQVIQLVCVFISPKMGHRREQLALRKMSKALRS \nDWEHGETALHLAAGYGHVDILEYLQAKGASIDVADKTGETPLHVAGRYGQ \nVEAVQYLCDQAVNSNLADEDGETPLHIAAWHGYTSIVQTLCKAGATLDLK \nNKDGETTLLCAAARGHLDIVKILVEAGALLNTIDKHGITPLHHAVRRQHY \nDIVKYLVDSNCDVNLQDKLGDTPLNVACKEGALDLVEMLHAVGAKRDILN \nRHKNSALHMAARGGHIEVVRYLCLAGALIHQRNQDGLTASQLASLEGHED \nVADVLTQVEGDKSKDLFINQLNSTSGPLHRIRIKVLGQSGVGKTALIDSL \nKCGYFRGLFRRSRSNISLIGSSSNGRSSPRSPRSPRSPLTPMFGNGKKMD \nGGRFFMESLKRKQLSSTSSSFDVDSEVTRGIEFTHGTIPGAGDFTFLEFS \nGEDTYHTAYPHFLSDEGAIHLVVFSLDDMFEEQLAQVTYWMNFLRSQLPA \nTEPVGYCGKYRQQPKIALVATHADHTQCPKQPTGELISGEGNIVLYQTKR \nLFGRLFDICDVLFVMDANSAQSKDVKMLRTHISSLRNSILKDKSKDLFIN \nQLNSTSGPLHRIRIKVLGQSGVGKTALIDSLKCGYFRGLFRRSRSNISLI \nGSSSNGRSSPRSPRSPRSPLTPMFGNGKKMDGGRFFMESLKRKQLSSTSS \nSFDADSEVTRGIEFTHGTIPGAGDFTFLEFSGEDTYHTAYPHFLSDEGAI \nHLVVFSLDDMFEEQLAQVTYWMNFLRSQLPATEPVGYCGKYRQQPKIALV \nATHADHTQCPKQPTGEMISGEGNIVLYQTKRLFGRLFDICDVLFVMDANS \nAQSKDVKMLRTHISSLRNSILKVEAPVSVLCEAVASALPAWRRTFVNFPV \nLTWQQFSEGIHASINPLAGQAHLREVGRQLHLMGEVQCFGSELLQEVIVI \nEPTWLCSGIIGRLLSHDATEQPEGQYSIHYIQSLFPDTDAMDISQLMEAM \nDICVHGTVCEIPAVMRCPAPEGIWEKEDENGNFRVYGGVRMQLSDCGSTL \nPSGLFSRIQMSLRRNFQQDMEDTTDNELVMWRNGAKCSSGSIEGLISMTN \nDECAIEIKVRGYNDTRQGCFIFLEDLVHLVKHVLVDSYPGLPLNMEVLSP \nIQLSSHEKTIMVYNACSLLRLQLRTERTVENPISNQEEDFVDIFCFGSES \nVESNLIAGVDLHLSEIPSLTRRQISMLLDPPDPMGKDWCLLAVGLGLTEK \nIPMLDTLNRRCGPDESDSPTERLLQEWGKEETNSVGVLLNKVKDLGREDV \nLRVLMQGSPLYKFVPDPRALEEGRQSGSGSNHSSGTVASR \n
GLEAN3_14118	SPU_014118		#\nThis gene combines with part of the sequence from SPU_014119 to make a complete gene model (Helicase_C domain is missing from SPU_014119). \nSee other gene model for further details.\n
GLEAN3_11306	SPU_011306		in progress\n
GLEAN3_26949	SPU_026949		more similar to vertebrate transferrins than invertebrate \nappears to be novel form\n
Sp-Mafs	SPU_030167		Likely missing first exon (non-coding) and second exon encoding N-terminal sequence. \nFirst reported by Coolen, et al. (2005). Phylogenomic analysis and expression patterns of large Maf genes in Xenopus tropicalis provide new insights into the functional evolution of the gene family in osteichthyans. Dev Genes Evol 215, 327-39.\n
GLEAN3_09476	SPU_009476		missing exon in middle\n
GLEAN3_08954	SPU_008954		There seem to be two MAP1A/1B_LC3-like proteins encoded in the S. purpuratus genome: GLEAN3_09444 (~72% sequence identity) and GLEAN3_08954 (~60% sequence identity).\n
GLEAN3_04008	SPU_004008		Contains FHA domain and Reverse transcriptase domain \nForkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).\n
GLEAN3_01270	SPU_001270		 only N-terminus fragment\n
GLEAN3_01271	SPU_001271		 only N-terminus fragment\n
GLEAN3_03343	SPU_003343		 fragment\n
GLEAN3_10265	SPU_010265		 partial, missing C-terminus half, should join with Glean3_10266\n
GLEAN3_10266	SPU_010266		 fragment, should join with Glean3_10265\n
GLEAN3_10602	SPU_010602		 fragment\n
GLEAN3_10738	SPU_010738		 fragment\n
GLEAN3_13126	SPU_013126		 fragment\n
GLEAN3_13531	SPU_013531		 partial, missing N-terminus\n
GLEAN3_14927	SPU_014927		 fragment\n
GLEAN3_15609	SPU_015609		 fragment\n
GLEAN3_16267	SPU_016267		 fragment\n
GLEAN3_17253	SPU_017253		 fragment\n
GLEAN3_18825	SPU_018825		 partial\n
GLEAN3_19381	SPU_019381		 fragment\n
GLEAN3_19849	SPU_019849		 partial, missing N- and C-terminus\n
GLEAN3_20358	SPU_020358		 partial, missing N-terminus\n
GLEAN3_22124	SPU_022124		 fragment, missing N-terminus region\n
GLEAN3_01506	SPU_001506		 fragment\n
GLEAN3_02449	SPU_002449		 fragment\n
GLEAN3_03043	SPU_003043		 fragment\n
GLEAN3_10028	SPU_010028		 fragment\n
GLEAN3_10065	SPU_010065		 fragment\n
GLEAN3_10066	SPU_010066		 fragment\n
GLEAN3_01514	SPU_001514		 fragment, extra mismatched stretch on N-terminus\n
GLEAN3_03613	SPU_003613		 partial\n
GLEAN3_06198	SPU_006198		 extra mismatched long stretch on C-terminus\n
GLEAN3_06971	SPU_006971		 fragment\n
GLEAN3_08301	SPU_008301		 fragment\n
GLEAN3_08303	SPU_008303		 partial\n
GLEAN3_08612	SPU_008612		 fragment\n
GLEAN3_09147	SPU_009147		 fragment, unmatched stretch on N-terminus\n
GLEAN3_06296	SPU_006296		Hypothetical protein \nsimilarities with solute carrier \nATP binding cassette \nlactotrasferrin\n
GLEAN3_01715	SPU_001715		Hypothetical protein with no homologs in other species\n
GLEAN3_05307	SPU_005307		probable ortholog of human Zinc-finger 318 \nthe naming is different form the Stefan Materna naming  \nbecause of some uncertainties in the homology \n
GLEAN3_01945	SPU_001945		Hypothetical protein with BLAST hits with various prots with similiraties with Kazrin (among others) \nNo clear orthology \n  \nContains SMC and 3 SAM domains  \nSMC=nucleotide binding cassette \nSAM=Sterile alpha motif.; Widespread domain in signalling and nuclear proteins. In EPH-related tyrosine kinases, appears to mediate cell-cell initiated signal transduction via the binding of SH2-containing proteins to a conserved tyrosine that is phosphorylated. In many cases mediates homodimerization. \n
GLEAN3_21165	SPU_021165		#\nPREDICTED: similar to Laminin-like protein K08C7.3 precursor  \n
GLEAN3_06349	SPU_006349		transposon\n
GLEAN3_22598	SPU_022598		PREDICTED: similar to flavin containing monooxygenase 5 \n \nContains a long Fibrinogen-related domain (FReD); C terminal globular domain of fibrinogen. Fibrinogen is involved in blood clotting, being activated by thrombin to assemble into fibrin clots. The N-termini of 2 times 3 chains come together to form a globular arrangement called the disulfide knot. The C termini of fibrinogen chains end in globular domains, which are not completely equivalent. C terminal globular domains of the gamma chains (C-gamma) dimerize and bind to the GPR motif of the N-terminal domain of the alpha chain, while the GHR motif of N-terminal domain of the beta chain binds to the C terminal globular domains of another beta chain (C-beta), which leads to lattice formation. \n
GLEAN3_05714	SPU_005714		#\nHypothetical prot similar to CTD binding prot \n \nContains a RING and a PHD domain \n \nRING-finger (Really Interesting New Gene) domain, a specialized type of Zn-finger of 40 to 60 residues that binds two atoms of zinc; defined by the 'cross-brace' motif C-X2-C-X(9-39)-C-X(1-3)- H-X(2-3)-(N/C/H)-X2-C-X(4-48)C-X2-C; probably involved in mediating protein-protein interactions; identified in a proteins with a wide range of functions such as viral replication, signal transduction, and development; has two variants, the C3HC4-type and a C3H2C3-type (RING-H2 finger), which have different cysteine/histidine pattern; a subset of RINGs are associated with B-Boxes (C-X2-H-X7-C-X7-C-X2-C-H-X2-H) \n \nPHD-finger. PHD folds into an interleaved type of Zn-finger chelating 2 Zn ions in a similar manner to that of the RING and FYVE domains.\n
GLEAN3_22038	SPU_022038		contains two Cap-ED domains  \nPREDICTED: similar to cAMP-dependent protein kinase type I-alpha  regulatory subunit\n
GLEAN3_07251	SPU_007251		PREDICTED: similar to sequestosome 1 isoform 1 \n \nContains PB1 sdomain and ZZ domain \nPB1 domain ; Phox and Bem1p domain, present in many eukaryotic cytoplasmic signalling proteins. The domain adopts a beta-grasp fold, similar to that found in ubiquitin and Ras-binding domains. A motif, variously termed OPR, PC and AID, represents the most conserved region of the majority of PB1 domains, and is necessary for PB1 domain function. This function is the formation of PB1 domain heterodimers, although not all PB1 domain pairs associate \n \nZinc finger, ZZ type. Zinc finger present in Drosophila ref(2)P, NBR1, Human sequestosome 1 and related proteins. The ZZ motif coordinates two zinc ions and most likely participates in ligand binding or molecular scaffolding. Drosophila ref(2)P appears to control the multiplication of sigma rhabdovirus. NBR1 (Next to BRCA1 gene 1 protein) interacts with fasciculation and elongation protein zeta-1 (FEZ1) and calcium and integrin binding protein (CIB), and may function in cell signalling pathways. Sequestosome 1 is a phosphotyrosine independent ligand for the Lck SH2 domain and binds noncovalently to ubiquitin via its UBA domain.\n
GLEAN3_10285	SPU_010285		PREDICTED: similar to ubiquitously transcribed tetratricopeptide  repeat gene, X chromosome \n \nContains 1 TPR domain and 1 jmjC domain \n \nTetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and Cdc27p components of the cyclosome/APC, the Pex5p/Pas10p receptor for peroxisomal targeting signals, the Tom70p co-receptor for mitochondrial targeting signals, Ser/Thr phosphatase 5C and the p110 subunit of O-GlcNAc transferase; three copies of the repeat are present here \n \njmjC domain. The jmjC domain is thought to be involved in chromatin organisation by modulating heterochromatisation. \n
GLEAN3_16785	SPU_016785		Homology is no very strong (7e-11) \nBest genbank hit is form Danio rerio  \nname is temporary waiting for a better one\n
GLEAN3_16786	SPU_016786		PREDICTED: similar to Retinal homeobox protein Rx3 \n
GLEAN3_16787	SPU_016787		PREDICTED: similar to Alpha-1B adrenergic receptor \nContains a 7 transmembrane receptor (rhodopsin family)(8e-18)\n
GLEAN3_19627	SPU_019627		Predicted: putative protein which \ncontains a Putative homoserine kinase type II (protein kinase fold) [General function prediction only] \nthe level of homologyis weak \n
GLEAN3_23530	SPU_023530		predicted protein with similarities to At rich interactive domain Swi1-like\n
GLEAN3_24578	SPU_024578		PREDICTED: similar to zn-finger, CCHC type and RNA-directed DNA polymerase  and Integrase, catalytic domain containing protein  \nfamily member (1E419) \n \nHypothetical prot with reverse transcriptase and integrase domains\n
GLEAN3_15699	SPU_015699		This gene encodes a signal peptide and a C1q domain.\n
GLEAN3_15700	SPU_015700		this gene encodes a signal peptide and a C1q domain\n
GLEAN3_25635	SPU_025635		GLEAN3_03194 may represent the latter half of this gene. \n
GLEAN3_12648	SPU_012648		The prediction appears to be a combination of two genes, one encoding a bzip transcription factor and they other a phosphatidylinositol glycan.  Only GLEAN3_12648|Scaffold85774|13949|15052| is conserved with the bzip transcription factor gene, zhangfei.\n
GLEAN3_23034	SPU_023034		GLEAN3_14930 is a partial duplicate prediction.\n
GLEAN3_14930	SPU_014930		Partial duplicate prediction of GLEAN3_23034\n
GLEAN3_11973	SPU_011973		GLEAN3_11973 has part I. GLEAN3_01984 has the middle part. Last part may be missing.\n
GLEAN3_01984	SPU_001984		GLEAN3_11973 has part I. GLEAN3_01984 has the middle part. Last part may be missing.\n
GLEAN3_00127	SPU_000127		GLEAN3_09993 is a duplicate prediction.\n
GLEAN3_09993	SPU_009993		GLEAN3_09993 is a duplicate prediction for GLEAN3_00127.\n
GLEAN3_00893	SPU_000893		GLEAN3_00893 encodes the middle part of USP34. GLEAN3_13835 encodes the last. GLEAN3_13835 and GLEAN3_08562 are overlapping duplicate predictions. First part of the gene homologous to the human USP34 is missing. GLEAN3_28691 (homologous with ~12-312 human USP34) and GLEAN3_05174 (homologous with ~ 698-910 aa of human protein) are only partially similar to the human USP34.\n
GLEAN3_02882	SPU_002882		GLEAN3_02882 is a partially incorrect prediction for USP46. It appears to have an extra exon at the beginning and ~300 aa at the end are non-homologous.\n
GLEAN3_03944	SPU_003944		GLEAN3_03944 has part I, GLEAN3_03945 as part II and GLEAN3_23070 has part III of USP32 gene. In addition, GLEAN3_03945 and GLEAN3_23070 share an overlap of ~100 AA.\n
GLEAN3_03945	SPU_003945		GLEAN3_03944 has part I, GLEAN3_03945 as part II and GLEAN3_23070 has part III of USP32 gene. In addition, GLEAN3_03945 and GLEAN3_23070 share an overlap of ~100 AA.\n
GLEAN3_23070	SPU_023070		GLEAN3_03944 has part I, GLEAN3_03945 as part II and GLEAN3_23070 has part III of USP32 gene. In addition, GLEAN3_03945 and GLEAN3_23070 share an overlap of ~100 AA.\n
GLEAN3_08736	SPU_008736		Human USP48 gene isoforms are very different in size. Isoform a is 1035 aa long where as isoform b is 485 aa long. GLEAN3_08736 is homologous to the isoform b but along with GLEAN3_01900 may also be considered homologous to human USP48 isoform a. There are ~170 aa missing from the isoform a in urchin, if GLEAN3_08736 and 01900 are coding for USP48 isoform a.\n
GLEAN3_01900	SPU_001900		Human USP48 gene isoforms are very different in size. Isoform a is 1035 aa long where as isoform b is 485 aa long. GLEAN3_08736 is homologous to the isoform b but along with GLEAN3_01900 may also be considered homologous to human USP48 isoform a. There are ~170 aa missing from the isoform a in urchin, if GLEAN3_08736 and 01900 are coding for USP48 isoform a.\n
GLEAN3_09955	SPU_009955		GLEAN3_21779 is a partial duplicate prediction for GLEAN3_09955.\n
GLEAN3_06393	SPU_006393		GLEAN3_06393 encodes fisr half of the gene. GLEAN3_11524 has the rest.\n
GLEAN3_11524	SPU_011524		GLEAN3_06393 encodes fisr half of the gene. GLEAN3_11524 has the rest.\n
GLEAN3_11940	SPU_011940		Appears to be missing ~200 at the beginning of USP22.\n
GLEAN3_21779	SPU_021779		GLEAN3_21779 is a partial duplicate prediction for GLEAN3_09955.\n
GLEAN3_28265	SPU_028265		First ~400 aa (as compared to human USP10) are missing from GLEAN3 predictions.\n
GLEAN3_13119	SPU_013119		this is the same as GLEAN3_02448.\n
GLEAN3_15421	SPU_015421		also sim to glean3_22485\n
Sp-Il17-9	SPU_030184		This gene was annotated based on FgeneshAB and Fgenesh++. \n
Sp-Il17-10	SPU_030185		#\nThis gene was annotated based on FgeneshAB and ++.\n
Sp-Il17-11	SPU_030186		This gene was annotated based on FgeneshAB and ++. \n
Sp-Il17-12	SPU_030187		This gene was annotated based on FgeneshAB and ++. It may be partial because unknown sequence is located behind the first exon.\n
Sp-Il17-13	SPU_030188		This gene model was annotated based on FgeneshAB and ++. \n
Sp-Il17-p1	SPU_030189		#\nThis gene model was annotated based on FgeneshAB and ++. It is partial and may link to Sp-Il17-12.\n
Sp-Il17-14	SPU_030190		This gene model was annotated based on FgeneshAB and ++. \n
Sp-Il17-15	SPU_030191		This gene model was annotated based on FgeneshAB and ++. \n
Sp-Il17-16	SPU_030192		#\nThis gene model was annotated based on FgeneshAB and ++. \n
Sp-Il17-17	SPU_030193		This gene model was annotated based on FgeneshAB and ++. \n
Sp-Il17-p2	SPU_030194		#\nThis partial gene model was annotated based on FgeneshAB. The model is  located in a short scaffold. \n
Sp-Il17-p3	SPU_030195		This partial gene model was annotated based on FgeneshAB, ++ and BlastN.  The model is located at the end of a contig. \n
Sp-Il17-18	SPU_030196		This gene was annotated based on FgeneshAB and ++. \n
Sp-Il17-19	SPU_030197		#\nThis gene was annotated based on FgeneshAB and ++. \n
Sp-Il17-20	SPU_030198		This gene model was annotated based on FgeneshAB and ++. \n
Sp-Il17-21	SPU_030199		#\nThis gene model was annotated based on FgeneshAB and ++. The model is partial and located at the end of a scaffold. \n
Sp-Il17-22	SPU_030200		This gene model was annotated based on a part of NCBI and FgeneshAB prediction. The model is partial and located at the end of a contig. \n \n
Sp-Il17-p4	SPU_030201		#\nThis gene model was annotated based on FgeneshAB. The model was partial in a short scaffold, so Il17 domain was not found. But the nucleotide sequence was 93% similar to that of aother Sp-Il17.\n
Sp-Il17-p5	SPU_030202		This gene model was annotated based on FgeneshAB and ++. The model was located at the end of a scaffold and Il17 domain was not found. But the nucleotide sequence was 93% similar to another Sp-Il17. \n
Sp-Il17-23	SPU_030203		This gene model was annotated based on FgeneshAB and ++. It has a partial Il17 domain.\n
Sp-Il17-24	SPU_030204		This gene model was annotated based on FgeneshAB and ++. \n
GLEAN3_14962	SPU_014962		It is possible that GLEAN3_00634 may code for the first part of this protein.\n
GLEAN3_17602	SPU_017602		GLEAN3_16877 may represent the first half of this gene.\n
GLEAN3_17871	SPU_017871		Duplicate prediction for GLEAN3_07962.\n
GLEAN3_16537	SPU_016537		GLEAN3_01217 is a longer duplicate prediction.\n
GLEAN3_01217	SPU_001217		GLEAN3_01217 is a longer duplicate prediction for GLEAN3_16537.\n
GLEAN3_00228	SPU_000228		GLEAN3_03049 had first half of the HCFC2 gene. GLEAN3_00228 likely codes the rest, it may be incorrectly predicted.\n
GLEAN3_03049	SPU_003049		GLEAN3_03049 had first half of the HCFC2 gene. GLEAN3_00228 likely codes the rest, it may be incorrectly predicted.\n
GLEAN3_05886	SPU_005886		Missing first exon?\n
GLEAN3_22607	SPU_022607		GLEAN3_28866 is a partial duplicate prediction for GLEAN3_22607.\n
GLEAN3_23287	SPU_023287		GLEAN3_08313 is a partial duplicate prediction.\n
GLEAN3_24246	SPU_024246		GLEAN3_24246 codes for part I of TSC2 and GLEAN3_23402 codes the rest.\n
GLEAN3_23402	SPU_023402		GLEAN3_24246 codes for part I of TSC2 and GLEAN3_23402 codes the rest.\n
GLEAN3_28104	SPU_028104		GLEAN3_28850 is a duplicate long prediction that is likely incorrect.\n
GLEAN3_28850	SPU_028850		Incorrect longer prediction for RPL15. \n
GLEAN3_23692	SPU_023692		Appears to be short by ~29 aa at beginning.\n
GLEAN3_05571	SPU_005571		Incomplete gene model.\n
GLEAN3_09084	SPU_009084		Missing last half.\n
GLEAN3_24153	SPU_024153		Nterminus of APC1; the remaining parts of the gene are found in GLEAN3_08018 and 12580\n
GLEAN3_08018	SPU_008018		see GLEAN3_24153 for annotation\n
GLEAN3_12580	SPU_012580		see GLEAN3_24153 for annotation\n
GLEAN3_21616	SPU_021616		This Glean contains the C-terminal sequence of Psf2 (aa 186-301). \n
GLEAN3_17818	SPU_017818		GLEAN3_17818 covers the C-terminal sequence of the GINS protein subunit Psf1 \nGLEAN3_17817 covers the N-terminal sequence of the GINS protein subunit Psf1\n
GLEAN3_17817	SPU_017817		 \nGLEAN3_17817 covers the N-terminal sequence of the GINS protein subunit Psf1 \nGLEAN3_17818 covers the C-terminal sequence of the GINS protein subunit Psf1 \n
GLEAN3_18318	SPU_018318		Missing ~60aa at the end.\n
GLEAN3_02606	SPU_002606		GLEAN3_27769 is a partial duplicate prediction for GLEAN3_02606.\n
Sp-DNAH4	SPU_030222		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number AAM12861 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold78480, scaffold102459,scaffold95646 and scaffold102781. Sp-DNAH4 is 54% identical to human axonemal dynein heavy chain 3  \n \n
Sp-DNAH2	SPU_030224		#\nThe coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number NP_065928 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold2032, scaffold62207, scaffold46527, scaffold22838.\n
Sp-DNAH3	SPU_030225		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number NP060009 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu.  Merge of scaffold108582 and scaffold87263\n
Sp-DNAH8	SPU_030229		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number CAI42433(Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold289, scaffold56250, scaffoldi854 (3/2006 assembly), scaffold56250 \n
Sp-DNAH14	SPU_030233		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number XP_943287(Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold86991, scaffold33286, scaffold58811, scaffold65994,  \n
Sp-DYNC2H1	SPU_030235		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous sea urchin (Tripneustes gratilla) gene, accession number AAA63583 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold23743, scaffold_v2_46821, scaffold28672, scaffold_v2_2321, scaffold53559\n
GLEAN3_20648	SPU_020648		Note that GLEAN3_20648 probably has the C-terminal exon of this gene.\n
GLEAN3_20649	SPU_020649		This probably represents the C terminal exon of GLEAN3_20648, and should be combined with that model.\n
GLEAN3_14454	SPU_014454		GLEAN3_15818 is a partial duplicate.\n
Sp-DNAH5	SPU_030226		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number CAI42433 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu.  Merge of scaffold1668, scaffold59332, scaffold1157, scaffold11477\n
Sp-DNAH6	SPU_030227		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number XP_532984 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu.  Merge of scaffold1866 and  scaffold105269 \n \n
Sp-DNAH7	SPU_030228		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number NP_061720 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold1775 and scaffold50668 \n
Sp-DNAH9	SPU_030230		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous sea urchin (Tripneustes gratilla) gene, accession number CAA42170. The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu.(Morris, RL, et al., Dev. Biol. in press). Merge of scaffold27, scaffold28695\n
Sp-DNAH10	SPU_030231		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number XP_543369 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu.  Merge of scaffold75058 scaffold105976, scaffold102255, scaffold52292, scaffold48159\n
Sp-DNAH12	SPU_030232		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number XP_541831 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold108117, scaffold694, scaffold133848\n
Sp-DNAH15	SPU_030234		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number NP_001360 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. Merge of scaffold1897, scaffold128501, scaffold55352, scaffold71847, scaffold26249\n
Sp-DYNC1H1	SPU_030236		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number Q14204 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu.  Merge of scaffold9923, scaffold88369, scaffold88369, scaffold3805\n
GLEAN3_14060	SPU_014060		First part of the gene model prediction is incorrect.\n
Sp-DNAH1	SPU_030223		The coding sequence of this gene has been assembled from exons identified in the 3/2005 Version of the S. purpuratus genome by indexing to the homologous mammalian gene, accession number BAA92648 (Morris, RL, et al., Dev. Biol. in press). The exons span multiple scaffolds (see below) in the current version of the genome and their detailed coordinates will be added to the database at a later stage. In the meantime, exon details can be obtained by email from the annotators robar@scientist.com and igibbons@berkeley.edu. \nMerge of scaffold60126, scaffold75782, scaffold50318, scaffold83954. Sp-DNAH1 is 63% identical to human axonemal dynein heavy chain 1\n
GLEAN3_02458	SPU_002458		GLEAN3_21702 is a duplicate prediction.\n
GLEAN3_21702	SPU_021702		Duplicate prediction for GLEAN3_02458\n
GLEAN3_11152	SPU_011152		Partial duplicate of GLEAN3_06459.\n
GLEAN3_09946	SPU_009946		Prediction is short by ~20 AA.\n
GLEAN3_07971	SPU_007971		GLEAN3_11900 is a duplicate prediction.\n
GLEAN3_11900	SPU_011900		GLEAN3_07971 duplicate.\n
GLEAN3_22682	SPU_022682		Likely has an extra ~100 aa predicted towards the end.\n
GLEAN3_06487	SPU_006487		Incorrect gene model. First ~120 AA are completely unrelated.\n
GLEAN3_11456	SPU_011456		GLEAN3_19736 is a duplicate prediction.\n
GLEAN3_19736	SPU_019736		GLEAN3_11456 is a duplicate prediction.\n
GLEAN3_04087	SPU_004087		Partial duplicate prediction for GLEAN3_02438.\n
GLEAN3_08650	SPU_008650		Incorrect gene model. Extra AA at beginning and end.\n
GLEAN3_18079	SPU_018079		GLEAN3_04721 is a duplicate prediction.\n
GLEAN3_13834	SPU_013834		Longer than necessary prediction.\n
GLEAN3_08394	SPU_008394		GLEAN3_06605 has first half and GLEAN3_08394 has the rest.\n
GLEAN3_06605	SPU_006605		GLEAN3_06605 has first half and GLEAN3_08394 has the rest.\n
GLEAN3_10936	SPU_010936		GLEAN3_05825 is a partial duplicate prediction.\n
GLEAN3_05825	SPU_005825		GLEAN3_05825 is a duplicate prediction for GLEAN3_10936.\n
GLEAN3_18866	SPU_018866		Missing ~25 AA at end.\n
GLEAN3_10692	SPU_010692		GLEAN3_26671 is a partial duplicate prediction.\n
GLEAN3_05700	SPU_005700		GLEAN3_05700 is a partial duplicate of GLEAN3_5699.\n
GLEAN3_11352	SPU_011352		GLEAN3_20308 and GLEAN3_01585 are partial identical predictions.\n
GLEAN3_20308	SPU_020308		Partial duplicate prediction for GLEAN3_11352.\n
GLEAN3_11303	SPU_011303		GLEAN3_11303 appears to have first part and GLEAN3_04643 the latter half.\n
GLEAN3_04643	SPU_004643		GLEAN3_11303 appears to have first part and GLEAN3_04643 the latter half.\n
GLEAN3_21496	SPU_021496		this gene was identified and published by  Nemer et al 1991 \n
GLEAN3_24608	SPU_024608		GLEAN3_25585 appears to have partI of CAND1 and GLEAN3_24608 the other part. There is an overlap of ~50 AA.\n
GLEAN3_25585	SPU_025585		GLEAN3_25585 appears to have partI of CAND1 and GLEAN3_24608 the other part. There is an overlap of ~50 AA.\n
GLEAN3_15870	SPU_015870		GLEAN3_27123 is a partial duplicate prediction.\n
GLEAN3_27123	SPU_027123		Duplicate prediction for GLEAN3_15870.\n
GLEAN3_05722	SPU_005722		GLEAN3_023814 is a partial duplicate prediction.\n
GLEAN3_23814	SPU_023814		Partial duplicate prediction for GLEAN3_05722.\n
GLEAN3_24801	SPU_024801		Likely missing first exon.\n
GLEAN3_25140	SPU_025140		GLEAN3_18792 has first part and GLEAN3_25140 has the latter half. There is some overlap in the two predictions.\n
GLEAN3_18792	SPU_018792		GLEAN3_18792 has first part and GLEAN3_25140 has the latter half. There is some overlap in the two predictions.\n
GLEAN3_23345	SPU_023345		GLEAN3_03864 appears to have partI. GLEAN3_23345 has partII. GLEAN3_00408 appears to have the rest of the protein.\n
GLEAN3_03864	SPU_003864		GLEAN3_03864 appears to have partI. GLEAN3_23345 has partII. GLEAN3_00408 appears to have the rest of the protein.\n
GLEAN3_00408	SPU_000408		GLEAN3_03864 appears to have partI. GLEAN3_23345 has partII. GLEAN3_00408 appears to have the rest of the protein.\n
GLEAN3_10706	SPU_010706		May be missing ~100 AA at the end.\n
GLEAN3_10014	SPU_010014		GLEAN3_10014 has first part and GLEAN3_06607 has the latter half.\n
GLEAN3_06607	SPU_006607		GLEAN3_10014 has first part and GLEAN3_06607 has the latter half.\n
GLEAN3_18853	SPU_018853		Hit to the bestrophin homology is embedded within a longer prediction. \n
GLEAN3_17216	SPU_017216		GLEAN3_10982 is a partial duplicate prediction.\n
GLEAN3_10982	SPU_010982		GLEAN3_10982 is a partial duplicate prediction.\n
GLEAN3_17745	SPU_017745		Model is longer than necessary.\n
GLEAN3_04715	SPU_004715		Model is likely missing a few AA at beginning.\n
GLEAN3_01164	SPU_001164		Likely missing ~50 AA at beginning.\n
GLEAN3_18367	SPU_018367		Prediction is short ~50 AA at beginning.\n
GLEAN3_25476	SPU_025476		Partial duplicate prediction for GLEAN3_17333.\n
GLEAN3_02228	SPU_002228		GLEAN3_22387 is a partial duplicate prediction.\n
GLEAN3_22387	SPU_022387		GLEAN3_02228 is a partial duplicate prediction.\n
GLEAN3_21649	SPU_021649		GLEAN3_18977 has part I and GLEAN3_21649 appears to have the rest of the gene.\n
GLEAN3_18977	SPU_018977	From the BLAST results and the excel data it appears that the sequence is distributed onto 2 different scaffolds. The beginning portion of the sequence is distributed onto v2.1_scaffold64444 and part of the remaining sequence is distributed onto v2.1_scaffold23041. The sequence ends at 579 but the second scaffold is truncated at 455. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with most of the scores ranging less than 5. 	GLEAN3_18977 has part I and GLEAN3_21649 appears to have the rest of the gene.\n
GLEAN3_15415	SPU_015415		GLEAN3_03876 is a partial duplicate prediction.\n
GLEAN3_03876	SPU_003876		GLEAN3_03876 is a partial duplicate prediction for GLEAN3_15415.\n
GLEAN3_04528	SPU_004528		May be missing ~50 AA at end.\n
GLEAN3_24347	SPU_024347		This model may contain 2 genes. Latter half is the DNA cytosine methylase (DNMT3A).\n
GLEAN3_09023	SPU_009023		GLEAN3_12849 is a partial duplicate prediction for GLEAN3_09023.\n
GLEAN3_25761	SPU_025761		GLEAN3_25761 has part I. GLEAN3_20254 has part II. GLEAN3_20255 has part III. Missing a few AA at the end.\n
GLEAN3_20255	SPU_020255		GLEAN3_25761 has part I. GLEAN3_20254 has part II. GLEAN3_20255 has part III. Missing a few AA at the end.\n
GLEAN3_20254	SPU_020254		GLEAN3_25761 has part I. GLEAN3_20254 has part II. GLEAN3_20255 has part III. Missing a few AA at the end.\n
GLEAN3_19071	SPU_019071		GLEAN3_19071 is a duplicate prediction for GLEAN3_24656.\n
GLEAN3_24656	SPU_024656		GLEAN3_19071 is a duplicate prediction for GLEAN3_24656.\n
GLEAN3_09473	SPU_009473		Missing ~200 AA at the beginning.\n
GLEAN3_14637	SPU_014637		GLEAN3_11917 is a partial duplicate prediction for GLEAN3_14637.\n
GLEAN3_11917	SPU_011917		GLEAN3_11917 is a partial duplicate prediction for GLEAN3_14637.\n
GLEAN3_06672	SPU_006672		Missing ~150 AA at end.\n
GLEAN3_11688	SPU_011688		GLEAN3_11688 is a duplicate prediction for GLEAN3_08690. It also may represent a longer incorrect model for this gene.\n
GLEAN3_08690	SPU_008690		GLEAN3_11688 is a duplicate prediction for GLEAN3_08690. It also may represent a longer incorrect model for this gene.\n
GLEAN3_09423	SPU_009423		GLEAN3_09422 has the first part and GLEAN3_09423 has the rest of the gene.\n
GLEAN3_09422	SPU_009422		GLEAN3_09422 has the first part and GLEAN3_09423 has the rest of the gene.\n
GLEAN3_25290	SPU_025290		GLEAN3_25290 is a partial duplicate prediction for GLEAN3_04522.\n
GLEAN3_11094	SPU_011094		GLEAN3_06390 may encode for first part of NOC3L gene. GLEAN3_11094 has the rest. GLEAN3_16413 is a partial duplicate prediction for GLEAN3_11094.\n
GLEAN3_06390	SPU_006390		GLEAN3_06390 may encode for first part of NOC3L gene. GLEAN3_11094 has the rest.\n
GLEAN3_16413	SPU_016413		GLEAN3_06390 may encode for first part of NOC3L gene. GLEAN3_11094 has the rest. GLEAN3_16413 is a partial duplicate prediction for GLEAN3_11094.\n
GLEAN3_03253	SPU_003253		GLEAN3_12701 has part I, GLEAN3_18955 has part II, GLEAN3_03253 has part III and GLEAN3_04821 has the last part of the gene. GLEAN3_28421 is a partial duplicate prediction for GLEAN3_04821. GLEAN3_05616 is a partial duplicate prediction for GLEAN3_18955.\n
GLEAN3_04821	SPU_004821		GLEAN3_12701 has part I, GLEAN3_18955 has part II, GLEAN3_03253 has part III and GLEAN3_04821 has the last part of the gene. GLEAN3_28421 is a partial duplicate prediction for GLEAN3_04821. GLEAN3_05616 is a partial duplicate prediction for GLEAN3_18955.\n
GLEAN3_12701	SPU_012701		GLEAN3_12701 has part I, GLEAN3_18955 has part II, GLEAN3_03253 has part III and GLEAN3_04821 has the last part of the gene. GLEAN3_28421 is a partial duplicate prediction for GLEAN3_04821. GLEAN3_05616 is a partial duplicate prediction for GLEAN3_18955.\n
GLEAN3_18955	SPU_018955		GLEAN3_12701 has part I, GLEAN3_18955 has part II, GLEAN3_03253 has part III and GLEAN3_04821 has the last part of the gene. GLEAN3_28421 is a partial duplicate prediction for GLEAN3_04821. GLEAN3_05616 is a partial duplicate prediction for GLEAN3_18955.\n
GLEAN3_28421	SPU_028421		GLEAN3_12701 has part I, GLEAN3_18955 has part II, GLEAN3_03253 has part III and GLEAN3_04821 has the last part of the gene. GLEAN3_28421 is a partial duplicate prediction for GLEAN3_04821. GLEAN3_05616 is a partial duplicate prediction for GLEAN3_18955.\n
GLEAN3_05616	SPU_005616		GLEAN3_12701 has part I, GLEAN3_18955 has part II, GLEAN3_03253 has part III and GLEAN3_04821 has the last part of the gene. GLEAN3_28421 is a partial duplicate prediction for GLEAN3_04821. GLEAN3_05616 is a partial duplicate prediction for GLEAN3_18955.\n
GLEAN3_28675	SPU_028675		GLEAN3_28675 is a partial duplicate prediction for GLEAN3_26740.\n
GLEAN3_09849	SPU_009849		May be missing ~25 AA be beginning.\n
GLEAN3_04974	SPU_004974		GLEAN3_04974 is a duplicate prediction for GLEAN3_06042.\n
GLEAN3_14021	SPU_014021		GLEAN3_14021 is a partial duplicate prediction for GLEAN3_13399.\n
GLEAN3_07111	SPU_007111		GLEAN3_07455 is a duplicate longer prediction.\n
GLEAN3_00285	SPU_000285		GLEAN3_00285 has most of the gene except the last part which appears to be on GLEAN3_09456. There are errors in the prediction at the end of GLEAN3_00285 and at beginning of GLEAN3_09456 which perfectly match with the COPZ1 gene.\n
GLEAN3_09546	SPU_009546		GLEAN3_00285 has most of the gene except the last part which appears to be on GLEAN3_09456. There are errors in the prediction at the end of GLEAN3_00285 and at beginning of GLEAN3_09456 which perfectly match with the COPZ1 gene.\n
GLEAN3_12559	SPU_012559		GLEAN3_12559 and GLEAN3_26314 both contain the AP2S1 gene but both are erroneous long predictions. Beginning of the predictions do not code for any significant proteins in DB.\n
GLEAN3_26314	SPU_026314		GLEAN3_12559 and GLEAN3_26314 both contain the AP2S1 gene but both are erroneous long predictions. Beginning of the predictions do not code for any significant proteins in DB.\n
GLEAN3_18163	SPU_018163		GLEAN3_18163 and GLEAN3_16841 are partially duplicate predictions.\n
GLEAN3_16841	SPU_016841		GLEAN3_18163 and GLEAN3_16841 are partially duplicate predictions.\n
GLEAN3_02616	SPU_002616		GLEAN3_18244 is a partial duplicate prediction for GLEAN3_02616.\n
GLEAN3_18244	SPU_018244		GLEAN3_18244 is a partial duplicate prediction for GLEAN3_02616.\n
GLEAN3_01148	SPU_001148		GLEAN3_01148 - GLEAN3_03209 - GLEAN3_09645 represent three parts of the CLPTM1 gene, perhaps in that order (and with overlap). It is difficult to be certain as only parts of GLEAN3_03209 and GLEAN3_09645 are homologous to CLPTM1.\n
GLEAN3_03209	SPU_003209		GLEAN3_01148 - GLEAN3_03209 - GLEAN3_09645 represent three parts of the CLPTM1 gene, perhaps in that order (and with overlap). It is difficult to be certain as only parts of GLEAN3_03209 and GLEAN3_09645 are homologous to CLPTM1.\n
GLEAN3_09645	SPU_009645		GLEAN3_01148 - GLEAN3_03209 - GLEAN3_09645 represent three parts of the CLPTM1 gene, perhaps in that order (and with overlap). It is difficult to be certain as only parts of GLEAN3_03209 and GLEAN3_09645 are homologous to CLPTM1.\n
GLEAN3_00731	SPU_000731		GLEAN3_00731 and GLEAN3_07244 represent two parts of the CLN3 protein. There is perhaps ~50 AA missing from GLEAN3_07244 model.\n
GLEAN3_07244	SPU_007244		GLEAN3_00731 and GLEAN3_07244 represent two parts of the CLN3 protein. There is perhaps ~50 AA missing from GLEAN3_07244 model.\n
GLEAN3_01258	SPU_001258		GLEAN3_10663 is a duplicate prediction for GLEAN3_01258.\n
GLEAN3_10663	SPU_010663		GLEAN3_10663 is a duplicate prediction for GLEAN3_01258.\n
GLEAN3_10747	SPU_010747		GLEAN3_25121 is a duplicate prediction for GLEAN3_10747.\n
GLEAN3_25121	SPU_025121		GLEAN3_25121 is a duplicate prediction for GLEAN3_10747.\n
GLEAN3_21311	SPU_021311		This gene model is incomplete. Missing ~350 AA at the end. GLEAN3_14811 MAY represent the missing half. It does show COPB2 as the best hit in searches against DB.\n
GLEAN3_23769	SPU_023769		Missing ~100 AA at end.\n
GLEAN3_09246	SPU_009246		Model is incorrect.\n
GLEAN3_26357	SPU_026357		GLEAN3_03524 is a partial duplicate prediction for GLEAN3_26357.\n
GLEAN3_03524	SPU_003524		GLEAN3_03524 is a partial duplicate prediction for GLEAN3_26357.\n
GLEAN3_24920	SPU_024920		GLEAN3_04667 is a partial duplicate prediction for GLEAN3_24920.\n
GLEAN3_04667	SPU_004667		GLEAN3_04667 is a partial duplicate prediction for GLEAN3_24920.\n
GLEAN3_02885	SPU_002885		Has an additional ~45 AA at beginning of protein.\n
GLEAN3_07188	SPU_007188		GLEAN3_07188 has the first half of DDB1 and GLEAN3_07379 has the rest. A few AA may be missing at the beginning.\n
GLEAN3_07379	SPU_007379		GLEAN3_07188 has the first half of DDB1 and GLEAN3_07379 has the rest. A few AA may be missing at the beginning.\n
GLEAN3_09266	SPU_009266		GLEAN3_09266 is a partially duplicate prediction for GLEAN3_01826.\n
GLEAN3_10339	SPU_010339		GLEAN3_07300 is a partial duplicate prediction for GLEAN3_10339.\n
GLEAN3_07300	SPU_007300		GLEAN3_07300 is a partial duplicate prediction for GLEAN3_10339.\n
GLEAN3_19786	SPU_019786		GLEAN3_19786 is a partial duplicate ptediction for GLEAN3_13953.\n
GLEAN3_04860	SPU_004860		GLEAN3_24432 is a partial duplicate prediction for GLEAN3_04860.\n
GLEAN3_24432	SPU_024432		GLEAN3_24432 is a partial duplicate prediction for GLEAN3_04860.\n
GLEAN3_03593	SPU_003593	After examining the data and performing a BLAST search, it appears that the sequence is distributed onto 2 scaffolds. The sequence doesn't begin until the 6th base pair, but from there, if the two scaffolds were combined the sequence would have an orderly continuous arrangement, without any repeats or gaps present. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be	GLEAN3_03593 has the first half of FBXL10 gene. GLEAN3_025153 MAY encode the other half. GLEAN3_02864 is a partial duplicate prediction for GLEAN3_03593.\n
GLEAN3_25153	SPU_025153		GLEAN3_03593 has the first half of FBXL10 gene. GLEAN3_025153 MAY encode the other half. GLEAN3_02864 is a partial duplicate prediction for GLEAN3_03593.\n
GLEAN3_02864	SPU_002864		GLEAN3_03593 has the first half of FBXL10 gene. GLEAN3_025153 MAY encode the other half. GLEAN3_02864 is a partial duplicate prediction for GLEAN3_03593.\n
GLEAN3_15361	SPU_015361		Contains a leucine-rich repeat (LRR-R1) domain.\n
GLEAN3_05729	SPU_005729		GLEAN3_05729 and GLEAN3_08183 are similar duplicate predictions.\n
GLEAN3_08183	SPU_008183		GLEAN3_05729 and GLEAN3_08183 are similar duplicate predictions.\n
GLEAN3_24772	SPU_024772		The gene model on the N-terminal end appears to be incorrect.\n
GLEAN3_07722	SPU_007722		GLEAN3_07722 encodes the complete CYFIP2 gene. GLEAN3_07272 and GLEAN3_11333 are partial duplicate predictions for GLEAN3_07722.\n
GLEAN3_07272	SPU_007272		GLEAN3_07722 encodes the complete CYFIP2 gene. GLEAN3_07272 and GLEAN3_11333 are partial duplicate predictions for GLEAN3_07722.\n
GLEAN3_11333	SPU_011333		GLEAN3_07722 encodes the complete CYFIP2 gene. GLEAN3_07272 and GLEAN3_11333 are partial duplicate predictions for GLEAN3_07722.\n
GLEAN3_17645	SPU_017645		GLEAN3_12959 is a partial duplicate prediction for GLEAn3_17645.\n
GLEAN3_12959	SPU_012959		GLEAN3_12959 is a partial duplicate prediction for GLEAn3_17645.\n
GLEAN3_00487	SPU_000487		GLEAN3_00487 has the first half of the gene. GLEAN3_27232 MAY have the rest.\n
GLEAN3_27232	SPU_027232		GLEAN3_00487 has the first half of the gene. GLEAN3_27232 MAY have the rest.\n
GLEAN3_03575	SPU_003575		GLEAN3_03575 and GLEAN3_11118 are partial duplicate predictions for GLEAN3_27232.\n
GLEAN3_11118	SPU_011118		GLEAN3_03575 and GLEAN3_11118 are partial duplicate predictions for GLEAN3_27232.\n
GLEAN3_04739	SPU_004739		GLEAN3_04739 is a partial duplicate prediction for GLEAN3_12301.\n
GLEAN3_12301	SPU_012301		GLEAN3_04739 is a partial duplicate prediction for GLEAN3_12301.\n
GLEAN3_09085	SPU_009085		Likely isoform of DHPS.\n
GLEAN3_17680	SPU_017680		Likely isoform of DHPS.\n
GLEAN3_22098	SPU_022098		Likely isoform of DHPS.\n
GLEAN3_25930	SPU_025930		Duplicate prediction for GLEAN3_12301.\n
GLEAN3_00231	SPU_000231		GLEAN3_05915 has part II of this gene. GLEAN3_27703 MAY encode the first part. GLEAN3_00273 is a duplicate prediction for GLEAN3_27703.\n
GLEAN3_05915	SPU_005915		GLEAN3_05915 has part II of this gene. GLEAN3_27703 MAY encode the first part. GLEAN3_00273 is a duplicate prediction for GLEAN3_27703.\n
GLEAN3_27703	SPU_027703		GLEAN3_05915 has part II of this gene. GLEAN3_27703 MAY encode the first part. GLEAN3_00273 is a duplicate prediction for GLEAN3_27703.\n
GLEAN3_06485	SPU_006485		GLEAN3_06485 and GLEAN3_08534 are duplicate predictions.\n
GLEAN3_08534	SPU_008534		GLEAN3_06485 and GLEAN3_08534 are duplicate predictions.\n
GLEAN3_18067	SPU_018067		GLEAN3_21911 is a duplicate prediction for GLEAN3_18067.\n
GLEAN3_21911	SPU_021911		GLEAN3_21911 is a duplicate prediction for GLEAN3_18067.\n
GLEAN3_10109	SPU_010109		Similar to both ZDHHC2 or ZDHHC20.\n
GLEAN3_17163	SPU_017163		GLEAN3_17163 has part I and GLEAN3_17164 has part II of ZDHHC7 gene.\n
GLEAN3_17164	SPU_017164		GLEAN3_17163 has part I and GLEAN3_17164 has part II of ZDHHC7 gene.\n
GLEAN3_21834	SPU_021834		GLEAN3_21833 has part I and GLEAN3_21834 has part II of ZDHHC17 gene. \n
GLEAN3_21833	SPU_021833		GLEAN3_21833 has part I and GLEAN3_21834 has part II of ZDHHC17 gene. \n
GLEAN3_12476	SPU_012476		#\nThis gene blasts back to the mannose receptor for a VERY large range of animals from Human to Danio.\n
GLEAN3_06842	SPU_006842		GLEAN3_27068 is a duplicate prediction for GLEAN3_06842.\n
GLEAN3_27068	SPU_027068		GLEAN3_27068 is a duplicate prediction for GLEAN3_06842.\n
GLEAN3_12025	SPU_012025		GLEAN3_12025 has the first half of the gene. GLEAN3_22214 has the rest.\n
GLEAN3_22214	SPU_022214		GLEAN3_12025 has the first half of the gene. GLEAN3_22214 has the rest.\n
GLEAN3_09348	SPU_009348		Likely missing latter half of LIN9 gene. GLEAN3_08517 MAY code for the missing half.\n
GLEAN3_02194	SPU_002194		GLEAN3_27247 is a partial duplicate prediction for GLEAN3_02194.\n
GLEAN3_27247	SPU_027247		GLEAN3_27247 is a partial duplicate prediction for GLEAN3_02194.\n
GLEAN3_04401	SPU_004401		May be missing an exon at the beginning.\n
GLEAN3_07535	SPU_007535		GLEAN3_07535 contains first part of the gene and GLEAN3_04388 has the later half. There is some overlap between the two models.\n
GLEAN3_04388	SPU_004388		GLEAN3_07535 contains first part of the gene and GLEAN3_04388 has the later half. There is some overlap between the two models.\n
GLEAN3_15460	SPU_015460		GLEAN3_15460 is duplicate prediction for GLEAN3_09441.\n
GLEAN3_09441	SPU_009441		GLEAN3_15460 is duplicate prediction for GLEAN3_09441.\n
GLEAN3_15152	SPU_015152		GLEAN3_04341 is a partial duplicate prediction for GLEAN3_15152.\n
GLEAN3_04341	SPU_004341		GLEAN3_04341 is a partial duplicate prediction for GLEAN3_15152.\n
GLEAN3_28370	SPU_028370		GLEAN3_28370 codes for first part of DNAH1 protein and GLEAN3_00013 codes for the rest. There may be an exon missing in between the two predictions.\n
GLEAN3_00013	SPU_000013		GLEAN3_28370 codes for first part of DNAH1 protein and GLEAN3_00013 codes for the rest. There may be an exon missing in between the two predictions.\n
GLEAN3_03564	SPU_003564		GLEAN3_15049 has first part of the DNAH7 gene and GLEAN3_03564 has the rest.\n
GLEAN3_15049	SPU_015049		GLEAN3_15049 has first part of the DNAH7 gene and GLEAN3_03564 has the rest.\n
GLEAN3_27326	SPU_027326		GLEAN3_27326 may have the first part of the gene. Rest is coded by GLEAN3_17271.\n
GLEAN3_17271	SPU_017271		GLEAN3_27326 may have the first part of the gene. Rest is coded by GLEAN3_17271.\n
GLEAN3_24529	SPU_024529		GLEAN3_24529 has the first part of DNAH5 gene and GLEAN3_03660 has the latter. There is overlap between the two GLEAN models (~2161-2763 AA).\n
GLEAN3_03660	SPU_003660		GLEAN3_24529 has the first part of DNAH5 gene and GLEAN3_03660 has the latter. There is overlap between the two GLEAN models (~2161-2763 AA).\n
GLEAN3_07564	SPU_007564		Missing the first half.\n
GLEAN3_02335	SPU_002335		GLEAN3_02335 has most of the first part. GLEAN3_24805 codes for the rest. It is possible that these two GLEAN's may be partial predictions of two different genes.\n
GLEAN3_24805	SPU_024805		GLEAN3_02335 has most of the first part. GLEAN3_24805 codes for the rest. It is possible that these two GLEAN's may be partial predictions of two different genes.\n
GLEAN3_20931	SPU_020931		GLEAN3_10157 and GLEAN3_20427 are partial duplicate predictions for GLEAN3_20931.\n
GLEAN3_10157	SPU_010157		GLEAN3_10157 and GLEAN3_20427 are partial duplicate predictions for GLEAN3_20931.\n
GLEAN3_20427	SPU_020427		GLEAN3_10157 and GLEAN3_20427 are partial duplicate predictions for GLEAN3_20931.\n
GLEAN3_17547	SPU_017547		Incomplete gene model.\n
GLEAN3_22833	SPU_022833		GLEAN3_22833 is a duplicate prediction for GLEAN3_17547.\n
GLEAN3_08312	SPU_008312		GLEAN3_28010 is a duplicate prediction for GLEAN3_08312.\n
GLEAN3_28010	SPU_028010		GLEAN3_28010 is a duplicate prediction for GLEAN3_08312.\n
GLEAN3_22907	SPU_022907		Gene model incorrect. Too long.\n
GLEAN3_27407	SPU_027407		GLEAN3_27407 is a partial duplicate prediction for GLEAN3_27406.\n
GLEAN3_27406	SPU_027406		GLEAN3_27407 is a partial duplicate prediction for GLEAN3_27406.\n
GLEAN3_04463	SPU_004463		GLEAN3_25916 is a partial duplicate prediction for GLEAN3_04463.\n
GLEAN3_25916	SPU_025916		GLEAN3_25916 is a partial duplicate prediction for GLEAN3_04463.\n
GLEAN3_26427	SPU_026427		GLEAN3_13244 is a partial duplicate prediction for GLEAN3_26427.\n
GLEAN3_13244	SPU_013244		GLEAN3_13244 is a partial duplicate prediction for GLEAN3_26427.\n
GLEAN3_10328	SPU_010328		GLEAN3_26065 is a partial duplicate prediction for GLEAN3_10328.\n
GLEAN3_10620	SPU_010620		Larger than required prediction.\n
GLEAN3_23970	SPU_023970		GLEAN3_23970 is a partial duplicate prediction for GLEAN3_19111.\n
GLEAN3_19111	SPU_019111		GLEAN3_23970 is a partial duplicate prediction for GLEAN3_19111.\n
GLEAN3_08304	SPU_008304		GLEAN3_08304 may represent a partial prediction for MYST2 (which is largely encoded by GLEAN3_17172, but is missing a piece that is present in GLEAN3_08304).\n
GLEAN3_17172	SPU_017172		GLEAN3_08304 may represent a partial prediction for MYST2 (which is largely encoded by GLEAN3_17172, but is missing a piece that is present in GLEAN3_08304).\n
GLEAN3_09734	SPU_009734		Missing ~70 AA at beginning.\n
GLEAN3_14886	SPU_014886		GLEAN3_14887 is a longer duplicate prediction for GLEAN3_14886.\n
GLEAN3_14887	SPU_014887		GLEAN3_14887 is a longer duplicate prediction for GLEAN3_14886.\n
GLEAN3_11305	SPU_011305		Prediction likely short by ~50 AA at beginning.\n
GLEAN3_13192	SPU_013192		GLEAN3_13192 is a partial duplicate of GLEAN3_19276.\n
GLEAN3_13812	SPU_013812		GLEAN3_13812 encodes Part I of XRN1 and GLEAN3_13813 probably encodes the rest (though may be partially incorrect).\n
GLEAN3_19276	SPU_019276		GLEAN3_19276 is a partial prediction that may be missing ~180-200 AA at end\n
GLEAN3_13813	SPU_013813		GLEAN3_13812 encodes Part I of XRN1 and GLEAN3_13813 probably encodes the rest (though may be partially incorrect).\n
GLEAN3_21007	SPU_021007		GLEAN3_21007 may be missing a few AA at the end. GLEAN3_24819 is a partial duplicate prediction for GLEAN3_21007. GLEAN3_00416 may be as well.\n
GLEAN3_24819	SPU_024819		GLEAN3_21007 may be missing a few AA at the end. GLEAN3_24819 is a partial duplicate prediction for GLEAN3_21007. GLEAN3_00416 may be as well.\n
GLEAN3_00461	SPU_000461		GLEAN3_21007 may be missing a few AA at the end. GLEAN3_24819 is a partial duplicate prediction for GLEAN3_21007. GLEAN3_00416 may be as well.\n
GLEAN3_04023	SPU_004023		GLEAN3_04023 is a partial duplicate prediction for GLEAN3_11694.\n
GLEAN3_11694	SPU_011694		GLEAN3_04023 is a partial duplicate prediction for GLEAN3_11694.\n
GLEAN3_04608	SPU_004608		GLEAN3_04608 is a duplicate prediction for GLEAN3_04445.\n
GLEAN3_04445	SPU_004445		GLEAN3_04608 is a duplicate prediction for GLEAN3_04445.\n
GLEAN3_14305	SPU_014305		GLEAN3_27401 is a partial duplicate prediction for GLEAN3_14305.\n
GLEAN3_27401	SPU_027401		GLEAN3_27401 is a partial duplicate prediction for GLEAN3_14305.\n
GLEAN3_23752	SPU_023752		Prediction is incorrect .. much longer than required.\n
GLEAN3_26862	SPU_026862		GLEAN3_26862 has the first part of the FTSJ3 gene and GELAN3_21738 likely has the rest. A couple of exons are likely missing - one at the beginning and one between the two GLEAN's.\n
GLEAN3_21738	SPU_021738		GLEAN3_26862 has the first part of the FTSJ3 gene and GELAN3_21738 likely has the rest. A couple of exons are likely missing - one at the beginning and one between the two GLEAN's.\n
GLEAN3_17569	SPU_017569		Likely missing last exon.\n
GLEAN3_20191	SPU_020191		GLEAN3_20191 is a partial duplicate prediction for GLEAN3_01541.\n
GLEAN3_01541	SPU_001541		GLEAN3_20191 is a partial duplicate prediction for GLEAN3_01541.\n
GLEAN3_23204	SPU_023204		GLEAN3_07955 is a duplicate prediction for GLEAN3_23204.\n
GLEAN3_07955	SPU_007955		GLEAN3_07955 is a duplicate prediction for GLEAN3_23204.\n
GLEAN3_11958	SPU_011958		GLEAN3_11958 has most of the gene except the last exon or two which are encoded by GLEAN3_08969. There is a significant overlap between the two GLEAN's.\n
GLEAN3_08969	SPU_008969		GLEAN3_11958 has most of the gene except the last exon or two which are encoded by GLEAN3_08969. There is a significant overlap between the two GLEAN's.\n
GLEAN3_04785	SPU_004785		GLEAN3_06645 and GLEAN3_10748 are partial duplicate predictions for GLEAN3_04785.\n
GLEAN3_06645	SPU_006645		GLEAN3_06645 and GLEAN3_10748 are partial duplicate predictions for GLEAN3_04785.\n
GLEAN3_10748	SPU_010748	After reviewing the data and performing a BLAST search it appears that the sequence is dispersed onto 2 different scaffolds. The sequence appears to have an orderly arrangement if the two scaffolds were combined. The BLAST results displayed an additional stretch of sequence from 1-29 that was not available in the Excel spreadsheet. The additional 1-29 bases were continuous with the data provided in the excel spreadsheet. Some EST information was available and the transcriptome intensity score were strong ranging from 30-60. This is an un-annotated gene so no additional comments were available.	GLEAN3_06645 and GLEAN3_10748 are partial duplicate predictions for GLEAN3_04785.\n
GLEAN3_11454	SPU_011454		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group IA.\n
GLEAN3_12464	SPU_012464		The intron of this gene model included a long unknown sequence. So the model was modified as an intronless gene by comparison to Fgenesh++ and NCBI prediction. \nThis is a member of sea urchin-specific Tlr Group IB. \n
GLEAN3_27226	SPU_027226		This model is incorrect. It encodes the MTO1 gene in the first part but the latter half appears to be similar to GLIPR1L1(NP_689992).\n
GLEAN3_13160	SPU_013160	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. There was no Est information available on GBrowse assembly V0.5 and the transcriptome intensity scores appeared to widely dispersed with several values ranging from about 5-50. Additional information found from Baylor gene information: This GLEAN contains N-terminal region of the full-length Mdn1 gene, corresponding to amino acids 1 to ~1532 of human midasin. The mid rgion of the Sp-MDN1 gene is in GLEAN3_09702 and the C-terminal region is GLEAN3_22614. The MDN1 gene contains a conserved MIDAS domain COG5271.2 and a hexomeric AAA ATPase domain distantly related to that of dynein	This GLEAN contains N-terminal region of the full-length Mdn1 gene, corresponding to amino acids 1 to ~1532 of human midasin. The mid rgion of the Sp-MDN1 gene is in GLEAN3_09702 and the C-terminal region is GLEAN3_22614. The MDN1 gene contains a conserved MIDAS domain COG5271.2 and a hexomeric AAA ATPase domain distantly related to that of dynein\n
GLEAN3_09702	SPU_009702		This GLEAN contains the mid-region of the full length midasin polypeptide, corresponding to approximately amino acids 2660-3943 of human midasin. The N-terminal region is GLEAN3_13160 and the C-terminal region is GLEAN3_22614\n
GLEAN3_22908	SPU_022908		Likely missing an exon.\n
GLEAN3_23058	SPU_023058		Likely missing an exon (~30 AA).\n
GLEAN3_02179	SPU_002179		Missing ~100 AA.\n
GLEAN3_24540	SPU_024540		GLEAN3_13159 is a partial duplicate prediction for GLEAN3_24540.\n
GLEAN3_13159	SPU_013159		GLEAN3_13159 is a partial duplicate prediction for GLEAN3_24540.\n
GLEAN3_17807	SPU_017807		GLEAN3_17807 is a partial duplicate prediction for GLEAN3_18692.\n
GLEAN3_28576	SPU_028576		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. \nThis is a member of sea urchin-specific Tlr Group I(orphan).\n
GLEAN3_07429	SPU_007429		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.  \nThis is a member of sea urchin-specific Tlr Group IC. \n
GLEAN3_11570	SPU_011570		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR. This is a member of sea urchin-specific Tlr Group IC. \n
GLEAN3_26347	SPU_026347		This GLEAN forms part of the annotated full-length Sp-DNAH1 gene (SPU_030223)\n
GLEAN3_11876	SPU_011876		This GLEAN forms part of the annotated full-length Sp-DNAH10 gene (SPU_030231)\n
GLEAN3_15054	SPU_015054		This Glean forms part of the annotated full-length Sp-DNAH10 gene (SPU_030231)\n
GLEAN3_28362	SPU_028362		This glean forms part of the annotated full-length Sp-DNAH10 gene (SPU_030231)\n
GLEAN3_26539	SPU_026539		This GLEAN forms part of the annotated full length Sp-DNAH10 -gene (SPU_030231)\n
GLEAN3_28189	SPU_028189		This Glewan forms part of the annotated full-length Sp-DNAH10 gene (SPU_030231)\n
GLEAN3_02747	SPU_002747		This Glean forms part of the annotated full length Sp-DNAH12 gene (SPU_030232)\n
GLEAN3_05317	SPU_005317		This Glean forms part of the annotated full length Sp-DNAH12 gene (SPU_030232)\n
GLEAN3_20747	SPU_020747		This Glean forms part of the annotated full-length Sp-DNAH12 gene (SPU_030232)\n
GLEAN3_02750	SPU_002750		This Glean forms part of the annotated full length Sp-DNAH14 gene (SPU_030233)\n
GLEAN3_28434	SPU_028434		This glean forms part of the annotated full-length Sp-DNAH14 gene (SPU_030233)\n
GLEAN3_08188	SPU_008188	For this particular GLEAN model, there was no Cds information from both Baylor annotations and SpBase. There was however mRNA information available from SpBase. When examining the excel data, it appears that the sequence is distributed onto 2 different scaffolds. Subject AAGJ02141483 and AAGJ02094195 appear to have the most orderly arrangement however, without the BLAST results, it is difficult to determine whether or not these are the best fits for this GLEAN model. On subject AAGJ02094195, the sequence doesn't begin until about 400, resulting in a large gap within the sequence. There was no Est. information available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak with all of the values being less than 10.	This Glean forms part of the annotated full length Sp-DNAH14 gene (SPU_030233)\n
GLEAN3_12139	SPU_012139		This Glean forms part of the annotated full length Sp-DNAH14 gene (SPU_030233)\n
GLEAN3_16536	SPU_016536		Intronless Toll-like receptor with predicted secretory signal peptide, LRR-NT, LRR(22), LRR-CT, TM and TIR.   \nThis is a member of sea urchin-specific Tlr Group ID.\n
GLEAN3_26081	SPU_026081		This Glean is part of the annotated full-length Sp-DNAH15 gene (SPU_030234)\n
GLEAN3_17935	SPU_017935		This glean is part of the annotated full-length Sp-DNAH15 gene (SPU_030234)\n
GLEAN3_18822	SPU_018822		This Glean is part of the annotated full-length Sp-DNAH2 gene (SPU_030224)\n
GLEAN3_10136	SPU_010136		This Glean is part of the annotated full length Sp-DNAH2 gene (SPU_030224)\n
GLEAN3_18784	SPU_018784		This Glean is part of the annotated full-length Sp-DNAH2 gene (SPU_030224)\n
GLEAN3_27159	SPU_027159		This Glean forms part of the annotated full-length Sp-DNAH3 gene (SPU_030236)\n
GLEAN3_26238	SPU_026238		GLEAN3_12397 is a partial duplicate prediction for GLEAN3_26238.\n
GLEAN3_12397	SPU_012397		GLEAN3_12397 is a partial duplicate prediction for GLEAN3_26238.\n
GLEAN3_04548	SPU_004548		Missing an exon in middle.\n
GLEAN3_05987	SPU_005987		GLEAN3_05987 is a composite prediction of two genes. Sp-NUC205 and the last ~350 AA code for Sp-PAQR5. \nGLEAN3_15468 and GLEAN3_21249 are partial duplicate predictions for GLEAN3_05987.\n
GLEAN3_15468	SPU_015468		GLEAN3_15468 and GLEAN3_21249 are partial duplicate predictions for GLEAN3_05987.\n
GLEAN3_21249	SPU_021249		GLEAN3_15468 and GLEAN3_21249 are partial duplicate predictions for GLEAN3_05987.\n
GLEAN3_10384	SPU_010384		Missing first exon.\n
GLEAN3_12748	SPU_012748		GLEAN3_12748 is a partial duplicate prediction for GLEAN3_18535.\n
GLEAN3_27020	SPU_027020		Incorrect gene model. Extra exons.\n
GLEAN3_15720	SPU_015720		Likely has an extra exon. \nGLEAN3_02618 and GLEAN3_22235 are partial duplicate predictions for GLEAN3_15720.\n
GLEAN3_22235	SPU_022235		GLEAN3_02618 and GLEAN3_22235 are partial duplicate predictions for GLEAN3_15720.\n
GLEAN3_02618	SPU_002618		GLEAN3_02618 and GLEAN3_22235 are partial duplicate predictions for GLEAN3_15720.\n
GLEAN3_24386	SPU_024386		This gene model may represent a pseudogene or contain a sequence error. Intron sequence matches coding sequence of other Sp-Tlr genes and contains stop codons. Modified gene model has some frame shifts, but reflects best gene structure. \nThis is a member of sea urchin-specific Tlr Group IE. \n
GLEAN3_17828	SPU_017828		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_25360	SPU_025360		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_08522	SPU_008522		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_00614	SPU_000614		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_04170	SPU_004170		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_28108	SPU_028108		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_15037	SPU_015037		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_21335	SPU_021335		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_06655	SPU_006655		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_16709	SPU_016709		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_22759	SPU_022759		A number of other GLEAN's may be partially similar to Sacsin. These may include GLEAN3_08522, 15037, 21335, 28108, 25360, 04170 and 00614. \nGLEAN3_17828,_08522 and _25360 are very simlar and form one group. \nGLEAN3_04170, _28108, _15037, _21335, _00614, _06655, _16709 and _22759 form a second group of sacsin like genes.\n
GLEAN3_15550	SPU_015550		Likely incorrect. Longer than required.\n
GLEAN3_15087	SPU_015087		GLEAN3_15087 has the second half of TSR1 gene. GLEAN3_19664 likely codes for the first half.\n
GLEAN3_19664	SPU_019664		GLEAN3_15087 has the second half of TSR1 gene. GLEAN3_19664 likely codes for the first half.\n
GLEAN3_27482	SPU_027482		Longer than required prediction.\n
GLEAN3_05961	SPU_005961		Missing one or more exons at beginning (~80 AA).\n
GLEAN3_05268	SPU_005268		GLEAN3_20117 is a duplicate prediction for GLEAN3_05268.\n
GLEAN3_20117	SPU_020117		GLEAN3_20117 is a duplicate prediction for GLEAN3_05268.\n
GLEAN3_22232	SPU_022232		GLEAN3_22232 is a partial duplicate prediction for GLEAN3_13282.\n
GLEAN3_02056	SPU_002056		GLEAN3_20938 is a partial duplicate prediction for GLEAN3_2056.\n
GLEAN3_20938	SPU_020938		GLEAN3_20938 is a partial duplicate prediction for GLEAN3_2056.\n
GLEAN3_18123	SPU_018123		Incorrect gene model. Has an extra exon (or more) in the middle and is missing the last exon (or more).\n
GLEAN3_23316	SPU_023316		Incorrect gene model. This appears to be a conserved gene across species.\n
GLEAN3_26256	SPU_026256		GLEAN3_26256 is a longer duplicate prediction for GLEAN3_23423.\n
GLEAN3_22950	SPU_022950		Only second half of the WAPAL gene is encoded by GLEAN3_22950. First part is missing. \n
GLEAN3_21483	SPU_021483		#\nIncorrect gene model. Likely has an extra exon predicted.\n
GLEAN3_01415	SPU_001415		Inspection of the tiling array suggests that glean may have missed the following exons: KSKRAPREEDTAPKRRREEAAGSSKQSPTKKKISSGRQAAGSGGGTPTQDELAPDPRESAKPAAQKRAEGPIKSDQTVRVEEKQESDSESSGRSSSGKGAKLASLPELME\n
GLEAN3_02008	SPU_002008		Inspection of the tiling array suggests that glean may have missed the following exons: DSYRREILLLYSLWQGLPNETSSYQPRPCAHGRETLRVRDMPQSFHRAGYSPKAQDHPFWPEALQVRDLRPSLCRQKRPQLSC,CELCPRKFVRKNFLNAHMKLHQGIKPKKPPERSFTCTICNKVLKTRASYQTHNRIHTGEKSFCCTLCGKAFPTKPRLINHVRVHTGEKPYECETCHKAFTEPGTLRRHKIIHSGLKPYKCETCDRAFADKSALNSHVKMHTGQKSHSCEFCGKMFWTATNMRQHAKTHRKKSMFECGVCSKEIFGQENLTAHLVEHEAEQR,LSNVPSAISGFRQKGQRTQHQKKVHKVKMEEGNEAEGEVVSSEDGPVIIPNVKRVFKCRVCQVEFEAKEELKEHKLTHKELENGDDEYVPISVKVSRPKKVVETFKCDICNNSFAQKAYLERHRRVHTGEKPFGCTLCEKKFSDMTSLRRHKSIHTGAKPF,QCEVCEKFFKTKKTLQKHGAIHDEEKRYECDVCQKRFSRKAYLVSHSTIHTGEKPYTCEDCGRQFRDRSSMKRHMNTHKGIKRYECNVCQKQFTDKSAANIHLRIHTGEKPYECYECK\n
GLEAN3_02009	SPU_002009		Inspection of the tiling array suggests that glean may have missed the following exons: DGQRLKHMRQKHSLAKCDECGACFEDEALLQRHMKMHSQVTMFMCDVCGSTFTKKSYLTFHMVVHEKEDLSERMSMKEVDEGKVPALPSKGQKQLQIVDDDEDDDDVGDGGNDPDDSDWEEPLAAKKKGSKFDCDRCSRSFASLRGLKMHQRMLHITVEEEPQSESSEEEEEDAKTEEDAMIEEKGGNDKSKHCPVCKKSFVSVRQLTRHESTHASWDCTYCSKTFRTSWILKEHLNTHTGQRPYQCTECDKTFKSHGALRRHTIIHKGTKPYRCDLCDMRFSDGSSLKSHKKRRSCR,RATRNVDHAGDKQVECDVCLKKFYTGFQMRTHRLTHGGQDHKEENLLRCESCSKAFLSPSGLEKHKKSGKCGKKFTCPFCTDSFIYKYEREKHMETHAEFVNKADKEDEVTDGVKKSKVFKCPICEQKFPNLRTFTIHRKKHERGKVYACEVCNKVFSTPISLKYHRKLHTGQGPKCSVCDKTFYNLKSLRRHERIHTGQKPYNCGFC,SLSYLSSTHTESFVQIVGRHSSTFPDSSLPQAESTNLNLETMGEDAQPTNQDVDTVMLETELTNESADTMTPEAELTDENADPVILVSEMTNQSADTMTPEAGLTNENADPVILVSEMTNQSADTMTPEAGLTNENADPVILASEMTNQSVDTMIPEDQPQTNAEPRSSSLEVQGKL,LSNVPSAISGFRQKGQRTQHQKKVHKVKMEEGNEAEGEVVSSEDGPVIIPNVKRVFKCRVCQVEFEAKEELKEHKLTHKELENGDDEYVPISVKVSRPKKVVETFKCDICNNSFAQKAYLERHRRVHTGEKPFGCTLCEKKFSDMTSLRRHKSIHTGAKPF\n
GLEAN3_07169	SPU_007169		Inspection of the tiling array suggests that glean may have missed the following exons: TRYKRICDRHICDDPVLAMLNLPLPNINSKSITNFISKQEKRSISLQPLNPKSSNIAKAFLLIPFRHLSSFQRCSRTQGMHRLSTVSYLMNLASAGVRRGTHSSMNQGLFHLQGMSQHLAHLQCLYLLLVTNQNLSYPLLPPR\n
GLEAN3_11001	SPU_011001		Inspection of the tiling array suggests that glean may have missed the following exons: FYVNFLLLFFRFSLVSLYIWRYHHGLPVKRCGKLATYHPVNRSATVISYLKNSQPCQPLDMASTSNPSLLPQKNIKQEIIVVFEPPAREMASSSSHSSVPQPETSQEINVVPVSPVKDMPSTSNPSSGHETGQENDMVPDSTGIELSKDDLKAGGGKVGSSSKKKSGCEKDSDEYKRRRERNNEAVRKSRQKSRQKASETEVRVTELKKENADLEQRVTLLHKELELLKDLFLTHANELPDPSTTFGLFNANPRLGSSSPNPALSRRIVLKTESLTVSLTCRNVPESITTTT\n
GLEAN3_11380	SPU_011380		Inspection of the tiling array suggests that glean may have missed the following exons: FLDEGQVLLRLLLFVLDNPLIKVLIKQTFELLIYVCLGLYSVRMNCMLPGLILSCSIFYPAERAVRLFLTSTVSWPLEYYFLLTDQHAYLHLRSVAGCLEGMAISRGLLLLVCHLLTLVFALEHILQEGLCCSVHENQ\n
GLEAN3_14683	SPU_014683		Inspection of the tiling array suggests that glean may have missed the following exons: SPVISDAQKFQCSHCEGLFSSAKLILRHIRCEHSDGEPCEMMPALAWKRKGKKKGREKSVAIKFKFNHPINHPIVRKRKNSEEEEECDFRCGTCVKSFPSLGRLKEHELFHEMMHGDKPYECSECNQRYTAQSSLNRHEREVHGFLDDYKPRSRPKRLKAHVPKKPLHCRYCGQGYKSRGALANHERRIHGSRHPIREPDLPNDEPKDMLGRPSDYYQRPFKCRFCPKRYVSWTTVEQHEKEVHTREGTFKCSHCPKVCASESRLKEHLVVHKYMHMHRCTLCPRSFASESALNNHQGEHTGLKPFKCEICSRGFRTRKLTLKHKQRMHQERPKRYICSICNKGFAEKCNLKVHERRHKGIRQFVCLECGKGFTARFSLTAHMQAMHIKERPFACEICGKSFALNHHYNHHMAKHRLDGDDSIPQ,RRMYRKSHFTVVTVAKGTNHAVHSRTTRGESMALGTRFGNRTYRTMSPRICLVDPLITTSDPSSADFVQRDTFPGQRLNNTRRRSTREKALSSAVIVPRFAPVRAV,SVKEIQTIKQREQCSSSSHQASASSSSSDTSNPTPNTSKDESQLLAALNLKKTKSIQDLPQNLLFRATPEGKVDGVVAKERIEKGVEFGPYAGTLLDEEQGWTRDTTWEVRRAVFHKTVF,FPLDSAHGVSNAGIIHQARQQLPVHLRGHGVKSSTLSANYAPPITTHEPIRERNDLPITTHESVSSIIQPLTTPESGAKSNVPRPQGTVCNFCLVGFC,TRCPGVRDRTLVVTQRSVNKLFVARNCHSFTLKKPRQFILWQFTAPYLNKPLLSLSLSLSPSLYATSLNLENELSSASTDSNLTLYH\n
GLEAN3_14686	SPU_014686		Inspection of the tiling array suggests that glean may have missed the following exons: FSFNVSDAQKFQCSLCEDLFSSSKLILRHIRLEHTDGKPHDKLPLVTPKKREKKHVRLKISSKHLFKTKKKAREKEESDLKCATCGKVFLSSGRLKAHEIFHDYNQDHTCPICGKRQKNAQTWAKHMNLHKPASEARPHKCNECNGRYKSKAALRKHQHQVHGYPCRLCSERFSRMKDCKTHEQTHQAFIPPHAAVEYLVAELSPDEPKDMLGKRANYYQRRFKCRYCPKRYSDHYTVRNHEKENHTGEGTFKCSHCPKAYTSESRLKTHLLFHEQTHIYRCMLCPSSFASESALNSHQGEHTGLKPVKCDVCGKGFRTRKHSLAHRRRVHQERPKRFFCSYCNFGFAEKGDFNKHEQRHKGIRRYMCRECGMPFTSNTSLTAHIRALHTKERPFSCEICGKTFALNFKYTLHMVRHNVQVNGSSLQQQ,QNSLLMSRKTCLVNVPITTNEGSSAVIVQRDIVTITLSGTMRKRTIPVKEPSNAVIAQKLIPAKAVLKLTSCSMSRRTYTAACFVQAALHRKAPSTVTKENIPD,SQSNVMSVAKDLELGSIPLRIGVVFIRNVQSVSSAPTAILGLQKRAILTNMSSGTRALDDTCVGSVECPSPQIHLSQLTFEHCIPRKDHFHVKYAGKLLP,LGQSSIKSAPSLQSTDHRKQCLPSSKPFHQANASPPAPDVSAAEPPLFAALNLTKTISVMDLPLSLSLRVASQDKVEGVVAKDTVEKGVEFGPYTGTLLDEEQGSSKETTWEV,VSHLLNQLLHYKAPTIESSVYLRQSRFIKLMQVLRPQTSLLPNLLFLQRSISQRQSPSWIFHCLSHYELHRRTRSKEWLPRIQLRRGWSLDPTQEHCWMRSRDRLRRQPGRY,ANIIHQTPPQPPVTLPVHIRGHGVKSSTLSANYAPPINTTHGAVEEERNDLQIATHGSVEASKMLPLAFHKSVVERNFQPTTSYESGLHVESNALPIITCKSA\n
GLEAN3_14687	SPU_014687		Inspection of the tiling array suggests that glean may have missed the following exons: QCRFPILKPLYHEIQIAVWMPQGAVQGMVSEADPLRTQADAVRLCGMVGPWGRVQHVAGVVAVVVAKVVDVVGFPVDLFQCSRNQVDNYLPLLIYYEGIKEN,VLNVAVHTLVKFEILESCMNSNILPVYLAVEHFQFTSWFNDLSASEVDIAVILICFHVLFRFTPCASQMRHPVFTSVNVEENPTWITDL\n
GLEAN3_17847	SPU_017847		Inspection of the tiling array suggests that glean may have missed the following exons: EWTGGRHYVMIQDWTDLEGGHHVVTAHRDENPNRPKETQRNRHVKRPRSLHSMMDRKLPLDIADNLRPNQKKSWITTKDPLHHLTCDIHLGKWRRRLHGLVSGLKLLVRMLKLKLTL,PPGHEIVLADRVTGKRVALCEGLVESPVNPTELWKKGPIVARSVVKADEDVLPVRVFNPTREQGIIKAGTAIASLSALEEMGTEMCKQTMPTETSASKNMTSDKQRNRDGDARRATTHFFQCDSCTKSYEKKDSLQRHKREVHIRKYRCGQCDYRTGRKTEIERHQGAHAVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPLRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDPTKSSCQKTPESPQHDGSEASPRHSRQSTPEPEEELDNDERSPSPLDLRYSSRKMASETAWASEWTEIASKNAETQTDT,ESPVNPIELWKKGPIVARSVVKTDEDVLPVRVFNPTREQRNIKAGTAIASLSAVEEVGTEMCNQTMPTETSASRNMPPDKQGNKDVDARRTTTHFFQCDSCTKSYEKKDSLQRHKREVHILKYRCDQCDYRTGRKTEMERHQGAHEVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPPRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDLTKSSCQKTPESPQHDGSEAS,WCQKWDPVTMAQGRNRQNRPTPGEMMKHLERSMTEVDQMMEQLRASMSQVPVADPEPGRQEAFPPTLSGQTASGPTADSQMPAAQRVRFSSTPREQSRRLPSAGTGVHITPPLFSPP\n
GLEAN3_17848	SPU_017848		Inspection of the tiling array suggests that glean may have missed the following exons: EWTGGRHYVMIQDWTDLEGGHHVVTAHRDENPNRPKETQRNRHVKRPRSLHSMMDRKLPLDIADNLRPNQKKSWITTKDPLHHLTCDIHLGKWRRRLHGLVSGLKLLVRMLKLKLTL,PPGHEIVLADRVTGKRVALCEGLVESPVNPTELWKKGPIVARSVVKADEDVLPVRVFNPTREQGIIKAGTAIASLSALEEMGTEMCKQTMPTETSASKNMTSDKQRNRDGDARRATTHFFQCDSCTKSYEKKDSLQRHKREVHIRKYRCGQCDYRTGRKTEIERHQGAHAVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPLRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDPTKSSCQKTPESPQHDGSEASPRHSRQSTPEPEEELDNDERSPSPLDLRYSSRKMASETAWASEWTEIASKNAETQTDT,GKGEVYLAELRQRCRQPKESLQELGQTIRELCTLSYPEFDEKGQDRLARGHFLDAVVTPEIREGLFRAQPRTLDDAVEAALNTEAFLRMEGQRNEVKRSTTYSRALEECEVSAIREQQPRNPTIDEIVKKVLDALDMRNGRNTIKPDVPDRRPEQTMPTKVSEREDNRCFNCNELGHWRNQCPYPRKVRGGTAPPAAEKANTNLQWATANGMTDEEEQARVGSSQDPNRKGLFLE,RCVNCLSGCSIQQGNMGSQRQALPLPHCQRWKKWAQKCAIRRCPLKPVQAETCHPTSRETRTLMPEVRLPTSFNVTLVPKVMRRKTL,DFLVQINAVLDCQKMELRTEWGIIPCLDSEGESFCRRIVAGEEYSIPPGHEMVLANRVTGEKIALCEGLVESPVNPTELWKKGPIVARSVVKADEDVLIACQGVQFNKGTWDHKGRHCHCLIVSAGRSGHRNVQSDDAH,LPVRVFNSTREHGITKAGTAIASLSALEEVGTEMCNQTMPIETSASRNMPPDKQGDKDVDARGTTTHFFQCDSCTKSYEKKDSL,LPVRVFNSTREQGITKAGTAIASLSALEEVGTEMCNQTMPIETSASRNMPPDKQEDKDVDARRTTTHFFQCDSCTKSYEKKDSLQRHKREVHILKYRCDQCDYRTGRKTEMERHQGAHDVAVVP,CLSGCSIQQGNRGSQRQALPLPHCQRWKKWAQKCAIRRCPLKPVQAETCHPTSRKTRTLMPDVRLPTSFNVTLVPKVMRRKTLCKDTSGKCIS,WCQKWDPVTMAQGRNRQNRPTPGEMMKHLERSMTEVDQMMEQLRASMSQVPVADPEPGRQEAFPPTLSGQTASGPTADSQMPAAQRVRFSSTPREQSRRLPSAGTGVHITPPLFSPP,SPRRYRSSRRKSKSLQRDLTKSSCQKTPESPQHDGSEAPPRHSRQSTPEPGEELDNDERSPSPLDLRYSSRKMASETAWASEWTEIASKNAETQTDTDSSR,RNRHVKRPRSLHNMMDRKLPLDIADNLRPNQEKSWITTKGPLHHLTCDIHLGKWRRRLPGLVSGLKLLVRMLKLKLTLIVLVK\n
GLEAN3_17849	SPU_017849		Inspection of the tiling array suggests that glean may have missed the following exons: EWTGGRHYVMIQDWTDLEGGHHVVTAHRDENPNRPKETQRNRHVKRPRSLHSMMDRKLPLDIADNLRPNQKKSWITTKDPLHHLTCDIHLGKWRRRLHGLVSGLKLLVRMLKLKLTL,PPGHEIVLADRVTGKRVALCEGLVESPVNPTELWKKGPIVARSVVKADEDVLPVRVFNPTREQGIIKAGTAIASLSALEEMGTEMCKQTMPTETSASKNMTSDKQRNRDGDARRATTHFFQCDSCTKSYEKKDSLQRHKREVHIRKYRCGQCDYRTGRKTEIERHQGAHAVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPLRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDPTKSSCQKTPESPQHDGSEASPRHSRQSTPEPEEELDNDERSPSPLDLRYSSRKMASETAWASEWTEIASKNAETQTDT,ESPVNPIELWKKGPIVARSVVKTDEDVLPVRVFNPTREQRNIKAGTAIASLSAVEEVGTEMCNQTMPTETSASRNMPPDKQGNKDVDARRTTTHFFQCDSCTKSYEKKDSLQRHKREVHILKYRCDQCDYRTGRKTEMERHQGAHEVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPPRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDLTKSSCQKTPESPQHDGSEAS,GKGEVYLAELRQRCRQPKESLQELGQTIRELCTLSYPEFDEKGQDRLARGHFLDAVVTPEIREGLFRAQPRTLDDAVEAALNTEAFLRMEGQRNEVKRSTTYSRALEECEVSAIREQQPRNPTIDEIVKKVLDALDMRNGRNTIKPDVPDRRPEQTMPTKVSEREDNRCFNCNELGHWRNQCPYPRKVRGGTAPPAAEKANTNLQWATANGMTDEEEQARVGSSQDPNRKGLFLE,RCVNCLSGCSIQQGNMGSQRQALPLPHCQRWKKWAQKCAIRRCPLKPVQAETCHPTSRETRTLMPEVRLPTSFNVTLVPKVMRRKTL,DFLVQINAVLDCQKMELRTEWGIIPCLDSEGESFCRRIVAGEEYSIPPGHEMVLANRVTGEKIALCEGLVESPVNPTELWKKGPIVARSVVKADEDVLIACQGVQFNKGTWDHKGRHCHCLIVSAGRSGHRNVQSDDAH,LPVRVFNSTREHGITKAGTAIASLSALEEVGTEMCNQTMPIETSASRNMPPDKQGDKDVDARGTTTHFFQCDSCTKSYEKKDSL,LPVRVFNSTREQGITKAGTAIASLSALEEVGTEMCNQTMPIETSASRNMPPDKQEDKDVDARRTTTHFFQCDSCTKSYEKKDSLQRHKREVHILKYRCDQCDYRTGRKTEMERHQGAHDVAVVP,CLSGCSIQQGNRGSQRQALPLPHCQRWKKWAQKCAIRRCPLKPVQAETCHPTSRKTRTLMPDVRLPTSFNVTLVPKVMRRKTLCKDTSGKCIS,SPRRYRSSRRKSKSLQRDLTKSSCQKTPESPQHDGSEAPPRHSRQSTPEPGEELDNDERSPSPLDLRYSSRKMASETAWASEWTEIASKNAETQTDTDSSR,RNRHVKRPRSLHNMMDRKLPLDIADNLRPNQEKSWITTKGPLHHLTCDIHLGKWRRRLPGLVSGLKLLVRMLKLKLTLIVLVK\n
GLEAN3_17850	SPU_017850		Inspection of the tiling array suggests that glean may have missed the following exons: ESPVNPIELWKKGPIVARSVVKTDEDVLPVRVFNPTREQRNIKAGTAIASLSAVEEVGTEMCNQTMPTETSASRNMPPDKQGNKDVDARRTTTHFFQCDSCTKSYEKKDSLQRHKREVHILKYRCDQCDYRTGRKTEMERHQGAHEVAVVPVRKHSLTPVSSPRCKPTSTPITGTSEKADASWRHVRMDWRSPPRDDPRLDRPRGRSPRRYRSSRRKSKSPQRDLTKSSCQKTPESPQHDGSEAS,WCQKWDPVTMAQGRNRQNRPTPGEMMKHLERSMTEVDQMMEQLRASMSQVPVADPEPGRQEAFPPTLSGQTASGPTADSQMPAAQRVRFSSTPREQSRRLPSAGTGVHITPPLFSPP\n
GLEAN3_19384	SPU_019384		Inspection of the tiling array suggests that glean may have missed the following exons: TWIVTFGVRFHINADIVDIMKDEIAAAVVFLTRIVKRNTSLTAEQMSKFSEKLALTLIEKFRNHWYEDKPSKGQAYRCIRVSRNEPRDSVISKTAKDCGIHYNHLNLPAELCLWVDPLEVSCR,NQICCRFVNRHAPFSFICRFGERGTVCEVATFNQQTLTDNRPSTPISSSNNAFNDNNANQLSPTPSPPSSPPRQLVINDNLRPNSGRKMVNSSQYVFNRNATNTPRVIQRPQNKVWVRQPNPEQYRWVNKSIAGRA\n
GLEAN3_21758	SPU_021758		Inspection of the tiling array suggests that glean may have missed the following exons: EGLTPEALAQAGLTQNYVNAFTQQTLESLANSQGDITAENQISLQTQQIQQQLEAITGQSASLFSHAVNIQPVQEELPPPPPQETNRLPTTNTSSVFATNEGEKKTYSCHFCEKTFKKSSHLKQHIRSHTGEKPCKCMQCGRSFVSASTLRNHMRTHSGIKSFKCNLCNTSFTTNGSLVRHMNIHTDQRPHTCQQCGEAFRKQLDLKRHLKDHATESDDGEQLEDGKRPRNVIRFNEEEAQQIAKKPLARNATTSERILVQMVNDKNRVSEVLTQEDVLKSRPNFPNRCIHCSKSFKKPCDLVRHVRTHTGEKPFKCTECERTFAVKSTLVCHVRTHKGGKQEVCHICKTT,QVVVQNSGIQNIIPSLQNQSFIPSTSGMTTQVVAPPNSVAPFLQGSDVQMTIHDTLAQNVSHGEALMTNKMYTIHTTDRGTHLVQTDTTPSQADTSNLGQSFQLSLSGDQLSLQPNILLQQPNIQNPVEYTPSSSIAHSQEQITISTNGALGDSESSGTVTVNVVDFANLASHENVQTTASSSAAVPLQEEEESGEEDEDDDDVDDEDEEELEEESEDELQYVTEGGNLPSTGPMIAGHHVMREKPAMSAKDAAAAIGSIYPCT,EKPYKCPHCDKAFNQNGALHVHLTKHTGTKPHSCEFCGQKFAQRGNLRAHIVRVHEINSELEQRFECPQCCCNFRKMSSLNAHISRFHNSEEASSPLLSLKAALSAIQEGADWLDSTVDTPALAGALEQLINMLEENAGEGDPLDRIQEIMSGNTINTDILQQALDNSGVTNNVVEGQTETPAPAAPVAPVAPDAPDPAPEIPTASTTAPTVSTETTVAPANNQLNQLFSFVAE,KCLICDCLFTTNGSLKRHMSTHTDVRPFMCPYCQKTFKTSVNCKKHMKMHKRELALQAKQQENIEQDPNHVQEIQSIAEQASNLAEHLAIEQASNSILALAQGQLTVGQDGLHGANALNETS\n
GLEAN3_22024	SPU_022024		Inspection of the tiling array suggests that glean may have missed the following exons: YFSTMTIRHPLIHFAIQTYKYSSFKRQYWKTNGGEVCREDQRQRRQDNEGWPKRKRFISTSGSGVKRKLEGACQKEVYIHICQ,LCSQLSWRVKHLKHISVWTLYEGASIKDSDRRLRGAVDTHYIITGPEQTIRQSASKLLVSSAFKNELSNFILKEWGKEHYWNIYSGRTRFASYGGG\n
GLEAN3_23140	SPU_023140		Inspection of the tiling array suggests that glean may have missed the following exons: RAFPPAGLMNPRSSDCVGVTRPEGHLFGVGFVGLGGSEVVSVAAKQIFLTCSCDSAKLIPGTALHNLQYGFRNVVPFSSVTKFW,DTMARVLLIDTVKLSRAPRVLLSSKARQVVMAKTQVHKKCETTDREVQSSGTDVPSPLIICRSCARKSTTHNTMMSLLLTQRRM\n
GLEAN3_23510	SPU_023510		Inspection of the tiling array suggests that glean may have missed the following exons: GFCYEKSLLLHMKSHIGEMPHKGLVCKRGLSSNSFLLRHIRSHTGEKPYQCLVCGKSFAHNSTLKLHTLRTHPGELRASFLEGISGAPTRENPFHLFWGACLVPSGLCIRSGCRVT,SKLDISSCSLCSGGNSRMQSSHQSMSGDRSHGGERRPGLPEDFPLETGRGTRSNSSERESCPSEYDSCLIPSQVAYFHEKRAADRETADATTSYHKPGGMHKDRNADITMEYDEMINQSSQLRAAGSDQNEPSCSLTEEKLFLCC,THTGEKPYQCKFCDKKYSRSSSLGVHIRTHTGEKPYQCKHCDVSFSRVETLSRHIGTHTGKEPFECSFCKKTFSHNGHLSRHLKIHTGERPFKCSVCNKTFSERGYLKDHQVIH,EGSHFKSKIHISSHSLCSGGDSSLQSSYQSMRNTQEMSGDRSCDGARHLPEDFPLQDYLGTQSDSSERESCTSENDSYLIPSQVACFHEKQAADRETLDATTSNHKPEGIDID\n
GLEAN3_24746	SPU_024746		Inspection of the tiling array suggests that glean may have missed the following exons: NLTEHMGTHIGASPHHCPLCKSTFTRLSSLKKHVRNHHGKCPFQCVFCNKTFNEKDDLLVHVKIHTKKDLYHCLLCNKPWPTIRGLSFHIQLTHKDKSLASVLPETKHSPQGISSQEIPKPTQENIIPVALFSTTNPKDHSVNP\n
GLEAN3_24902	SPU_024902		Inspection of the tiling array suggests that glean may have missed the following exons: FFLSLFRFLHMKKSMCVDQSTQTMPDIYSCSHCGSSLLAKPMPPLARPQPSGPQQLPGLESSQARCINKKDNGDDYTYDDGIVTLDNEKELLGPDKWVSEAPDHPVAIFVKEEVMSDLEQSNSMHSDKEESFFDPRQDQEFRSDSKLEHSETGWYEEEEEEEEEEEEDEDEMEDDEEIDYEALERLDPTYDPFIGSRPHQLVSIVNF,GVSDDEFQDKQDRVKKCLTGNHSSVRSECPGVCPLCSTIWNSLADRTSHLQTHIPKDQGRQAFKEACEAIEKKIGKEYTQRMGWCEVCDKFSSSLHTHMSNV\n
GLEAN3_25849	SPU_025849		Inspection of the tiling array suggests that glean may have missed the following exons: SFVFSVSIHFGDEQHIETEHGVAQPVQTKDTVPSRGKLVTPATVSQHQDSMEVQIDYDDDEDNSYAAWMTEPDDGNDSDTQPSDTAKAVESDCPSAIQQPREEHPCPDCHVILHSTWDLDLHKLHDCTESNNSIQYVYNCDQCRKSFHRWAFTVAHLRKVHNCNMSHSEIIKKLDELKSTKSKKSGNKDEKCLPSEKKAPKRPGLIEAKGQTTRKKVARRKKRDNGDMQEVKKQTKSDRRNAPRLSKRPCIRLNKDLIKIFESERDDENNLTTNNRSIKPERTAGVSNVEKAQQRHTSDEHQNLNFKTEANNSSGGELATGRLNDERQGQGSTAEQVEEKGEDDLILMKEVELVIKDGERETSATIAMPESEISKAELEEETGNTLISQRDEGQGQAAEGVEENQDASANPIMEKKRKIHTADLKVHESQEGKSEPRDMIEHKASHETAERETKEASNLDADDDENNFTTIQKSTENAIVCSLCDLQFETKHDRSKHMPSHKEHRLQYKCSTCGKTFNRKVIYRTHVETHQDKTKRQKYH,RRECLKPTMRKSQEEMVSLINLLPHHLHPQTQGLTTLTMTVQTSKEAMLLKLLRPVMGKWLETGASPKRGCAGCNTPQPVRILSLKLHCRRVKSEKGERKKKR,LTAHTDVYKYSCDVCGKKFKRTSLRNSHMKVHSNDPANKPFKCELCSKAFAAQGKLKVHMDWHYNIRSYTCDVCGKSFLTKGNLDKHQFLHKGTKPHECKICSRGFMDLPGLRKHLDLVHKITLKKVVTQRVLEANDAKESGGDGVPHKPAATPSSPSNSGSDDADNDSPDFKRGNVAQAPEASDGQVVRDRGVSKEGVCRVQHPPTCQDIVAKIALPSGKKRKRGEEKKEE,NEIFFILFFNLLCSIPLTLRFIVQSFPTGIIGFSFEVKVLMFIRSVPLLCFCYVGNPCSSLRFNNHVVRGELIFIISFTFENLYPIFIQTDTKSLGKQRCLSSVSFCLLFHFLCISYIYFLPPCHLLPHLPSCFSEAWTSLNYCLRWKAFFIRIAIF,RSRPGAFLVLGGFCLGAGIGLLGAGTGLCCTVLSASLCNNPPSDMSCICSVSVSAWTPTPSSTRSSVCDIPLSSSDWSTC,TLYNHAICADLSPMCCVRSHGNPGMGVPLSADELHSGSHFLPLLRCLMFGCHYHLHHQVLSSMQHKNYPRHHRNRSVPPWNLGVVKR\n
GLEAN3_27708	SPU_027708		Inspection of the tiling array suggests that glean may have missed the following exons: SRCHVHASFVGRHVLSCSESLAAEVALVGPFTPVFWKVVLERILRREGAWALLALEGPFQRVFPPVHDKPGLFIECQWTELTFVSTTIRMHSHFVLVHLVVSGELLWAVLTFEDLWFLAGDKYSLSKHVLW,TWRSTIITLAVAWLTLWQSSISWVECGKCRLLSRHNTLVPMRDSKCESFDQQSGDISTRLKLVHDDTNRMSKLSNNSSPEWLRL\n
GLEAN3_07083	SPU_007083		Motor domain\n
GLEAN3_03746	SPU_003746		motor domain \n
GLEAN3_01633	SPU_001633		motor domain\n
GLEAN3_07595	SPU_007595		GLEAN3_16421 is a partial duplicate prediction for GLEAN3_07595.\n
GLEAN3_16421	SPU_016421		GLEAN3_16421 is a partial duplicate prediction for GLEAN3_07595.\n
GLEAN3_05795	SPU_005795		Missing first ~80-100 AA.\n
GLEAN3_15843	SPU_015843	From the BLAST results as well as the excel data, it is evident that the sequence is distributed onto two different scaffolds. It is also apparent that there is an overlap between the two scaffolds that occur from 1353-2346. When examining the excel data, it is clear that there are no internal repeats or gaps present within the two scaffolds. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be strong with most of the values being greater than 10. 	GLEAN3_02037 is a partial duplicate prediction for GLEAN3_15843.\n
GLEAN3_02037	SPU_002037		GLEAN3_02037 is a partial duplicate prediction for GLEAN3_15843.\n
GLEAN3_26535	SPU_026535		GLEAN3_26535 has part II of the gene. GLEAN3_19403 may code for Part I. There is still ~100 AA missing between the two GLEAN's.\n
GLEAN3_19403	SPU_019403		GLEAN3_26535 has part II of the gene. GLEAN3_19403 may code for Part I. There is still ~100 AA missing between the two GLEAN's.\n
GLEAN3_21620	SPU_021620		GLEAN3_05597 has most of the gene but a small part that is better coded by GLEAN3_21620.\n
GLEAN3_05597	SPU_005597		GLEAN3_05597 has most of the gene but a small part that is better coded by GLEAN3_21620.\n
GLEAN3_13688	SPU_013688		Incorrect gene prediction. GLEAN3_13687 is a partial duplicate prediction for UNC50 gene.\n
GLEAN3_13687	SPU_013687		Incorrect gene prediction. GLEAN3_13687 is a partial duplicate prediction for UNC50 gene.\n
GLEAN3_03464	SPU_003464		May have an extra exon at beginning.\n
GLEAN3_01579	SPU_001579		GLEAN3_25102 is a partial duplicate prediction for GLEAN3_01579.\n
GLEAN3_25102	SPU_025102		GLEAN3_25102 is a partial duplicate prediction for GLEAN3_01579.\n
GLEAN3_10312	SPU_010312		Missing first ~20 AA.\n
GLEAN3_16553	SPU_016553		GLEAN3_16553 and GLEAN3_18172 both overlapping but non-complete models for UTP14A.\n
GLEAN3_18172	SPU_018172		GLEAN3_16553 and GLEAN3_18172 both overlapping but non-complete models for UTP14A.\n
GLEAN3_14712	SPU_014712		GLEAN3_20985 is a partial duplicate prediction for GLEAN3_14712.\n
GLEAN3_20985	SPU_020985		GLEAN3_20985 is a partial duplicate prediction for GLEAN3_14712.\n
GLEAN3_28809	SPU_028809		Incorrect gene model. Extra exon(s).\n
GLEAN3_08110	SPU_008110		e val = 0 against NP_036442. \nExact match to XP_795366: PREDICTED: similar to Chromosome-associated kinesin KIF4A(Chromokinesin) [Strongylocentrotus purpuratus]. \n3447 nts spread over 25 exons. \nThis same sequence is found on Scaffoldi3148 from sp_20060316_asm. \nAnnotated by RA Obar, RL Morris, AL Silverio, BJ Chick,  AM Musante, AS Shorette.\n
GLEAN3_25767	SPU_025767	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. When examining the excel data it appears that the both scaffolds have no gaps or internal repeats present. However if the data from both scaffolds were to be combined, there would be a sequence overlap between the 2 scaffolds from 1210-1951. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	e val for Q02224=e-3e-38; CENPE_HUMAN [Homo sapiens].   \ne val for AAI10990=2e-30; KIF19 protein [Homo sapiens]. \nKinesin-7 family member.   \nSee also Glean3_17809 and Glean3_23126 which also hit Q02224. \nCENPE_HUMAN data obtained from UniProtKB/Swiss-Prot entry Q02224.    \nAnnotation by RA Obar, RL Morris, SA Tower, and AP Rawson.  \n
GLEAN3_06544	SPU_006544		Incorrect gene model. Excellent homology to xylulokinase only in the latter half of the gene model. Annotated as such. \n
GLEAN3_19081	SPU_019081		GLEAN3_04122 is a partial duplicate prediction for GLEAN3_19081. GLEAN3_19081 is likely missing a few amino acids in middle.\n
GLEAN3_04122	SPU_004122		GLEAN3_04122 is a partial duplicate prediction for GLEAN3_19081. GLEAN3_19081 is likely missing a few amino acids in middle.\n
GLEAN3_04700	SPU_004700		GLEAN3_04741 is a partial duplicate prediction for GLEAN3_04700.\n
GLEAN3_04741	SPU_004741		GLEAN3_04741 is a partial duplicate prediction for GLEAN3_04700.\n
GLEAN3_06047	SPU_006047		GLEAN3_06047 is missing a few AA at beginning and end. GLEAN3_15592 is a partial duplicate prediction that goes to the end of the CCT8 protein. There is a significant overlap between the two predictions.\n
GLEAN3_15592	SPU_015592		GLEAN3_06047 is missing a few AA at beginning and end. GLEAN3_15592 is a partial duplicate prediction that goes to the end of the CCT8 protein. There is a significant overlap between the two predictions.\n
GLEAN3_09300	SPU_009300		Likely missing ~50 AA in middle (one or more exon).\n
GLEAN3_22270	SPU_022270		GLEAN3_22269 is a partial duplicate prediction for GLEAN3_22270.\n
GLEAN3_22269	SPU_022269		GLEAN3_22269 is a partial duplicate prediction for GLEAN3_22270.\n
GLEAN3_24103	SPU_024103		Incorrect gene model. Longer than necessary.\n
GLEAN3_28735	SPU_028735		Missing ~100 AA.\n
GLEAN3_01512	SPU_001512		GLEAN3_17897 is a partial duplicate prediction for GLEAN3_01512.\n
GLEAN3_17897	SPU_017897		GLEAN3_17897 is a partial duplicate prediction for GLEAN3_01512.\n
GLEAN3_13005	SPU_013005		GLEAN3_02924 is a partial duplicate prediction for GLEAn3_13005.\n
GLEAN3_02924	SPU_002924		GLEAN3_02924 is a partial duplicate prediction for GLEAn3_13005.\n
GLEAN3_17669	SPU_017669		GLEAN3_25669 is a partial duplicate prediction for GLEAN3_17669.\n
GLEAN3_25669	SPU_025669		GLEAN3_25669 is a partial duplicate prediction for GLEAN3_17669.\n
GLEAN3_04893	SPU_004893		GLEAN3_1686 is a partial duplicate prediction for GLEAN3_04893.\n
GLEAN3_01686	SPU_001686		GLEAN3_1686 is a partial duplicate prediction for GLEAN3_04893.\n
GLEAN3_14697	SPU_014697		GLEAN3_11718 is a partial duplicate prediction for GLEAN3_14697.\n
GLEAN3_11718	SPU_011718		GLEAN3_11718 is a partial duplicate prediction for GLEAN3_14697.\n
GLEAN3_20801	SPU_020801		May be missing ~100 AA.\n
GLEAN3_12946	SPU_012946		GLEAN3_05913 is a partial duplicate prediction for GLEAN3_12946.\n
GLEAN3_05913	SPU_005913		GLEAN3_05913 is a partial duplicate prediction for GLEAN3_12946.\n
GLEAN3_21795	SPU_021795		Missing first ~80 AA.\n
GLEAN3_13396	SPU_013396		GLEAN3_00788 has first part of the TIMM44 gene. GLEAN3_13396 has the rest.\n
GLEAN3_00778	SPU_000778		GLEAN3_00788 has first part of the TIMM44 gene. GLEAN3_13396 has the rest.\n
GLEAN3_08672	SPU_008672		GLEAN3_06174 is a partial duplicate prediction for GLEAN3_08672.\n
GLEAN3_06174	SPU_006174		GLEAN3_06174 is a partial duplicate prediction for GLEAN3_08672.\n
GLEAN3_10671	SPU_010671		Likely missing the last exon.\n
GLEAN3_17059	SPU_017059		GLEAN3_19019 has the first part and GLEAN3_17059 the rest. There is a significant overlap between the two.\n
GLEAN3_19091	SPU_019091		GLEAN3_19019 has the first part and GLEAN3_17059 the rest. There is a significant overlap between the two.\n
GLEAN3_20010	SPU_020010		Only the first half of TOP3A is predicted by this GLEAN, rest is missing from the predictions.\n
GLEAN3_17723	SPU_017723		Missing the first half of the gene.\n
GLEAN3_05321	SPU_005321		Likely has an extra exon predicted. \n
GLEAN3_21928	SPU_021928		#\nMissing first ~40 AA.\n
GLEAN3_11091	SPU_011091		Model is missing ~50 AA.\n
GLEAN3_27066	SPU_027066		Missing first half.\n
GLEAN3_03970	SPU_003970		GLEAN3_09819 is a partial duplicate prediction for GLEAN3_03970.\n
GLEAN3_09819	SPU_009819		GLEAN3_09819 is a partial duplicate prediction for GLEAN3_03970.\n
GLEAN3_15304	SPU_015304		Incomplete model. First half is not correctly predicted.\n
GLEAN3_13174	SPU_013174		GLEAN3_20504 is a partial duplicate prediction for GLEAN3_13174.\n
GLEAN3_20504	SPU_020504		GLEAN3_20504 is a partial duplicate prediction for GLEAN3_13174.\n
GLEAN3_15044	SPU_015044		GLEAN3_11230 is a partial duplicate prediction for GLEAN3_15044.\n
GLEAN3_11230	SPU_011230		GLEAN3_11230 is a partial duplicate prediction for GLEAN3_15044.\n
GLEAN3_11567	SPU_011567		GLEAN3_11567 has first part of the gene and GLEAN3_25450 has the rest.\n
GLEAN3_25450	SPU_025450		GLEAN3_11567 has first part of the gene and GLEAN3_25450 has the rest.\n
GLEAN3_27693	SPU_027693		GLEAN3_22721 is a partial duplicate prediction for GLEAN3_27693.\n
GLEAN3_22721	SPU_022721		GLEAN3_22721 is a partial duplicate prediction for GLEAN3_27693.\n
GLEAN3_08040	SPU_008040		GLEAN3_27793 is a partial duplicate prediction for GLEAN3_08040.\n
GLEAN3_27793	SPU_027793		GLEAN3_27793 is a partial duplicate prediction for GLEAN3_08040.\n
GLEAN3_15114	SPU_015114		GLEAN3_15114 is a partial duplicate prediction for GLEAN3_15351.\n
GLEAN3_01978	SPU_001978		Incorrect gene model. Longer than required. \nGLEAN3_03188 is a partial duplicate prediction for GLEAN3_01978.\n
GLEAN3_03183	SPU_003183		GLEAN3_03188 is a partial duplicate prediction for GLEAN3_01978.\n
GLEAN3_12816	SPU_012816		GLEAN3_12816 has first part of the gene and GLEAN3_25309 has the latter. \n
GLEAN3_25309	SPU_025309		GLEAN3_12816 has first part of the gene and GLEAN3_25309 has the latter. \n
GLEAN3_27769	SPU_027769	From the BLAST results as well as the excel data, it appears that this is the best fit for this particular GLEAN model. When examining the excel data, it is clear that the scaffold contains an orderly arrangement without any gaps or repeats present. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5.	GLEAN3_27769 is a partial duplicate prediction for GLEAN3_02606.\n
GLEAN3_28866	SPU_028866		GLEAN3_28866 is a partial duplicate prediction for GLEAN3_22607.\n
GLEAN3_26082	SPU_026082		Transcriptome and alignment with best blast hit suggests that prediction may lack an N-terminal exon\n
GLEAN3_19582	SPU_019582		GLEAN3_19260 is a partial duplicate prediction for GLEAN3_19582.\n
GLEAN3_19260	SPU_019260		GLEAN3_19260 is a partial duplicate prediction for GLEAN3_19582.\n
GLEAN3_21510	SPU_021510		GLEAN3_01446 has the first part of the HECTD1 gene. GLEAN3_21510 has the rest. There is an overlap between the two GLEANs.\n
GLEAN3_05518	SPU_005518		Missing an exon at beginning.\n
GLEAN3_19431	SPU_019431		Missing first half of the gene. GLEAN3_09004 is a partial duplicate prediction for GLEAN3_19431.\n
GLEAN3_09004	SPU_009004		Missing first half of the gene. GLEAN3_09004 is a partial duplicate prediction for GLEAN3_19431.\n
GLEAN3_03914	SPU_003914		GLEAN3_05017 is a partial duplicate prediction for GLEAN3_03914.\n
GLEAN3_05017	SPU_005017		GLEAN3_05017 is a partial duplicate prediction for GLEAN3_03914.\n
GLEAN3_07280	SPU_007280		GLEAN3_07280, GLEAN3_13615, GLEAN3_21285 and GLEAN3_27378 are all "phospholipid scramblase like" sequences.\n
GLEAN3_13615	SPU_013615		GLEAN3_07280, GLEAN3_13615, GLEAN3_21285 and GLEAN3_27378 are all "phospholipid scramblase like" sequences.\n
GLEAN3_21285	SPU_021285		GLEAN3_07280, GLEAN3_13615, GLEAN3_21285 and GLEAN3_27378 are all "phospholipid scramblase like" sequences.\n
GLEAN3_05123	SPU_005123		GLEAN3_05123 has the first part of SCFD1 gene and GLEAN3_05492 has the other half.\n
GLEAN3_05492	SPU_005492		GLEAN3_05123 has the first part of SCFD1 gene and GLEAN3_05492 has the other half.\n
GLEAN3_17144	SPU_017144		GLEAN3_17144 appears to have the first half of the VPS33B gene, while GLEAN3_02218 may have the remaining part.\n
GLEAN3_02218	SPU_002218		GLEAN3_17144 appears to have the first half of the VPS33B gene, while GLEAN3_02218 may have the remaining part.\n
GLEAN3_24158	SPU_024158		May be missing the last exon.\n
GLEAN3_05862	SPU_005862		GLEAN3_01745 is a duplicate prediction for GLEAN3_05862.\n
GLEAN3_01745	SPU_001745		GLEAN3_01745 is a duplicate prediction for GLEAN3_05862.\n
GLEAN3_02387	SPU_002387		Incorrect gene model. Likely extra exon(s) predicted for the longer model GLEAN3_02387. GLEAN3_16361 has better sequence homology with GBF1 but is incomplete.\n
GLEAN3_16361	SPU_016361		Incorrect gene model. Likely extra exon(s) predicted for the longer model GLEAN3_02387. GLEAN3_16361 has better sequence homology with GBF1 but is incomplete.\n
GLEAN3_03406	SPU_003406		GLEAN3_25852 is a partial duplicate prediction for GLEAN3_03406. \nGLEAN3_03406 is likely missing the last exon(s).\n
GLEAN3_25852	SPU_025852		GLEAN3_25852 is a partial duplicate prediction for GLEAN3_03406.\n
GLEAN3_19999	SPU_019999		Missing first ~80 AA.\n
GLEAN3_06680	SPU_006680		Incorrect gene model. Longer than is necessary.\n
GLEAN3_08487	SPU_008487		GLEAN3_08487 is a duplicate prediction for GLEAN3_13194.\n
GLEAN3_13647	SPU_013647	From the BLAST results as well as the excel data it was evident that the sequence is distributed onto 2 different scaffolds. The sequence didn't begin until the 17th base pair and had an orderly arrangement until the end of the scaffold. However, there were 2 internal repeats present within the scaffold on 569-666 and 665-825. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be somewhat strong with most of the values being greater than 5. 	GLEAN3_13648 is a duplicate prediction for GLEAN3_13647.\n
GLEAN3_13648	SPU_013648		GLEAN3_13648 is a duplicate prediction for GLEAN3_13647.\n
GLEAN3_27976	SPU_027976		Likely missing first exon.\n
GLEAN3_10670	SPU_010670		Incorrect gene model. Prediction longer than necessary.\n
GLEAN3_03197	SPU_003197	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. The first portion of the sequence (v2.1_scaffold29830) has a continuous orderly arrangement without any repeats or gaps present. It also has the lowest e-value and highest bit score results from the BLAST search. The rest of the sequence is distributed onto v2.1_scaffold50215, which has an orderly continuous arrangement as well but had a lower bit score and higher e-value when compared to the rest of the results. There was some Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be somewhat strong with most of the scores being greater than 10.	Incorrect gene model. Likely a hybrid prediction.\n
GLEAN3_13577	SPU_013577		Extra ~40AA (exon).\n
GLEAN3_03805	SPU_003805		GLEAN3_25072 is a similar prediction.\n
GLEAN3_25072	SPU_025072		GLEAN3_03805 is a similar prediction. \n
GLEAN3_00723	SPU_000723		Missing first ~25 AA.\n
GLEAN3_26892	SPU_026892		GLEAN3_26892 has most of the TUBGCP3 gene. Last 150 aa are likely encoded by GLEAN3_10158. GLEAN3_21617 is a partial duplicate prediction for GLEAN3_26892.\n
GLEAN3_21617	SPU_021617		GLEAN3_26892 has most of the TUBGCP3 gene. Last 150 aa are likely encoded by GLEAN3_10158. GLEAN3_21617 is a partial duplicate prediction for GLEAN3_26892.\n
GLEAN3_10158	SPU_010158		GLEAN3_26892 has most of the TUBGCP3 gene. Last 150 aa are likely encoded by GLEAN3_10158. GLEAN3_21617 is a partial duplicate prediction for GLEAN3_26892.\n
GLEAN3_11574	SPU_011574		Exons 24, 14, 31 and 46 are not conserved or expressed according to the transcriptome data and are therefore questionable.\n
GLEAN3_09648	SPU_009648		Missing first ~100 AA.\n
GLEAN3_02510	SPU_002510		GLEAN3_28365 is a partial duplicate prediction for GLEAN3_02510.\n
GLEAN3_28635	SPU_028635		GLEAN3_28365 is a partial duplicate prediction for GLEAN3_02510.\n
GLEAN3_01119	SPU_001119		GLEAN3_01119 and GLEAN3_23988 are overlapping incomplete predictions for SSRP1. GLEAN3_01119 is missing ~100 AA at the beginning and end of SSRP1. GLEAN3_23988 is missing ~250 AA at the beginning but goes all the way to the end of SSRP1.\n
GLEAN3_23988	SPU_023988		GLEAN3_01119 and GLEAN3_23988 are overlapping incomplete predictions for SSRP1. GLEAN3_01119 is missing ~100 AA at the beginning and end of SSRP1. GLEAN3_23988 is missing ~250 AA at the beginning but goes all the way to the end of SSRP1.\n
GLEAN3_22853	SPU_022853		GLEAN3_22853 is a partial duplicate prediction for GLEAN3_19373.\n
GLEAN3_11902	SPU_011902		GLEAN3_07972 is a partial duplicate prediction for GLEAN3_11902.\n
GLEAN3_07972	SPU_007972		GLEAN3_07972 is a partial duplicate prediction for GLEAN3_11902.\n
GLEAN3_03845	SPU_003845		GLEAN3_22665 encodes the first part of the RABEP1 gene and GLEAN3_03845 appears to encode the latter half. There is overlap between the two predictions.\n
GLEAN3_22665	SPU_022665		GLEAN3_22665 encodes the first part of the RABEP1 gene and GLEAN3_03845 appears to encode the latter half. There is overlap between the two predictions.\n
GLEAN3_11194	SPU_011194		PREDICTED: similar to fertility related protein WMP1 [Strongylocentrotus purpuratus],spermatogenesis-associated protein 7 [Rattus norvegicus]\n
GLEAN3_11195	SPU_011195		PREDICTED: hypothetical protein [Mus musculus] \n
GLEAN3_11196	SPU_011196		#\nTryptophanyl-tRNA synthetase, cytoplasmic (Tryptophan--tRNA ligase) (TrpRS)\n
GLEAN3_11198	SPU_011198		PREDICTED: similar to polymerase (DNA directed), alpha 2 (70kD subunit) [Strongylocentrotus purpuratus] \n
GLEAN3_11201	SPU_011201	After reviewing the data it appears that there is no sufficient GLEAN model that fits model_GLEAN3_11201. The excel data had numerous repeats within the 1-228 region and this may be the reason there is no good fit onto V2.1.  The sequence is distributed onto 2 different scaffolds. If the 2 scaffolds were combined the sequence would be continuous with no gaps or repeats. There was Est. information available on GBrowse assembly V0.5 and the transcriptome intensity scores appear to be unusually distributed with the majority of scores on the weak side and one large outlier (225).This is an un-annotation gene so no additional comments were available. 	PREDICTED: hypothetical protein [Strongylocentrotus purpuratus],PREDICTED: similar to golgi-specific brefeldin A-resistance guanine nucleotide exchange factor 1 [Strongylocentrotus purpuratus] \n
GLEAN3_03899	SPU_003899		#\nhypothetical protein LOC423694 [Gallus gallus], \n
GLEAN3_03902	SPU_003902		myosin V [Strongylocentrotus purpuratus]\n
GLEAN3_03903	SPU_003903		PREDICTED: hypothetical protein\n
GLEAN3_23288	SPU_023288		PREDICTED: hypothetical protein \n
GLEAN3_23289	SPU_023289		PREDICTED: similar to Ribonuclease Oy (RNase Oy)\n
GLEAN3_08157	SPU_008157		hypothetical protein DDBDRAFT_0206412 [Dictyostelium discoideum AX4]\n
GLEAN3_08158	SPU_008158		PREDICTED: similar to peroxisomal biogenesis factor 6-like protein [Strongylocentrotus purpuratus]\n
GLEAN3_14604	SPU_014604		PREDICTED: similar to smad nuclear interacting protein [Strongylocentrotus purpuratus] \n
GLEAN3_15924	SPU_015924		#\nPREDICTED: similar to acid-sensing ion channel 1, partial [Strongylocentrotus purpuratus] \n
GLEAN3_25080	SPU_025080		PREDICTED: similar to Leucine rich repeat containing 15 [Strongylocentrotus purpuratus] \n
GLEAN3_25081	SPU_025081		#\nPREDICTED: similar to Transmembrane and coiled-coil domains 2 [Strongylocentrotus purpuratus] \n
GLEAN3_23431	SPU_023431		PREDICTED: similar to selectin-like protein\n
GLEAN3_07137	SPU_007137		PREDICTED: similar to LDL receptor adaptor protein (ARH) [Bos taurus] \n
GLEAN3_07138	SPU_007138		#\nPREDICTED: hypothetical protein [Strongylocentrotus purpuratus] \n
GLEAN3_07139	SPU_007139		PREDICTED: hypothetical protein \nPREDICTED: similar to heparan sulfate sulfotransferase [Strongylocentrotus purpuratus] \n
GLEAN3_07140	SPU_007140		#\nPREDICTED: similar to WD repeat domain 66 [Strongylocentrotus purpuratus]\n
GLEAN3_08948	SPU_008948		PREDICTED: similar to xanthine dehydrogenase [Strongylocentrotus purpuratus]\n
GLEAN3_08949	SPU_008949		PREDICTED: similar to codanin 1 [Gallus gallus] \n
GLEAN3_08950	SPU_008950		PREDICTED: similar to opsin [Strongylocentrotus purpuratus]\n
GLEAN3_08951	SPU_008951		PREDICTED: KIAA0586 isoform 7 [Pan troglodytes]\n
GLEAN3_08952	SPU_008952		hypothetical protein LOC324723 [Danio rerio]\n
GLEAN3_08955	SPU_008955		PREDICTED: similar to small zinc finger-like protein [Strongylocentrotus purpuratus], also similar to G protein-coupled receptor 1 [Strongylocentrotus purpuratus] \n \n
GLEAN3_08956	SPU_008956		#\nPREDICTED: similar to neurexin iv [Strongylocentrotus purpuratus]\n
GLEAN3_08957	SPU_008957		PREDICTED: similar to ligand Delta-1, partial [Strongylocentrotus purpuratus] \n
GLEAN3_08958	SPU_008958		PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] \n
GLEAN3_08960	SPU_008960		#\nPREDICTED: similar to CG4089-PA [Tribolium castaneum]\n
GLEAN3_27146	SPU_027146		PREDICTED: hypothetical protein isoform 1 [Strongylocentrotus purpuratus]\n
GLEAN3_23395	SPU_023395		PREDICTED: similar to putative urease accessory protein F [Strongylocentrotus purpuratus] \n
GLEAN3_23396	SPU_023396		similar to fibrosurfin, partial [Strongylocentrotus  \npurpuratus] \n
GLEAN3_23399	SPU_023399		#\nPREDICTED: similar to hyalin [Strongylocentrotus purpuratus]\n
GLEAN3_00994	SPU_000994	Due to the bits scores, the E-value, and the low sequence coverage it appears that there is no good model fit for model. There was also no Est information available from GBrowse assembly V0.5 and it was somewhat unclear whether or not the transcriptome intensity scores were strong or moderate due to several values being <10 and one large value of about 40. 	PREDICTED: similar to alpha macroglobulin [Strongylocentrotus purpuratus].\n
GLEAN3_00996	SPU_000996		PREDICTED: hypothetical protein [Strongylocentrotus purpuratus], similar to tumor protein p53 inducible protein 11, isoform CRA_b [Homo sapiens] \n
GLEAN3_02116	SPU_002116		PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]. also similar to Rap2-binding protein 9 isoform 4 [Macaca mulatta]. \n
Sp-185/333-07	SPU_030265		#\nThis gene goes out of frame in element 2, and therefore may be either a pseudogene or a problem in the assembly.\n
GLEAN3_00031	SPU_000031		SP, lec, PAN? kringle, lec\n
GLEAN3_00141	SPU_000141		lec, EGF-like (epidermal-growth-factor like domain), laminin-EGF, fibronectin 3, TM\n
GLEAN3_00192	SPU_000192		lec\n
GLEAN3_00271	SPU_000271		EGF(epidermal growth factor like domain), lec\n
GLEAN3_00289	SPU_000289		Clec X 2\n
GLEAN3_00290	SPU_000290		Clec X 3 (or 2.5)\n
GLEAN3_00542	SPU_000542		lec, PAN\n
GLEAN3_00543	SPU_000543		lec, EGF (epidermal growth factor like domain), EGF(epidermal growth factor like domain), TM, cyt; macrophage mannose binding lec?\n
GLEAN3_00837	SPU_000837		SP, lec\n
GLEAN3_01027	SPU_001027		low complexity, Clec X 2, low complexity, EGF (epidermal growth factor like domain)-like X 3, TM, low complexity\n
GLEAN3_01200	SPU_001200		EGF, EGF, EGF, EGF, EGF, EGF, EGF, EGF, lec, EGF \n*EGF = Epidermal Growth Factor like domain\n
GLEAN3_01527	SPU_001527		SP, lec, hyalin, lec, hyalin, hyalin, apple domain (plasminogen)\n
GLEAN3_01878	SPU_001878		gal_lec, transmembrane\n
GLEAN3_01887	SPU_001887		EGF (Epidermal growth factor like domain, lec\n
GLEAN3_01898	SPU_001898		lec apple domain\n
GLEAN3_02005	SPU_002005		Gal_lec\n
GLEAN3_02383	SPU_002383		SP, lec\n
GLEAN3_02420	SPU_002420		Clec X 2, low complexity, LY(Low-density lipoprotein-receptor YWTD domain) X 2\n
GLEAN3_02718	SPU_002718	From the BLAST results as well as the excel data, it appears that the sequence is distributed onto two different scaffolds. When examining the excel data, it is apparent that there are several base pair duplicates and sequence overlaps within both scaffolds. There was no Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appeared to be weak (>5) excluding one outlier value at about 37. 	lec, CCP(Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)), TM\n
GLEAN3_02720	SPU_002720		SP, gal_lec, coiled coil, TM, TM, TM, TM, TM, TM, TM, low complexity region\n
GLEAN3_02861	SPU_002861		EGF(epidermal growth factor like domain), hyalin, hyalin, lec\n
GLEAN3_03246	SPU_003246		SP, gal_lec, transmembrane, low complexity\n
GLEAN3_03284	SPU_003284		Kringle, lec, EGF(epidermal growth factor like domain)-CA(cadherin repeats)\n
GLEAN3_03610	SPU_003610		Sp, Clec X 3, TM\n
GLEAN3_04014	SPU_004014		lec\n
GLEAN3_04476	SPU_004476		hyalin, hyalin, lec\n
GLEAN3_04818	SPU_004818		Clec X 2\n
GLEAN3_04831	SPU_004831		low complexity, gal_lec\n
GLEAN3_05013	SPU_005013		SP, CCP, lec, CCP, CCP, low complexity, TM \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_05028	SPU_005028		lec, LDLA, LDLA \n \n*LDLA = Low-density lipoprotein receptor domain class A\n
GLEAN3_05248	SPU_005248		lec, CCP (Low-density lipoprotein receptor domain class A), TM, cyt\n
GLEAN3_05596	SPU_005596		EGF (epidermal growth factor like domain), lec, lec\n
GLEAN3_05706	SPU_005706		SP, EGF (Epidermal growth factor like domain), hyalin, lec\n
GLEAN3_05725	SPU_005725		CUB, coag factor 5/8 C terminal domain, lec, LDLa, LDLa, LDLa, space,  LDLa \n*CUB = Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. \n*LDLA = Low-density lipoprotein receptor domain class A\n
GLEAN3_05888	SPU_005888		Clec X 2\n
GLEAN3_05909	SPU_005909		CCP, CCP, lec, CCP, CCP, EGF \n \n* CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_05910	SPU_005910		lec C-type, CCP, CCP, TM, cyt \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_06083	SPU_006083		lec, hyalin, hyalin, hyalin, hyalin, apple domain\n
GLEAN3_06306	SPU_006306		gal_lec, gal_lec\n
GLEAN3_06310	SPU_006310		EGF(epidermal growth factor link domain), lec\n
GLEAN3_06620	SPU_006620		apple, lec\n
GLEAN3_06648	SPU_006648		Clec, CCP, CCP, low complexity, TM \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_06649	SPU_006649		SP,  CCP, CCP, lec \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_07118	SPU_007118		CCP, lec, CCP TM \n \n* CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_07565	SPU_007565		gal_lec\n
GLEAN3_07566	SPU_007566		low complex, lec\n
GLEAN3_08001	SPU_008001		lec\n
GLEAN3_08065	SPU_008065		SP,  lec, EGF,EGF, protein tyr phosphatase or fibronectin3 \n \n*EGF = Epidermal Growth Factor like Domain\n
GLEAN3_08976	SPU_008976		CCP, lec \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_09174	SPU_009174		SP, EGF (epidermal growth factor like domain), hyalin, hyalin,  lec\n
GLEAN3_09222	SPU_009222		CCP, lec, lec, CCP, CCP, EGF, EGF \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR) \n \n*EGF = epidermal growth factor like domain\n
GLEAN3_09615	SPU_009615		lec\n
GLEAN3_09886	SPU_009886		CLECT X 4\n
GLEAN3_10019	SPU_010019		SP, gal_lec, transmembrane\n
GLEAN3_10099	SPU_010099		Sp, lec\n
GLEAN3_10313	SPU_010313		gal_lec, gal_lec, gal_lec\n
GLEAN3_10317	SPU_010317		Clec X 2, low complexity\n
GLEAN3_10470	SPU_010470		Clec X 2, PAN_1\n
GLEAN3_10615	SPU_010615		SP, lec, PAN?\n
GLEAN3_10639	SPU_010639		PAN apple, lec, PAN apple, TM\n
GLEAN3_11014	SPU_011014		F5F8 type C, C_lec, gal_lec, C_lec, FA58C(Coagulation factor 5/8 C-terminal domain, discoidin domain), Protein has FTP (eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain) at relatively low probability \n
GLEAN3_11167	SPU_011167		lec, CCP, CCP, EGF, EGF, TM \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR) \n*EGF = epidermal growth factor like domain\n
GLEAN3_11176	SPU_011176		SCOP or Clectin, low complexity, GPS (G-protein-coupled receptor proteolytic site domain), 7TM-2\n
GLEAN3_11192	SPU_011192		EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, lec \n \n*EGF(epidermal growth factor link domain)-Ca(Cadherin repeats)\n
GLEAN3_11632	SPU_011632		lec\n
GLEAN3_11829	SPU_011829		Sp, Clec X 3, PAN 1\n
GLEAN3_11867	SPU_011867		EGF (epidermal growth factor like domain) X 2, FTP(eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain)\n
GLEAN3_11971	SPU_011971		SP, CUB, CUB,CUB, lec, LDLa, LDLa, LDLa, LDLa, LDLa, LDLa \n \n*CUB = Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. \n \n*LEDLA = Low-density lipoprotein receptor domain class A\n
GLEAN3_12121	SPU_012121		CCP, CCP, lec, CCP CCO, EGF-Ca, EGF-Ca, EGF-Ca, TM \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR) \n \n*EGF-Ca = Epidermal Growth factor like domain - Cadherin repeats. \n
GLEAN3_12302	SPU_012302		#\nlow complexity, gal_lec\n
GLEAN3_12477	SPU_012477		CLec X 5, PAN_AP (divergent subfamily of APPLE domains), CLec X 3\n
GLEAN3_12478	SPU_012478		Clec, PAN_AP(divergent subfamily of APPLE domains), Clec, Clec, Clec\n
GLEAN3_12479	SPU_012479		Sp, Clec X 4, TM\n
GLEAN3_12585	SPU_012585		lec, CCP, CCP, TM \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_12742	SPU_012742		Clec X 2, CCP, EGF(epidermal growth factor like domian)-like X 2, TM\n
GLEAN3_12837	SPU_012837		TM, lec\n
GLEAN3_12869	SPU_012869		CUB, CUB, CUB, CUB, CUB or lec, LDLa, LDLa, LDLa, LDLa, TM, TM, TM, TM  TM, TM, TM  \n \n*CUB = Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein. \n \n*LDLA = Low-density lipoprotein receptor domain class A\n
GLEAN3_12951	SPU_012951		PAN, FA5/8C, FA5/8C, FA5/8C, lec, CUB, CUB, CUB, lec \n \n*FA58C = Coagulation factor 5/8 C-terminal domain, discoidin domain \n*CUB = Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein.\n
GLEAN3_13186	SPU_013186		SP,  CCP, CCP, leclec \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_13388	SPU_013388		lec\n
GLEAN3_13825	SPU_013825		lec, low complex\n
GLEAN3_13860	SPU_013860		Sp, lec, lec\n
GLEAN3_13872	SPU_013872		Sp, lec,lec \n
GLEAN3_14004	SPU_014004		SP, low complexity, gal_lec\n
GLEAN3_14097	SPU_014097		SRCR , lec, EGF (Epidermal Growth Factor Like Domain) \n
GLEAN3_14218	SPU_014218		#\ngal_lec, EGF_like, EGF_like, transmembrane \n \n*EGF = epidermal growth factor like domain\n
GLEAN3_14219	SPU_014219		lec, low complelx, TM, long cyt\n
GLEAN3_14220	SPU_014220		Low complexity, TM \nAlternativer: Clec below threshold match \n
GLEAN3_14585	SPU_014585		(FTP(eel-Fucolectin Tachylectin-4 Pentaxrin-1 Domain) domain) F5F8_type_C(Coagulation factor 5/8 C-terminal domain, discoidin domain), gal_lec, gal_lec, gal_lec,\n
GLEAN3_14623	SPU_014623		lec, PAN\n
GLEAN3_14672	SPU_014672		CUB, lec, low complex\n
GLEAN3_15065	SPU_015065		Clec X 2, PAN_1\n
GLEAN3_15487	SPU_015487		Sp, Lec\n
GLEAN3_15726	SPU_015726	After reviewing the data in the excel file and performing a BLAST search on model_GLEAN3_15726, it appears that the best matches found were on two different scaffolds. Scaffold >v2.1_scaffold42451 has no visible repeats and covers about 500 base pairs of 753. Scaffold v2.1_scaffold73364 only covered 209 base pairs. model_GLEAN3_15726 is an un-annotated model and it was unclear what the complete scaffold coverage was. There was no EST information available from the GBrowse V0.5 assembly and the transcriptome intensity scores were weak (less than 10) as well. 	lec\n
GLEAN3_15961	SPU_015961		lec, pan\n
GLEAN3_15962	SPU_015962		Clec X 2, PAN 1\n
GLEAN3_16088	SPU_016088		DUF1339,lec,TM\n
GLEAN3_16103	SPU_016103		SP,  CCP, CCP, lec \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_16529	SPU_016529		CCP, CCP, lec, low complexity region \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_16629	SPU_016629		Clec, CCP X 3, EGF (Epidermal growth factor like domain)-like, CCP, TM \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_16681	SPU_016681		CCP, lec, CCP, CCP, EGF(Epidermal Growth Factor like domain)/Ca X34, TM, short cyt \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR) \n \n
GLEAN3_16771	SPU_016771		#\nlec \n \n
GLEAN3_16772	SPU_016772		lec\n
GLEAN3_17006	SPU_017006		CCP, CCP, CCP, lec, CCP, CCP, CCP, TM cyt \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR) \n \n
GLEAN3_17007	SPU_017007		#\nSP,  CCP, CCP, CCP, lec, CCP, TM cyt \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_17109	SPU_017109		Gal_lec\n
GLEAN3_17519	SPU_017519		CCP(Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)), lec\n
GLEAN3_17520	SPU_017520		lec, CCP, CCP \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_17842	SPU_017842		lec overlaps with VWF type C\n
GLEAN3_17887	SPU_017887		Sp, Clec, PAN_AP(divergent subfamily of APPLE domains), Clec X3, SCOP\n
GLEAN3_18258	SPU_018258		lec, CCP, CCP, EGF(epidermal growth factor link domain) \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_18501	SPU_018501		low complexity, Clec X 2\n
GLEAN3_18550	SPU_018550		SP, gal_lec, gal_lec, low complexity\n
GLEAN3_18601	SPU_018601		lec, EGF(epidermal growth factor domain\n
GLEAN3_18682	SPU_018682		low complexity, gal_lec, gal_lec\n
GLEAN3_18940	SPU_018940	After reviewing the data and performing a BLAST search, it appears that the sequence is distributed onto 2 different scaffolds. v2.1_scaffold65146 has a low bit score as well as a high e-value when comparing it to the other scaffold results. When examining the excel data as well as the BLAST results, it is apparent that there are numerous repeats as well as gaps within the scaffolds resulting in poor coverage. There was Est support available from GBrowse assembly V0.5 and the transcriptome intensity scores appear to be weak overall, with most of the scores being less than 5 (excluding 2 outliers at about 35 and 60). 	Clec X4, TM\n
GLEAN3_19060	SPU_019060		lec, low complexity\n
GLEAN3_19088	SPU_019088		SP, lec\n
GLEAN3_19150	SPU_019150		lec, LDLa, LDLa \n \n*LDLDA = Low-density lipoprotein receptor domain class A\n
GLEAN3_19438	SPU_019438		gal_lec, gal_lec, PAN\n
GLEAN3_19576	SPU_019576		SP, lec, EGF, EGF \n \n*EGF = epidermal growth factor link domain\n
GLEAN3_19805	SPU_019805		Clec, Clec, low complexity\n
GLEAN3_19986	SPU_019986		SP,  lec, EGF, EGF \n \n*EGF = Epidermal growth factor like domain\n
GLEAN3_20377	SPU_020377		lec, PAN\n
GLEAN3_20424	SPU_020424		 low complexity, lec, lec\n
GLEAN3_20513	SPU_020513		CCP(Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)), lec\n
GLEAN3_20760	SPU_020760		hyalin, hyalin, lec\n
GLEAN3_21221	SPU_021221		low complexity, Clec X 2, low complexity\n
GLEAN3_21505	SPU_021505		Clec X 2, low complexity regions\n
GLEAN3_21675	SPU_021675		low complexity, gal_lec\n
GLEAN3_21709	SPU_021709		SP, lec, space, PAN\n
GLEAN3_22197	SPU_022197		SP, CUB, CUB, CUB, lec, TM \n \n*CUB = Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein.\n
GLEAN3_22470	SPU_022470		SP, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, Clec, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca, EGF-Ca \n \n* EGF = epidermal growth factor like domain \n*CA = Cadherin repeats.\n
GLEAN3_22595	SPU_022595		low complexity, gal_lec\n
GLEAN3_22718	SPU_022718		SP,F_raikovi_mat, gal_lec, EGF_like, transmembrane\n
GLEAN3_22719	SPU_022719		SP, gal_lec\n
GLEAN3_23114	SPU_023114		Clec X 2, several low complexity regions\n
GLEAN3_23130	SPU_023130		lec, long low complexity\n
GLEAN3_23175	SPU_023175		lec overlap with LDLa, lec overlap with 2 \n LDLa, LDLa \n \n*LDLA = Low-density lipoprotein receptor domain class A\n
GLEAN3_23607	SPU_023607		EGF-Ca, lec, EGF-Ca, EGF-Ca, lec, EGF-Ca, lec, EGF-Ca, lec, EGF-Ca, lec, EGF-Ca, lec \n \n*EGF = epidermal growth factor like domain  \n*CA = Cadherin repeats.\n
GLEAN3_23714	SPU_023714		Sp, Clec X 2\n
GLEAN3_24218	SPU_024218		CCP X 2, CLec X 2, CCP \n \n* CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_24494	SPU_024494		lec, low complexity\n
GLEAN3_24638	SPU_024638		Clec X 3\n
GLEAN3_24786	SPU_024786		FA5/8C, FA5/8C, lec, EGF-Ca, lec C-type, CUB, CUB, CUB, TM \n \n*FA5/8C = Coagulation factor 5/8 C-terminal domain, discoidin domain \n \n*EGF-Ca = Epidermal growth factor like domain - Cadherin repeats. \n \n*CUB = Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein.\n
GLEAN3_25051	SPU_025051		Clec X 3, TM, low complexity\n
GLEAN3_25074	SPU_025074		lec, lec\n
GLEAN3_25097	SPU_025097		F5/F8 type C (Coagulation factor 5/8 C-terminal domain, discoidin domain), lec, lec C-type, CUB (Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein.)\n
GLEAN3_25194	SPU_025194		SP, CCP, CCP, CCP, TM, CCP, CCP, CCP, lec, EGF overlaps with histone deacetylase interacting domain \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR) \n*EGF = Epidermal Growth Factor like domain\n
GLEAN3_25414	SPU_025414		SP, gal_lec, gal_lec, gal_lec, gal_lec\n
GLEAN3_26103	SPU_026103		CCP, CCP, lec \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_26208	SPU_026208		lec, CCP, CCP \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_27084	SPU_027084		CUB (Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein.) overlaps with lec, LDLa, LDLa, LDLa, LDLa, LDLa \n \n*LDLA = Low-density lipoprotein receptor domain class A\n
GLEAN3_27332	SPU_027332		SP, lec, long space, hormone receptor domain with many Cys, TM, TM, TM, TM, TM, TM, TM, long cyt\n
GLEAN3_28067	SPU_028067		SP,  CCP, lec, TM, cyt\n
GLEAN3_28298	SPU_028298		SP,  CCP, CCP, lec \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_28326	SPU_028326		CUB (Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein.), lec, LDLa (Low-density lipoprotein receptor domain class A) \n
GLEAN3_28539	SPU_028539		Sp, Clec, Kring X 3, Clec X 6, HDAC_interact(Histone deacetylase (HDAC) interacting), SCOP\n
GLEAN3_28565	SPU_028565		Sp, lec\n
GLEAN3_28712	SPU_028712		CLec, Fibrinogen, PAN_AP(divergent subfamily of APPLE domains), CLec\n
GLEAN3_28846	SPU_028846		lec, CCP (Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)), TM, cyt\n
GLEAN3_14450	SPU_014450		CCP, lec, CCP \n \n*CCP = Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR)\n
GLEAN3_19008	SPU_019008		dystrophin-like protein [Strongylocentrotus purpuratus] \n,3908 aa \n
GLEAN3_19010	SPU_019010		#\nhypothetical protein isoform 1 [Strongylocentrotus  \npurpuratus]\n
GLEAN3_10580	SPU_010580		similar to ubiquitin-conjugating enzyme E2D 3 [Strongylocentrotus purpuratus]\n
GLEAN3_10581	SPU_010581		similar to cathepsin l [Strongylocentrotus purpuratus], 821 aa\n
GLEAN3_10582	SPU_010582		similar to transcriptional intermediary factor 1 alpha  \n[Strongylocentrotus purpuratus],220 aa\n
GLEAN3_09812	SPU_009812		2OG-Fe(II) oxygenase domain\n
GLEAN3_16862	SPU_016862		frizzled domain, CCP X2, CLECT, CCP\n
GLEAN3_23863	SPU_023863		GLEAN3_23863 is a partial duplicate prediction for GLEAN3_28180\n
GLEAN3_25316	SPU_025316		Domains: low complexity, Clec X 2, low complexity, Clec X 2, low complexity\n
GLEAN3_00239	SPU_000239		Many eukaryotic proteins that are known or supposed to bind single-stranded RNA contain one or more copies of a putative RNA-binding domain of about 90 amino acids. This is known as the eukaryotic putative RNA-binding region RNP-1 signature or RNA recognition motif (RRM). RRMs are found in a variety of RNA binding proteins, including heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing, and protein components of small nuclear ribonucleoproteins (snRNPs). The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases. Two individual models were built which identify subtypes of this domain, but there is no functional difference between the subtypes. \n
GLEAN3_27751	SPU_027751		Among the different families of transporter only two occur ubiquitously in all classifications of organisms. These are the ATP-Binding Cassette (ABC) superfamily and the Major Facilitator Superfamily (MFS). The MFS transporters are single-polypeptide secondary carriers capable only of transporting small solutes in response to chemiosmotic ion gradients\n
GLEAN3_23347	SPU_023347		Ras proteins are membrane-associated molecular switches that bind GTP and GDP and slowly hydrolyze GTP to GDP. The balance between the GTP bound (active) and GDP bound (inactive) states is regulated by the opposite action of proteins activating the GTPase activity and that of proteins which promote the loss of bound GDP and the uptake of fresh GTP. The latter proteins are known as guanine-nucleotide dissociation stimulators (GDSs) (or also as guanine-nucleotide releasing (or exchange) factors (GRFs)). Proteins that act as GDS can be classified into at least two families, on the basis of sequence similarities, the CDC24 family (see IPR001331) and the CDC25 family. \nThe size of the proteins of the CDC25 family range from 309 residues (LTE1) to 1596 residues (sos). The sequence similarity shared by all these proteins is limited to a region of about 250 amino acids generally located in their C-terminal section (currently the only exceptions are sos and ralGDS where this domain makes up the central part of the protein). This domain has been shown, in CDC25 an SCD25, to be essential for the activity of these proteins. \n
GLEAN3_18998	SPU_018998		Hormone receptor (7 transmember receptor) \n
GLEAN3_18997	SPU_018997		Hormone receptor (7 Transmember protein)\n
GLEAN3_10716	SPU_010716		ATP-binding cassette (ABC) transporters are multidomain membrane proteins, responsible for the controlled efflux and influx of substances (allocrites) across cellular membranes. They are minimally composed of four domains, with two transmembrane domains (TMDs) responsible for allocrite binding and transport and two nucleotide-binding domains (NBDs) responsible for coupling the energy of ATP hydrolysis to conformational changes in the TMDs. Both NBDs are capable of ATP hydrolysis, and inhibition of hydrolysis at one NBD effectively abrogates hydrolysis at the other. Hydrolysis at the two NBDs may occur in an alternative fashion although they appear substantially functionally symmetrical in terms of their binding to diverse nucleotides\n
GLEAN3_10968	SPU_010968		Epidermal growth factors and transforming growth factors belong to a general class of proteins that share a repeat pattern involving a number of conserved Cys residues. Growth factors are involved in cell recognition and division. The repeating pattern, especially of cysteines (the so-called EGF repeat), is thought to be important to the 3D structure of the proteins, and hence its recognition by receptors and other molecules. The type 1 EGF signature includes six conserved cysteines believed to be involved in disulphide bond formation. The EGF motif is found frequently in nature, particularly in extracellular proteins.\n
GLEAN3_05786	SPU_005786		Zinc finger domains are nucleic acid-binding protein structures first identified in the Xenopus laevis transcription factor TFIIIA. These domains have since been found in numerous nucleic acid-binding proteins. A zinc finger domain is composed of 25 to 30 amino-acid residues including 2 conserved Cys and 2 conserved His residues in a C-2-C-12-H-3-H type motif. The 12 residues separating the second Cys and the first His are mainly polar and basic, implicating this region in particular in nucleic acid binding. The zinc finger motif is an unusually small, self-folding domain in which Zn is a crucial component of its tertiary structure. All bind 1 atom of Zn in a tetrahedral array to yield a finger-like projection, which interacts with nucleotides in the major groove of the nucleic acid. The Zn binds to the conserved Cys and His residues. Fingers have been found to bind to about 5 base pairs of nucleic acid containing short runs of guanine residues. They have the ability to bind to both RNA and DNA, a versatility not demonstrated by the helix-turn-helix motif. The zinc finger may thus represent the original nucleic acid binding protein. It has also been suggested that a Zn-centred domain could be used in a protein interaction, e.g. in protein kinase C. Many classes of zinc fingers are characterized according to the number and positions of the histidine and cysteine residues involved in the zinc atom coordination. In the first class to be characterized, called C2H2, the first pair of zinc coordinating residues are cysteines, while the second pair are histidines. \nThis THAP domain is a putative DNA-binding domain with a C2CH architecture that probably binds a zinc ion. The domain is widespread in Drosophila species, Mus musculus, Homo sapiens and has been reported in Caenorhabditis elegans. \n
GLEAN3_10967	SPU_010967		#\nThe CUB domain is an extracellular domain of approximately 110 residues which is found in functionally diverse, mostly developmentally regulated proteins and in peptidases belonging to MEROPS peptidase families M12A (astacin) and S1A (chymotrypsin). Almost all CUB domains contain four conserved cysteines which probably form two disulphide bridges (C1-C2, C3-C4). The structure of the CUB domain has been predicted to be a beta-barrel similar to that of immunoglobulins. Proteins that have been found to contain the CUB domain include mammalian complement subcomponents C1s/C1r, which form the calcium-dependent complex C1, the first component of the classical pathway of the complement system; hamster serine protease Casp, which degrades type I and IV collagen and fibronectin in the presence of calcium; mammalian complement-activating component of Ra-reactive factor (RARF), a protease that cleaves the C4 component of complement; vertebrate enteropeptidase (EC 3.4.21.9), a type II membrane protein of the intestinal brush border, which activates trypsinogen; vertebrate bone morphogenic protein 1 (BMP-1), a protein which induces cartilage and bone formation and expresses metalloendopeptidase activity; sea urchins blastula proteins BP10 and SpAN; Caenorhabditis elegans hypothetical proteins F42A10.8 and R151.5; neuropilin (A5 antigen), a calcium-independent cell adhesion molecule that functions during the formation of certain neuronal circuits; fibropellins I and III from sea urchin; mammalian hyaluronate-binding protein TSG-6 (or PS4), a serum and growth factor induced protein; mammalian spermadhesins; and Xenopus embryonic protein UVS.2, which is expressed during dorsoanterior development.\n
GLEAN3_14285	SPU_014285		This Glean was picked up from one dimensional gel            electrophoresis and mass spectrometry.  It appears to be missing ~300nt at the 5' end and ~200 nt at the 3' end compared to the best human hit.\n
GLEAN3_28871	SPU_028871		This gene is most likely an allel of Glean3_07497\n
GLEAN3_07497	SPU_007497		This gene has an allel in Glean3_28871\n
GLEAN3_19286	SPU_019286		Two non-overlapping peptides were pulled from an IP coupled MS/MS from unfertilized eggs.\n
GLEAN3_14951	SPU_014951	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. However, the 2nd scaffold had a high e-value and a low bit score when compared the other BLAST results (>v2.1_scaffold76004 was about the 31st alignment down the list). There was no Est information available from GBRowse assembly V0.5 and the transcriptome intensity scores appeared to be weak and widely dispersed with all of values less than 5 except for an outlier at about 20. This is an un-annotated gene so no additional gene information was available from Baylor annotations (comments). 	an SH2 domain pulled up from the Anger group mRNA expression database associated with SFK1...unclear if it is really expressed.  First ~400aa of sequence has homology to "solute carrier family 15", followed by a full SH2 domain and then a partial TyK domain.  Probably not a SFK. \n
GLEAN3_10845	SPU_010845		Looks to be an allel of Glean3_22112\n
GLEAN3_23905	SPU_023905		C term of the prediction seems to be a piece of the Fringe protein\n
GLEAN3_03256	SPU_003256		The GLEAN3 model shows little confidence and may thus be incorrect.  However, the sequence of the GLEAN model CDS is nearly identical to the probe used by Rast et. al. 2002 Dev. Biol. and should thus is likely the "Kakapo" discussed in that paper.  \n
GLEAN3_14715	SPU_014715		May be a fragment of GLEAN3_00296 or GLEAN3_03985 (or another gene altogether) as this model is significantly shorter than either of these gelsolin-like genes.  However, this Glean model is nearly identical to the probe for Gelsolin used in Rast et. al.(2002) Dev Bio and suggests that this gene is expressed in the sea urchin genome in an area around the blastopore > 24 hour along with Kakapo and apobec\n
GLEAN3_08512	SPU_008512		Alignmenet with best blast sequence suggests that the model may lack N- and C-terminal sequences.\nThere are a large number of nearly identical sequences on many  scaffolds that are not on the glean3 list.  There are also multiple copies on the glean3 list.
GLEAN3_10275	SPU_010275	After reviewing the data and performing a BLAST search it appears that the sequence is distributed onto 2 different scaffolds. However, the sequence from the BLAST results does not coincide with the sequence from the excel data. The BLAST results indicate that the sequence ends at about 4500, but the excel data is truncated at 1147. 	This is the N termainal region of PLCg.  The full annotation can be found on scaffold 53431 and GLEAN3_27462.\nFragment, extra stretch in middle.
GLENA3_03774	SPU_030161	none	small C-type lectin.
GLEAN3_06056	SPU_006056		This glean result contains the C terminal reigon of the PLCg sequence.  The n terminal sequence is held on scaffold 53431 and GLEAN3_27462.
NGFFFamide-precursor	SPU_030074	(by Maurice Elphick) This gene encodes a putative neuropeptide precursor. It was identified by BLAST analysis of the sea urchin genomic sequence data using the sea cucumber neuropeptide NGIWYamide as the query. The precursor appears to encode two copies of a NGIWYamide-like peptide, which has the sequence NGFFFamide.\nThe gene was originally predicted by gnomon to comprise two exons, with the first exon encoding a puative N-terminal signal peptide (supported by SignalP3.0 analysis of the protein sequence) and the second exon encoding two copies of NGFFFamide in tandem, separated and bounded by dibasic cleavage sites.\nSubsequent analysis of EST data from a radial nerve library (EC439145; GI: 109403168 and EC438106; GI: 109402129) has revealed that this gene comprises 4 exons and enocodes a 266 residue protein comprising:\n1. 26 amino acid residue signal peptide\n2.  a 114 amino acid residue sequence\n3. a 18 residue sequence (KRNGFFFGKRNGFFFGKR), comprising two copies of the putative NGFFFamide neuropeptide separated and flanked by potential dibasic cleavage sites (KR). \n4. a 108 amino acid residue sequence that shares a high level of sequence identity with neurophysins, proteins that form the C-terminal region of precursors for the neuropeptide hormones vasopressin, oxytocin and vasotocin.	This gene encodes a putative neuropeptide precursor. It was identified by BLAST analysis of the sea urchin genomic sequence data using the sea cucumber neuropeptide NGIWYamide as the query. The precursor appears to encode two copies of a NGIWYamide-like peptide, which has the sequence NGFFFamide. \nThe gene comprises two exons, as predicted by gnomon; this is supported by the tiling data, which shows signals corresponding exactly with the two predicted protein coding exons. \nThe first exon encodes a puative N-terminal signal peptide, supported by SignalP3.0 analysis of the protein sequence. \nThe second exon encodes two copies of NGFFFamide in tandem, separated and bounded by dibasic cleavage sites. \n
GLEAN3_04779	SPU_004779	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04915	SPU_004915	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04933	SPU_004933	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13938	SPU_013938	many similar protease genes in Strongylocentrotus purpuratus	none
GLEAN3_00341	SPU_000341	none	none
GLEAN3_00625	SPU_000625	hypothetical protein. similar to SPU_000560.	none
GLEAN3_01269	SPU_001269	none	none
GLEAN3_01273	SPU_001273	also homologous to SC5AC, sodium-coupled monocarboxylate transporter 1, or sodium solute transporter Vito-a, electrogenic sodium monocarboxylate cotransporter	none
GLEAN3_01779	SPU_001779	none	none
GLEAN3_01809	SPU_001809	none	none
GLEAN3_02262	SPU_002262	none	none
GLEAN3_02474	SPU_002474	none	none
GLEAN3_03140	SPU_003140	none	none
GLEAN3_03973	SPU_003973	contains 2 Ion_trans_2 superfamily motifs and a FRQ1 domain	none
GLEAN3_04547	SPU_004547	none	none
GLEAN3_04607	SPU_004607	strongly homologous to bacterial as well as mammalian agmatinases	none
GLEAN3_04740	SPU_004740	homologous to numerous transport proteins in Drosophila, and in bacteria	none
GLEAN3_04984	SPU_004984	none	none
GLEAN3_05125	SPU_005125	homologous to numerous unnamed, hypothetical proteins in bacteria	none
GLEAN3_05136	SPU_005136	none	none
GLEAN3_05480	SPU_005480	none	none
GLEAN3_05554	SPU_005554	contains S-methyl_trans domain	none
GLEAN3_05623	SPU_005623	none	none
GLEAN3_05630	SPU_005630	also homologous to numerous known or putative proteins	none
GLEAN3_06317	SPU_006317	none	none
GLEAN3_06382	SPU_006382	contains 2 SDF superfamily motifs	none
GLEAN3_06992	SPU_006992	none	none
GLEAN3_07000	SPU_007000	contains 2 SH2 superfamily motifs	none
GLEAN3_07163	SPU_007163	none	none
GLEAN3_07221	SPU_007221	none	none
GLEAN3_07673	SPU_007673	none	none
GLEAN3_08832	SPU_008832	none	none
GLEAN3_10121	SPU_010121	none	none
GLEAN3_10207	SPU_010207	none	none
GLEAN3_10268	SPU_010268	none	none
GLEAN3_10392	SPU_010392	none	none
GLEAN3_11126	SPU_011126	none	none
GLEAN3_11657	SPU_011657	none	none
GLEAN3_12361	SPU_012361	contains BTB superfamily domain near N-terminus	none
GLEAN3_12886	SPU_012886	none	none
GLEAN3_12980	SPU_012980	contains 2 ANK superfamily motifs	none
GLEAN3_13999	SPU_013999	contains 4 ANK superfamily motifs	none
GLEAN3_14529	SPU_014529	none	none
GLEAN3_15478	SPU_015478	none	none
GLEAN3_15716	SPU_015716	contains MFS_1 domain	none
GLEAN3_16024	SPU_016024	none	none
GLEAN3_16145	SPU_016145	none	none
GLEAN3_16578	SPU_016578	none	none
GLEAN3_17285	SPU_017285	none	none
GLEAN3_18084	SPU_018084	none	none
GLEAN3_18220	SPU_018220	none	none
GLEAN3_18782	SPU_018782	contains 2 PBPb superfamily motifs	none
GLEAN3_18852	SPU_018852	none	none
GLEAN3_19023	SPU_019023	none	none
GLEAN3_19256	SPU_019256	contains 2 Bromodomain superfamily motifs	none
GLEAN3_19340	SPU_019340	none	none
GLEAN3_19424	SPU_019424	none	none
GLEAN3_19590	SPU_019590	contains ABC_transp_aux domain	none
GLEAN3_19979	SPU_019979	none	none
GLEAN3_20314	SPU_020314	none	none
GLEAN3_20875	SPU_020875	contains 4 C2 superfamily motifs	none
GLEAN3_20900	SPU_020900	contains RhaT domain	none
GLEAN3_22452	SPU_022452	none	none
GLEAN3_22457	SPU_022457	contains Prominin domain	none
GLEAN3_22860	SPU_022860	contains 5 ANK superfamily motifs	none
GLEAN3_28720	SPU_028720	none	none
GLEAN3_28684	SPU_028684	contains LPD_N domain	none
GLEAN3_28629	SPU_028629	contains 3 AdoHcyase superfamily motifs and D1tE domain	none
GLEAN3_28532	SPU_028532	none	none
GLEAN3_28403	SPU_028403	none	none
GLEAN3_28353	SPU_028353	contains Smc domain and Qor domain	none
GLEAN3_27962	SPU_027962	none	none
GLEAN3_27791	SPU_027791	also homologous to bacterial glutamate formiminotransferase	none
GLEAN3_27382	SPU_027382	none	none
GLEAN3_27320	SPU_027320	none	none
GLEAN3_26924	SPU_026924	none	none
GLEAN3_26640	SPU_026640	none	none
GLEAN3_26328	SPU_026328	none	none
GLEAN3_25857	SPU_025857	none	none
GLEAN3_25845	SPU_025845	none	none
GLEAN3_25840	SPU_025840	none	none
GLEAN3_25832	SPU_025832	none	none
GLEAN3_25831	SPU_025831	none	none
GLEAN3_25827	SPU_025827	none	none
GLEAN3_25806	SPU_025806	none	none
GLEAN3_25791	SPU_025791	none	none
GLEAN3_25787	SPU_025787	contains 3 ARM superfamily motifs	none
GLEAN3_25785	SPU_025785	none	none
GLEAN3_25782	SPU_025782	none	none
GLEAN3_25764	SPU_025764	none	none
GLEAN3_25762	SPU_025762	none	none
GLEAN3_25754	SPU_025754	none	none
GLEAN3_25727	SPU_025727	none	none
GLEAN3_25726	SPU_025726	none	none
GLEAN3_25725	SPU_025725	none	none
GLEAN3_25713	SPU_025713	also homologous to miscellaneous proteins	none
GLEAN3_25693	SPU_025693	none	none
GLEAN3_25672	SPU_025672	none	none
GLEAN3_25655	SPU_025655	none	none
GLEAN3_25654	SPU_025654	contains ARM superfamily motif near N-terminus	none
GLEAN3_25644	SPU_025644	none	none
GLEAN3_25639	SPU_025639	none	none
GLEAN3_25626	SPU_025626	none	none
GLEAN3_25623	SPU_025623	contains 2-Hacid domain	none
GLEAN3_25614	SPU_025614	none	none
GLEAN3_25611	SPU_025611	none	none
GLEAN3_25571	SPU_025571	contains 6 Kelch_1 superfamily motifs	none
GLEAN3_25557	SPU_025557	none	none
GLEAN3_25515	SPU_025515	none	none
GLEAN3_25483	SPU_025483	none	none
GLEAN3_25480	SPU_025480	none	none
GLEAN3_25475	SPU_025475	none	none
GLEAN3_25453	SPU_025453	also homologous to mammalian G protein-coupled receptors 85 and 173	none
GLEAN3_25451	SPU_025451	none	none
GLEAN3_25435	SPU_025435	contains RPN2 domain	none
GLEAN3_25393	SPU_025393	contains SSL2 domain	none
GLEAN3_25366	SPU_025366	none	none
GLEAN3_25362	SPU_025362	none	none
GLEAN3_25357	SPU_025357	none	none
GLEAN3_25340	SPU_025340	none	none
GLEAN3_25306	SPU_025306	contains 2 CXC superfamily motifs	none
GLEAN3_25305	SPU_025305	none	none
GLEAN3_25298	SPU_025298	none	none
GLEAN3_25275	SPU_025275	none	none
GLEAN3_25234	SPU_025234	contains 2 RRM superfamily motifs and HEC1 domain	none
GLEAN3_25141	SPU_025141	contains 2 PA superfamily motifs	none
GLEAN3_25139	SPU_025139	none	none
GLEAN3_25120	SPU_025120	none	none
GLEAN3_25115	SPU_025115	none	none
GLEAN3_25110	SPU_025110	contains DUF1740 domain	none
GLEAN3_25107	SPU_025107	contains Smc domain	none
GLEAN3_25090	SPU_025090	contains Rph domain	none
GLEAN3_25089	SPU_025089	contains 2 Kelch_1 superfamily motifs and Ehrlichia_rpt domain and COG3055 domain	none
GLEAN3_25028	SPU_025028	none	none
GLEAN3_24997	SPU_024997	contains Herpes_LMP2 domain	none
GLEAN3_24983	SPU_024983	none	none
GLEAN3_24927	SPU_024927	none	none
GLEAN3_24893	SPU_024893	contains Adaptin_N domain	none
GLEAN3_24875	SPU_024875	contains 2 IBR superfamily motifs	none
GLEAN3_24849	SPU_024849	none	none
GLEAN3_24814	SPU_024814	contains Sec63 superfamily motif at N-terminus	none
GLEAN3_24810	SPU_024810	none	none
GLEAN3_24802	SPU_024802	contains 2 MTH_psq superfamily motifs	none
GLEAN3_24750	SPU_024750	none	none
GLEAN3_24741	SPU_024741	contains 4 CCP superfamily motifs	none
GLEAN3_24728	SPU_024728	none	none
GLEAN3_24719	SPU_024719	none	none
GLEAN3_24718	SPU_024718	none	none
GLEAN3_24696	SPU_024696	none	none
GLEAN3_24693	SPU_024693	none	none
GLEAN3_24675	SPU_024675	none	none
GLEAN3_24667	SPU_024667	contains DnaJ superfamily motif at N-terminus	none
GLEAN3_24658	SPU_024658	none	none
GLEAN3_24645	SPU_024645	none	none
GLEAN3_24637	SPU_024637	none	none
GLEAN3_24620	SPU_024620	contains 6 HS1_rep superfamily motifs, and SH3 superfaily motif at C-terminus	none
GLEAN3_24607	SPU_024607	none	none
GLEAN3_24587	SPU_024587	none	none
GLEAN3_24560	SPU_024560	none	none
GLEAN3_24534	SPU_024534	none	none
GLEAN3_24524	SPU_024524	contains Smc domain	none
GLEAN3_24514	SPU_024514	none	none
GLEAN3_24490	SPU_024490	contains 4 MAM superfamily motifs and 2 LDLa superfamily motifs	none
GLEAN3_24445	SPU_024445	none	none
GLEAN3_24401	SPU_024401	contains 2 ANK superfamily motifs near C-terminus	none
GLEAN3_24399	SPU_024399	contains hATC superfamily motif at C-terminus	none
GLEAN3_24398	SPU_024398	none	none
GLEAN3_24397	SPU_024397	none	none
GLEAN3_24376	SPU_024376	also homologous to mammalian nucleolar protein 4 (NOL4) (Mus musculus)	none
GLEAN3_24334	SPU_024334	none	none
GLEAN3_24329	SPU_024329	none	none
GLEAN3_24310	SPU_024310	none	none
GLEAN3_24269	SPU_024269	none	none
GLEAN3_24230	SPU_024230	none	none
GLEAN3_24180	SPU_024180	none	none
GLEAN3_24179	SPU_024179	none	none
GLEAN3_24177	SPU_024177	none	none
GLEAN3_24172	SPU_024172	contains 5 ANK superfamily motifs and Arp domain	none
GLEAN3_24168	SPU_024168	none	none
GLEAN3_24167	SPU_024167	none	none
GLEAN3_24150	SPU_024150	none	none
GLEAN3_24142	SPU_024142	contains 3 ANK superfamily motifs	none
GLEAN3_24140	SPU_024140	none	none
GLEAN3_24138	SPU_024138	none	none
GLEAN3_24137	SPU_024137	none	none
GLEAN3_24130	SPU_024130	contains 2 UPF0016 superfamily motifs	none
GLEAN3_24114	SPU_024114	none	none
GLEAN3_24111	SPU_024111	none	none
GLEAN3_24081	SPU_024081	none	none
GLEAN3_24069	SPU_024069	none	none
GLEAN3_24063	SPU_024063	none	none
GLEAN3_24059	SPU_024059	contains Smc domain	none
GLEAN3_24049	SPU_024049	contains 2 Ion_trans_2 superfamily motifs	none
GLEAN3_23992	SPU_023992	contains 5 RCC1 superfamily motifs and ATS1 domain	none
GLEAN3_23975	SPU_023975	none	none
GLEAN3_23968	SPU_023968	none	none
GLEAN3_23964	SPU_023964	none	none
GLEAN3_23958	SPU_023958	contains COG5594 domain	none
GLEAN3_23947	SPU_023947	contains 2 TPR superfamily motifs	none
GLEAN3_23925	SPU_023925	none	none
GLEAN3_23907	SPU_023907	none	none
GLEAN3_23893	SPU_023893	none	none
GLEAN3_23881	SPU_023881	contains SRP40_C superfamily motif at C-terminus	none
GLEAN3_23855	SPU_023855	none	none
GLEAN3_23854	SPU_023854	none	none
GLEAN3_23837	SPU_023837	none	none
GLEAN3_23802	SPU_023802	none	none
GLEAN3_23790	SPU_023790	none	none
GLEAN3_23785	SPU_023785	none	none
GLEAN3_23778	SPU_023778	none	none
GLEAN3_23768	SPU_023768	none	none
GLEAN3_23766	SPU_023766	contains HCR domain	none
GLEAN3_23765	SPU_023765	contains COG5273 domain	none
GLEAN3_23743	SPU_023743	none	none
GLEAN3_23740	SPU_023740	contains THAP superfamily motif at N-terminus. homologous to Drosophila transposases.	none
GLEAN3_23733	SPU_023733	contains SbcC domain	none
GLEAN3_23701	SPU_023701	none	none
GLEAN3_23700	SPU_023700	none	none
GLEAN3_23678	SPU_023678	none	none
GLEAN3_23659	SPU_023659	contains COG5252 domain	none
GLEAN3_23653	SPU_023653	none	none
GLEAN3_23646	SPU_023646	none	none
GLEAN3_23640	SPU_023640	contains 2 JmjC superfamily motifs	none
GLEAN3_23632	SPU_023632	contains MFS_1 domain	none
GLEAN3_23616	SPU_023616	none	none
GLEAN3_23610	SPU_023610	none	none
GLEAN3_23598	SPU_023598	novel, conserved protein	none
GLEAN3_23584	SPU_023584	contains Vitellogenin_N domain	none
GLEAN3_23575	SPU_023575	none	none
GLEAN3_23571	SPU_023571	contains 3 Thioredoxin-like superfamily motifs	none
GLEAN3_23561	SPU_023561	none	none
GLEAN3_23527	SPU_023527	none	none
GLEAN3_23506	SPU_023506	none	none
GLEAN3_23466	SPU_023466	contains FRQ1 domain	none
GLEAN3_23460	SPU_023460	contains AMP-binding domain	none
GLEAN3_23459	SPU_023459	none	none
GLEAN3_23452	SPU_023452	none	none
GLEAN3_00581	SPU_000581	contains AST1 domain and HECTc domain	none
GLEAN3_00714	SPU_000714	none	none
GLEAN3_01252	SPU_001252	none	none
GLEAN3_01263	SPU_001263	contains COG5543 domain	none
GLEAN3_01695	SPU_001695	contains SMC_N domain	none
GLEAN3_02007	SPU_002007	contains Crp domain and CAP_ED superfamily motif near the N-terminus	none
GLEAN3_02147	SPU_002147	none	none
GLEAN3_03093	SPU_003093	none	none
GLEAN3_03417	SPU_003417	contains COG3391 domain	none
GLEAN3_04321	SPU_004321	none	none
GLEAN3_05526	SPU_005526	contains Pecanex_C superfamily motif at C-terminus	none
GLEAN3_05792	SPU_005792	contains 2 EGF_CA superfamily motifs	none
GLEAN3_05870	SPU_005870	none	none
GLEAN3_06665	SPU_006665	contains LacZ domain	none
GLEAN3_06902	SPU_006902	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_07115	SPU_007115	contains 2 ANK superfamily motifs	none
GLEAN3_07747	SPU_007747	contains 4 Periplasmic_Binding_Protein_T1 superfamily motifs and 3 ANF_receptor domain motifs	none
GLEAN3_08107	SPU_008107	contains UCH domain	none
GLEAN3_08442	SPU_008442	contains COG5635 domain. member of very large S. purpuratus-specific protein family	none
GLEAN3_08721	SPU_008721	none	none
GLEAN3_10379	SPU_010379	none	none
GLEAN3_10729	SPU_010729	contains 2 P_loop_NTPase superfamily motifs and 2 DYN1 domain motifs	none
GLEAN3_11332	SPU_011332	none	none
GLEAN3_13250	SPU_013250	contains 2 zf-RING superfamily motifs	none
GLEAN3_13313	SPU_013313	contains PRK8566 domain	none
GLEAN3_13984	SPU_013984	contains Metallo-dependent_Hydrolase superfamily motif at C-terminus. S. purpuratus-specific protein.	none
GLEAN3_14416	SPU_014416	contains 3 BRCT superfamily motifs	none
GLEAN3_14617	SPU_014617	contains 5 LDL_r superfamily motifs and COG3391 domain	none
GLEAN3_15386	SPU_015386	contains Tex domain	none
GLEAN3_16624	SPU_016624	contains 6 ANK superfamily motifs	none
GLEAN3_16630	SPU_016630	contains 5 ANK superfamily motifs	none
GLEAN3_16642	SPU_016642	contains DNA_pol_B domain	none
GLEAN3_16664	SPU_016664	contains SMC_N domain	none
GLEAN3_16730	SPU_016730	none	none
GLEAN3_16735	SPU_016735	contains Tra5 domain	none
GLEAN3_16741	SPU_016741	none	none
GLEAN3_16742	SPU_016742	none	none
GLEAN3_16805	SPU_016805	none	none
GLEAN3_16835	SPU_016835	none	none
GLEAN3_16843	SPU_016843	none	none
GLEAN3_16873	SPU_016873	none	none
GLEAN3_16885	SPU_016885	none	none
GLEAN3_16910	SPU_016910	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_16927	SPU_016927	contains 14 ANK superfamily motifs and Arp domains	none
GLEAN3_16928	SPU_016928	contains 11 ANK superfamily motifs and Arp domains	none
GLEAN3_16981	SPU_016981	none	none
GLEAN3_16982	SPU_016982	contains AIR1 domain	none
GLEAN3_16986	SPU_016986	contains 10 ANK superfamily motifs and Arp domains	none
GLEAN3_17043	SPU_017043	contains HOOK domain	none
GLEAN3_17087	SPU_017087	none	none
GLEAN3_17113	SPU_017113	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_17114	SPU_017114	contains 6 ANK superfamily motifs and Arp domains and Ion_trans domain	none
GLEAN3_17126	SPU_017126	contains 2 zf- superfamily motifs	none
GLEAN3_17130	SPU_017130	none	none
GLEAN3_17173	SPU_017173	contains Dynein_heavy domain	none
GLEAN3_17265	SPU_017265	contains 7 ANK superfamily motifs and 2 ZU5 superfamily motifs and Arp domain motifs	none
GLEAN3_17283	SPU_017283	contains 9 TRP superfamily motifs	none
GLEAN3_17303	SPU_017303	contains 8 SPEC superfamily motifs	none
GLEAN3_17310	SPU_017310	none	none
GLEAN3_17312	SPU_017312	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17369	SPU_017369	contains Smc domain	none
GLEAN3_17376	SPU_017376	none	none
GLEAN3_17390	SPU_017390	contains AIR1 domain	none
GLEAN3_17413	SPU_017413	contains 2 tandem repeat of DM15 superfamily motifs and LHP1 domain	none
GLEAN3_17531	SPU_017531	none	none
GLEAN3_17532	SPU_017532	contains 2 SPEC superfamily motifs and Smc domain and UCH domain and UBP5 domain and ULP1 domain	none
GLEAN3_17580	SPU_017580	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17584	SPU_017584	none	none
GLEAN3_17588	SPU_017588	Sp-specific protein	none
GLEAN3_17626	SPU_017626	contains RING superfamily motif at C-terminus	none
GLEAN3_17648	SPU_017648	contains 2 DNA_Pol_B_2 domain motifs	none
GLEAN3_17678	SPU_017678	contains 4 EGF_CA superfamily motifs and 2 vWA superfamily motifs	none
GLEAN3_17711	SPU_017711	none	none
GLEAN3_17713	SPU_017713	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17822	SPU_017822	contains 4 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17823	SPU_017823	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17825	SPU_017825	contains 5 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17831	SPU_017831	none	none
GLEAN3_17841	SPU_017841	contains 2 Sfi1 superfamily motifs	none
GLEAN3_17845	SPU_017845	none	none
GLEAN3_17870	SPU_017870	none	none
GLEAN3_17875	SPU_017875	contains 2 WD40 superfamily motifs	none
GLEAN3_17879	SPU_017879	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17928	SPU_017928	contains 22 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_17931	SPU_017931	none	none
GLEAN3_17944	SPU_017944	contains 2 Vps53_N superfamily motifs	none
GLEAN3_17946	SPU_017946	contains MRS6 domain	none
GLEAN3_17953	SPU_017953	none	none
GLEAN3_17974	SPU_017974	contains 2 ANK superfamily motifs	none
GLEAN3_18066	SPU_018066	contains AIR1 domain	none
GLEAN3_18133	SPU_018133	contains SMC_N domain	none
GLEAN3_18136	SPU_018136	contains 11 EGF_CA superfamily motifs	none
GLEAN3_18138	SPU_018138	contains SMC_N domain. Sp-specific protein family.	none
GLEAN3_18142	SPU_018142	contains Ion_trans domain and COG3883 domain	none
GLEAN3_18221	SPU_018221	contains 2 KH-I superfamily motifs	none
GLEAN3_18265	SPU_018265	contains SMC_N domain	none
GLEAN3_18275	SPU_018275	none	none
GLEAN3_18323	SPU_018323	none	none
GLEAN3_18343	SPU_018343	none	none
GLEAN3_18355	SPU_018355	none	none
GLEAN3_18362	SPU_018362	none	none
GLEAN3_18390	SPU_018390	contains VPS9 superfamily motif at C-terminus. contains MDN1 domain.	none
GLEAN3_18400	SPU_018400	none	none
GLEAN3_18428	SPU_018428	Sp-specific protein	none
GLEAN3_18447	SPU_018447	none	none
GLEAN3_18491	SPU_018491	contains DYN1 domain and Dynein_heavy domain	none
GLEAN3_18537	SPU_018537	contains Ndh domain	none
GLEAN3_18587	SPU_018587	contains 2 UBQ domain motifs	none
GLEAN3_18602	SPU_018602	contains AIR1 domain	none
GLEAN3_18650	SPU_018650	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_18651	SPU_018651	contains 8 ANK superfamily motifs and Arp domain motifs and 2 ZU5 superfamily motifs	none
GLEAN3_18666	SPU_018666	contains TRF4 domain and PAT1 domain	none
GLEAN3_18668	SPU_018668	contains COG1112 domain and Keratin_B2 domain	none
GLEAN3_18671	SPU_018671	contains 7 EGF_CA superfamily motifs and 2 Tryp_SP superfamily motifs and 4 CUB superfamily motifs and 4 LDLa superfamily motifs	none
GLEAN3_18686	SPU_018686	none	none
GLEAN3_18709	SPU_018709	contains 11 FA58C superfamily motifs and 9 EGF_CA superfamily motifs	none
GLEAN3_18713	SPU_018713	contains 10 EGF_CA superfamily motifs and 3 KR superfamily motifs	none
GLEAN3_18721	SPU_018721	contains 14 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_18730	SPU_018730	none	none
GLEAN3_18745	SPU_018745	contains MDN1 domain	none
GLEAN3_18754	SPU_018754	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_18796	SPU_018796	contains Ric8 domain	none
GLEAN3_18809	SPU_018809	contains 2 WD40 superfamily motifs and COG2319 domain	none
GLEAN3_18814	SPU_018814	none	none
GLEAN3_18821	SPU_018821	contains 2 EGF_CA superfamily motifs	none
GLEAN3_18837	SPU_018837	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_18842	SPU_018842	contains 2 Smc domain motifs and SMC_N domain. Sp-specific family.	none
GLEAN3_18862	SPU_018862	none	none
GLEAN3_18932	SPU_018932	none	none
GLEAN3_18950	SPU_018950	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_18958	SPU_018958	contains CH superfamily motifs at C-terminus	none
GLEAN3_19083	SPU_019083	Sp-specific protein	none
GLEAN3_19119	SPU_019119	contains COG4485 domain	none
GLEAN3_19128	SPU_019128	contains 2 LamG superfamily motifs at N-terminus	none
GLEAN3_19130	SPU_019130	contains VSP domain	none
GLEAN3_19149	SPU_019149	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_19156	SPU_019156	Sp-specific unique protein	none
GLEAN3_19191	SPU_019191	Sp-specific protein	none
GLEAN3_19222	SPU_019222	Sp-specific protein	none
GLEAN3_19226	SPU_019226	contains PRK08581 domain. Sp-specific protein.	none
GLEAN3_19265	SPU_019265	none	none
GLEAN3_19305	SPU_019305	none	none
GLEAN3_19310	SPU_019310	contains 2 Smc domain motifs and 2 SMC_N domain motifs. Sp-specific protein.	none
GLEAN3_19336	SPU_019336	contains 3 NDPk superfamily motifs. Sp-specific protein.	none
GLEAN3_19347	SPU_019347	contains COG5635 domain. large protein family specific to S. purpuratus and Branchiostoma floridae.	none
GLEAN3_19359	SPU_019359	none	none
GLEAN3_19371	SPU_019371	contains Hamartin domain and Smc domain	none
GLEAN3_19380	SPU_019380	contains Smc domain	none
GLEAN3_19383	SPU_019383	none	none
GLEAN3_19416	SPU_019416	none	none
GLEAN3_19461	SPU_019461	contains HECTc superfamily motif and HECTc domain at C-terminus	none
GLEAN3_19525	SPU_019525	none	none
GLEAN3_19556	SPU_019556	contains 3 P_loop_NTPase superfamily motifs	none
GLEAN3_19564	SPU_019564	contains RING superfamily motif near N-terminus	none
GLEAN3_19570	SPU_019570	none	none
GLEAN3_19613	SPU_019613	none	none
GLEAN3_19724	SPU_019724	contains Smc domain and SMC_N domain	none
GLEAN3_19729	SPU_019729	contains 2 TPR superfamily motifs	none
GLEAN3_19745	SPU_019745	contains C2 superfamily motif near C-terminus	none
GLEAN3_19765	SPU_019765	contains COG4913 domain. Sp-specific protein family.	none
GLEAN3_19785	SPU_019785	none	none
GLEAN3_19820	SPU_019820	contains 10 Gal-3-0_sulfotr superfamily motifs	none
GLEAN3_19840	SPU_019840	none	none
GLEAN3_19854	SPU_019854	contains 4 HYR superfamily motifs and 7 EGF_CA superfamily motifs	none
GLEAN3_19855	SPU_019855	none	none
GLEAN3_19865	SPU_019865	contains 8 FA58C superfamily motifs and 18 EGF_CA superfamily motifs	none
GLEAN3_19884	SPU_019884	contains DYN1 domain and Dynein_heavy domain	none
GLEAN3_19923	SPU_019923	none	none
GLEAN3_19936	SPU_019936	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_19953	SPU_019953	contains 2 SMC_N domain motifs	none
GLEAN3_19991	SPU_019991	contains 16 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_19994	SPU_019994	none	none
GLEAN3_20044	SPU_020044	contains 4 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20052	SPU_020052	contains 3 UBQ superfamily motifs and SMC_N domain and PRK12704 domain	none
GLEAN3_20072	SPU_020072	none	none
GLEAN3_20076	SPU_020076	contains 6 EGF_CA superfamily motifs	none
GLEAN3_20083	SPU_020083	contains 3 PH-like superfamily motifs and 2 zf-RING superfamily motifs	none
GLEAN3_20097	SPU_020097	contains MDN1 domain	none
GLEAN3_20132	SPU_020132	contains COG1112 domain and Keratin_B2 domain	none
GLEAN3_20167	SPU_020167	contains MDN1 domain	none
GLEAN3_20183	SPU_020183	contains 2 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20216	SPU_020216	none	none
GLEAN3_20238	SPU_020238	contains Smc domain and SMC_N domain	none
GLEAN3_20288	SPU_020288	contains KAP95 domain	none
GLEAN3_20307	SPU_020307	contains 4 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20315	SPU_020315	contains 3 CUB superfamily motifs and 5 LDLa superfamily motifs	none
GLEAN3_20320	SPU_020320	contains 7 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20394	SPU_020394	contains 4 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20410	SPU_020410	contains PAT1 domain	none
GLEAN3_20436	SPU_020436	contains COG5635 domain	none
GLEAN3_20442	SPU_020442	contains Bromodomain superfamily motif at C-terminue	none
GLEAN3_20453	SPU_020453	none	none
GLEAN3_20455	SPU_020455	contains 22 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20459	SPU_020459	contains GOG10033 domain	none
GLEAN3_20484	SPU_020484	none	none
GLEAN3_20508	SPU_020508	none	none
GLEAN3_20526	SPU_020526	contains SMC_N domain and Smc domain	none
GLEAN3_20561	SPU_020561	contains COG5635 domain	none
GLEAN3_20605	SPU_020605	contains 2 Herpes_teg_N superfamily motifs and 4 PRK13596 domain motifs	none
GLEAN3_20616	SPU_020616	contains 2 SAM superfamily motifs	none
GLEAN3_20625	SPU_020625	contains 2 P_loop_NTPase superfamily motifs	none
GLEAN3_20645	SPU_020645	none	none
GLEAN3_20659	SPU_020659	none	none
GLEAN3_20662	SPU_020662	none	none
GLEAN3_20731	SPU_020731	none	none
GLEAN3_20770	SPU_020770	contains XPG superfamily motif at N-terminus	none
GLEAN3_20809	SPU_020809	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20820	SPU_020820	contains 14 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20824	SPU_020824	contains 6 CUB superfamily motifs	none
GLEAN3_20828	SPU_020828	contains COG1112 domain and Keratin_B2 domain	none
GLEAN3_20874	SPU_020874	none	none
GLEAN3_20909	SPU_020909	contains 3 IPT superfamily motifs	none
GLEAN3_20944	SPU_020944	contains 7 CCP superfamily motifs and 2 HYR superfamily motifs	none
GLEAN3_20971	SPU_020971	contains 24 HYR superfamily motifs	none
GLEAN3_20976	SPU_020976	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_20980	SPU_020980	contains 2 IBR superfamily motifs	none
GLEAN3_21024	SPU_021024	none	none
GLEAN3_21025	SPU_021025	none	none
GLEAN3_21036	SPU_021036	none	none
GLEAN3_21045	SPU_021045	none	none
GLEAN3_21076	SPU_021076	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21079	SPU_021079	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21084	SPU_021084	none	none
GLEAN3_21089	SPU_021089	contains 4 HYR superfamily motifs and RecD domain	none
GLEAN3_21106	SPU_021106	none	none
GLEAN3_21142	SPU_021142	none	none
GLEAN3_21152	SPU_021152	contains 17 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21196	SPU_021196	none	none
GLEAN3_21222	SPU_021222	contains Smc domain	none
GLEAN3_21234	SPU_021234	none	none
GLEAN3_21266	SPU_021266	contains MFS_1 domain	none
GLEAN3_21284	SPU_021284	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21296	SPU_021296	none	none
GLEAN3_21318	SPU_021318	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21411	SPU_021411	contains HEC1 domain	none
GLEAN3_21419	SPU_021419	contains 2 PHD superfamily motifs and MDN1 domain	none
GLEAN3_21440	SPU_021440	none	none
GLEAN3_21448	SPU_021448	contains MRS6 domain	none
GLEAN3_21547	SPU_021547	contains 4 MBT superfamily motifs	none
GLEAN3_21600	SPU_021600	contains 4 EGF_CA superfamily motifs	none
GLEAN3_21603	SPU_021603	none	none
GLEAN3_21619	SPU_021619	none	none
GLEAN3_21622	SPU_021622	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21642	SPU_021642	none	none
GLEAN3_21665	SPU_021665	Sp-specific protein	none
GLEAN3_21678	SPU_021678	none	none
GLEAN3_21681	SPU_021681	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21697	SPU_021697	none	none
GLEAN3_21728	SPU_021728	contains 3 Peptidase_C19 superfamily motifs	none
GLEAN3_21767	SPU_021767	none	none
GLEAN3_21785	SPU_021785	contains 2 Esterase_lipase superfamily motifs	none
GLEAN3_21808	SPU_021808	contains 2 ANK superfamily motifs	none
GLEAN3_21821	SPU_021821	none	none
GLEAN3_21822	SPU_021822	contains COG1112 domain	none
GLEAN3_21866	SPU_021866	none	none
GLEAN3_21896	SPU_021896	none	none
GLEAN3_21922	SPU_021922	none	none
GLEAN3_21954	SPU_021954	contains 6 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_21958	SPU_021958	none	none
GLEAN3_21959	SPU_021959	contains 7 PDZ superfamily motifs	none
GLEAN3_21969	SPU_021969	none	none
GLEAN3_21973	SPU_021973	none	none
GLEAN3_21976	SPU_021976	none	none
GLEAN3_21992	SPU_021992	none	none
GLEAN3_22011	SPU_022011	none	none
GLEAN3_22013	SPU_022013	contains 2 Smc domain motifs	none
GLEAN3_22017	SPU_022017	Sp-specific protein	none
GLEAN3_22058	SPU_022058	contains Fib_alpha domain and SMC_N domain	none
GLEAN3_22087	SPU_022087	contains 13 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_22204	SPU_022204	none	none
GLEAN3_22311	SPU_022311	none	none
GLEAN3_22315	SPU_022315	contains 8 EGF_CA superfamily motifs	none
GLEAN3_22341	SPU_022341	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_22345	SPU_022345	none	none
GLEAN3_22355	SPU_022355	none	none
GLEAN3_22361	SPU_022361	none	none
GLEAN3_22368	SPU_022368	contains MDN1 domain	none
GLEAN3_22388	SPU_022388	contains 17 ANK superfamily motifs and Arp domain motifs and 2 ZU5 superfamily motifs	none
GLEAN3_22393	SPU_022393	none	none
GLEAN3_22430	SPU_022430	contains DNA_pol_B2 domain	none
GLEAN3_22443	SPU_022443	contains 9 MAM superfamily motifs and 3 LDLa superfamily motifs	none
GLEAN3_22449	SPU_022449	none	none
GLEAN3_22485	SPU_022485	none	none
GLEAN3_22500	SPU_022500	contains 4 PDZ superfamily motifs	none
GLEAN3_22509	SPU_022509	none	none
GLEAN3_22514	SPU_022514	contains Pneumo_att_G domain	none
GLEAN3_22544	SPU_022544	contains 6 SRCR superfamily motifs	none
GLEAN3_22553	SPU_022553	contains 6 ANK superfamily motifs and Arp domain motifs and 2 ZU5 superfamily motifs	none
GLEAN3_22559	SPU_022559	contains 2 BBOX superfamily motifs and GRASP55_65 domain	none
GLEAN3_22563	SPU_022563	none	none
GLEAN3_22582	SPU_022582	contains 6 EGF_CA superfamily motifs and 3 HYR superfamily motifs	none
GLEAN3_22618	SPU_022618	contains 17 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_22642	SPU_022642	contains 3 ANK superfamily motifs	none
GLEAN3_22690	SPU_022690	contains f1hF domain	none
GLEAN3_22766	SPU_022766	none	none
GLEAN3_22787	SPU_022787	none	none
GLEAN3_22808	SPU_022808	contains SbcC domain	none
GLEAN3_22819	SPU_022819	none	none
GLEAN3_22822	SPU_022822	none	none
GLEAN3_22825	SPU_022825	contains 5 CUB superfamily motifs	none
GLEAN3_22831	SPU_022831	none	none
GLEAN3_22905	SPU_022905	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22970	SPU_022970	none	none
GLEAN3_22973	SPU_022973	none	none
GLEAN3_23020	SPU_023020	none	none
GLEAN3_23060	SPU_023060	contains 22 MAM superfamily motifs and 10 LDLa superfamily motifs	none
GLEAN3_23072	SPU_023072	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_23087	SPU_023087	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_23093	SPU_023093	contains MDN1 domain	none
GLEAN3_23102	SPU_023102	none	none
GLEAN3_23142	SPU_023142	contains 4 SPEC superfamily motifs and 8 IG superfamily motifs	none
GLEAN3_23258	SPU_023258	none	none
GLEAN3_23267	SPU_023267	contains 2 PHD superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23282	SPU_023282	contains 3 CUB superfamily motifs	none
GLEAN3_23300	SPU_023300	none	none
GLEAN3_23302	SPU_023302	none	none
GLEAN3_23313	SPU_023313	none	none
GLEAN3_23324	SPU_023324	none	none
GLEAN3_23358	SPU_023358	none	none
GLEAN3_23370	SPU_023370	none	none
GLEAN3_23404	SPU_023404	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_23412	SPU_023412	none	none
GLEAN3_23454	SPU_023454	contains 2 DHC_N1 domain motifs	none
GLEAN3_23477	SPU_023477	contains 7 Ldl_re superfamily motifs	none
GLEAN3_23483	SPU_023483	none	none
GLEAN3_23545	SPU_023545	contains 5 MAM superfamily motifs	none
GLEAN3_23570	SPU_023570	contains HRD1 domain	none
GLEAN3_23596	SPU_023596	none	none
GLEAN3_23665	SPU_023665	none	none
GLEAN3_23725	SPU_023725	none	none
GLEAN3_23745	SPU_023745	contains 2 EGF_CA superfamily motifs	none
GLEAN3_23746	SPU_023746	contains COG1112 domain and SSL2 domain	none
GLEAN3_23749	SPU_023749	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_23774	SPU_023774	none	none
GLEAN3_23775	SPU_023775	none	none
GLEAN3_23795	SPU_023795	none	none
GLEAN3_23796	SPU_023796	contains 9 MAM superfamily motifs	none
GLEAN3_23844	SPU_023844	none	none
GLEAN3_23877	SPU_023877	none	none
GLEAN3_23921	SPU_023921	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_23929	SPU_023929	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23956	SPU_023956	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23996	SPU_023996	contains 22 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_24018	SPU_024018	none	none
GLEAN3_24066	SPU_024066	none	none
GLEAN3_24125	SPU_024125	contains 2 rve superfamily motifs	none
GLEAN3_24126	SPU_024126	none	none
GLEAN3_24129	SPU_024129	none	none
GLEAN3_24135	SPU_024135	none	none
GLEAN3_24160	SPU_024160	none	none
GLEAN3_24175	SPU_024175	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24176	SPU_024176	contains Actin superfamily motif at C-terminus	none
GLEAN3_24223	SPU_024223	contains 3 RRM superfamily motifs	none
GLEAN3_24242	SPU_024242	contains 34 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_24270	SPU_024270	none	none
GLEAN3_24281	SPU_024281	contains PRK02362 domain	none
GLEAN3_24282	SPU_024282	none	none
GLEAN3_24284	SPU_024284	none	none
GLEAN3_24285	SPU_024285	none	none
GLEAN3_24309	SPU_024309	none	none
GLEAN3_24435	SPU_024435	none	none
GLEAN3_24466	SPU_024466	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24491	SPU_024491	none	none
GLEAN3_24521	SPU_024521	contains SMC_N domain	none
GLEAN3_24552	SPU_024552	none	none
GLEAN3_24558	SPU_024558	contains recD domain	none
GLEAN3_24592	SPU_024592	none	none
GLEAN3_24609	SPU_024609	contains 9 HYR superfamily motifs and recD domain	none
GLEAN3_24626	SPU_024626	none	none
GLEAN3_24647	SPU_024647	contains Smc domain	none
GLEAN3_24652	SPU_024652	none	none
GLEAN3_24663	SPU_024663	contains 16 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_24695	SPU_024695	none	none
GLEAN3_24771	SPU_024771	contains COG5222 domain	none
GLEAN3_24834	SPU_024834	contains 7 MAM superfamily motifs and 3 LDLa superfamily motifs	none
GLEAN3_24856	SPU_024856	contains 5 SPEC superfamily motifs	none
GLEAN3_24945	SPU_024945	none	none
GLEAN3_24950	SPU_024950	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24958	SPU_024958	Strongly homologous to mammalian hypothetical proteins.	none
GLEAN3_24976	SPU_024976	none	none
GLEAN3_25012	SPU_025012	none	none
GLEAN3_25027	SPU_025027	contains COG4783 domain	none
GLEAN3_25030	SPU_025030	contains 3 PDZ superfamily motifs	none
GLEAN3_25064	SPU_025064	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_25117	SPU_025117	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25137	SPU_025137	none	none
GLEAN3_25161	SPU_025161	none	none
GLEAN3_25197	SPU_025197	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25220	SPU_025220	contains SbcC domain and SMC_N domain	none
GLEAN3_25255	SPU_025255	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25274	SPU_025274	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_25286	SPU_025286	none	none
GLEAN3_25354	SPU_025354	none	none
GLEAN3_25383	SPU_025383	none	none
GLEAN3_25385	SPU_025385	none	none
GLEAN3_25388	SPU_025388	contains 14 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_25399	SPU_025399	none	none
GLEAN3_25427	SPU_025427	none	none
GLEAN3_25456	SPU_025456	contains 2 7tm_2 superfamily motifs	none
GLEAN3_25494	SPU_025494	none	none
GLEAN3_25496	SPU_025496	none	none
GLEAN3_25536	SPU_025536	none	none
GLEAN3_25573	SPU_025573	contains PAT1 domain	none
GLEAN3_25603	SPU_025603	contains 19 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_25609	SPU_025609	none	none
GLEAN3_25610	SPU_025610	none	none
GLEAN3_25618	SPU_025618	none	none
GLEAN3_25624	SPU_025624	contains 2 Bromo_TFIID superfamily motifs	none
GLEAN3_25630	SPU_025630	contains COG3899 domain	none
GLEAN3_25667	SPU_025667	contains 22 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_25691	SPU_025691	none	none
GLEAN3_25723	SPU_025723	none	none
GLEAN3_25889	SPU_025889	none	none
GLEAN3_25724	SPU_025724	none	none
GLEAN3_25769	SPU_025769	contains 24 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_25781	SPU_025781	none	none
GLEAN3_25846	SPU_025846	none	none
GLEAN3_25869	SPU_025869	none	none
GLEAN3_25876	SPU_025876	contains PRK10416 domain	none
GLEAN3_25900	SPU_025900	none	none
GLEAN3_25913	SPU_025913	none	none
GLEAN3_25987	SPU_025987	contains COG1413 domain	none
GLEAN3_26006	SPU_026006	contains 13 HYR superfamily motifs	none
GLEAN3_26018	SPU_026018	contains infB domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26041	SPU_026041	none	none
GLEAN3_26053	SPU_026053	none	none
GLEAN3_26069	SPU_026069	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_26097	SPU_026097	none	none
GLEAN3_26111	SPU_026111	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26121	SPU_026121	contains 14 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_26126	SPU_026126	none	none
GLEAN3_26131	SPU_026131	contains 3 UDPGT domain motifs	none
GLEAN3_26151	SPU_026151	contains COG1112 domain	none
GLEAN3_26152	SPU_026152	contains COG1112 domain and Furin-1 domain	none
GLEAN3_26157	SPU_026157	none	none
GLEAN3_26191	SPU_026191	contains 4 LDLa superfamily motifs and 2 CUB superfamily motifs and 2 Tryp_SPc superfamily motifs	none
GLEAN3_26215	SPU_026215	none	none
GLEAN3_26266	SPU_026266	none	none
GLEAN3_26267	SPU_026267	none	none
GLEAN3_26279	SPU_026279		none
GLEAN3_26293	SPU_026293	contains 18 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_26342	SPU_026342	contains DYN1 domain	none
GLEAN3_26356	SPU_026356	none	none
GLEAN3_26412	SPU_026412	contains PRK08691 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26417	SPU_026417	contains 5 EGF_CA superfamily motifs and 3 CUB superfamily motifs	none
GLEAN3_26433	SPU_026433	none	none
GLEAN3_26454	SPU_026454	contains 6 MAM superfamily motifs	none
GLEAN3_26456	SPU_026456	none	none
GLEAN3_26517	SPU_026517	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26526	SPU_026526	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26527	SPU_026527	none	none
GLEAN3_26543	SPU_026543	contains 2 CCP superfamily motifs and 5 HYR superfamily motifs and 11 EGF_CA superfamily motifs	none
GLEAN3_26567	SPU_026567	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26569	SPU_026569	contains 2 CUB superfamily motifs and 8 EGF_CA superfamily motifs	none
GLEAN3_26607	SPU_026607	contains 18 MAM superfamily motifs	none
GLEAN3_26623	SPU_026623	contains 2 KAZAL_FS superfamily motifs at N-terminus and 1 KAZAL_FS superfamily motif at C-terminus. 5383 amino acids. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26704	SPU_026704	none	none
GLEAN3_26733	SPU_026733	contains CDC6 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26773	SPU_026773	contains 2 PBP1_GPCR_family_C-like superfamily motifs and 5 Periplasmic_Binding_Protein superfamily motifs and ANF_receptor domain	none
GLEAN3_26785	SPU_026785	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26797	SPU_026797	none	none
GLEAN3_26803	SPU_026803	none	none
GLEAN3_26804	SPU_026804	none	none
GLEAN3_26836	SPU_026836	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26847	SPU_026847	none	none
GLEAN3_26896	SPU_026896	none	none
GLEAN3_26939	SPU_026939	contains RecD domain	none
GLEAN3_26952	SPU_026952	none	none
GLEAN3_26976	SPU_026976	contains 12 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_27013	SPU_027013	none	none
GLEAN3_27025	SPU_027025	contains COG1112 domain	none
GLEAN3_27095	SPU_027095	contains 6 SRCR superfamily motifs and COG4886 domain	none
GLEAN3_27108	SPU_027108	none	none
GLEAN3_27109	SPU_027109	none	none
GLEAN3_27148	SPU_027148	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27257	SPU_027257	none	none
GLEAN3_27259	SPU_027259	none	none
GLEAN3_27260	SPU_027260	none	none
GLEAN3_27263	SPU_027263	none	none
GLEAN3_27264	SPU_027264	none	none
GLEAN3_27265	SPU_027265	none	none
GLEAN3_27266	SPU_027266	none	none
GLEAN3_27267	SPU_027267	none	none
GLEAN3_27286	SPU_027286	none	none
GLEAN3_27302	SPU_027302	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27310	SPU_027310	contains 3 MRS6 domain motifs	none
GLEAN3_27313	SPU_027313	none	none
GLEAN3_27329	SPU_027329	contains 2 TST_Repeat superfamily motifs and 2 RING superfamily motifs and SseA domain and UBP5 domain	none
GLEAN3_27330	SPU_027330	none	none
GLEAN3_27333	SPU_027333	contains 13 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_27338	SPU_027338	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27368	SPU_027368	none	none
GLEAN3_27387	SPU_027387	none	none
GLEAN3_27403	SPU_027403	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27465	SPU_027465	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_27469	SPU_027469	none	none
GLEAN3_27489	SPU_027489	none	none
GLEAN3_27509	SPU_027509	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27544	SPU_027544	none	none
GLEAN3_27546	SPU_027546	none	none
GLEAN3_27601	SPU_027601	none	none
GLEAN3_27602	SPU_027602	none	none
GLEAN3_27605	SPU_027605	none	none
GLEAN3_27615	SPU_027615	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27637	SPU_027637	none	none
GLEAN3_27653	SPU_027653	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27665	SPU_027665	none	none
GLEAN3_27668	SPU_027668	contains 21 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_27678	SPU_027678	none	none
GLEAN3_27679	SPU_027679	none	none
GLEAN3_27691	SPU_027691	none	none
GLEAN3_27696	SPU_027696	contains COG4942 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27762	SPU_027762	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27772	SPU_027772	contains 13 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_27782	SPU_027782	none	none
GLEAN3_27790	SPU_027790	contains MDN1 domain	none
GLEAN3_27820	SPU_027820	none	none
GLEAN3_27821	SPU_027821	none	none
GLEAN3_27829	SPU_027829	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27861	SPU_027861	contains 6 MAM superfamily motifs	none
GLEAN3_27862	SPU_027862	contains 6 MAM superfamily motifs	none
GLEAN3_27898	SPU_027898	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27939	SPU_027939	contains 2 Peptidase_C54 superfamily motifs	none
GLEAN3_27959	SPU_027959	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27998	SPU_027998	none	none
GLEAN3_28009	SPU_028009	contains PH-like superfamily motif at C-terminus	none
GLEAN3_28014	SPU_028014	contains CYK3 domain	none
GLEAN3_28029	SPU_028029	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28064	SPU_028064	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28076	SPU_028076	contains RecD domain	none
GLEAN3_28196	SPU_028196	contains SbcC domain and SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28197	SPU_028197	none	none
GLEAN3_28251	SPU_028251	contains 3 C2 superfamily motifs	none
GLEAN3_28256	SPU_028256	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28260	SPU_028260	none	none
GLEAN3_28291	SPU_028291	contains 11 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28292	SPU_028292	contains 10 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28293	SPU_028293	contains 8 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28303	SPU_028303	contains RecD domain	none
GLEAN3_28333	SPU_028333	contains SNF2_N domain	none
GLEAN3_28340	SPU_028340	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28349	SPU_028349	none	none
GLEAN3_28361	SPU_028361	contains 16 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28383	SPU_028383	none	none
GLEAN3_28388	SPU_028388	none	none
GLEAN3_28393	SPU_028393	contains 14 EGF_CA superfamily motifs	none
GLEAN3_28415	SPU_028415	contains 15 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28439	SPU_028439	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28481	SPU_028481	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28508	SPU_028508	none	none
GLEAN3_28512	SPU_028512	contains 4 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28533	SPU_028533	contains 6 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28535	SPU_028535	contains 7 LDL_recept_b superfamily motifs and 2 HYR superfamily motifs and 3 CCP superfamily motifs	none
GLEAN3_28571	SPU_028571	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28585	SPU_028585	none	none
GLEAN3_28599	SPU_028599	none	none
GLEAN3_28600	SPU_028600	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28619	SPU_028619	none	none
GLEAN3_28640	SPU_028640	none	none
GLEAN3_28695	SPU_028695	none	none
GLEAN3_28713	SPU_028713	contains Mitofilin domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28745	SPU_028745	contains 20 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28768	SPU_028768	none	none
GLEAN3_28779	SPU_028779	contains COG1413 domain	none
GLEAN3_28786	SPU_028786	contains DYN1 domain and 3 Dynein_heavy domains	none
GLEAN3_28788	SPU_028788	contains COG4886 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28812	SPU_028812	none	none
GLEAN3_28813	SPU_028813	none	none
GLEAN3_28877	SPU_028877	none	none
GLEAN3_28890	SPU_028890	contains 9 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_28909	SPU_028909	none	none
GLEAN3_28916	SPU_028916	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28942	SPU_028942	none	none
GLEAN3_28943	SPU_028943	none	none
GLEAN3_00003	SPU_000003	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00004	SPU_000004	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00040	SPU_000040	none	none
GLEAN3_00079	SPU_000079	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00081	SPU_000081	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00087	SPU_000087	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00100	SPU_000100	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00133	SPU_000133	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00182	SPU_000182	none	none
GLEAN3_00185	SPU_000185	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00189	SPU_000189	none	none
GLEAN3_00250	SPU_000250	none	none
GLEAN3_00261	SPU_000261	none	none
GLEAN3_00320	SPU_000320	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00325	SPU_000325	contains COG4886 domain	none
GLEAN3_00335	SPU_000335	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00339	SPU_000339	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00356	SPU_000356	none	none
GLEAN3_00358	SPU_000358	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00361	SPU_000361	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00367	SPU_000367	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00368	SPU_000368	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00369	SPU_000369	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00370	SPU_000370	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00372	SPU_000372	none	none
GLEAN3_00377	SPU_000377	none	none
GLEAN3_00381	SPU_000381	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00382	SPU_000382	contains Glyco_hydro_47 superfamily motif at C-terminus	none
GLEAN3_00389	SPU_000389	none	none
GLEAN3_00397	SPU_000397	none	none
GLEAN3_00401	SPU_000401	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00410	SPU_000410	contains 6 EGF_CA superfamily motifs	none
GLEAN3_00412	SPU_000412	none	none
GLEAN3_00413	SPU_000413	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00419	SPU_000419	contains 2 EGF_CA superfamily motifs	none
GLEAN3_00432	SPU_000432	none	none
GLEAN3_00439	SPU_000439	none	none
GLEAN3_00452	SPU_000452	none	none
GLEAN3_00459	SPU_000459	none	none
GLEAN3_00462	SPU_000462	none	none
GLEAN3_00479	SPU_000479	none	none
GLEAN3_00481	SPU_000481		none
GLEAN3_00488	SPU_000488	none	none
GLEAN3_00491	SPU_000491	none	none
GLEAN3_00498	SPU_000498	none	none
GLEAN3_00516	SPU_000516	none	none
GLEAN3_00517	SPU_000517	none	none
GLEAN3_00519	SPU_000519	none	none
GLEAN3_00525	SPU_000525	none	none
GLEAN3_00527	SPU_000527	contains 2 CUB superfamily motifs	none
GLEAN3_00528	SPU_000528	none	none
GLEAN3_00530	SPU_000530	none	none
GLEAN3_00535	SPU_000535	none	none
GLEAN3_00536	SPU_000536	none	none
GLEAN3_00537	SPU_000537	none	none
GLEAN3_00555	SPU_000555	contains NADB_Rossmann superfamily motif in the C-terminal half. probable assembly chimera.	none
GLEAN3_00564	SPU_000564	contains 2 ANK superfamily motifs	none
GLEAN3_00568	SPU_000568	none	none
GLEAN3_00569	SPU_000569	none	none
GLEAN3_00572	SPU_000572	none	none
GLEAN3_00576	SPU_000576	none	none
GLEAN3_00585	SPU_000585	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_00586	SPU_000586	contains LdhA domain	none
GLEAN3_00589	SPU_000589	none	none
GLEAN3_00593	SPU_000593	none	none
GLEAN3_00605	SPU_000605	none	none
GLEAN3_00616	SPU_000616	none	none
GLEAN3_00620	SPU_000620	contains 2 LDLa superfamily motifs	none
GLEAN3_00622	SPU_000622	none	none
GLEAN3_00624	SPU_000624	contains PAT1 domain	none
GLEAN3_00626	SPU_000626	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00628	SPU_000628	contains Arp domain	none
GLEAN3_00629	SPU_000629	none	none
GLEAN3_00630	SPU_000630	none	none
GLEAN3_00632	SPU_000632	none	none
GLEAN3_00634	SPU_000634	none	none
GLEAN3_00640	SPU_000640	contains Smc domain and Reo_sigma1 domain	none
GLEAN3_00653	SPU_000653	contains 2 FA58C superfamily motifs	none
GLEAN3_00656	SPU_000656	none	none
GLEAN3_00658	SPU_000658	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00660	SPU_000660	contains GatA domain	none
GLEAN3_00662	SPU_000662	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00671	SPU_000671	contains RING superfamily motif and COG5540 domain at C-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00681	SPU_000681	none	none
GLEAN3_00682	SPU_000682	contains Sulfotransfer_1 domain	none
GLEAN3_00683	SPU_000683	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00684	SPU_000684	contains COG0429 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00693	SPU_000693	none	none
GLEAN3_00695	SPU_000695	contains 2 WW superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00698	SPU_000698	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00699	SPU_000699	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00700	SPU_000700	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00701	SPU_000701	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00703	SPU_000703	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00704	SPU_000704	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00705	SPU_000705	contains RAD18 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00711	SPU_000711	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00712	SPU_000712	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00713	SPU_000713	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00718	SPU_000718	contains Sultotransfer_1 domain	none
GLEAN3_00719	SPU_000719	none	none
GLEAN3_00720	SPU_000720	none	none
GLEAN3_00728	SPU_000728	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00732	SPU_000732	none	none
GLEAN3_00733	SPU_000733	none	none
GLEAN3_00741	SPU_000741	none	none
GLEAN3_00763	SPU_000763	none	none
GLEAN3_00767	SPU_000767	none	none
GLEAN3_00780	SPU_000780	none	none
GLEAN3_00783	SPU_000783	none	none
GLEAN3_00796	SPU_000796	none	none
GLEAN3_00800	SPU_000800	none	none
GLEAN3_00804	SPU_000804	contains CALCOCO1 domain	none
GLEAN3_00808	SPU_000808	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00810	SPU_000810	none	none
GLEAN3_00819	SPU_000819	none	none
GLEAN3_00840	SPU_000840	none	none
GLEAN3_00860	SPU_000860	none	none
GLEAN3_00878	SPU_000878	none	none
GLEAN3_00880	SPU_000880	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00895	SPU_000895	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00902	SPU_000902	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00903	SPU_000903	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00905	SPU_000905	none	none
GLEAN3_00913	SPU_000913	none	none
GLEAN3_00916	SPU_000916	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00917	SPU_000917	contains 3 LDL_recept_b superfamily motifs	none
GLEAN3_00928	SPU_000928	contains 2 TM2 superfamily motifs	none
GLEAN3_00929	SPU_000929	contains 3 TM2 superfamily motifs	none
GLEAN3_00930	SPU_000930	contains 2 TM2 superfamily motifs	none
GLEAN3_00940	SPU_000940	none	none
GLEAN3_00942	SPU_000942	contains 2 LuxE superfamily motifs	none
GLEAN3_00944	SPU_000944	none	none
GLEAN3_00956	SPU_000956	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00968	SPU_000968	none	none
GLEAN3_00978	SPU_000978	none	none
GLEAN3_00979	SPU_000979	contains Sulfotransfer_1 domain	none
GLEAN3_00980	SPU_000980	none	none
GLEAN3_00982	SPU_000982	none	none
GLEAN3_00983	SPU_000983	contains 2 TSP_1 superfamily motifs	none
GLEAN3_00989	SPU_000989	contains 2 MAM superfamily motifs	none
GLEAN3_01007	SPU_001007	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01008	SPU_001008	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01028	SPU_001028	none	none
GLEAN3_01031	SPU_001031	none	none
GLEAN3_01032	SPU_001032	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01034	SPU_001034	none	none
GLEAN3_01046	SPU_001046	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01050	SPU_001050	none	none
GLEAN3_01051	SPU_001051	none	none
GLEAN3_01059	SPU_001059	none	none
GLEAN3_01061	SPU_001061	none	none
GLEAN3_01065	SPU_001065	none	none
GLEAN3_01076	SPU_001076	none	none
GLEAN3_01077	SPU_001077	none	none
GLEAN3_01078	SPU_001078	contains MRP-S27 domain	none
GLEAN3_01080	SPU_001080	contains mutL domain	none
GLEAN3_01082	SPU_001082	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01087	SPU_001087	none	none
GLEAN3_01095	SPU_001095	none	none
GLEAN3_01096	SPU_001096	none	none
GLEAN3_01102	SPU_001102	contains SMC_N domain	none
GLEAN3_01105	SPU_001105	none	none
GLEAN3_01108	SPU_001108	contains COG4886 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01115	SPU_001115	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01122	SPU_001122	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01133	SPU_001133	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_01145	SPU_001145	contains metH domain	none
GLEAN3_01149	SPU_001149	none	none
GLEAN3_01162	SPU_001162	none	none
GLEAN3_01166	SPU_001166	none	none
GLEAN3_01168	SPU_001168	none	none
GLEAN3_01173	SPU_001173	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01176	SPU_001176	none	none
GLEAN3_01179	SPU_001179	none	none
GLEAN3_01180	SPU_001180	none	none
GLEAN3_01186	SPU_001186	none	none
GLEAN3_01196	SPU_001196	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01197	SPU_001197	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01212	SPU_001212	contains Na_H_Exchanger domain	none
GLEAN3_01219	SPU_001219	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01220	SPU_001220	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01223	SPU_001223	none	none
GLEAN3_01224	SPU_001224	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01242	SPU_001242	none	none
GLEAN3_01246	SPU_001246	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01247	SPU_001247	none	none
GLEAN3_01248	SPU_001248	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01250	SPU_001250	contains 2 Ras-like_GTPase superfamily motifs	none
GLEAN3_01265	SPU_001265	contains 2 PHD superfamily motifs and SFP1 domain	none
GLEAN3_01289	SPU_001289	none	none
GLEAN3_01295	SPU_001295	contains FRQ1 domain	none
GLEAN3_01297	SPU_001297	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01304	SPU_001304	none	none
GLEAN3_01305	SPU_001305	none	none
GLEAN3_01313	SPU_001313	none	none
GLEAN3_01316	SPU_001316	contains Sulfotransfer_1 domain	none
GLEAN3_01329	SPU_001329	none	none
GLEAN3_01331	SPU_001331	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01334	SPU_001334	none	none
GLEAN3_01337	SPU_001337	contains 2 RF-1 superfamily motifs and prfA domain	none
GLEAN3_01339	SPU_001339	none	none
GLEAN3_01345	SPU_001345	contains 10 EGF_CA superfamily motifs	none
GLEAN3_01353	SPU_001353	none	none
GLEAN3_01361	SPU_001361	none	none
GLEAN3_01367	SPU_001367	none	none
GLEAN3_01372	SPU_001372	none	none
GLEAN3_01381	SPU_001381	contains 2 MFS superfamily motifs	none
GLEAN3_01386	SPU_001386	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01389	SPU_001389	none	none
GLEAN3_01390	SPU_001390	none	none
GLEAN3_01391	SPU_001391	contains COG3391 domain	none
GLEAN3_01407	SPU_001407	none	none
GLEAN3_01411	SPU_001411	none	none
GLEAN3_01412	SPU_001412	contains 2 DUF605 domain motifs	none
GLEAN3_01413	SPU_001413	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01418	SPU_001418	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01419	SPU_001419	none	none
GLEAN3_01425	SPU_001425	none	none
GLEAN3_01426	SPU_001426	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01429	SPU_001429	none	none
GLEAN3_01438	SPU_001438	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01442	SPU_001442	none	none
GLEAN3_01447	SPU_001447	none	none
GLEAN3_01448	SPU_001448	contains DNA_pol_B_2 domain	none
GLEAN3_01456	SPU_001456	only N-terminal 80 amino acids have homologies to other proteins. probable assembly chimera.	none
GLEAN3_01462	SPU_001462	contains B41 domain	none
GLEAN3_01464	SPU_001464	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01468	SPU_001468	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01476	SPU_001476	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01479	SPU_001479	contains 2 NPR2 superfamily motifs	none
GLEAN3_01485	SPU_001485	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01486	SPU_001486	none	none
GLEAN3_01487	SPU_001487	none	none
GLEAN3_01488	SPU_001488	none	none
GLEAN3_01490	SPU_001490	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01499	SPU_001499	contains Sulfotransfer_1 domain	none
GLEAN3_01502	SPU_001502	none	none
GLEAN3_01503	SPU_001503	none	none
GLEAN3_01504	SPU_001504	none	none
GLEAN3_01511	SPU_001511	none	none
GLEAN3_01530	SPU_001530	none	none
GLEAN3_01535	SPU_001535	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_01537	SPU_001537	none	none
GLEAN3_01544	SPU_001544	none	none
GLEAN3_01546	SPU_001546	none	none
GLEAN3_01547	SPU_001547	none	none
GLEAN3_01554	SPU_001554	none	none
GLEAN3_01557	SPU_001557	none	none
GLEAN3_01567	SPU_001567	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01580	SPU_001580	none	none
GLEAN3_01581	SPU_001581	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01587	SPU_001587	none	none
GLEAN3_01602	SPU_001602	contains 2 V-set domain motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01614	SPU_001614	none	none
GLEAN3_01617	SPU_001617	none	none
GLEAN3_01625	SPU_001625	none	none
GLEAN3_01626	SPU_001626	none	none
GLEAN3_01627	SPU_001627	contains PRK07800 domain	none
GLEAN3_01631	SPU_001631	none	none
GLEAN3_01639	SPU_001639	none	none
GLEAN3_01641	SPU_001641	contains COG4886 domain	none
GLEAN3_01642	SPU_001642	none	none
GLEAN3_01644	SPU_001644	none	none
GLEAN3_01645	SPU_001645	contains Smc domain	none
GLEAN3_01654	SPU_001654	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01655	SPU_001655	contains Sulfotransfer_1 domain	none
GLEAN3_01664	SPU_001664	none	none
GLEAN3_01666	SPU_001666	contains 2 PBPb superfamily motifs	none
GLEAN3_01667	SPU_001667	none	none
GLEAN3_01672	SPU_001672	contains COG4886 domain	none
GLEAN3_01673	SPU_001673	homolgous to bacterial and plant proteins	none
GLEAN3_01674	SPU_001674	none	none
GLEAN3_01675	SPU_001675	none	none
GLEAN3_01678	SPU_001678	none	none
GLEAN3_01687	SPU_001687	none	none
GLEAN3_01688	SPU_001688	none	none
GLEAN3_01691	SPU_001691	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01699	SPU_001699	contains DYN1 domain	none
GLEAN3_01708	SPU_001708	none	none
GLEAN3_01717	SPU_001717	none	none
GLEAN3_01719	SPU_001719	contains ComEA domain and Tex domain	none
GLEAN3_01720	SPU_001720	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01728	SPU_001728	none	none
GLEAN3_01737	SPU_001737	contains 3 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01744	SPU_001744	none	none
GLEAN3_01747	SPU_001747	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01751	SPU_001751	contains COG4870 domain	none
GLEAN3_01754	SPU_001754	contains 4 FA58C superfamily motifs	none
GLEAN3_01755	SPU_001755	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01783	SPU_001783	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01790	SPU_001790	none	none
GLEAN3_01801	SPU_001801	none	none
GLEAN3_01802	SPU_001802	none	none
GLEAN3_01806	SPU_001806	none	none
GLEAN3_01819	SPU_001819	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01834	SPU_001834	none	none
GLEAN3_01839	SPU_001839	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01841	SPU_001841	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01842	SPU_001842	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01843	SPU_001843	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01844	SPU_001844	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01845	SPU_001845	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01858	SPU_001858	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_01865	SPU_001865	none	none
GLEAN3_01867	SPU_001867	none	none
GLEAN3_01868	SPU_001868	none	none
GLEAN3_01869	SPU_001869	none	none
GLEAN3_01872	SPU_001872	none	none
GLEAN3_01876	SPU_001876	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01883	SPU_001883	none	none
GLEAN3_01886	SPU_001886	none	none
GLEAN3_01892	SPU_001892	none	none
GLEAN3_01913	SPU_001913	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01917	SPU_001917	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01920	SPU_001920	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01940	SPU_001940	contains Fanconi_C domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01941	SPU_001941	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01943	SPU_001943	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01944	SPU_001944	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01950	SPU_001950	none	none
GLEAN3_01951	SPU_001951	none	none
GLEAN3_01953	SPU_001953	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_01955	SPU_001955	none	none
GLEAN3_01963	SPU_001963	contains Pkinase domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01964	SPU_001964	none	none
GLEAN3_01965	SPU_001965	none	none
GLEAN3_01966	SPU_001966	none	none
GLEAN3_01976	SPU_001976	none	none
GLEAN3_01981	SPU_001981	contains Smc domain	none
GLEAN3_01983	SPU_001983	none	none
GLEAN3_01985	SPU_001985	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01989	SPU_001989	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01997	SPU_001997	contains 6 HYR superfamily motifs	none
GLEAN3_01999	SPU_001999	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02000	SPU_002000	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02001	SPU_002001	none	none
GLEAN3_02027	SPU_002027	none	none
GLEAN3_02034	SPU_002034	contains Borrelia_P83 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02038	SPU_002038	none	none
GLEAN3_02039	SPU_002039	none	none
GLEAN3_02040	SPU_002040	none	none
GLEAN3_02045	SPU_002045	none	none
GLEAN3_02048	SPU_002048	none	none
GLEAN3_02053	SPU_002053	none	none
GLEAN3_02064	SPU_002064	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02065	SPU_002065	none	none
GLEAN3_02077	SPU_002077	contains 2 MIB_HERC2 superfamily motifs	none
GLEAN3_02078	SPU_002078	none	none
GLEAN3_02087	SPU_002087	none	none
GLEAN3_02094	SPU_002094	none	none
GLEAN3_02097	SPU_002097	none	none
GLEAN3_02098	SPU_002098	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02099	SPU_002099	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02104	SPU_002104	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_02126	SPU_002126	none	none
GLEAN3_02131	SPU_002131	none	none
GLEAN3_02134	SPU_002134	none	none
GLEAN3_02135	SPU_002135	none	none
GLEAN3_02138	SPU_002138	none	none
GLEAN3_02143	SPU_002143	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02146	SPU_002146	none	none
GLEAN3_02155	SPU_002155	none	none
GLEAN3_02166	SPU_002166	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02167	SPU_002167	none	none
GLEAN3_02169	SPU_002169	none	none
GLEAN3_02171	SPU_002171	none	none
GLEAN3_02188	SPU_002188	none	none
GLEAN3_02208	SPU_002208	none	none
GLEAN3_02216	SPU_002216	contains Smc domain	none
GLEAN3_02221	SPU_002221	none	none
GLEAN3_02229	SPU_002229	none	none
GLEAN3_02232	SPU_002232	none	none
GLEAN3_02235	SPU_002235	none	none
GLEAN3_02237	SPU_002237	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_02239	SPU_002239	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02240	SPU_002240	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02242	SPU_002242	none	none
GLEAN3_02243	SPU_002243	contains 2 EGF_CA superfamily motifs	none
GLEAN3_02244	SPU_002244	none	none
GLEAN3_02245	SPU_002245	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02248	SPU_002248	none	none
GLEAN3_02253	SPU_002253	contains ACTIN domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02255	SPU_002255	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_02259	SPU_002259	none	none
GLEAN3_02261	SPU_002261	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02263	SPU_002263	none	none
GLEAN3_02268	SPU_002268	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_02269	SPU_002269	none	none
GLEAN3_02271	SPU_002271	none	none
GLEAN3_02275	SPU_002275	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_02276	SPU_002276	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_02277	SPU_002277	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_02283	SPU_002283	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02286	SPU_002286	none	none
GLEAN3_02287	SPU_002287	none	none
GLEAN3_02288	SPU_002288	none	none
GLEAN3_02289	SPU_002289	contains Mrp domain	none
GLEAN3_02296	SPU_002296	homologous to numerous putative Branchiostoma floridae proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02301	SPU_002301	none	none
GLEAN3_02311	SPU_002311	none	none
GLEAN3_02317	SPU_002317	none	none
GLEAN3_02331	SPU_002331	none	none
GLEAN3_02332	SPU_002332	contains PRK00142 domain	none
GLEAN3_02344	SPU_002344	contains COG2319 domain	none
GLEAN3_02345	SPU_002345	none	none
GLEAN3_02348	SPU_002348	contains Sultotransfer_1 domain	none
GLEAN3_02351	SPU_002351	none	none
GLEAN3_02358	SPU_002358	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02365	SPU_002365	none	none
GLEAN3_02388	SPU_002388	none	none
GLEAN3_02393	SPU_002393	none	none
GLEAN3_02400	SPU_002400	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02395	SPU_002395	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02403	SPU_002403	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02408	SPU_002408	none	none
GLEAN3_02410	SPU_002410	contains 2 FA58C superfamily motifs	none
GLEAN3_02419	SPU_002419	none	none
GLEAN3_02422	SPU_002422	none	none
GLEAN3_02424	SPU_002424	contains Tektin domain	none
GLEAN3_02427	SPU_002427	probable assembly chimera	none
GLEAN3_02430	SPU_002430	none	none
GLEAN3_02440	SPU_002440	contains COG3391 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02451	SPU_002451	none	none
GLEAN3_02452	SPU_002452	contains 3 CCP superfamily motifs	none
GLEAN3_02454	SPU_002454	contains 2 CCP superfamily motifs	none
GLEAN3_02488	SPU_002488	none	none
GLEAN3_02495	SPU_002495	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02515	SPU_002515	none	none
GLEAN3_02519	SPU_002519	none	none
GLEAN3_02523	SPU_002523	none	none
GLEAN3_02526	SPU_002526	none	none
GLEAN3_02527	SPU_002527	none	none
GLEAN3_02528	SPU_002528	none	none
GLEAN3_02540	SPU_002540	contains 4 IG superfamily motifs	none
GLEAN3_02542	SPU_002542	none	none
GLEAN3_02551	SPU_002551	contains 4 EGF_CA superfamily motifs	none
GLEAN3_02557	SPU_002557	none	none
GLEAN3_02560	SPU_002560	none	none
GLEAN3_02565	SPU_002565	none	none
GLEAN3_02568	SPU_002568	none	none
GLEAN3_02573	SPU_002573	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02583	SPU_002583	none	none
GLEAN3_02588	SPU_002588	none	none
GLEAN3_02597	SPU_002597	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02599	SPU_002599	none	none
GLEAN3_02620	SPU_002620	contains 2 TPR superfamily motifs	none
GLEAN3_02629	SPU_002629	contains MesJ domain	none
GLEAN3_02640	SPU_002640	contains COG2433 domain	none
GLEAN3_02642	SPU_002642	contains ArgE domain	none
GLEAN3_02643	SPU_002643	none	none
GLEAN3_02652	SPU_002652	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02672	SPU_002672	none	none
GLEAN3_02675	SPU_002675	contains 3 MAM superfamily motifs	none
GLEAN3_02678	SPU_002678	none	none
GLEAN3_02684	SPU_002684	contains 2 MFS superfamily motifs	none
GLEAN3_02689	SPU_002689	contains 3 zf-C2HC superfamily motifs and Myosin_tail_1 domain	none
GLEAN3_02710	SPU_002710	none	none
GLEAN3_02716	SPU_002716	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02726	SPU_002726	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02729	SPU_002729	contains RecD domain	none
GLEAN3_02731	SPU_002731	none	none
GLEAN3_02739	SPU_002739	none	none
GLEAN3_02746	SPU_002746	contains 2 EGF_CA superfamily motifs	none
GLEAN3_02751	SPU_002751	contains 2 CCP superfamily motifs	none
GLEAN3_02753	SPU_002753	none	none
GLEAN3_02761	SPU_002761	none	none
GLEAN3_02762	SPU_002762	contains 2 ANK superfamily motifs	none
GLEAN3_02768	SPU_002768	none	none
GLEAN3_02775	SPU_002775	none	none
GLEAN3_02781	SPU_002781	none	none
GLEAN3_02791	SPU_002791	none	none
GLEAN3_02799	SPU_002799	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02814	SPU_002814	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02826	SPU_002826	contains 2 IG superfamily motifs	none
GLEAN3_02834	SPU_002834	contains 2 FA58C superfamily motifs	none
GLEAN3_02838	SPU_002838	none	none
GLEAN3_02841	SPU_002841	none	none
GLEAN3_02845	SPU_002845	none	none
GLEAN3_02850	SPU_002850	none	none
GLEAN3_02856	SPU_002856	none	none
GLEAN3_02897	SPU_002897	none	none
GLEAN3_02906	SPU_002906	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02913	SPU_002913	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02916	SPU_002916	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02918	SPU_002918	none	none
GLEAN3_02920	SPU_002920	contains Smc domain	none
GLEAN3_02934	SPU_002934	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02935	SPU_002935	contains COG4886 domain	none
GLEAN3_02940	SPU_002940	none	none
GLEAN3_02949	SPU_002949	none	none
GLEAN3_02956	SPU_002956	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02967	SPU_002967	none	none
GLEAN3_02976	SPU_002976	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02985	SPU_002985	none	none
GLEAN3_02999	SPU_002999	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03002	SPU_003002	none	none
GLEAN3_03003	SPU_003003	none	none
GLEAN3_03004	SPU_003004	none	none
GLEAN3_03011	SPU_003011	none	none
GLEAN3_03012	SPU_003012	none	none
GLEAN3_03013	SPU_003013	none	none
GLEAN3_03014	SPU_003014	none	none
GLEAN3_03015	SPU_003015	none	none
GLEAN3_03016	SPU_003016	none	none
GLEAN3_03018	SPU_003018	none	none
GLEAN3_03019	SPU_003019	none	none
GLEAN3_03020	SPU_003020	none	none
GLEAN3_03024	SPU_003024	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03036	SPU_003036	none	none
GLEAN3_03044	SPU_003044	none	none
GLEAN3_03053	SPU_003053	contains RimI domain	none
GLEAN3_03058	SPU_003058	none	none
GLEAN3_03066	SPU_003066	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03068	SPU_003068	none	none
GLEAN3_03074	SPU_003074	contains COG4886 domain	none
GLEAN3_03079	SPU_003079	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03090	SPU_003090	none	none
GLEAN3_03096	SPU_003096	contains Herpes_BLLF1 domain	none
GLEAN3_03097	SPU_003097	none	none
GLEAN3_03098	SPU_003098	none	none
GLEAN3_03105	SPU_003105	none	none
GLEAN3_03112	SPU_003112	none	none
GLEAN3_03118	SPU_003118	contains 2 HYR superfamily motifs	none
GLEAN3_03121	SPU_003121	none	none
GLEAN3_03125	SPU_003125	none	none
GLEAN3_03128	SPU_003128	none	none
GLEAN3_03133	SPU_003133	none	none
GLEAN3_03139	SPU_003139	none	none
GLEAN3_03143	SPU_003143	none	none
GLEAN3_03147	SPU_003147	none	none
GLEAN3_03150	SPU_003150	none	none
GLEAN3_03155	SPU_003155	contains PAT1 domain	none
GLEAN3_03173	SPU_003173	none	none
GLEAN3_03178	SPU_003178	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_03180	SPU_003180	contains 2 class_II_aaRS-like_core superfamily motifs	none
GLEAN3_03187	SPU_003187	probable assembly chimera	none
GLEAN3_03188	SPU_003188	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_03193	SPU_003193	none	none
GLEAN3_03201	SPU_003201	none	none
GLEAN3_03204	SPU_003204	none	none
GLEAN3_03205	SPU_003205	contains PilF domain	none
GLEAN3_03208	SPU_003208	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03212	SPU_003212	none	none
GLEAN3_03215	SPU_003215	none	none
GLEAN3_03237	SPU_003237	none	none
GLEAN3_03238	SPU_003238	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03254	SPU_003254	none	none
GLEAN3_03255	SPU_003255	none	none
GLEAN3_03270	SPU_003270	none	none
GLEAN3_03271	SPU_003271	none	none
GLEAN3_03275	SPU_003275	none	none
GLEAN3_03279	SPU_003279	none	none
GLEAN3_03288	SPU_003288	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03301	SPU_003301	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03306	SPU_003306	none	none
GLEAN3_03309	SPU_003309	none	none
GLEAN3_03327	SPU_003327	none	none
GLEAN3_03336	SPU_003336	none	none
GLEAN3_03337	SPU_003337	contains 2 DEXDc superfamily motifs	none
GLEAN3_03338	SPU_003338	none	none
GLEAN3_03339	SPU_003339	none	none
GLEAN3_03340	SPU_003340	none	none
GLEAN3_03341	SPU_003341	none	none
GLEAN3_03342	SPU_003342	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03352	SPU_003352	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03353	SPU_003353	none	none
GLEAN3_03354	SPU_003354	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03359	SPU_003359	contains 2 IG superfamily motifs	none
GLEAN3_03361	SPU_003361	none	none
GLEAN3_03379	SPU_003379	none	none
GLEAN3_03383	SPU_003383	none	none
GLEAN3_03385	SPU_003385	contains 2 ANK superfamily motifs and Arp domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03386	SPU_003386	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03391	SPU_003391	none	none
GLEAN3_03393	SPU_003393	none	none
GLEAN3_03395	SPU_003395	none	none
GLEAN3_03409	SPU_003409	none	none
GLEAN3_03410	SPU_003410	none	none
GLEAN3_03420	SPU_003420	none	none
GLEAN3_03423	SPU_003423	none	none
GLEAN3_03427	SPU_003427	contains 3 IG superfamily motifs	none
GLEAN3_03442	SPU_003442	none	none
GLEAN3_03471	SPU_003471	none	none
GLEAN3_03482	SPU_003482	none	none
GLEAN3_03484	SPU_003484	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03489	SPU_003489	none	none
GLEAN3_03492	SPU_003492	none	none
GLEAN3_03493	SPU_003493	none	none
GLEAN3_03498	SPU_003498	none	none
GLEAN3_03502	SPU_003502	none	none
GLEAN3_03503	SPU_003503	contains PRK05771	none
GLEAN3_03519	SPU_003519	contains SSM4 domain	none
GLEAN3_03523	SPU_003523	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03525	SPU_003525	none	none
GLEAN3_03527	SPU_003527	none	none
GLEAN3_03535	SPU_003535	none	none
GLEAN3_03543	SPU_003543	none	none
GLEAN3_03546	SPU_003546	none	none
GLEAN3_03562	SPU_003562	contains 2 FNR-like superfamily motifs	none
GLEAN3_03571	SPU_003571	none	none
GLEAN3_03572	SPU_003572	contains Arp domain	none
GLEAN3_03577	SPU_003577	contains 2 FA58C superfamily motifs	none
GLEAN3_03590	SPU_003590	none	none
GLEAN3_03629	SPU_003629	probable assembly chimera	none
GLEAN3_03636	SPU_003636	none	none
GLEAN3_03638	SPU_003638	none	none
GLEAN3_03645	SPU_003645	none	none
GLEAN3_03647	SPU_003647	none	none
GLEAN3_03648	SPU_003648	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_03650	SPU_003650	contains 2 DnaQ-like_exo superfamily and PolB domain	none
GLEAN3_03652	SPU_003652	contains DUF1394 domain	none
GLEAN3_03656	SPU_003656	none	none
GLEAN3_03661	SPU_003661	none	none
GLEAN3_03662	SPU_003662	probable assembly chimera	none
GLEAN3_03665	SPU_003665	none	none
GLEAN3_03668	SPU_003668	none	none
GLEAN3_03670	SPU_003670	none	none
GLEAN3_03673	SPU_003673	none	none
GLEAN3_03682	SPU_003682	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03686	SPU_003686	none	none
GLEAN3_03688	SPU_003688	none	none
GLEAN3_03693	SPU_003693	none	none
GLEAN3_03695	SPU_003695	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03721	SPU_003721	none	none
GLEAN3_03727	SPU_003727	none	none
GLEAN3_03734	SPU_003734	contains PRK05771 domain and UhpB domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03737	SPU_003737	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03738	SPU_003738	contains SNC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03739	SPU_003739	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03745	SPU_003745	contains Sulfotransfer_1 domain	none
GLEAN3_03747	SPU_003747	none	none
GLEAN3_03751	SPU_003751	contains 7 Ldl_recept_b superfamily motifs	none
GLEAN3_03753	SPU_003753	none	none
GLEAN3_03756	SPU_003756	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03764	SPU_003764	none	none
GLEAN3_03765	SPU_003765	none	none
GLEAN3_03771	SPU_003771	none	none
GLEAN3_03779	SPU_003779	contains COG4886 domain	none
GLEAN3_03780	SPU_003780	none	none
GLEAN3_03789	SPU_003789	none	none
GLEAN3_03794	SPU_003794	none	none
GLEAN3_03801	SPU_003801	contains SbcC domain	none
GLEAN3_03808	SPU_003808	none	none
GLEAN3_03813	SPU_003813	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_03817	SPU_003817	none	none
GLEAN3_03819	SPU_003819	none	none
GLEAN3_03820	SPU_003820	none	none
GLEAN3_03822	SPU_003822	none	none
GLEAN3_03838	SPU_003838	contains ATS1 domain	none
GLEAN3_03858	SPU_003858	contains Sulfotransfer_1 domain	none
GLEAN3_03871	SPU_003871	contains 2 PHD superfamily motifs	none
GLEAN3_03875	SPU_003875	none	none
GLEAN3_03877	SPU_003877	none	none
GLEAN3_03879	SPU_003879	contains AcuC domain	none
GLEAN3_03880	SPU_003880	none	none
GLEAN3_03883	SPU_003883	none	none
GLEAN3_03885	SPU_003885	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03888	SPU_003888	contains Smc domain	none
GLEAN3_03889	SPU_003889	contains Sulfotransfer_1 domain	none
GLEAN3_03909	SPU_003909	none	none
GLEAN3_03921	SPU_003921	none	none
GLEAN3_03940	SPU_003940	none	none
GLEAN3_03947	SPU_003947	none	none
GLEAN3_03948	SPU_003948	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03950	SPU_003950	contains 2 ANK superfamily motifs	none
GLEAN3_03952	SPU_003952	contains SMC_N domain and SGL domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03953	SPU_003953	none	none
GLEAN3_03960	SPU_003960	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03968	SPU_003968	none	none
GLEAN3_03980	SPU_003980	none	none
GLEAN3_03986	SPU_003986	none	none
GLEAN3_03991	SPU_003991	none	none
GLEAN3_03997	SPU_003997	none	none
GLEAN3_04002	SPU_004002	none	none
GLEAN3_04003	SPU_004003	none	none
GLEAN3_04012	SPU_004012	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_04034	SPU_004034	none	none
GLEAN3_04044	SPU_004044	none	none
GLEAN3_04047	SPU_004047	none	none
GLEAN3_04052	SPU_004052	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04054	SPU_004054	contains DnaJ domain. Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_04058	SPU_004058	none	none
GLEAN3_04060	SPU_004060	none	none
GLEAN3_04063	SPU_004063	contains ATS1 domain	none
GLEAN3_04081	SPU_004081	contains RecQ domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04082	SPU_004082	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04090	SPU_004090	SPP	none
GLEAN3_04093	SPU_004093	none	none
GLEAN3_04098	SPU_004098	none	none
GLEAN3_04099	SPU_004099	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04108	SPU_004108	contains Uvr domain	none
GLEAN3_04126	SPU_004126	contains 3 RRM superfamily motifs	none
GLEAN3_04129	SPU_004129	none	none
GLEAN3_04132	SPU_004132	none	none
GLEAN3_04140	SPU_004140	none	none
GLEAN3_04168	SPU_004168	none	none
GLEAN3_04175	SPU_004175	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_04184	SPU_004184	none	none
GLEAN3_04186	SPU_004186	contains PnbA domain	none
GLEAN3_04187	SPU_004187	contains AMP-binding domain	none
GLEAN3_04199	SPU_004199	none	none
GLEAN3_04201	SPU_004201	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04202	SPU_004202	none	none
GLEAN3_04204	SPU_004204	none	none
GLEAN3_04205	SPU_004205	none	none
GLEAN3_04214	SPU_004214	none	none
GLEAN3_04221	SPU_004221	none	none
GLEAN3_04222	SPU_004222	none	none
GLEAN3_04223	SPU_004223	none	none
GLEAN3_04231	SPU_004231	contains 2 LamG superfamily motifs	none
GLEAN3_04234	SPU_004234	none	none
GLEAN3_04248	SPU_004248	none	none
GLEAN3_04253	SPU_004253	none	none
GLEAN3_04254	SPU_004254	none	none
GLEAN3_04264	SPU_004264	none	none
GLEAN3_04270	SPU_004270	none	none
GLEAN3_04273	SPU_004273	none	none
GLEAN3_04274	SPU_004274	none	none
GLEAN3_04288	SPU_004288	none	none
GLEAN3_04297	SPU_004297	none	none
GLEAN3_04302	SPU_004302	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04307	SPU_004307	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04308	SPU_004308	none	none
GLEAN3_04316	SPU_004316	none	none
GLEAN3_04331	SPU_004331	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04334	SPU_004334	none	none
GLEAN3_04335	SPU_004335		none
GLEAN3_04337	SPU_004337	none	none
GLEAN3_04351	SPU_004351	contains AF-4 domain	none
GLEAN3_04355	SPU_004355	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04364	SPU_004364	none	none
GLEAN3_04368	SPU_004368	none	none
GLEAN3_04369	SPU_004369	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04370	SPU_004370	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04374	SPU_004374	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04383	SPU_004383	none	none
GLEAN3_04387	SPU_004387	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04392	SPU_004392	none	none
GLEAN3_04400	SPU_004400	contains 2 KR superfamily motifs	none
GLEAN3_04410	SPU_004410	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04418	SPU_004418	none	none
GLEAN3_04422	SPU_004422	contains 3 PDZ superfamily motifs	none
GLEAN3_04428	SPU_004428	none	none
GLEAN3_04429	SPU_004429	none	none
GLEAN3_04430	SPU_004430	none	none
GLEAN3_04439	SPU_004439	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04441	SPU_004441	none	none
GLEAN3_04457	SPU_004457	none	none
GLEAN3_04460	SPU_004460	none	none
GLEAN3_04465	SPU_004465	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04469	SPU_004469	none	none
GLEAN3_04471	SPU_004471	none	none
GLEAN3_04472	SPU_004472	contains PAT1 domain. probable assembly chimera.	none
GLEAN3_04475	SPU_004475	none	none
GLEAN3_04479	SPU_004479	none	none
GLEAN3_04480	SPU_004480	contains UDPGT domain	none
GLEAN3_04481	SPU_004481	contains UDPGT domain	none
GLEAN3_04484	SPU_004484	none	none
GLEAN3_04488	SPU_004488	contains 2 LDLa superfamily motifs	none
GLEAN3_04491	SPU_004491	contains 3 MAM superfamily motifs	none
GLEAN3_04497	SPU_004497	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_04502	SPU_004502	none	none
GLEAN3_04509	SPU_004509	contains Filament domain	none
GLEAN3_04514	SPU_004514	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04515	SPU_004515	none	none
GLEAN3_04516	SPU_004516	contains COG4886 domain	none
GLEAN3_04520	SPU_004520	none	none
GLEAN3_04523	SPU_004523	none	none
GLEAN3_04530	SPU_004530	contains 3 EGF_CA superfamily motifs	none
GLEAN3_04532	SPU_004532	none	none
GLEAN3_04538	SPU_004538	none	none
GLEAN3_04539	SPU_004539	none	none
GLEAN3_04542	SPU_004542	contains 2 EGF_CA superfamily motifs	none
GLEAN3_04560	SPU_004560	probable assembly chimera	none
GLEAN3_04562	SPU_004562	none	none
GLEAN3_04565	SPU_004565	contains 2 Cupin superfamily motifs	none
GLEAN3_04568	SPU_004568		none
GLEAN3_04571	SPU_004571	none	none
GLEAN3_04574	SPU_004574	none	none
GLEAN3_04576	SPU_004576	none	none
GLEAN3_04577	SPU_004577	none	none
GLEAN3_04578	SPU_004578	none	none
GLEAN3_04579	SPU_004579	none	none
GLEAN3_04580	SPU_004580	none	none
GLEAN3_04581	SPU_004581	none	none
GLEAN3_04583	SPU_004583	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04591	SPU_004591	none	none
GLEAN3_04617	SPU_004617	none	none
GLEAN3_04618	SPU_004618	contains 3 KR superfamily motifs	none
GLEAN3_04629	SPU_004629	none	none
GLEAN3_04631	SPU_004631	none	none
GLEAN3_04635	SPU_004635	probable assembly chimera	none
GLEAN3_04636	SPU_004636	contains 2 MAM superfamily motifs	none
GLEAN3_04677	SPU_004677	none	none
GLEAN3_04681	SPU_004681	none	none
GLEAN3_04682	SPU_004682	contains Adaptin_N domain	none
GLEAN3_04686	SPU_004686	contains 2 FA58C superfamily motifs	none
GLEAN3_04695	SPU_004695	none	none
GLEAN3_04703	SPU_004703	contains DLH domain	none
GLEAN3_04706	SPU_004706	contains 2 WSC superfamily motifs	none
GLEAN3_04707	SPU_004707	contains 2 WSC superfamily motifs	none
GLEAN3_04709	SPU_004709	none	none
GLEAN3_04712	SPU_004712	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04727	SPU_004727	none	none
GLEAN3_04728	SPU_004728	none	none
GLEAN3_04729	SPU_004729	none	none
GLEAN3_04730	SPU_004730	none	none
GLEAN3_04732	SPU_004732	none	none
GLEAN3_04733	SPU_004733	none	none
GLEAN3_04749	SPU_004749	none	none
GLEAN3_04752	SPU_004752	none	none
GLEAN3_04754	SPU_004754	none	none
GLEAN3_04757	SPU_004757	contains PRK05431 domain	none
GLEAN3_04761	SPU_004761	contains PnbA domain	none
GLEAN3_04764	SPU_004764	none	none
GLEAN3_04767	SPU_004767	none	none
GLEAN3_04768	SPU_004768	none	none
GLEAN3_04774	SPU_004774	none	none
GLEAN3_04775	SPU_004775	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04780	SPU_004780	contains PRK06116 domain. probable assembly chimera.	none
GLEAN3_04793	SPU_004793	none	none
GLEAN3_04803	SPU_004803	none	none
GLEAN3_04804	SPU_004804	none	none
GLEAN3_04819	SPU_004819	none	none
GLEAN3_04820	SPU_004820	none	none
GLEAN3_04825	SPU_004825	none	none
GLEAN3_04828	SPU_004828	none	none
GLEAN3_04829	SPU_004829	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04843	SPU_004843	probable assembly chimera	none
GLEAN3_04851	SPU_004851	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04871	SPU_004871	none	none
GLEAN3_04873	SPU_004873	none	none
GLEAN3_04878	SPU_004878	none	none
GLEAN3_04879	SPU_004879	none	none
GLEAN3_04882	SPU_004882	none	none
GLEAN3_04883	SPU_004883	none	none
GLEAN3_04884	SPU_004884	none	none
GLEAN3_04889	SPU_004889	none	none
GLEAN3_04900	SPU_004900	none	none
GLEAN3_04904	SPU_004904	homologous to only 1 hypothetical Branchiostoma floridae protein. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04906	SPU_004906	homologous to only 1 hypothetical Branchiostoma floridae protein. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04921	SPU_004921	none	none
GLEAN3_04925	SPU_004925	none	none
GLEAN3_04931	SPU_004931	contains 2 IG superfamily motifs	none
GLEAN3_04936	SPU_004936	none	none
GLEAN3_04942	SPU_004942	probable assembly chimera	none
GLEAN3_04948	SPU_004948	none	none
GLEAN3_04979	SPU_004979	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04981	SPU_004981	none	none
GLEAN3_05010	SPU_005010	none	none
GLEAN3_05012	SPU_005012	contains 2 CCP superfamily motifs	none
GLEAN3_05014	SPU_005014	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05016	SPU_005016	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05018	SPU_005018	none	none
GLEAN3_05025	SPU_005025	contains MFS_1 domain	none
GLEAN3_05040	SPU_005040	none	none
GLEAN3_05042	SPU_005042	none	none
GLEAN3_05043	SPU_005043	contains V-set domain	none
GLEAN3_05046	SPU_005046	contains SMC_N domain	none
GLEAN3_05052	SPU_005052	none	none
GLEAN3_05058	SPU_005058	none	none
GLEAN3_05064	SPU_005064	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05069	SPU_005069	contains 2 IG superfamily motifs	none
GLEAN3_05072	SPU_005072	none	none
GLEAN3_05076	SPU_005076	none	none
GLEAN3_05080	SPU_005080	none	none
GLEAN3_05087	SPU_005087	none	none
GLEAN3_05093	SPU_005093	contains Sulfotransfer_1 domain	none
GLEAN3_05098	SPU_005098	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05101	SPU_005101	contains MFS domain	none
GLEAN3_05108	SPU_005108	none	none
GLEAN3_05109	SPU_005109	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05110	SPU_005110	none	none
GLEAN3_05116	SPU_005116	none	none
GLEAN3_05121	SPU_005121	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05128	SPU_005128	none	none
GLEAN3_05130	SPU_005130	contains 2 HYR superfamily motifs. probable assembly chimera.	none
GLEAN3_05141	SPU_005141	none	none
GLEAN3_05159	SPU_005159	contains Sultotransfer_1 domain	none
GLEAN3_05163	SPU_005163	contains Ion_trans domain	none
GLEAN3_05165	SPU_005165	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05168	SPU_005168	contains PnbA domain	none
GLEAN3_05179	SPU_005179	contains COG3415 domain	none
GLEAN3_05180	SPU_005180	none	none
GLEAN3_05181	SPU_005181	none	none
GLEAN3_05184	SPU_005184	none	none
GLEAN3_05185	SPU_005185	none	none
GLEAN3_05191	SPU_005191	contains 2 ARM superfamily motifs	none
GLEAN3_05197	SPU_005197	none	none
GLEAN3_05199	SPU_005199	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05206	SPU_005206	contains 2 EGF_CA superfamily	none
GLEAN3_05218	SPU_005218	contains YidC domain	none
GLEAN3_05227	SPU_005227	none	none
GLEAN3_05230	SPU_005230	none	none
GLEAN3_05233	SPU_005233	none	none
GLEAN3_05242	SPU_005242	none	none
GLEAN3_05263	SPU_005263	contains ACTIN domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05278	SPU_005278	none	none
GLEAN3_05279	SPU_005279	none	none
GLEAN3_05281	SPU_005281	none	none
GLEAN3_05282	SPU_005282	none	none
GLEAN3_05283	SPU_005283	contains S_TKc domain	none
GLEAN3_05284	SPU_005284	none	none
GLEAN3_05286	SPU_005286	none	none
GLEAN3_05289	SPU_005289	poor sequence data: ~25% of amino acids are X	none
GLEAN3_05294	SPU_005294	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05297	SPU_005297	contains PRK00409 domain	none
GLEAN3_05299	SPU_005299	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_05304	SPU_005304	none	none
GLEAN3_05310	SPU_005310	homologous to bacterial putative proteins	none
GLEAN3_05322	SPU_005322	none	none
GLEAN3_05330	SPU_005330	contains OATP domain	none
GLEAN3_05341	SPU_005341	none	none
GLEAN3_05373	SPU_005373	contains Sulfotransfer_1 domain	none
GLEAN3_05390	SPU_005390	none	none
GLEAN3_05391	SPU_005391	none	none
GLEAN3_05392	SPU_005392	none	none
GLEAN3_05395	SPU_005395	none	none
GLEAN3_05404	SPU_005404	none	none
GLEAN3_05408	SPU_005408	none	none
GLEAN3_05417	SPU_005417	none	none
GLEAN3_05421	SPU_005421	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_05425	SPU_005425	none	none
GLEAN3_05440	SPU_005440	none	none
GLEAN3_05448	SPU_005448	contains ECM4 domain	none
GLEAN3_05449	SPU_005449	none	none
GLEAN3_05455	SPU_005455	none	none
GLEAN3_05471	SPU_005471	none	none
GLEAN3_05473	SPU_005473	none	none
GLEAN3_05474	SPU_005474	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05477	SPU_005477	contains Integrin_alpha2 domain	none
GLEAN3_05479	SPU_005479	none	none
GLEAN3_05482	SPU_005482	none	none
GLEAN3_05490	SPU_005490	none	none
GLEAN3_05494	SPU_005494	none	none
GLEAN3_05496	SPU_005496	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_05497	SPU_005497	none	none
GLEAN3_05504	SPU_005504	none	none
GLEAN3_05505	SPU_005505	none	none
GLEAN3_05506	SPU_005506	none	none
GLEAN3_05510	SPU_005510	none	none
GLEAN3_05525	SPU_005525	none	none
GLEAN3_05536	SPU_005536	none	none
GLEAN3_05542	SPU_005542	none	none
GLEAN3_05546	SPU_005546	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05548	SPU_005548	contains COG4886 domain	none
GLEAN3_05555	SPU_005555	none	none
GLEAN3_05578	SPU_005578	none	none
GLEAN3_05584	SPU_005584	none	none
GLEAN3_05586	SPU_005586	none	none
GLEAN3_05589	SPU_005589	none	none
GLEAN3_05608	SPU_005608	none	none
GLEAN3_05614	SPU_005614	none	none
GLEAN3_05615	SPU_005615	none	none
GLEAN3_05617	SPU_005617	contains 2 MFS superfamily motifs	none
GLEAN3_05631	SPU_005631	contains RhaT domain	none
GLEAN3_05632	SPU_005632	contains 2 DUF6 superfamily motifs	none
GLEAN3_05633	SPU_005633	contains RhaT domain	none
GLEAN3_05634	SPU_005634	contains RhaT domain	none
GLEAN3_05635	SPU_005635	contains RhaT domain	none
GLEAN3_05647	SPU_005647	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05648	SPU_005648	contains S-methyl_trans domain	none
GLEAN3_05650	SPU_005650	none	none
GLEAN3_05656	SPU_005656	none	none
GLEAN3_05660	SPU_005660	probable assembly chimera	none
GLEAN3_05664	SPU_005664	contains 2 DM13 superfamily motifs	none
GLEAN3_05671	SPU_005671	none	none
GLEAN3_05678	SPU_005678	none	none
GLEAN3_05682	SPU_005682	contains 3 HYR superfamily motifs	none
GLEAN3_05683	SPU_005683	none	none
GLEAN3_05685	SPU_005685	contains 6 rve superfamily motifs	none
GLEAN3_05690	SPU_005690	none	none
GLEAN3_05692	SPU_005692	none	none
GLEAN3_05693	SPU_005693	none	none
GLEAN3_05705	SPU_005705	none	none
GLEAN3_05711	SPU_005711	contains PRK05431 domain	none
GLEAN3_05721	SPU_005721	none	none
GLEAN3_05724	SPU_005724	contains 5 CUB superfamily motifs	none
GLEAN3_05749	SPU_005749	none	none
GLEAN3_05751	SPU_005751	none	none
GLEAN3_05756	SPU_005756	none	none
GLEAN3_05763	SPU_005763	none	none
GLEAN3_05764	SPU_005764	contains RAD18 domain	none
GLEAN3_05770	SPU_005770	contains PRK07003 domain	none
GLEAN3_05778	SPU_005778	none	none
GLEAN3_05790	SPU_005790	none	none
GLEAN3_05793	SPU_005793	none	none
GLEAN3_05801	SPU_005801	none	none
GLEAN3_05802	SPU_005802	none	none
GLEAN3_05804	SPU_005804	none	none
GLEAN3_05809	SPU_005809	contains 2 MAM superfamily motifs	none
GLEAN3_05812	SPU_005812	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05815	SPU_005815	none	none
GLEAN3_05816	SPU_005816	none	none
GLEAN3_05818	SPU_005818	none	none
GLEAN3_05819	SPU_005819	none	none
GLEAN3_05822	SPU_005822	none	none
GLEAN3_05823	SPU_005823	none	none
GLEAN3_05824	SPU_005824	contains SMC_N domain	none
GLEAN3_05827	SPU_005827	contains PRK04778 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05841	SPU_005841	none	none
GLEAN3_05846	SPU_005846	none	none
GLEAN3_05857	SPU_005857	none	none
GLEAN3_05861	SPU_005861	none	none
GLEAN3_05879	SPU_005879	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05883	SPU_005883	none	none
GLEAN3_05889	SPU_005889	none	none
GLEAN3_05891	SPU_005891	none	none
GLEAN3_05898	SPU_005898	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05902	SPU_005902	contains V-set domain	none
GLEAN3_05907	SPU_005907	none	none
GLEAN3_05918	SPU_005918	contains 2 MAM superfamily motifs	none
GLEAN3_05921	SPU_005921	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05922	SPU_005922	none	none
GLEAN3_05925	SPU_005925	none	none
GLEAN3_05727	SPU_005727	contains 2 MAM superfamily motifs	none
GLEAN3_05929	SPU_005929	none	none
GLEAN3_05937	SPU_005937	none	none
GLEAN3_05939	SPU_005939	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05940	SPU_005940	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05946	SPU_005946	none	none
GLEAN3_05949	SPU_005949	none	none
GLEAN3_05953	SPU_005953	contains 2 Periplasmic_Binding_Protein_type_1 superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05959	SPU_005959	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05967	SPU_005967	none	none
GLEAN3_05981	SPU_005981	none	none
GLEAN3_05985	SPU_005985	none	none
GLEAN3_06007	SPU_006007	contains COG0790 domain	none
GLEAN3_06009	SPU_006009	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06013	SPU_006013	none	none
GLEAN3_06015	SPU_006015	none	none
GLEAN3_06023	SPU_006023	contains 3 IG superfamily motifs	none
GLEAN3_06032	SPU_006032	probable assembly chimera	none
GLEAN3_06036	SPU_006036	probable assembly chimera	none
GLEAN3_06039	SPU_006039	contains COG4486 domain	none
GLEAN3_06049	SPU_006049	none	none
GLEAN3_06050	SPU_006050	contains COG3415 domain	none
GLEAN3_06075	SPU_006075	none	none
GLEAN3_06077	SPU_006077	none	none
GLEAN3_06078	SPU_006078	none	none
GLEAN3_06079	SPU_006079	none	none
GLEAN3_06080	SPU_006080	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06081	SPU_006081	none	none
GLEAN3_06090	SPU_006090	none	none
GLEAN3_06091	SPU_006091	none	none
GLEAN3_06093	SPU_006093	none	none
GLEAN3_06094	SPU_006094	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_06095	SPU_006095	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_06100	SPU_006100	none	none
GLEAN3_06104	SPU_006104	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06108	SPU_006108	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06113	SPU_006113	contains MDN1 domain	none
GLEAN3_06115	SPU_006115	none	none
GLEAN3_06120	SPU_006120	none	none
GLEAN3_06121	SPU_006121	none	none
GLEAN3_06127	SPU_006127	contains GCC2_GCC3 superfamily motifs	none
GLEAN3_06130	SPU_006130	none	none
GLEAN3_06133	SPU_006133	contains SNC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06137	SPU_006137	none	none
GLEAN3_06140	SPU_006140	none	none
GLEAN3_06146	SPU_006146	none	none
GLEAN3_06147	SPU_006147	none	none
GLEAN3_06167	SPU_006167	none	none
GLEAN3_06168	SPU_006168	none	none
GLEAN3_06171	SPU_006171	none	none
GLEAN3_06179	SPU_006179	none	none
GLEAN3_06182	SPU_006182	none	none
GLEAN3_06187	SPU_006187	contains Sulfotransfer_1 domain	none
GLEAN3_06190	SPU_006190	none	none
GLEAN3_06193	SPU_006193	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06194	SPU_006194	contains Transposase domain	none
GLEAN3_06196	SPU_006196	none	none
GLEAN3_06208	SPU_006208	none	none
GLEAN3_06227	SPU_006227	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06231	SPU_006231	contains 3 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_06233	SPU_006233	contains 2 Gelsolin superfamily motifs	none
GLEAN3_06240	SPU_006240	contains PEX10 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06246	SPU_006246	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06247	SPU_006247	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06248	SPU_006248	none	none
GLEAN3_06249	SPU_006249	none	none
GLEAN3_06250	SPU_006250	none	none
GLEAN3_06251	SPU_006251	none	none
GLEAN3_06252	SPU_006252	none	none
GLEAN3_06253	SPU_006253	none	none
GLEAN3_06255	SPU_006255	probable assembly chimera	none
GLEAN3_06256	SPU_006256	none	none
GLEAN3_06266	SPU_006266	none	none
GLEAN3_06269	SPU_006269	none	none
GLEAN3_06273	SPU_006273	none	none
GLEAN3_06277	SPU_006277	none	none
GLEAN3_06281	SPU_006281	none	none
GLEAN3_06282	SPU_006282	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06288	SPU_006288	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06289	SPU_006289	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06292	SPU_006292	contains SMC_N domain	none
GLEAN3_06295	SPU_006295	none	none
GLEAN3_06304	SPU_006304	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06307	SPU_006307	contains AslA domain	none
GLEAN3_06309	SPU_006309	none	none
GLEAN3_06314	SPU_006314	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06316	SPU_006316	none	none
GLEAN3_06321	SPU_006321	none	none
GLEAN3_06325	SPU_006325	none	none
GLEAN3_06336	SPU_006336	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06337	SPU_006337	none	none
GLEAN3_06356	SPU_006356	none	none
GLEAN3_06360	SPU_006360	none	none
GLEAN3_06371	SPU_006371	none	none
GLEAN3_06376	SPU_006376	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06384	SPU_006384	none	none
GLEAN3_06407	SPU_006407	contains Smc domain	none
GLEAN3_06408	SPU_006408	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06409	SPU_006409	none	none
GLEAN3_06411	SPU_006411	none	none
GLEAN3_06415	SPU_006415	probable assembly chimera	none
GLEAN3_06416	SPU_006416	contains 5 EGF_CA superfamily	none
GLEAN3_06422	SPU_006422	contains 2 MFS superfamily motifs	none
GLEAN3_06424	SPU_006424	none	none
GLEAN3_06428	SPU_006428	none	none
GLEAN3_06431	SPU_006431	none	none
GLEAN3_06433	SPU_006433	none	none
GLEAN3_06436	SPU_006436	contains SEC21 domain	none
GLEAN3_06441	SPU_006441	none	none
GLEAN3_06442	SPU_006442	none	none
GLEAN3_06447	SPU_006447	none	none
GLEAN3_06448	SPU_006448	none	none
GLEAN3_06449	SPU_006449	none	none
GLEAN3_06450	SPU_006450	none	none
GLEAN3_06478	SPU_006478	none	none
GLEAN3_06482	SPU_006482	none	none
GLEAN3_06500	SPU_006500	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06501	SPU_006501	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06502	SPU_006502	none	none
GLEAN3_06507	SPU_006507	none	none
GLEAN3_06512	SPU_006512	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06515	SPU_006515	poor sequence data: >80% of the amino acids are X	none
GLEAN3_06516	SPU_006516	none	none
GLEAN3_06537	SPU_006537	contains 2 MFS superfamily motifs	none
GLEAN3_06559	SPU_006559	none	none
GLEAN3_06575	SPU_006575	none	none
GLEAN3_06584	SPU_006584	none	none
GLEAN3_06593	SPU_006593	none	none
GLEAN3_06594	SPU_006594	none	none
GLEAN3_06597	SPU_006597	none	none
GLEAN3_06598	SPU_006598	none	none
GLEAN3_06600	SPU_006600	none	none
GLEAN3_06613	SPU_006613	contains 2 HYR superfamily motifs	none
GLEAN3_06616	SPU_006616	contains 3 IG superfamily motifs	none
GLEAN3_06618	SPU_006618	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06626	SPU_006626	none	none
GLEAN3_06631	SPU_006631	none	none
GLEAN3_06638	SPU_006638	none	none
GLEAN3_06641	SPU_006641	none	none
GLEAN3_06642	SPU_006642	none	none
GLEAN3_06643	SPU_006643	none	none
GLEAN3_06646	SPU_006646	none	none
GLEAN3_06652	SPU_006652	none	none
GLEAN3_06653	SPU_006653	none	none
GLEAN3_06661	SPU_006661	none	none
GLEAN3_06667	SPU_006667	contains 2 Neutralized superfamily motifs	none
GLEAN3_06668	SPU_006668	none	none
GLEAN3_06681	SPU_006681	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06687	SPU_006687	none	none
GLEAN3_06688	SPU_006688	none	none
GLEAN3_06690	SPU_006690	none	none
GLEAN3_06694	SPU_006694	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06697	SPU_006697	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06709	SPU_006709	none	none
GLEAN3_06712	SPU_006712	probable assembly chimera	none
GLEAN3_06713	SPU_006713	none	none
GLEAN3_06717	SPU_006717	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06725	SPU_006725	none	none
GLEAN3_06735	SPU_006735	none	none
GLEAN3_06747	SPU_006747	none	none
GLEAN3_06760	SPU_006760	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06765	SPU_006765	none	none
GLEAN3_06769	SPU_006769	none	none
GLEAN3_06774	SPU_006774	contains COG3391 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06775	SPU_006775	none	none
GLEAN3_06777	SPU_006777	none	none
GLEAN3_06780	SPU_006780	poor sequence data: ~40% of amino acids are X	none
GLEAN3_06796	SPU_006796	none	none
GLEAN3_06798	SPU_006798	none	none
GLEAN3_06799	SPU_006799	none	none
GLEAN3_06844	SPU_006844	none	none
GLEAN3_06854	SPU_006854	none	none
GLEAN3_06856	SPU_006856	contains 3 EGF_CA superfamily motifs	none
GLEAN3_06860	SPU_006860	none	none
GLEAN3_06867	SPU_006867	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06868	SPU_006868	none	none
GLEAN3_06873	SPU_006873	none	none
GLEAN3_06885	SPU_006885	probable assembly chimera	none
GLEAN3_06898	SPU_006898	probable assembly chimera	none
GLEAN3_06904	SPU_006904	none	none
GLEAN3_06905	SPU_006905	contains SMC_N domain	none
GLEAN3_06906	SPU_006906	none	none
GLEAN3_06907	SPU_006907	none	none
GLEAN3_06918	SPU_006918	none	none
GLEAN3_06920	SPU_006920	none	none
GLEAN3_06924	SPU_006924	none	none
GLEAN3_06928	SPU_006928	none	none
GLEAN3_06930	SPU_006930	none	none
GLEAN3_06933	SPU_006933	none	none
GLEAN3_06934	SPU_006934	contains 3 CCP superfamily motifs	none
GLEAN3_06940	SPU_006940	none	none
GLEAN3_06957	SPU_006957	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_06960	SPU_006960	none	none
GLEAN3_06962	SPU_006962	none	none
GLEAN3_06964	SPU_006964	none	none
GLEAN3_06966	SPU_006966	none	none
GLEAN3_06975	SPU_006975	none	none
GLEAN3_06978	SPU_006978	contains 2 S1-like superfamily motifs	none
GLEAN3_06996	SPU_006996	none	none
GLEAN3_06999	SPU_006999	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07010	SPU_007010	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07016	SPU_007016	contains 2 IG superfamily motifs	none
GLEAN3_07025	SPU_007025	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07037	SPU_007037	none	none
GLEAN3_07039	SPU_007039	contains 2 FA58C superfamily motifs	none
GLEAN3_07053	SPU_007053	none	none
GLEAN3_07054	SPU_007054	contains ArgK domain	none
GLEAN3_07058	SPU_007058	none	none
GLEAN3_07069	SPU_007069	none	none
GLEAN3_07076	SPU_007076	contains 3 CCP superfamily motifs	none
GLEAN3_07089	SPU_007089	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07090	SPU_007090	none	none
GLEAN3_07094	SPU_007094	none	none
GLEAN3_07120	SPU_007120	none	none
GLEAN3_07124	SPU_007124	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_07128	SPU_007128	none	none
GLEAN3_07133	SPU_007133	none	none
GLEAN3_07136	SPU_007136	none	none
GLEAN3_07144	SPU_007144	none	none
GLEAN3_07146	SPU_007146	none	none
GLEAN3_07156	SPU_007156	none	none
GLEAN3_07162	SPU_007162	none	none
GLEAN3_07173	SPU_007173	contains DUF2146 domain	none
GLEAN3_07177	SPU_007177	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07179	SPU_007179	contains Ins134_P3_kin domain	none
GLEAN3_07197	SPU_007197	none	none
GLEAN3_07198	SPU_007198	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07199	SPU_007199	none	none
GLEAN3_07201	SPU_007201	none	none
GLEAN3_07207	SPU_007207	none	none
GLEAN3_07212	SPU_007212	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07220	SPU_007220	none	none
GLEAN3_07223	SPU_007223	none	none
GLEAN3_07232	SPU_007232	none	none
GLEAN3_07234	SPU_007234	none	none
GLEAN3_07236	SPU_007236	none	none
GLEAN3_07250	SPU_007250	contains FrhG domain	none
GLEAN3_07252	SPU_007252	none	none
GLEAN3_07255	SPU_007255	none	none
GLEAN3_07256	SPU_007256	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07266	SPU_007266	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07269	SPU_007269	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07271	SPU_007271	none	none
GLEAN3_07278	SPU_007278	none	none
GLEAN3_07288	SPU_007288	none	none
GLEAN3_07290	SPU_007290	none	none
GLEAN3_07293	SPU_007293	none	none
GLEAN3_07295	SPU_007295	none	none
GLEAN3_07303	SPU_007303	contains SpoVK domain	none
GLEAN3_07304	SPU_007304	none	none
GLEAN3_07305	SPU_007305	none	none
GLEAN3_07316	SPU_007316	none	none
GLEAN3_07329	SPU_007329	none	none
GLEAN3_07334	SPU_007334	none	none
GLEAN3_07338	SPU_007338	none	none
GLEAN3_07345	SPU_007345	none	none
GLEAN3_07348	SPU_007348	contains COG5635 domain	none
GLEAN3_07351	SPU_007351	none	none
GLEAN3_07356	SPU_007356	none	none
GLEAN3_07362	SPU_007362	none	none
GLEAN3_07365	SPU_007365	none	none
GLEAN3_07367	SPU_007367	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07369	SPU_007369	none	none
GLEAN3_07375	SPU_007375	none	none
GLEAN3_07378	SPU_007378	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07383	SPU_007383	none	none
GLEAN3_07388	SPU_007388	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07395	SPU_007395	none	none
GLEAN3_07396	SPU_007396	none	none
GLEAN3_07399	SPU_007399	none	none
GLEAN3_07415	SPU_007415	none	none
GLEAN3_07417	SPU_007417	contains PRK11664 domain	none
GLEAN3_07432	SPU_007432	none	none
GLEAN3_07439	SPU_007439	none	none
GLEAN3_07442	SPU_007442	none	none
GLEAN3_07447	SPU_007447	none	none
GLEAN3_07451	SPU_007451	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07453	SPU_007453	none	none
GLEAN3_07459	SPU_007459	none	none
GLEAN3_07464	SPU_007464	contains 3 TPR superfamily motifs	none
GLEAN3_07465	SPU_007465	none	none
GLEAN3_07467	SPU_007467	none	none
GLEAN3_07496	SPU_007496	probable assembly chimera	none
GLEAN3_07501	SPU_007501	none	none
GLEAN3_07504	SPU_007504	contains 2 ANK superfamily motifs	none
GLEAN3_07524	SPU_007524	none	none
GLEAN3_07534	SPU_007534	contains 2 EGF_CA superfamily motifs	none
GLEAN3_07542	SPU_007542	none	none
GLEAN3_07545	SPU_007545	none	none
GLEAN3_07546	SPU_007546	none	none
GLEAN3_07550	SPU_007550	contains 2 EFh superfamily motifs	none
GLEAN3_07559	SPU_007559	none	none
GLEAN3_07560	SPU_007560	none	none
GLEAN3_07561	SPU_007561	contains COG5635 domain	none
GLEAN3_07569	SPU_007569	none	none
GLEAN3_07570	SPU_007570	contains 2 MFS superfamily motifs	none
GLEAN3_07573	SPU_007573	none	none
GLEAN3_07574	SPU_007574	none	none
GLEAN3_07575	SPU_007575	none	none
GLEAN3_07577	SPU_007577	none	none
GLEAN3_07584	SPU_007584	none	none
GLEAN3_07586	SPU_007586	none	none
GLEAN3_07588	SPU_007588	contains Tweety domain	none
GLEAN3_07602	SPU_007602	none	none
GLEAN3_07608	SPU_007608	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07614	SPU_007614	none	none
GLEAN3_07615	SPU_007615	none	none
GLEAN3_07619	SPU_007619	none	none
GLEAN3_07625	SPU_007625	none	none
GLEAN3_07626	SPU_007626	none	none
GLEAN3_07627	SPU_007627	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07632	SPU_007632	none	none
GLEAN3_07635	SPU_007635	contains GalT domain	none
GLEAN3_07642	SPU_007642	none	none
GLEAN3_07643	SPU_007643	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07646	SPU_007646	none	none
GLEAN3_07647	SPU_007647	none	none
GLEAN3_07648	SPU_007648	contains 2 CUB superfamily motifs	none
GLEAN3_07653	SPU_007653	none	none
GLEAN3_07659	SPU_007659	none	none
GLEAN3_07663	SPU_007663	none	none
GLEAN3_07678	SPU_007678	none	none
GLEAN3_07680	SPU_007680	contains Smc domain	none
GLEAN3_07681	SPU_007681	contains Smc domain	none
GLEAN3_07689	SPU_007689	contains UDPGT domain	none
GLEAN3_07694	SPU_007694	none	none
GLEAN3_07700	SPU_007700	none	none
GLEAN3_07707	SPU_007707	none	none
GLEAN3_07723	SPU_007723	none	none
GLEAN3_07726	SPU_007726	none	none
GLEAN3_07746	SPU_007746	none	none
GLEAN3_07752	SPU_007752	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_07754	SPU_007754	contains Smc domain	none
GLEAN3_07761	SPU_007761	none	none
GLEAN3_07767	SPU_007767	none	none
GLEAN3_07772	SPU_007772	none	none
GLEAN3_07776	SPU_007776	none	none
GLEAN3_07786	SPU_007786	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07792	SPU_007792	none	none
GLEAN3_07793	SPU_007793	none	none
GLEAN3_07797	SPU_007797	none	none
GLEAN3_07788	SPU_007788	none	none
GLEAN3_07798	SPU_007798	none	none
GLEAN3_07803	SPU_007803	none	none
GLEAN3_07804	SPU_007804	none	none
GLEAN3_07806	SPU_007806	none	none
GLEAN3_07807	SPU_007807	none	none
GLEAN3_07814	SPU_007814	none	none
GLEAN3_07817	SPU_007817	contains COG5222 domain	none
GLEAN3_07824	SPU_007824	contains 3 IG superfamily motifs	none
GLEAN3_07826	SPU_007826	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07829	SPU_007829	none	none
GLEAN3_07848	SPU_007848	none	none
GLEAN3_07858	SPU_007858	none	none
GLEAN3_07871	SPU_007871	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_07872	SPU_007872	contains Sulfotransfer_1 domain	none
GLEAN3_07874	SPU_007874	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07878	SPU_007878	none	none
GLEAN3_07879	SPU_007879	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07885	SPU_007885	none	none
GLEAN3_07886	SPU_007886	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07903	SPU_007903	none	none
GLEAN3_07904	SPU_007904	none	none
GLEAN3_07912	SPU_007912	contains COG3391 domain	none
GLEAN3_07921	SPU_007921	none	none
GLEAN3_07926	SPU_007926	contains Sulfotransfer_1 domain	none
GLEAN3_07931	SPU_007931	none	none
GLEAN3_07945	SPU_007945	none	none
GLEAN3_07947	SPU_007947	contains V-set domain	none
GLEAN3_07956	SPU_007956	contains TruA domain	none
GLEAN3_07959	SPU_007959	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07963	SPU_007963	contains Ion_trans domain	none
GLEAN3_07967	SPU_007967	none	none
GLEAN3_07970	SPU_007970	none	none
GLEAN3_07974	SPU_007974	none	none
GLEAN3_07975	SPU_007975	none	none
GLEAN3_07978	SPU_007978	none	none
GLEAN3_07982	SPU_007982	none	none
GLEAN3_07988	SPU_007988	none	none
GLEAN3_07989	SPU_007989	none	none
GLEAN3_07996	SPU_007996	none	none
GLEAN3_07998	SPU_007998	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07999	SPU_007999	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08002	SPU_008002	ATS1 domain	none
GLEAN3_08010	SPU_008010	contains VacB domain	none
GLEAN3_08012	SPU_008012	none	none
GLEAN3_08023	SPU_008023	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08026	SPU_008026	contains 2 LIM superfamily motifs	none
GLEAN3_08028	SPU_008028	contains 3 EGF_CA superfamily motifs. probable assembly chimera.	none
GLEAN3_08031	SPU_008031	none	none
GLEAN3_08033	SPU_008033	none	none
GLEAN3_08035	SPU_008035	none	none
GLEAN3_08043	SPU_008043	none	none
GLEAN3_08047	SPU_008047	none	none
GLEAN3_08052	SPU_008052	none	none
GLEAN3_08059	SPU_008059	contains 3 BTB superfamily motifs	none
GLEAN3_08060	SPU_008060	contains 2 BTB superfamily motifs	none
GLEAN3_08061	SPU_008061	contains 2 BTB superfamily motifs	none
GLEAN3_08086	SPU_008086	none	none
GLEAN3_08087	SPU_008087	none	none
GLEAN3_08088	SPU_008088	none	none
GLEAN3_08100	SPU_008100	contains 3 CLECT superfamily motifs	none
GLEAN3_08101	SPU_008101	none	none
GLEAN3_08103	SPU_008103	none	none
GLEAN3_08106	SPU_008106	none	none
GLEAN3_08124	SPU_008124	none	none
GLEAN3_08126	SPU_008126	none	none
GLEAN3_08127	SPU_008127	none	none
GLEAN3_08129	SPU_008129	contains 2 TUDOR superfamily motifs	none
GLEAN3_08145	SPU_008145	contains 2 EGF_CA superfamily motifs	none
GLEAN3_08156	SPU_008156	contains 2 FA58C superfamily motifs	none
GLEAN3_08162	SPU_008162	none	none
GLEAN3_08163	SPU_008163	none	none
GLEAN3_08165	SPU_008165	contains 2 EGF_CA superfamily motifs	none
GLEAN3_08167	SPU_008167	none	none
GLEAN3_08168	SPU_008168	none	none
GLEAN3_08187	SPU_008187	none	none
GLEAN3_08190	SPU_008190	none	none
GLEAN3_08191	SPU_008191	none	none
GLEAN3_08193	SPU_008193	none	none
GLEAN3_08198	SPU_008198	none	none
GLEAN3_08202	SPU_008202	none	none
GLEAN3_08203	SPU_008203	none	none
GLEAN3_08205	SPU_008205	none	none
GLEAN3_08216	SPU_008216	none	none
GLEAN3_08220	SPU_008220	none	none
GLEAN3_08223	SPU_008223	none	none
GLEAN3_08224	SPU_008224	none	none
GLEAN3_08226	SPU_008226	none	none
GLEAN3_08234	SPU_008234	contains 6 HMA superfamily motifs	none
GLEAN3_08238	SPU_008238	none	none
GLEAN3_08239	SPU_008239	none	none
GLEAN3_08243	SPU_008243	none	none
GLEAN3_08251	SPU_008251	none	none
GLEAN3_08252	SPU_008252	none	none
GLEAN3_08257	SPU_008257	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_08262	SPU_008262	none	none
GLEAN3_08269	SPU_008269	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08273	SPU_008273	none	none
GLEAN3_08285	SPU_008285	none	none
GLEAN3_08306	SPU_008306	none	none
GLEAN3_08307	SPU_008307	none	none
GLEAN3_08311	SPU_008311	none	none
GLEAN3_08316	SPU_008316	none	none
GLEAN3_08321	SPU_008321	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_08323	SPU_008323	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08330	SPU_008330	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08335	SPU_008335	contains Torsin domain	none
GLEAN3_08339	SPU_008339	none	none
GLEAN3_08342	SPU_008342	none	none
GLEAN3_08343	SPU_008343	contains MFS_1 domain	none
GLEAN3_08345	SPU_008345	contains 2 SAM superfamily motifs	none
GLEAN3_08348	SPU_008348	none	none
GLEAN3_08355	SPU_008355	none	none
GLEAN3_08364	SPU_008364	none	none
GLEAN3_08367	SPU_008367	none	none
GLEAN3_08375	SPU_008375	none	none
GLEAN3_08376	SPU_008376	none	none
GLEAN3_08383	SPU_008383	contains 2 EGF_CA superfamily motifs	none
GLEAN3_08384	SPU_008384	none	none
GLEAN3_08392	SPU_008392	contains Myosin_tail_1 domain	none
GLEAN3_08400	SPU_008400	none	none
GLEAN3_08414	SPU_008414	contains 2 MFS superfamily motifs	none
GLEAN3_08419	SPU_008419	none	none
GLEAN3_08430	SPU_008430	contains 2 FA58C superfamily motifs	none
GLEAN3_08436	SPU_008436	none	none
GLEAN3_08446	SPU_008446	none	none
GLEAN3_08449	SPU_008449	none	none
GLEAN3_08453	SPU_008453	none	none
GLEAN3_08455	SPU_008455	none	none
GLEAN3_08460	SPU_008460	none	none
GLEAN3_08461	SPU_008461	none	none
GLEAN3_08468	SPU_008468	none	none
GLEAN3_08473	SPU_008473	contains PnbA domain	none
GLEAN3_08474	SPU_008474	contains COG4886 domain	none
GLEAN3_08476	SPU_008476	none	none
GLEAN3_08478	SPU_008478	none	none
GLEAN3_08482	SPU_008482	none	none
GLEAN3_08484	SPU_008484	probable assembly chimera	none
GLEAN3_08488	SPU_008488	none	none
GLEAN3_08490	SPU_008490	none	none
GLEAN3_08491	SPU_008491	probable assembly chimera	none
GLEAN3_08494	SPU_008494	none	none
GLEAN3_08497	SPU_008497	none	none
GLEAN3_08501	SPU_008501	none	none
GLEAN3_08508	SPU_008508	contains AIR1 domain	none
GLEAN3_08509	SPU_008509	none	none
GLEAN3_08518	SPU_008518	contains SMC_N domain	none
GLEAN3_08519	SPU_008519	none	none
GLEAN3_08523	SPU_008523	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08524	SPU_008524	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08530	SPU_008530	none	none
GLEAN3_08546	SPU_008546	contains Nop14 domain	none
GLEAN3_08555	SPU_008555	none	none
GLEAN3_08556	SPU_008556	none	none
GLEAN3_08564	SPU_008564	none	none
GLEAN3_08565	SPU_008565	none	none
GLEAN3_08568	SPU_008568	none	none
GLEAN3_08569	SPU_008569	contains DYN1 domain	none
GLEAN3_08570	SPU_008570	none	none
GLEAN3_08573	SPU_008573	none	none
GLEAN3_08574	SPU_008574	none	none
GLEAN3_08579	SPU_008579	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08580	SPU_008580	none	none
GLEAN3_08584	SPU_008584	none	none
GLEAN3_08588	SPU_008588	none	none
GLEAN3_08601	SPU_008601	none	none
GLEAN3_08603	SPU_008603	none	none
GLEAN3_08607	SPU_008607	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_08608	SPU_008608	none	none
GLEAN3_08613	SPU_008613	contains 2 EGF_CA superfamily motifs	none
GLEAN3_08629	SPU_008629	none	none
GLEAN3_08630	SPU_008630	contains COG1204 domain	none
GLEAN3_08635	SPU_008635	none	none
GLEAN3_08636	SPU_008636	contains COG4700 domain	none
GLEAN3_08637	SPU_008637	contains 3 LRR_RI superfamily motifs	none
GLEAN3_08639	SPU_008639	contains PRK08315 domain	none
GLEAN3_08640	SPU_008640	none	none
GLEAN3_08644	SPU_008644	none	none
GLEAN3_08654	SPU_008654	none	none
GLEAN3_08660	SPU_008660	none	none
GLEAN3_08661	SPU_008661	none	none
GLEAN3_08664	SPU_008664	none	none
GLEAN3_08666	SPU_008666	none	none
GLEAN3_08671	SPU_008671	probable assembly chimera	none
GLEAN3_08682	SPU_008682	none	none
GLEAN3_08689	SPU_008689	none	none
GLEAN3_08708	SPU_008708	none	none
GLEAN3_08711	SPU_008711	none	none
GLEAN3_08715	SPU_008715	none	none
GLEAN3_08723	SPU_008723	none	none
GLEAN3_08725	SPU_008725	none	none
GLEAN3_08726	SPU_008726	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08727	SPU_008727	none	none
GLEAN3_08744	SPU_008744	none	none
GLEAN3_08745	SPU_008745	none	none
GLEAN3_08753	SPU_008753	none	none
GLEAN3_08757	SPU_008757	none	none
GLEAN3_08758	SPU_008758	none	none
GLEAN3_08784	SPU_008784	contains SMC_N domain	none
GLEAN3_08788	SPU_008788	contains Sulfotransfer_1 domain	none
GLEAN3_08790	SPU_008790	none	none
GLEAN3_08792	SPU_008792	none	none
GLEAN3_08794	SPU_008794	none	none
GLEAN3_08796	SPU_008796	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_08797	SPU_008797	none	none
GLEAN3_08805	SPU_008805	contains MFS_1 domain	none
GLEAN3_08807	SPU_008807	none	none
GLEAN3_08809	SPU_008809	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08810	SPU_008810	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08813	SPU_008813	none	none
GLEAN3_08814	SPU_008814	none	none
GLEAN3_08817	SPU_008817	none	none
GLEAN3_08827	SPU_008827	none	none
GLEAN3_08829	SPU_008829	none	none
GLEAN3_08830	SPU_008830	none	none
GLEAN3_08831	SPU_008831	contains 2 BTB superfamily motifs	none
GLEAN3_08837	SPU_008837	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08852	SPU_008852	none	none
GLEAN3_08856	SPU_008856	none	none
GLEAN3_08873	SPU_008873	contains PRK06116 domain	none
GLEAN3_08883	SPU_008883	none	none
GLEAN3_08890	SPU_008890	none	none
GLEAN3_08900	SPU_008900	none	none
GLEAN3_08902	SPU_008902	none	none
GLEAN3_08904	SPU_008904	none	none
GLEAN3_08920	SPU_008920	none	none
GLEAN3_08921	SPU_008921	none	none
GLEAN3_08929	SPU_008929	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08941	SPU_008941	none	none
GLEAN3_08947	SPU_008947	none	none
GLEAN3_08961	SPU_008961	contains SMC_N domain	none
GLEAN3_08965	SPU_008965	none	none
GLEAN3_08968	SPU_008968	none	none
GLEAN3_08972	SPU_008972	contains 2 SPEC superfamily motifs	none
GLEAN3_08973	SPU_008973	none	none
GLEAN3_08998	SPU_008998	none	none
GLEAN3_08999	SPU_008999	none	none
GLEAN3_09001	SPU_009001	contains DadA domain	none
GLEAN3_09033	SPU_009033	contains 2 FA58C superfamily motifs	none
GLEAN3_09038	SPU_009038	none	none
GLEAN3_09041	SPU_009041	none	none
GLEAN3_09046	SPU_009046	none	none
GLEAN3_09048	SPU_009048	none	none
GLEAN3_09058	SPU_009058	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09062	SPU_009062	none	none
GLEAN3_09063	SPU_009063	none	none
GLEAN3_09068	SPU_009068	none	none
GLEAN3_09074	SPU_009074	none	none
GLEAN3_09077	SPU_009077	contains 2 FA58C superfamily motifs	none
GLEAN3_09087	SPU_009087	none	none
GLEAN3_09088	SPU_009088	none	none
GLEAN3_09094	SPU_009094	contains MFS_1 domain	none
GLEAN3_09096	SPU_009096	none	none
GLEAN3_09097	SPU_009097	none	none
GLEAN3_09098	SPU_009098	none	none
GLEAN3_09104	SPU_009104	none	none
GLEAN3_09110	SPU_009110	contains 2 VWC superfamily motifs	none
GLEAN3_09119	SPU_009119	none	none
GLEAN3_09120	SPU_009120	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09136	SPU_009136	none	none
GLEAN3_09137	SPU_009137	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09143	SPU_009143	none	none
GLEAN3_09144	SPU_009144	none	none
GLEAN3_09156	SPU_009156	none	none
GLEAN3_09157	SPU_009157	none	none
GLEAN3_09162	SPU_009162	none	none
GLEAN3_09166	SPU_009166	contains COG3889 domain	none
GLEAN3_09179	SPU_009179	none	none
GLEAN3_09183	SPU_009183	none	none
GLEAN3_09186	SPU_009186	none	none
GLEAN3_09187	SPU_009187	contains SMC_N domain	none
GLEAN3_09200	SPU_009200	none	none
GLEAN3_09205	SPU_009205	none	none
GLEAN3_09207	SPU_009207	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09211	SPU_009211	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09227	SPU_009227	none	none
GLEAN3_09228	SPU_009228	none	none
GLEAN3_09229	SPU_009229	none	none
GLEAN3_09244	SPU_009244	contains 8 EGF_CA superfamily motifs	none
GLEAN3_09247	SPU_009247	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09252	SPU_009252	none	none
GLEAN3_09253	SPU_009253	none	none
GLEAN3_09257	SPU_009257	none	none
GLEAN3_09261	SPU_009261	none	none
GLEAN3_09272	SPU_009272	none	none
GLEAN3_09273	SPU_009273	none	none
GLEAN3_09275	SPU_009275	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09282	SPU_009282	none	none
GLEAN3_09285	SPU_009285	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09286	SPU_009286	none	none
GLEAN3_09287	SPU_009287	contains PAT1 domain and Smc domain	none
GLEAN3_09291	SPU_009291	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09294	SPU_009294	none	none
GLEAN3_09299	SPU_009299	none	none
GLEAN3_09302	SPU_009302	none	none
GLEAN3_09304	SPU_009304	probable assembly chimera	none
GLEAN3_09306	SPU_009306	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09312	SPU_009312	none	none
GLEAN3_09313	SPU_009313	none	none
GLEAN3_09315	SPU_009315	none	none
GLEAN3_09325	SPU_009325	none	none
GLEAN3_09330	SPU_009330	none	none
GLEAN3_09333	SPU_009333	none	none
GLEAN3_09341	SPU_009341	none	none
GLEAN3_09342	SPU_009342	none	none
GLEAN3_09344	SPU_009344	contains 2 DEXDc superfamily motifs	none
GLEAN3_09350	SPU_009350	none	none
GLEAN3_09353	SPU_009353	none	none
GLEAN3_09363	SPU_009363	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09364	SPU_009364	contains AmpC domain	none
GLEAN3_09366	SPU_009366	none	none
GLEAN3_09370	SPU_009370	none	none
GLEAN3_09371	SPU_009371	none	none
GLEAN3_09373	SPU_009373	contains COG4886 domain	none
GLEAN3_09379	SPU_009379	contains COG5222 domain	none
GLEAN3_09380	SPU_009380	none	none
GLEAN3_09391	SPU_009391	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_09396	SPU_009396	none	none
GLEAN3_09402	SPU_009402	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09403	SPU_009403	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09406	SPU_009406	none	none
GLEAN3_09412	SPU_009412	none	none
GLEAN3_09426	SPU_009426	none	none
GLEAN3_09445	SPU_009445	contains DadA domain	none
GLEAN3_09448	SPU_009448	none	none
GLEAN3_09449	SPU_009449	contains DadA domain	none
GLEAN3_09451	SPU_009451	none	none
GLEAN3_09460	SPU_009460	none	none
GLEAN3_09466	SPU_009466	contains 2 HYR superfamily motifs	none
GLEAN3_09467	SPU_009467	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09468	SPU_009468	none	none
GLEAN3_09472	SPU_009472	none	none
GLEAN3_09493	SPU_009493	none	none
GLEAN3_09498	SPU_009498	none	none
GLEAN3_09500	SPU_009500	none	none
GLEAN3_09510	SPU_009510	none	none
GLEAN3_09532	SPU_009532	none	none
GLEAN3_09538	SPU_009538	none	none
GLEAN3_09540	SPU_009540	none	none
GLEAN3_09542	SPU_009542	none	none
GLEAN3_09551	SPU_009551	probable assembly chimera	none
GLEAN3_09552	SPU_009552	none	none
GLEAN3_09556	SPU_009556	none	none
GLEAN3_09561	SPU_009561	none	none
GLEAN3_09563	SPU_009563	contains 4 SRCR superfamily motifs	none
GLEAN3_09567	SPU_009567	none	none
GLEAN3_09568	SPU_009568	none	none
GLEAN3_09570	SPU_009570	contains 3 IG superfamily motifs	none
GLEAN3_09578	SPU_009578	none	none
GLEAN3_09581	SPU_009581	none	none
GLEAN3_09583	SPU_009583	none	none
GLEAN3_09585	SPU_009585	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09587	SPU_009587	none	none
GLEAN3_09597	SPU_009597	none	none
GLEAN3_09603	SPU_009603	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09607	SPU_009607	none	none
GLEAN3_09609	SPU_009609	none	none
GLEAN3_09623	SPU_009623		none
GLEAN3_09641	SPU_009641	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09660	SPU_009660	none	none
GLEAN3_09671	SPU_009671	probable assembly chimera	none
GLEAN3_09672	SPU_009672	contains GltB domain	none
GLEAN3_09675	SPU_009675	none	none
GLEAN3_09709	SPU_009709	none	none
GLEAN3_09710	SPU_009710	none	none
GLEAN3_09719	SPU_009719	none	none
GLEAN3_09722	SPU_009722	none	none
GLEAN3_09726	SPU_009726	none	none
GLEAN3_09727	SPU_009727	none	none
GLEAN3_09735	SPU_009735	none	none
GLEAN3_09744	SPU_009744	contains 2 C2 superfamily motifs	none
GLEAN3_09746	SPU_009746	none	none
GLEAN3_09754	SPU_009754	none	none
GLEAN3_09763	SPU_009763	none	none
GLEAN3_09765	SPU_009765	none	none
GLEAN3_09770	SPU_009770	none	none
GLEAN3_09778	SPU_009778	none	none
GLEAN3_09780	SPU_009780	none	none
GLEAN3_09781	SPU_009781	none	none
GLEAN3_09784	SPU_009784	contains COG5222 domain	none
GLEAN3_09785	SPU_009785	none	none
GLEAN3_09787	SPU_009787	contains SMC_N domain	none
GLEAN3_09790	SPU_009790	none	none
GLEAN3_09795	SPU_009795	none	none
GLEAN3_09799	SPU_009799	none	none
GLEAN3_09806	SPU_009806	none	none
GLEAN3_09809	SPU_009809	none	none
GLEAN3_09811	SPU_009811	contains Dynein_heavy domain	none
GLEAN3_09822	SPU_009822	none	none
GLEAN3_09827	SPU_009827	contains 5307 domain	none
GLEAN3_09833	SPU_009833	none	none
GLEAN3_09839	SPU_009839	none	none
GLEAN3_09857	SPU_009857	none	none
GLEAN3_09860	SPU_009860	none	none
GLEAN3_09868	SPU_009868	none	none
GLEAN3_09875	SPU_009875	contains 4 KAZAL_FS superfamily motifs	none
GLEAN3_09880	SPU_009880	none	none
GLEAN3_09895	SPU_009895	none	none
GLEAN3_09900	SPU_009900	none	none
GLEAN3_09903	SPU_009903	none	none
GLEAN3_09907	SPU_009907	none	none
GLEAN3_09918	SPU_009918	none	none
GLEAN3_09919	SPU_009919	none	none
GLEAN3_09920	SPU_009920	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09926	SPU_009926	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09935	SPU_009935	contains AST1 domain	none
GLEAN3_09938	SPU_009938	contains 2 FA58C superfamily motifs	none
GLEAN3_09941	SPU_009941	none	none
GLEAN3_09957	SPU_009957	none	none
GLEAN3_09958	SPU_009958	none	none
GLEAN3_09965	SPU_009965	none	none
GLEAN3_09968	SPU_009968	none	none
GLEAN3_09971	SPU_009971	contains COG4942 domain	none
GLEAN3_09972	SPU_009972	contains COG1579 domain	none
GLEAN3_09973	SPU_009973	contains COG1579 domain	none
GLEAN3_09974	SPU_009974	contains SMC_N domain	none
GLEAN3_09975	SPU_009975	none	none
GLEAN3_09977	SPU_009977	none	none
GLEAN3_10005	SPU_010005	none	none
GLEAN3_10008	SPU_010008	none	none
GLEAN3_10013	SPU_010013	none	none
GLEAN3_10015	SPU_010015	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10022	SPU_010022	none	none
GLEAN3_10023	SPU_010023	none	none
GLEAN3_10025	SPU_010025	none	none
GLEAN3_10027	SPU_010027	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10034	SPU_010034	contains 2 FA58C superfamily motifs	none
GLEAN3_10042	SPU_010042	none	none
GLEAN3_10047	SPU_010047	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10048	SPU_010048	none	none
GLEAN3_10055	SPU_010055	none	none
GLEAN3_10061	SPU_010061	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10085	SPU_010085	none	none
GLEAN3_10088	SPU_010088	contains 2 MFS superfamily motifs	none
GLEAN3_10092	SPU_010092	contains SMC_N domain	none
GLEAN3_10103	SPU_010103	none	none
GLEAN3_10106	SPU_010106	none	none
GLEAN3_10117	SPU_010117	none	none
GLEAN3_10122	SPU_010122	none	none
GLEAN3_10126	SPU_010126	none	none
GLEAN3_10137	SPU_010137	none	none
GLEAN3_10138	SPU_010138	none	none
GLEAN3_10142	SPU_010142	contains 2 MFS superfamily motifs	none
GLEAN3_10150	SPU_010150	none	none
GLEAN3_10154	SPU_010154	none	none
GLEAN3_10169	SPU_010169	probable assembly chimera. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10173	SPU_010173	none	none
GLEAN3_10179	SPU_010179	none	none
GLEAN3_10182	SPU_010182	none	none
GLEAN3_10184	SPU_010184	none	none
GLEAN3_10187	SPU_010187	none	none
GLEAN3_10194	SPU_010194	contains COG2433 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10195	SPU_010195	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10199	SPU_010199	none	none
GLEAN3_10218	SPU_010218	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10223	SPU_010223	none	none
GLEAN3_10224	SPU_010224	none	none
GLEAN3_10229	SPU_010229	none	none
GLEAN3_10231	SPU_010231	none	none
GLEAN3_10237	SPU_010237	none	none
GLEAN3_10256	SPU_010256	none	none
GLEAN3_10257	SPU_010257	contains 4 LDLa superfamily motifs and 2 CUB superfamily motifs	none
GLEAN3_10258	SPU_010258	none	none
GLEAN3_10259	SPU_010259	none	none
GLEAN3_10260	SPU_010260	none	none
GLEAN3_10272	SPU_010272	contains PRK09039 domain	none
GLEAN3_10273	SPU_010273	none	none
GLEAN3_10286	SPU_010286	none	none
GLEAN3_10288	SPU_010288	none	none
GLEAN3_10292	SPU_010292	contains COG5540 domain	none
GLEAN3_10298	SPU_010298	none	none
GLEAN3_10300	SPU_010300	none	none
GLEAN3_10305	SPU_010305	none	none
GLEAN3_10309	SPU_010309	none	none
GLEAN3_10314	SPU_010314	none	none
GLEAN3_10323	SPU_010323	none	none
GLEAN3_10331	SPU_010331	none	none
GLEAN3_10344	SPU_010344	none	none
GLEAN3_10345	SPU_010345	none	none
GLEAN3_10346	SPU_010346	none	none
GLEAN3_10355	SPU_010355	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10356	SPU_010356	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10357	SPU_010357	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10359	SPU_010359	none	none
GLEAN3_10361	SPU_010361	none	none
GLEAN3_10369	SPU_010369	none	none
GLEAN3_10373	SPU_010373	none	none
GLEAN3_10378	SPU_010378	none	none
GLEAN3_10380	SPU_010380	none	none
GLEAN3_10386	SPU_010386	none	none
GLEAN3_10387	SPU_010387	none	none
GLEAN3_10388	SPU_010388	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10397	SPU_010397	none	none
GLEAN3_10400	SPU_010400	contains UvrA domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10401	SPU_010401	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10405	SPU_010405	none	none
GLEAN3_10406	SPU_010406	none	none
GLEAN3_10414	SPU_010414	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_10431	SPU_010431	none	none
GLEAN3_10434	SPU_010434	poor sequence data: ~70% of amino acids are X	none
GLEAN3_10435	SPU_010435	none	none
GLEAN3_10452	SPU_010452	none	none
GLEAN3_10454	SPU_010454	none	none
GLEAN3_10457	SPU_010457	none	none
GLEAN3_10462	SPU_010462	none	none
GLEAN3_10465	SPU_010465	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_10482	SPU_010482	none	none
GLEAN3_10484	SPU_010484	none	none
GLEAN3_10487	SPU_010487	none	none
GLEAN3_10490	SPU_010490	none	none
GLEAN3_10492	SPU_010492	contains SMC_N domain	none
GLEAN3_10493	SPU_010493	none	none
GLEAN3_10497	SPU_010497	none	none
GLEAN3_10509	SPU_010509	none	none
GLEAN3_10510	SPU_010510	none	none
GLEAN3_10512	SPU_010512	none	none
GLEAN3_10521	SPU_010521	none	none
GLEAN3_10524	SPU_010524	none	none
GLEAN3_10526	SPU_010526	none	none
GLEAN3_10532	SPU_010532	contains 6 EGF_CA fft motifs	none
GLEAN3_10560	SPU_010560	none	none
GLEAN3_10566	SPU_010566	contains 2 PBPb superfamily motifs	none
GLEAN3_10568	SPU_010568	contains 2 PBPb superfamily motifs	none
GLEAN3_10569	SPU_010569	none	none
GLEAN3_10583	SPU_010583	none	none
GLEAN3_10584	SPU_010584	none	none
GLEAN3_10586	SPU_010586	contains PRK00409 domain	none
GLEAN3_10588	SPU_010588	contains SMC_N domain	none
GLEAN3_10589	SPU_010589	matches only to itself	none
GLEAN3_10591	SPU_010591	none	none
GLEAN3_10594	SPU_010594	matches only to itself	none
GLEAN3_10599	SPU_010599	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10600	SPU_010600	none	none
GLEAN3_10604	SPU_010604	none	none
GLEAN3_10605	SPU_010605	none	none
GLEAN3_10607	SPU_010607	none	none
GLEAN3_10616	SPU_010616	contains FAA1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10632	SPU_010632	none	none
GLEAN3_10652	SPU_010652	none	none
GLEAN3_10654	SPU_010654	contains RAD18 domain	none
GLEAN3_10657	SPU_010657	none	none
GLEAN3_10665	SPU_010665	contains PAT1 domain	none
GLEAN3_10668	SPU_010668	none	none
GLEAN3_10674	SPU_010674	none	none
GLEAN3_10679	SPU_010679	none	none
GLEAN3_10682	SPU_010682	probable assembly chimera	none
GLEAN3_10694	SPU_010694	none	none
GLEAN3_10709	SPU_010709	contains SMC_N domain	none
GLEAN3_10717	SPU_010717	none	none
GLEAN3_10723	SPU_010723	none	none
GLEAN3_10728	SPU_010728	none	none
GLEAN3_10731	SPU_010731	none	none
GLEAN3_10749	SPU_010749	none	none
GLEAN3_10751	SPU_010751	none	none
GLEAN3_10756	SPU_010756	none	none
GLEAN3_10757	SPU_010757	none	none
GLEAN3_10759	SPU_010759	none	none
GLEAN3_10768	SPU_010768	none	none
GLEAN3_10779	SPU_010779	none	none
GLEAN3_10781	SPU_010781	contains RfaG domain	none
GLEAN3_10790	SPU_010790	none	none
GLEAN3_10791	SPU_010791	PnbA domain	none
GLEAN3_10794	SPU_010794	none	none
GLEAN3_10795	SPU_010795	none	none
GLEAN3_10798	SPU_010798	none	none
GLEAN3_10806	SPU_010806	none	none
GLEAN3_10810	SPU_010810	none	none
GLEAN3_10817	SPU_010817	contains 5 ANK superfamily motifs. homologous to numerous ankyrin repeat proteins in Trichomonas vaginalis.	none
GLEAN3_10818	SPU_010818	contains COG4886 domain	none
GLEAN3_10819	SPU_010819	none	none
GLEAN3_10836	SPU_010836	none	none
GLEAN3_10837	SPU_010837	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10838	SPU_010838	none	none
GLEAN3_10862	SPU_010862	none	none
GLEAN3_10869	SPU_010869	none	none
GLEAN3_10870	SPU_010870	none	none
GLEAN3_10878	SPU_010878	none	none
GLEAN3_10879	SPU_010879	none	none
GLEAN3_10880	SPU_010880	none	none
GLEAN3_10882	SPU_010882	none	none
GLEAN3_10883	SPU_010883	none	none
GLEAN3_10884	SPU_010884	none	none
GLEAN3_10893	SPU_010893	none	none
GLEAN3_10894	SPU_010894	none	none
GLEAN3_10895	SPU_010895	contains 3 EGF_CA superfamily motifs	none
GLEAN3_10898	SPU_010898	none	none
GLEAN3_10901	SPU_010901	none	none
GLEAN3_10902	SPU_010902	none	none
GLEAN3_10903	SPU_010903	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10907	SPU_010907	contains 2 MFS superfamily motifs	none
GLEAN3_10912	SPU_010912	none	none
GLEAN3_10920	SPU_010920	none	none
GLEAN3_10927	SPU_010927	none	none
GLEAN3_10932	SPU_010932	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10933	SPU_010933	none	none
GLEAN3_10935	SPU_010935	none	none
GLEAN3_10937	SPU_010937	none	none
GLEAN3_10949	SPU_010949	none	none
GLEAN3_10955	SPU_010955	none	none
GLEAN3_10960	SPU_010960	none	none
GLEAN3_10979	SPU_010979	none	none
GLEAN3_10980	SPU_010980	none	none
GLEAN3_10983	SPU_010983	contains MPH1 domain	none
GLEAN3_10984	SPU_010984	none	none
GLEAN3_10985	SPU_010985	contains 2 Kelch_1 superfamily motifs	none
GLEAN3_10987	SPU_010987	none	none
GLEAN3_10988	SPU_010988	none	none
GLEAN3_10989	SPU_010989	none	none
GLEAN3_10998	SPU_010998	none	none
GLEAN3_10999	SPU_010999	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11005	SPU_011005	none	none
GLEAN3_11040	SPU_011040	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11049	SPU_011049	none	none
GLEAN3_11050	SPU_011050	none	none
GLEAN3_11053	SPU_011053	none	none
GLEAN3_11055	SPU_011055	none	none
GLEAN3_11056	SPU_011056	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11060	SPU_011060	none	none
GLEAN3_11067	SPU_011067	none	none
GLEAN3_11087	SPU_011087	none	none
GLEAN3_11089	SPU_011089	none	none
GLEAN3_11090	SPU_011090	none	none
GLEAN3_11096	SPU_011096	none	none
GLEAN3_11108	SPU_011108	none	none
GLEAN3_11109	SPU_011109	none	none
GLEAN3_11120	SPU_011120	none	none
GLEAN3_11122	SPU_011122	contains DAO domain	none
GLEAN3_11123	SPU_011123	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11128	SPU_011128	none	none
GLEAN3_11129	SPU_011129	none	none
GLEAN3_11132	SPU_011132	none	none
GLEAN3_11137	SPU_011137	contains 2 MFS superfamily motifs	none
GLEAN3_11138	SPU_011138	probable assembly chimera	none
GLEAN3_11142	SPU_011142	none	none
GLEAN3_11143	SPU_011143	none	none
GLEAN3_11149	SPU_011149	none	none
GLEAN3_11165	SPU_011165	contains 2 BTB superfamily motifs	none
GLEAN3_11169	SPU_011169	homologous to numerous putative Branchiostoma floridae proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11170	SPU_011170	homologous to numerous putative Branchiostoma floridae proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11178	SPU_011178	contains STT3 domain	none
GLEAN3_11183	SPU_011183	none	none
GLEAN3_11184	SPU_011184	none	none
GLEAN3_11185	SPU_011185	contains Sulfotransfer_1 domain	none
GLEAN3_11186	SPU_011186	none	none
GLEAN3_11205	SPU_011205	contains 4 PDZ superfamily motifs	none
GLEAN3_11207	SPU_011207	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_11209	SPU_011209	none	none
GLEAN3_11210	SPU_011210	none	none
GLEAN3_11221	SPU_011221	none	none
GLEAN3_11224	SPU_011224	none	none
GLEAN3_11227	SPU_011227	none	none
GLEAN3_11228	SPU_011228	none	none
GLEAN3_11236	SPU_011236	contains PnbA domain	none
GLEAN3_11240	SPU_011240	none	none
GLEAN3_11245	SPU_011245	none	none
GLEAN3_11247	SPU_011247	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11249	SPU_011249	none	none
GLEAN3_11255	SPU_011255	contains Sulfotransfer_1 domain	none
GLEAN3_11263	SPU_011263	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_11268	SPU_011268	none	none
GLEAN3_11274	SPU_011274	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_11276	SPU_011276	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11278	SPU_011278	contains Mnd1 domain	none
GLEAN3_11284	SPU_011284	none	none
GLEAN3_28611	SPU_028611	none	none
GLEAN3_11292	SPU_011292	none	none
GLEAN3_11316	SPU_011316	none	none
GLEAN3_11322	SPU_011322	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11324	SPU_011324	contains Smc domain. homologous only to a dozen putative S. purpuratus proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11325	SPU_011325	none	none
GLEAN3_11326	SPU_011326	none	none
GLEAN3_11330	SPU_011330	none	none
GLEAN3_11363	SPU_011363	none	none
GLEAN3_11366	SPU_011366	none	none
GLEAN3_11370	SPU_011370	none	none
GLEAN3_11371	SPU_011371	contains infB domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11377	SPU_011377	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11387	SPU_011387	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11392	SPU_011392	contains 2 PBPb superfamily motifs	none
GLEAN3_11393	SPU_011393	none	none
GLEAN3_11401	SPU_011401	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11402	SPU_011402	none	none
GLEAN3_11403	SPU_011403	none	none
GLEAN3_11405	SPU_011405	none	none
GLEAN3_11409	SPU_011409	none	none
GLEAN3_11410	SPU_011410	none	none
GLEAN3_11411	SPU_011411	none	none
GLEAN3_11430	SPU_011430	none	none
GLEAN3_11445	SPU_011445	none	none
GLEAN3_11450	SPU_011450	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_11465	SPU_011465	contains Smc domain	none
GLEAN3_11469	SPU_011469	none	none
GLEAN3_11472	SPU_011472	none	none
GLEAN3_11478	SPU_011478	none	none
GLEAN3_11482	SPU_011482	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11480	SPU_011480	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11483	SPU_011483	none	none
GLEAN3_11484	SPU_011484	none	none
GLEAN3_11488	SPU_011488	none	none
GLEAN3_11489	SPU_011489	none	none
GLEAN3_11490	SPU_011490	none	none
GLEAN3_11505	SPU_011505	none	none
GLEAN3_11507	SPU_011507	none	none
GLEAN3_11510	SPU_011510	none	none
GLEAN3_11511	SPU_011511	none	none
GLEAN3_11518	SPU_011518	none	none
GLEAN3_11520	SPU_011520	none	none
GLEAN3_11526	SPU_011526	none	none
GLEAN3_11528	SPU_011528	none	none
GLEAN3_11530	SPU_011530	none	none
GLEAN3_11542	SPU_011542	none	none
GLEAN3_11545	SPU_011545	none	none
GLEAN3_11553	SPU_011553	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11556	SPU_011556	none	none
GLEAN3_11557	SPU_011557	none	none
GLEAN3_11558	SPU_011558	none	none
GLEAN3_11568	SPU_011568	contains Borrelia_P83 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11577	SPU_011577	Adaptin_N domain	none
GLEAN3_11578	SPU_011578	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11586	SPU_011586	contains csdA domain	none
GLEAN3_11591	SPU_011591	none	none
GLEAN3_11592	SPU_011592	none	none
GLEAN3_11593	SPU_011593	none	none
GLEAN3_11600	SPU_011600	none	none
GLEAN3_11609	SPU_011609	none	none
GLEAN3_11615	SPU_011615	none	none
GLEAN3_11618	SPU_011618	none	none
GLEAN3_11619	SPU_011619	none	none
GLEAN3_11628	SPU_011628	none	none
GLEAN3_11630	SPU_011630	contains 2 RING superfamily motifs	none
GLEAN3_11636	SPU_011636	none	none
GLEAN3_11642	SPU_011642	none	none
GLEAN3_11643	SPU_011643	contains RecQ domain	none
GLEAN3_11646	SPU_011646	none	none
GLEAN3_11651	SPU_011651	none	none
GLEAN3_11662	SPU_011662	none	none
GLEAN3_11670	SPU_011670	none	none
GLEAN3_11672	SPU_011672	none	none
GLEAN3_11673	SPU_011673	none	none
GLEAN3_11674	SPU_011674	none	none
GLEAN3_11676	SPU_011676	none	none
GLEAN3_11702	SPU_011702	none	none
GLEAN3_11708	SPU_011708	none	none
GLEAN3_11709	SPU_011709	none	none
GLEAN3_11712	SPU_011712	none	none
GLEAN3_11716	SPU_011716	none	none
GLEAN3_11719	SPU_011719	none	none
GLEAN3_11721	SPU_011721	none	none
GLEAN3_11723	SPU_011723	none	none
GLEAN3_11728	SPU_011728	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11731	SPU_011731	none	none
GLEAN3_11741	SPU_011741	none	none
GLEAN3_11744	SPU_011744	none	none
GLEAN3_11749	SPU_011749	none	none
GLEAN3_11763	SPU_011763	contains Sulfotransfer_1 domain	none
GLEAN3_11764	SPU_011764	none	none
GLEAN3_11766	SPU_011766	none	none
GLEAN3_11781	SPU_011781	none	none
GLEAN3_11793	SPU_011793	none	none
GLEAN3_11795	SPU_011795	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_11802	SPU_011802	contains 4 FA58C superfamily motifs	none
GLEAN3_11805	SPU_011805	none	none
GLEAN3_25040	SPU_025040	none	none
GLEAN3_11807	SPU_011807	none	none
GLEAN3_11809	SPU_011809	probable assembly chimera	none
GLEAN3_11810	SPU_011810	probable assembly chimera	none
GLEAN3_11817	SPU_011817	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_11818	SPU_011818	contains 2 UBQ superfamily motifs	none
GLEAN3_11820	SPU_011820	contains 2 CCP superfamily motifs	none
GLEAN3_11821	SPU_011821	none	none
GLEAN3_11824	SPU_011824	none	none
GLEAN3_11826	SPU_011826	none	none
GLEAN3_11827	SPU_011827	none	none
GLEAN3_11831	SPU_011831	none	none
GLEAN3_11833	SPU_011833	none	none
GLEAN3_11835	SPU_011835	contains 6 Kelch_1 superfamily motifs	none
GLEAN3_11840	SPU_011840	none	none
GLEAN3_11850	SPU_011850	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11852	SPU_011852	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11854	SPU_011854	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11858	SPU_011858	contains 2 EGF_CA superfamily motifs	none
GLEAN3_11859	SPU_011859	none	none
GLEAN3_11861	SPU_011861	none	none
GLEAN3_11863	SPU_011863	none	none
GLEAN3_11870	SPU_011870	none	none
GLEAN3_11871	SPU_011871	none	none
GLEAN3_11874	SPU_011874	none	none
GLEAN3_11881	SPU_011881	none	none
GLEAN3_11883	SPU_011883	contains 4 CUB superfamily motifs	none
GLEAN3_11888	SPU_011888	contains 2 MAM superfamily motifs	none
GLEAN3_11891	SPU_011891	none	none
GLEAN3_11892	SPU_011892	contains 3 SCP superfamily motifs	none
GLEAN3_11899	SPU_011899	none	none
GLEAN3_11901	SPU_011901	none	none
GLEAN3_11915	SPU_011915	none	none
GLEAN3_11924	SPU_011924	none	none
GLEAN3_11926	SPU_011926	none	none
GLEAN3_11929	SPU_011929	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11932	SPU_011932	none	none
GLEAN3_11939	SPU_011939	none	none
GLEAN3_11941	SPU_011941	none	none
GLEAN3_11947	SPU_011947	none	none
GLEAN3_11948	SPU_011948	none	none
GLEAN3_11953	SPU_011953	none	none
GLEAN3_11963	SPU_011963	contains SMC_N domain	none
GLEAN3_11968	SPU_011968	none	none
GLEAN3_11974	SPU_011974	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11982	SPU_011982	none	none
GLEAN3_11992	SPU_011992	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_11994	SPU_011994	none	none
GLEAN3_11995	SPU_011995	none	none
GLEAN3_12007	SPU_012007	none	none
GLEAN3_12011	SPU_012011	contains 2 FA58C superfamily motifs	none
GLEAN3_12013	SPU_012013	none	none
GLEAN3_12014	SPU_012014	none	none
GLEAN3_12019	SPU_012019	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12023	SPU_012023	none	none
GLEAN3_12024	SPU_012024	none	none
GLEAN3_12031	SPU_012031	none	none
GLEAN3_12036	SPU_012036	none	none
GLEAN3_12037	SPU_012037	none	none
GLEAN3_12038	SPU_012038	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12054	SPU_012054	none	none
GLEAN3_12057	SPU_012057	none	none
GLEAN3_12062	SPU_012062	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_12064	SPU_012064	contains 3 EGF_CA superfamily motifs	none
GLEAN3_12065	SPU_012065	none	none
GLEAN3_12079	SPU_012079	none	none
GLEAN3_12082	SPU_012082	none	none
GLEAN3_12091	SPU_012091	none	none
GLEAN3_12094	SPU_012094	none	none
GLEAN3_12102	SPU_012102	none	none
GLEAN3_12115	SPU_012115	none	none
GLEAN3_12117	SPU_012117	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12119	SPU_012119	none	none
GLEAN3_12126	SPU_012126	none	none
GLEAN3_12128	SPU_012128	none	none
GLEAN3_12129	SPU_012129	none	none
GLEAN3_12133	SPU_012133	none	none
GLEAN3_12135	SPU_012135	none	none
GLEAN3_12143	SPU_012143	none	none
GLEAN3_12152	SPU_012152	none	none
GLEAN3_12154	SPU_012154	none	none
GLEAN3_12155	SPU_012155	none	none
GLEAN3_12156	SPU_012156	none	none
GLEAN3_12160	SPU_012160	none	none
GLEAN3_12161	SPU_012161	none	none
GLEAN3_12168	SPU_012168	none	none
GLEAN3_12171	SPU_012171	none	none
GLEAN3_12175	SPU_012175	none	none
GLEAN3_12176	SPU_012176	none	none
GLEAN3_12180	SPU_012180	none	none
GLEAN3_12182	SPU_012182	none	none
GLEAN3_12185	SPU_012185	none	none
GLEAN3_12189	SPU_012189	none	none
GLEAN3_12193	SPU_012193	none	none
GLEAN3_12201	SPU_012201	contains COG3217 domain	none
GLEAN3_12217	SPU_012217	none	none
GLEAN3_12233	SPU_012233	contains 4 EGF_CA superfamily motifs	none
GLEAN3_12237	SPU_012237	none	none
GLEAN3_12246	SPU_012246	none	none
GLEAN3_12248	SPU_012248	none	none
GLEAN3_12249	SPU_012249	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12260	SPU_012260	none	none
GLEAN3_12263	SPU_012263	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12266	SPU_012266	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12274	SPU_012274	none	none
GLEAN3_12280	SPU_012280	contains TRF4 domain	none
GLEAN3_12284	SPU_012284	none	none
GLEAN3_12305	SPU_012305	contains 3 IG superfamily motifs	none
GLEAN3_12311	SPU_012311	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12323	SPU_012323	none	none
GLEAN3_12327	SPU_012327	contains Ion_trans domain	none
GLEAN3_12330	SPU_012330	none	none
GLEAN3_12333	SPU_012333	none	none
GLEAN3_12336	SPU_012336	contains PRK11824 domain	none
GLEAN3_12342	SPU_012342	contains Sulfotransfer_1 domain	none
GLEAN3_12343	SPU_012343	contains Sulfotransfer_1 domain	none
GLEAN3_12356	SPU_012356	contains BRO1 domain	none
GLEAN3_12357	SPU_012357	contains COG5273 domain	none
GLEAN3_12363	SPU_012363	none	none
GLEAN3_12366	SPU_012366	none	none
GLEAN3_12370	SPU_012370	none	none
GLEAN3_12375	SPU_012375	none	none
GLEAN3_12377	SPU_012377		none
GLEAN3_12378	SPU_012378	none	none
GLEAN3_12379	SPU_012379	contains 4 PLAT superfamily motifs	none
GLEAN3_12391	SPU_012391	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12400	SPU_012400	none	none
GLEAN3_12406	SPU_012406	none	none
GLEAN3_12408	SPU_012408	none	none
GLEAN3_12418	SPU_012418	contains DYN1 domain and Dynein_heavy domain	none
GLEAN3_12419	SPU_012419	contains DHC_N1 domain	none
GLEAN3_12423	SPU_012423	none	none
GLEAN3_12426	SPU_012426	none	none
GLEAN3_12432	SPU_012432	contains 4 Ldl_recept_b superfamily motifs	none
GLEAN3_12435	SPU_012435	none	none
GLEAN3_12436	SPU_012436	none	none
GLEAN3_12437	SPU_012437	none	none
GLEAN3_12438	SPU_012438	none	none
GLEAN3_12440	SPU_012440	none	none
GLEAN3_12444	SPU_012444	contains RecQ domain	none
GLEAN3_12446	SPU_012446	none	none
GLEAN3_12447	SPU_012447	none	none
GLEAN3_12468	SPU_012468	contains Sulfotransfer_1 domain	none
GLEAN3_12481	SPU_012481	contains Spc7 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12484	SPU_012484	contains Laminin_EGF domain	none
GLEAN3_12486	SPU_012486	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12497	SPU_012497	none	none
GLEAN3_12502	SPU_012502	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12510	SPU_012510	none	none
GLEAN3_12520	SPU_012520	none	none
GLEAN3_12526	SPU_012526	none	none
GLEAN3_12535	SPU_012535	none	none
GLEAN3_12536	SPU_012536	none	none
GLEAN3_12537	SPU_012537	none	none
GLEAN3_12541	SPU_012541	none	none
GLEAN3_12552	SPU_012552	none	none
GLEAN3_12561	SPU_012561	none	none
GLEAN3_12562	SPU_012562	none	none
GLEAN3_12568	SPU_012568	none	none
GLEAN3_12570	SPU_012570	none	none
GLEAN3_12574	SPU_012574	none	none
GLEAN3_12577	SPU_012577	none	none
GLEAN3_12601	SPU_012601	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12603	SPU_012603	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12608	SPU_012608	none	none
GLEAN3_12613	SPU_012613	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12614	SPU_012614	none	none
GLEAN3_12622	SPU_012622	none	none
GLEAN3_12625	SPU_012625	none	none
GLEAN3_12629	SPU_012629	none	none
GLEAN3_12641	SPU_012641	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12647	SPU_012647	none	none
GLEAN3_12653	SPU_012653	none	none
GLEAN3_12660	SPU_012660	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12665	SPU_012665	none	none
GLEAN3_12666	SPU_012666	contains COG4886 domain	none
GLEAN3_12680	SPU_012680	none	none
GLEAN3_12681	SPU_012681	none	none
GLEAN3_12687	SPU_012687	none	none
GLEAN3_12689	SPU_012689	none	none
GLEAN3_12691	SPU_012691	contains Smc domain	none
GLEAN3_12697	SPU_012697	contains Smc domain	none
GLEAN3_12700	SPU_012700	none	none
GLEAN3_12706	SPU_012706	none	none
GLEAN3_12714	SPU_012714	none	none
GLEAN3_12723.1	SPU_012723.1	none	none
GLEAN3_12726	SPU_012726	contains 2 RHOD superfamily motifs and SseA domain	none
GLEAN3_12727	SPU_012727	contains 2 RHOD superfamily motifs and SseA domain	none
GLEAN3_12729	SPU_012729	none	none
GLEAN3_12731	SPU_012731	none	none
GLEAN3_12738	SPU_012738	none	none
GLEAN3_12752	SPU_012752	contains 2 LDLa superfamily motifs	none
GLEAN3_12762	SPU_012762	none	none
GLEAN3_12780	SPU_012780	none	none
GLEAN3_12781	SPU_012781	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12782	SPU_012782	none	none
GLEAN3_12795	SPU_012795	contains NptA domain	none
GLEAN3_12812	SPU_012812	contains DHC_N1 domain	none
GLEAN3_12830	SPU_012830	none	none
GLEAN3_12842	SPU_012842	none	none
GLEAN3_12853	SPU_012853	none	none
GLEAN3_12854	SPU_012854	contains PRK12704 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12857	SPU_012857	contains Mon1 domain	none
GLEAN3_12862	SPU_012862	none	none
GLEAN3_12868	SPU_012868	none	none
GLEAN3_12878	SPU_012878	none	none
GLEAN3_12880	SPU_012880	contains Smc domain and COG3391 domain	none
GLEAN3_12882	SPU_012882	none	none
GLEAN3_12890	SPU_012890	none	none
GLEAN3_12902	SPU_012902	none	none
GLEAN3_12905	SPU_012905	none	none
GLEAN3_12923	SPU_012923	none	none
GLEAN3_12925	SPU_012925	none	none
GLEAN3_12927	SPU_012927	none	none
GLEAN3_12934	SPU_012934	none	none
GLEAN3_12937	SPU_012937	none	none
GLEAN3_12952	SPU_012952	none	none
GLEAN3_12953	SPU_012953	none	none
GLEAN3_12957	SPU_012957	none	none
GLEAN3_12962	SPU_012962	none	none
GLEAN3_12970	SPU_012970	none	none
GLEAN3_12971	SPU_012971	none	none
GLEAN3_12974	SPU_012974	none	none
GLEAN3_12990	SPU_012990	none	none
GLEAN3_12994	SPU_012994	contains Elp3 domain	none
GLEAN3_12996	SPU_012996	contains 2 PH-like superfamily motifs	none
GLEAN3_13000	SPU_013000	none	none
GLEAN3_13001	SPU_013001	none	none
GLEAN3_13004	SPU_013004	none	none
GLEAN3_13008	SPU_013008	none	none
GLEAN3_13032	SPU_013032	none	none
GLEAN3_13033	SPU_013033	none	none
GLEAN3_13043	SPU_013043	probable assembly chimera	none
GLEAN3_13057	SPU_013057	none	none
GLEAN3_13058	SPU_013058	none	none
GLEAN3_13067	SPU_013067	none	none
GLEAN3_13068	SPU_013068	none	none
GLEAN3_13075	SPU_013075	none	none
GLEAN3_13080	SPU_013080	none	none
GLEAN3_13094	SPU_013094	probable assembly chimera	none
GLEAN3_13110	SPU_013110	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13114	SPU_013114	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_13120	SPU_013120	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13122	SPU_013122	none	none
GLEAN3_13127	SPU_013127	none	none
GLEAN3_13128	SPU_013128	none	none
GLEAN3_13137	SPU_013137	contains COG4886 domain	none
GLEAN3_13141	SPU_013141	none	none
GLEAN3_13143	SPU_013143	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13150	SPU_013150	contains Sultotransfer_1 domain	none
GLEAN3_13153	SPU_013153	none	none
GLEAN3_13154	SPU_013154	none	none
GLEAN3_13155	SPU_013155	none	none
GLEAN3_13158	SPU_013158	none	none
GLEAN3_13175	SPU_013175	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13176	SPU_013176	none	none
GLEAN3_13177	SPU_013177	none	none
GLEAN3_13196	SPU_013196	contains COG5222 domain	none
GLEAN3_13211	SPU_013211	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_13212	SPU_013212	none	none
GLEAN3_13216	SPU_013216	none	none
GLEAN3_13217	SPU_013217	none	none
GLEAN3_13218	SPU_013218	none	none
GLEAN3_13229	SPU_013229	contains SMC_N domain	none
GLEAN3_13236	SPU_013236	none	none
GLEAN3_13252	SPU_013252	none	none
GLEAN3_13263	SPU_013263	none	none
GLEAN3_13267	SPU_013267	none	none
GLEAN3_13268	SPU_013268	none	none
GLEAN3_13270	SPU_013270	none	none
GLEAN3_13284	SPU_013284	none	none
GLEAN3_13285	SPU_013285	none	none
GLEAN3_13291	SPU_013291	none	none
GLEAN3_13293	SPU_013293	contains 2 CCP superfamily motifs and 2 HYR superfamily motifs	none
GLEAN3_13300	SPU_013300	contains Smc domain	none
GLEAN3_13306	SPU_013306	none	none
GLEAN3_13316	SPU_013316	none	none
GLEAN3_13318	SPU_013318	none	none
GLEAN3_13327	SPU_013327	contains 4 HYR superfamily motifs	none
GLEAN3_13359	SPU_013359	none	none
GLEAN3_13367	SPU_013367	contains COG5222 domain	none
GLEAN3_13368	SPU_013368	none	none
GLEAN3_13371	SPU_013371	none	none
GLEAN3_13374	SPU_013374	none	none
GLEAN3_13375	SPU_013375	none	none
GLEAN3_13390	SPU_013390	none	none
GLEAN3_13391	SPU_013391	none	none
GLEAN3_13395	SPU_013395	none	none
GLEAN3_13400	SPU_013400	none	none
GLEAN3_13411	SPU_013411	contains SMC_N domain	none
GLEAN3_13417	SPU_013417	contains 2 FA58C superfamily motifs	none
GLEAN3_13437	SPU_013437	contains flhF domain	none
GLEAN3_13444	SPU_013444	none	none
GLEAN3_13445	SPU_013445	none	none
GLEAN3_13446	SPU_013446	none	none
GLEAN3_13455	SPU_013455	none	none
GLEAN3_13456	SPU_013456	none	none
GLEAN3_13469	SPU_013469	none	none
GLEAN3_13475	SPU_013475	none	none
GLEAN3_13483	SPU_013483	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13484	SPU_013484	none	none
GLEAN3_13488	SPU_013488	contains Myosin_tail_1 domain	none
GLEAN3_13490	SPU_013490	none	none
GLEAN3_13498	SPU_013498	none	none
GLEAN3_13500	SPU_013500	contains Vps16_N domain	none
GLEAN3_13503	SPU_013503	none	none
GLEAN3_13506	SPU_013506	none	none
GLEAN3_13511	SPU_013511	none	none
GLEAN3_13512	SPU_013512	none	none
GLEAN3_13515	SPU_013515	none	none
GLEAN3_13516	SPU_013516	contains RAD18 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13523	SPU_013523	none	none
GLEAN3_13525	SPU_013525	none	none
GLEAN3_13543	SPU_013543	none	none
GLEAN3_13545	SPU_013545	contains 3 LDL_recept_b superfamily motifs	none
GLEAN3_13552	SPU_013552	none	none
GLEAN3_13554	SPU_013554	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13555	SPU_013555	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13556	SPU_013556	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13558	SPU_013558	poor sequence data: ~40% of amino acids are X. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13559	SPU_013559	contains FAT domain	none
GLEAN3_13581	SPU_013581	none	none
GLEAN3_13588	SPU_013588	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13594	SPU_013594	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13595	SPU_013595	none	none
GLEAN3_13599	SPU_013599	none	none
GLEAN3_13600	SPU_013600	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13610	SPU_013610	none	none
GLEAN3_13612	SPU_013612	none	none
GLEAN3_13623	SPU_013623	none	none
GLEAN3_13625	SPU_013625	none	none
GLEAN3_13628	SPU_013628	none	none
GLEAN3_13632	SPU_013632	none	none
GLEAN3_13633	SPU_013633	none	none
GLEAN3_13641	SPU_013641	none	none
GLEAN3_13645	SPU_013645	none	none
GLEAN3_13652	SPU_013652	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13656	SPU_013656	none	none
GLEAN3_13657	SPU_013657	none	none
GLEAN3_13659	SPU_013659	none	none
GLEAN3_13663	SPU_013663	none	none
GLEAN3_13665	SPU_013665	contains COG5635 domain	none
GLEAN3_13671	SPU_013671	none	none
GLEAN3_13673	SPU_013673	none	none
GLEAN3_13683	SPU_013683	none	none
GLEAN3_13684	SPU_013684	contains OpuAC domain	none
GLEAN3_13695	SPU_013695	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13696	SPU_013696	none	none
GLEAN3_13713	SPU_013713	none	none
GLEAN3_13723	SPU_013723	contains 2 CUB superfamily motifs	none
GLEAN3_13740	SPU_013740	none	none
GLEAN3_13746	SPU_013746	none	none
GLEAN3_13753	SPU_013753	probable assembly chimera	none
GLEAN3_13758	SPU_013758	contains 3 CCP superfamily motifs	none
GLEAN3_13759	SPU_013759	none	none
GLEAN3_13760	SPU_013760	none	none
GLEAN3_13761	SPU_013761	none	none
GLEAN3_13763	SPU_013763	none	none
GLEAN3_13767	SPU_013767	none	none
GLEAN3_13774	SPU_013774	none	none
GLEAN3_13777	SPU_013777	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13778	SPU_013778	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13780	SPU_013780	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13781	SPU_013781	none	none
GLEAN3_13782	SPU_013782	none	none
GLEAN3_13783	SPU_013783	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13789	SPU_013789	none	none
GLEAN3_13787	SPU_013787	none	none
GLEAN3_13790	SPU_013790	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13791	SPU_013791	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13795	SPU_013795	none	none
GLEAN3_13798	SPU_013798	none	none
GLEAN3_13801	SPU_013801	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13802	SPU_013802	none	none
GLEAN3_13809	SPU_013809	none	none
GLEAN3_13836	SPU_013836	none	none
GLEAN3_13845	SPU_013845	none	none
GLEAN3_13865	SPU_013865	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13866	SPU_013866	none	none
GLEAN3_13870	SPU_013870	probable assembly chimera	none
GLEAN3_13874	SPU_013874	none	none
GLEAN3_13877	SPU_013877	probable assembly chimera	none
GLEAN3_13879	SPU_013879	none	none
GLEAN3_13880	SPU_013880	none	none
GLEAN3_13881	SPU_013881	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13886	SPU_013886	none	none
GLEAN3_13896	SPU_013896	none	none
GLEAN3_13897	SPU_013897	none	none
GLEAN3_13898	SPU_013898	none	none
GLEAN3_13899	SPU_013899	contains PRK13042 domain	none
GLEAN3_13900	SPU_013900	none	none
GLEAN3_13901	SPU_013901	none	none
GLEAN3_13915	SPU_013915	none	none
GLEAN3_13918	SPU_013918	none	none
GLEAN3_13920	SPU_013920	none	none
GLEAN3_13921	SPU_013921	none	none
GLEAN3_13924	SPU_013924	none	none
GLEAN3_13945	SPU_013945	none	none
GLEAN3_13947	SPU_013947	none	none
GLEAN3_13956	SPU_013956	contains 5 Filamin superfamily motifs	none
GLEAN3_13968	SPU_013968	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_13969	SPU_013969	none	none
GLEAN3_13970	SPU_013970	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13972	SPU_013972	none	none
GLEAN3_13977	SPU_013977	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13981	SPU_013981	contains COG5635 domain	none
GLEAN3_13983	SPU_013983	contains 3 ANK superfamily motifs	none
GLEAN3_13985	SPU_013985	none	none
GLEAN3_13986	SPU_013986	contains Sultotransfer_1 domain	none
GLEAN3_13989	SPU_013989	none	none
GLEAN3_13991	SPU_013991	none	none
GLEAN3_13992	SPU_013992	none	none
GLEAN3_13993	SPU_013993	none	none
GLEAN3_13997	SPU_013997	none	none
GLEAN3_13998	SPU_013998	none	none
GLEAN3_14003	SPU_014003	none	none
GLEAN3_14006	SPU_014006	none	none
GLEAN3_14012	SPU_014012	none	none
GLEAN3_14015	SPU_014015	contains OmpH domain	none
GLEAN3_14022	SPU_014022	contains 3 ANK superfamily motifs	none
GLEAN3_14023	SPU_014023	none	none
GLEAN3_14029	SPU_014029	none	none
GLEAN3_14030	SPU_014030	contains 2 WSC superfamily motifs	none
GLEAN3_14040	SPU_014040	none	none
GLEAN3_14047	SPU_014047	none	none
GLEAN3_14050	SPU_014050	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14061	SPU_014061	none	none
GLEAN3_14065	SPU_014065	none	none
GLEAN3_14070	SPU_014070	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14077	SPU_014077	none	none
GLEAN3_14083	SPU_014083	none	none
GLEAN3_14086	SPU_014086	none	none
GLEAN3_14089	SPU_014089	none	none
GLEAN3_14098	SPU_014098	none	none
GLEAN3_14107	SPU_014107	none	none
GLEAN3_14115	SPU_014115	none	none
GLEAN3_14121	SPU_014121	none	none
GLEAN3_14124	SPU_014124	homologous to numerous putative proteins in Branchiostoma floridae and Nematostella vectensis	none
GLEAN3_14125	SPU_014125	none	none
GLEAN3_14129	SPU_014129	none	none
GLEAN3_14133	SPU_014133	none	none
GLEAN3_14138	SPU_014138	none	none
GLEAN3_14139	SPU_014139	none	none
GLEAN3_14144	SPU_014144	none	none
GLEAN3_14146	SPU_014146	none	none
GLEAN3_14147	SPU_014147	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14149	SPU_014149	probable assembly chimera	none
GLEAN3_14150	SPU_014150	contains 2 WSC superfamily motifs	none
GLEAN3_14155	SPU_014155	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14181	SPU_014181	none	none
GLEAN3_14183	SPU_014183	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_14188	SPU_014188	none	none
GLEAN3_14189	SPU_014189	none	none
GLEAN3_14192	SPU_014192	contains 2 FA58C superfamily motifs	none
GLEAN3_14198	SPU_014198	none	none
GLEAN3_14205	SPU_014205	none	none
GLEAN3_14208	SPU_014208	none	none
GLEAN3_14212	SPU_014212	none	none
GLEAN3_14225	SPU_014225	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14233	SPU_014233	none	none
GLEAN3_14259	SPU_014259	none	none
GLEAN3_14265	SPU_014265	none	none
GLEAN3_14267	SPU_014267	none	none
GLEAN3_14272	SPU_014272	none	none
GLEAN3_14275	SPU_014275	none	none
GLEAN3_14282	SPU_014282	none	none
GLEAN3_14297	SPU_014297	none	none
GLEAN3_14301	SPU_014301	none	none
GLEAN3_14302	SPU_014302	contains Sulfotransfer_1 domain	none
GLEAN3_14304	SPU_014304	contains Sulfotransfer_1 domain	none
GLEAN3_14309	SPU_014309	none	none
GLEAN3_14318	SPU_014318	none	none
GLEAN3_14327	SPU_014327	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14329	SPU_014329	none	none
GLEAN3_14340	SPU_014340	contains 2 CCP superfamily motifs	none
GLEAN3_14348	SPU_014348	none	none
GLEAN3_14349	SPU_014349	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14360	SPU_014360	none	none
GLEAN3_14364	SPU_014364	none	none
GLEAN3_14367	SPU_014367	none	none
GLEAN3_14370	SPU_014370	none	none
GLEAN3_14371	SPU_014371	none	none
GLEAN3_14376	SPU_014376	none	none
GLEAN3_14378	SPU_014378	homolgous to bacterial and plant proteins	none
GLEAN3_14379	SPU_014379	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14380	SPU_014380	none	none
GLEAN3_20142	SPU_020142	none	none
GLEAN3_23573	SPU_023573	none	none
GLEAN3_14384	SPU_014384	none	none
GLEAN3_14385	SPU_014385	none	none
GLEAN3_14386	SPU_014386	none	none
GLEAN3_14388	SPU_014388	contains 2 BTB superfamily motifs	none
GLEAN3_14390	SPU_014390	none	none
GLEAN3_14393	SPU_014393	none	none
GLEAN3_14394	SPU_014394	none	none
GLEAN3_14419	SPU_014419	none	none
GLEAN3_14422	SPU_014422	contains 9 EGF_CA superfamily motifs	none
GLEAN3_14424	SPU_014424	none	none
GLEAN3_14425	SPU_014425	none	none
GLEAN3_14429	SPU_014429	none	none
GLEAN3_14434	SPU_014434	none	none
GLEAN3_14436	SPU_014436	none	none
GLEAN3_14438	SPU_014438	none	none
GLEAN3_14443	SPU_014443	poor amino acid sequence (~40 % of amino acids are X)	none
GLEAN3_14447	SPU_014447	none	none
GLEAN3_14448	SPU_014448	none	none
GLEAN3_14449	SPU_014449	contains Aes domain	none
GLEAN3_14452	SPU_014452	none	none
GLEAN3_14453	SPU_014453	probable assembly chimera	none
GLEAN3_14456	SPU_014456	none	none
GLEAN3_14459	SPU_014459	contains ATS1 domain	none
GLEAN3_14464	SPU_014464	contains 2 TSP_1 superfamily motifs	none
GLEAN3_14465	SPU_014465	contains 2 TSP_1 superfamily motifs	none
GLEAN3_14470	SPU_014470	none	none
GLEAN3_14475	SPU_014475	none	none
GLEAN3_14476	SPU_014476	none	none
GLEAN3_14486	SPU_014486	none	none
GLEAN3_14487	SPU_014487	none	none
GLEAN3_14493	SPU_014493	none	none
GLEAN3_14494	SPU_014494	none	none
GLEAN3_14499	SPU_014499	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14500	SPU_014500	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14501	SPU_014501	none	none
GLEAN3_14510	SPU_014510	none	none
GLEAN3_14514	SPU_014514	none	none
GLEAN3_14518	SPU_014518	none	none
GLEAN3_14524	SPU_014524	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14533	SPU_014533	none	none
GLEAN3_14538	SPU_014538	none	none
GLEAN3_14542	SPU_014542	none	none
GLEAN3_14549	SPU_014549	none	none
GLEAN3_14550	SPU_014550	none	none
GLEAN3_14554	SPU_014554	none	none
GLEAN3_14555	SPU_014555	none	none
GLEAN3_14556	SPU_014556	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14559	SPU_014559	none	none
GLEAN3_14560	SPU_014560	none	none
GLEAN3_14581	SPU_014581	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14590	SPU_014590	none	none
GLEAN3_14592	SPU_014592	contains O-FucT domain	none
GLEAN3_14608	SPU_014608	none	none
GLEAN3_14616	SPU_014616	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14635	SPU_014635	contains EURL domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14638	SPU_014638	none	none
GLEAN3_24080	SPU_024080	none	none
GLEAN3_14640	SPU_014640	contains 2 NHL superfamily motifs	none
GLEAN3_14641	SPU_014641	none	none
GLEAN3_14644	SPU_014644	none	none
GLEAN3_14647	SPU_014647	none	none
GLEAN3_14649	SPU_014649	none	none
GLEAN3_14651	SPU_014651	contains MDN1 domain	none
GLEAN3_14652	SPU_014652	none	none
GLEAN3_14656	SPU_014656	none	none
GLEAN3_14682	SPU_014682	contains flgG domain	none
GLEAN3_14690	SPU_014690	none	none
GLEAN3_14692	SPU_014692	none	none
GLEAN3_14699	SPU_014699	contains 4 ANK superfamily motifs	none
GLEAN3_14700	SPU_014700	none	none
GLEAN3_14707	SPU_014707	none	none
GLEAN3_14709	SPU_014709	none	none
GLEAN3_14714	SPU_014714	none	none
GLEAN3_14717	SPU_014717	none	none
GLEAN3_14723	SPU_014723	none	none
GLEAN3_14727	SPU_014727	none	none
GLEAN3_14729	SPU_014729	none	none
GLEAN3_14737	SPU_014737	none	none
GLEAN3_14738	SPU_014738	none	none
GLEAN3_14741	SPU_014741	none	none
GLEAN3_14742	SPU_014742	none	none
GLEAN3_14743	SPU_014743	none	none
GLEAN3_14744	SPU_014744	none	none
GLEAN3_14745	SPU_014745	none	none
GLEAN3_14748	SPU_014748	none	none
GLEAN3_14757	SPU_014757	none	none
GLEAN3_14764	SPU_014764	none	none
GLEAN3_14774	SPU_014774	none	none
GLEAN3_14794	SPU_014794	contains PRK12496 domain and COG1439 domain	none
GLEAN3_14820	SPU_014820	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14834	SPU_014834	none	none
GLEAN3_14838	SPU_014838	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14847	SPU_014847	contains 2 Ldl_recept_b superfamily motifs	none
GLEAN3_14848	SPU_014848	none	none
GLEAN3_14849	SPU_014849	none	none
GLEAN3_14853	SPU_014853	none	none
GLEAN3_14861	SPU_014861	none	none
GLEAN3_14863	SPU_014863	none	none
GLEAN3_14877	SPU_014877	none	none
GLEAN3_14891	SPU_014891	none	none
GLEAN3_14895	SPU_014895	none	none
GLEAN3_14898	SPU_014898	none	none
GLEAN3_14902	SPU_014902	none	none
GLEAN3_14904	SPU_014904	contains 2 CCP superfamily motifs	none
GLEAN3_14907	SPU_014907	homolgous to bacterial and plant proteins	none
GLEAN3_14911	SPU_014911	none	none
GLEAN3_14912	SPU_014912	contains PnbA domain	none
GLEAN3_14919	SPU_014919	none	none
GLEAN3_14920	SPU_014920	contains 2 Mito_carr superfamily motifs and PTZ00169 domain	none
GLEAN3_14921	SPU_014921	none	none
GLEAN3_14924	SPU_014924	none	none
GLEAN3_14964	SPU_014964	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14971	SPU_014971	none	none
GLEAN3_14973	SPU_014973	probable assembly chimera	none
GLEAN3_14983	SPU_014983	contains 6 LDLa superfamily motifs	none
GLEAN3_14985	SPU_014985	none	none
GLEAN3_14998	SPU_014998	none	none
GLEAN3_15000	SPU_015000	contains S-methyl_trans domain	none
GLEAN3_15001	SPU_015001	contains 3 IG superfamily motifs and V-set domain	none
GLEAN3_15003	SPU_015003	none	none
GLEAN3_15022	SPU_015022	none	none
GLEAN3_15032	SPU_015032	none	none
GLEAN3_15034	SPU_015034	none	none
GLEAN3_15042	SPU_015042	none	none
GLEAN3_15043	SPU_015043	none	none
GLEAN3_15045	SPU_015045	contains RAD18 domain	none
GLEAN3_15063	SPU_015063	none	none
GLEAN3_15047	SPU_015047	none	none
GLEAN3_15057	SPU_015057	probable assembly chimera	none
GLEAN3_15046	SPU_015046	none	none
GLEAN3_15070	SPU_015070	contains EGF_Lam domain	none
GLEAN3_15072	SPU_015072	none	none
GLEAN3_15073	SPU_015073	none	none
GLEAN3_15074	SPU_015074	none	none
GLEAN3_15075	SPU_015075	none	none
GLEAN3_15076	SPU_015076	none	none
GLEAN3_15077	SPU_015077	none	none
GLEAN3_15078	SPU_015078	none	none
GLEAN3_15085	SPU_015085	none	none
GLEAN3_15098	SPU_015098	none	none
GLEAN3_15103	SPU_015103	none	none
GLEAN3_15109	SPU_015109	none	none
GLEAN3_15110	SPU_015110	none	none
GLEAN3_12906	SPU_012906	none	none
GLEAN3_15111	SPU_015111	none	none
GLEAN3_15112	SPU_015112	none	none
GLEAN3_15113	SPU_015113	none	none
GLEAN3_15119	SPU_015119	none	none
GLEAN3_15125	SPU_015125	none	none
GLEAN3_15129	SPU_015129	none	none
GLEAN3_15130	SPU_015130	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15131	SPU_015131	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15134	SPU_015134	none	none
GLEAN3_15135	SPU_015135	none	none
GLEAN3_15138	SPU_015138	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15141	SPU_015141	contains MPH1 domain	none
GLEAN3_15143	SPU_015143	none	none
GLEAN3_15148	SPU_015148	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15151	SPU_015151	contains 2 homeodomain superfamily motifs	none
GLEAN3_15159	SPU_015159	none	none
GLEAN3_15172	SPU_015172	none	none
GLEAN3_15174	SPU_015174	none	none
GLEAN3_15181	SPU_015181	none	none
GLEAN3_15182	SPU_015182	none	none
GLEAN3_15186	SPU_015186	none	none
GLEAN3_15189	SPU_015189	contains 3 IG superfamily motifs	none
GLEAN3_15204	SPU_015204	contains PRK05648 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15217	SPU_015217	none	none
GLEAN3_15218	SPU_015218	none	none
GLEAN3_15220	SPU_015220	contains 2 BTB superfamily motifs	none
GLEAN3_15225	SPU_015225	contains COG3264 domain	none
GLEAN3_15230	SPU_015230	none	none
GLEAN3_15233	SPU_015233	none	none
GLEAN3_15234	SPU_015234	none	none
GLEAN3_15239	SPU_015239	contains COG3391 domain	none
GLEAN3_15250	SPU_015250	none	none
GLEAN3_15251	SPU_015251	none	none
GLEAN3_15255	SPU_015255	none	none
GLEAN3_15258	SPU_015258	none	none
GLEAN3_15265	SPU_015265	none	none
GLEAN3_15268	SPU_015268	contains 2 MFS superfamily motifs	none
GLEAN3_15275	SPU_015275	contains FRQ1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15277	SPU_015277	none	none
GLEAN3_15278	SPU_015278	none	none
GLEAN3_15287	SPU_015287	none	none
GLEAN3_15300	SPU_015300	none	none
GLEAN3_15306	SPU_015306	contains DUF803 domain	none
GLEAN3_15307	SPU_015307	contains CytochromB561_N domain	none
GLEAN3_15332	SPU_015332	none	none
GLEAN3_15353	SPU_015353	none	none
GLEAN3_15363	SPU_015363	none	none
GLEAN3_15366	SPU_015366	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15367	SPU_015367	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15368	SPU_015368	none	none
GLEAN3_15373	SPU_015373	none	none
GLEAN3_15384	SPU_015384	none	none
GLEAN3_15393	SPU_015393	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15400	SPU_015400	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_15402	SPU_015402	none	none
GLEAN3_15409	SPU_015409	none	none
GLEAN3_15410	SPU_015410	contains 2 CUB superfamily motifs	none
GLEAN3_15413	SPU_015413	contains 2 EGF_CA superfamily motifs	none
GLEAN3_15417	SPU_015417	none	none
GLEAN3_15427	SPU_015427	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_15433	SPU_015433	contains TraB_pillus domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15457	SPU_015457	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15458	SPU_015458	contains 2A0113 domain	none
GLEAN3_15469	SPU_015469	none	none
GLEAN3_15470.1	SPU_015470.1	contains Ion_trans domain	none
GLEAN3_15471	SPU_015471	none	none
GLEAN3_15475	SPU_015475	none	none
GLEAN3_15493	SPU_015493	contains 2 EGF_A superfamily motifs	none
GLEAN3_15496	SPU_015496	homologous to numerous putative proteins in Branchiostoma floridae	none
GLEAN3_15500	SPU_015500	none	none
GLEAN3_15501	SPU_015501	contains 5 ANK superfamily motifs and Arp domain	none
GLEAN3_15509	SPU_015509	none	none
GLEAN3_15510	SPU_015510	none	none
GLEAN3_15512	SPU_015512	none	none
GLEAN3_15518	SPU_015518	none	none
GLEAN3_15522	SPU_015522	none	none
GLEAN3_15523	SPU_015523	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_15537	SPU_015537	probable assembly chimera	none
GLEAN3_15538	SPU_015538	none	none
GLEAN3_15542	SPU_015542	contains RAB domain	none
GLEAN3_15546	SPU_015546	none	none
GLEAN3_15549	SPU_015549	none	none
GLEAN3_15552	SPU_015552	probable assembly chimera	none
GLEAN3_15554	SPU_015554	none	none
GLEAN3_15556	SPU_015556	contains Adaptin_N domain	none
GLEAN3_15557	SPU_015557	none	none
GLEAN3_15558	SPU_015558	none	none
GLEAN3_15585	SPU_015585	contains 4 IG superfamily motifs and V-set domain	none
GLEAN3_15594	SPU_015594	none	none
GLEAN3_15596	SPU_015596	contains 5 KAZAL_FS superfamily motifs	none
GLEAN3_15597	SPU_015597	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15608	SPU_015608	none	none
GLEAN3_15615	SPU_015615	none	none
GLEAN3_15617	SPU_015617	none	none
GLEAN3_15623	SPU_015623	none	none
GLEAN3_15626	SPU_015626	none	none
GLEAN3_15627	SPU_015627	none	none
GLEAN3_15630	SPU_015630	none	none
GLEAN3_15631	SPU_015631	none	none
GLEAN3_15632	SPU_015632	none	none
GLEAN3_15636	SPU_015636	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15637	SPU_015637	none	none
GLEAN3_15638	SPU_015638	none	none
GLEAN3_15639	SPU_015639	none	none
GLEAN3_15649	SPU_015649	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15656	SPU_015656	contains RhaT domain	none
GLEAN3_15659	SPU_015659	none	none
GLEAN3_15667	SPU_015667	none	none
GLEAN3_15684	SPU_015684	probable assembly chimera	none
GLEAN3_15693	SPU_015693	probable assembly chimera	none
GLEAN3_15695	SPU_015695	none	none
GLEAN3_15711	SPU_015711	contains 2A0113 domain	none
GLEAN3_15715	SPU_015715	none	none
GLEAN3_15717	SPU_015717	none	none
GLEAN3_15728	SPU_015728	none	none
GLEAN3_15730	SPU_015730	none	none
GLEAN3_15731	SPU_015731	none	none
GLEAN3_15750	SPU_015750	none	none
GLEAN3_15756	SPU_015756	none	none
GLEAN3_15757	SPU_015757	none	none
GLEAN3_15761	SPU_015761	none	none
GLEAN3_15762	SPU_015762	none	none
GLEAN3_15764	SPU_015764	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_15769	SPU_015769	none	none
GLEAN3_15773	SPU_015773	ArgS domain	none
GLEAN3_15779	SPU_015779	none	none
GLEAN3_15780	SPU_015780	none	none
GLEAN3_15783	SPU_015783	none	none
GLEAN3_15784	SPU_015784	probable assembly chimera	none
GLEAN3_15785	SPU_015785	contains SrmB domain	none
GLEAN3_15787	SPU_015787	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15791	SPU_015791	none	none
GLEAN3_15796	SPU_015796	contains RAD18 domain. homologous to numerous putative Branchiostoma floridae proteins.	none
GLEAN3_15816	SPU_015816	none	none
GLEAN3_15821	SPU_015821	none	none
GLEAN3_15822	SPU_015822	none	none
GLEAN3_15831	SPU_015831	none	none
GLEAN3_15847	SPU_015847	none	none
GLEAN3_15852	SPU_015852	none	none
GLEAN3_15855	SPU_015855	none	none
GLEAN3_15858	SPU_015858	none	none
GLEAN3_15864	SPU_015864	probable assembly chimera	none
GLEAN3_15874	SPU_015874	contains RVT-1 domain	none
GLEAN3_15876	SPU_015876	none	none
GLEAN3_15886	SPU_015886	none	none
GLEAN3_15890	SPU_015890	none	none
GLEAN3_15891	SPU_015891	none	none
GLEAN3_15892	SPU_015892	none	none
GLEAN3_15893	SPU_015893	none	none
GLEAN3_15895	SPU_015895	none	none
GLEAN3_15897	SPU_015897	contains 2 IG superfamily motifs and V-set domain	none
GLEAN3_15902	SPU_015902	none	none
GLEAN3_15903	SPU_015903	contains 3 IG superfamily motifs	none
GLEAN3_15910	SPU_015910	probable assembly chimera	none
GLEAN3_24827	SPU_024827	contains 2 Epimerase domains	none
GLEAN3_15912	SPU_015912	contains Epimerase domain	none
GLEAN3_15917	SPU_015917	none	none
GLEAN3_15927	SPU_015927	none	none
GLEAN3_15933	SPU_015933	probable assembly chimera	none
GLEAN3_15944	SPU_015944	probable assembly chimera	none
GLEAN3_15948	SPU_015948	none	none
GLEAN3_15960	SPU_015960	none	none
GLEAN3_15964	SPU_015964	none	none
GLEAN3_15968	SPU_015968	none	none
GLEAN3_15980	SPU_015980	none	none
GLEAN3_15985	SPU_015985	contains LcrH_SycD domain	none
GLEAN3_15993	SPU_015993	none	none
GLEAN3_16000	SPU_016000	contains AIR1 domain	none
GLEAN3_16002	SPU_016002	contains ABC_tran domain	none
GLEAN3_16004	SPU_016004	none	none
GLEAN3_16021	SPU_016021	none	none
GLEAN3_16022	SPU_016022	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_16025	SPU_016025	none	none
GLEAN3_16035	SPU_016035	none	none
GLEAN3_16040	SPU_016040	contains 4 PDZ superfamily motifs	none
GLEAN3_16051	SPU_016051	contains COG4886 domain	none
GLEAN3_16057	SPU_016057	none	none
GLEAN3_16064	SPU_016064	contains SMC_prok_B domain	none
GLEAN3_16071	SPU_016071	contains COG5635 domain. homologous to numerous putative Branchiostoma floridae proteins.	none
GLEAN3_16083	SPU_016083	none	none
GLEAN3_16084	SPU_016084	none	none
GLEAN3_16085	SPU_016085	none	none
GLEAN3_16086	SPU_016086	none	none
GLEAN3_16102	SPU_016102	none	none
GLEAN3_16106	SPU_016106	contains 3 ANK superfamily motifs	none
GLEAN3_16112	SPU_016112	contains UDPGT domain	none
GLEAN3_16120	SPU_016120	none	none
GLEAN3_16127	SPU_016127	contains LIC domain	none
GLEAN3_16133	SPU_016133	none	none
GLEAN3_16135	SPU_016135	none	none
GLEAN3_16138	SPU_016138	none	none
GLEAN3_16147	SPU_016147	none	none
GLEAN3_16148	SPU_016148	none	none
GLEAN3_16155	SPU_016155	contains sulP domain	none
GLEAN3_16160	SPU_016160	contains 4 IG superfamily motifs	none
GLEAN3_16166	SPU_016166	none	none
GLEAN3_16178	SPU_016178	none	none
GLEAN3_16180	SPU_016180	contains 2 Ldl_recept_b superfamily motifs	none
GLEAN3_16184	SPU_016184	contains MFS_1 domain	none
GLEAN3_16185	SPU_016185	contains 2 MFS superfamily motifs and 2A0119 domain	none
GLEAN3_16192	SPU_016192	contains 2A0113 domain	none
GLEAN3_16197	SPU_016197	contains 2A0113 domain	none
GLEAN3_16202	SPU_016202	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_16204	SPU_016204	none	none
GLEAN3_16206	SPU_016206	none	none
GLEAN3_16216	SPU_016216	none	none
GLEAN3_16220	SPU_016220	none	none
GLEAN3_16235	SPU_016235	contains EzrA domain	none
GLEAN3_16237	SPU_016237	none	none
GLEAN3_18828	SPU_018828	none	none
GLEAN3_16242	SPU_016242	contains 2 PSI superfamily motifs	none
GLEAN3_16245	SPU_016245	contains Dcp domain	none
GLEAN3_16246	SPU_016246	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16248	SPU_016248	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16255	SPU_016255	none	none
GLEAN3_16260	SPU_016260	none	none
GLEAN3_16265	SPU_016265	none	none
GLEAN3_16266	SPU_016266	none	none
GLEAN3_16268	SPU_016268	none	none
GLEAN3_16282	SPU_016282	none	none
GLEAN3_16288	SPU_016288	none	none
GLEAN3_16290	SPU_016290	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16301	SPU_016301	none	none
GLEAN3_16309	SPU_016309	none	none
GLEAN3_16320	SPU_016320	none	none
GLEAN3_16325	SPU_016325	none	none
GLEAN3_25303	SPU_025303	none	none
GLEAN3_16334	SPU_016334	none	none
GLEAN3_16336	SPU_016336	contains PtrB domain	none
GLEAN3_16337	SPU_016337	none	none
GLEAN3_16348	SPU_016348	contains 2 BTB superfamily motifs	none
GLEAN3_16353	SPU_016353	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16355	SPU_016355	contains COG5141 domain	none
GLEAN3_16365	SPU_016365	none	none
GLEAN3_16366	SPU_016366	none	none
GLEAN3_16368	SPU_016368	none	none
GLEAN3_16378	SPU_016378	none	none
GLEAN3_16381	SPU_016381	none	none
GLEAN3_16384	SPU_016384	none	none
GLEAN3_16389	SPU_016389	none	none
GLEAN3_16391	SPU_016391	none	none
GLEAN3_16393	SPU_016393	none	none
GLEAN3_16395	SPU_016395	none	none
GLEAN3_16396	SPU_016396	none	none
GLEAN3_16397	SPU_016397	none	none
GLEAN3_16400	SPU_016400	none	none
GLEAN3_16405	SPU_016405	contains 2 CUB superfamily motifs	none
GLEAN3_16416	SPU_016416	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16419	SPU_016419	none	none
GLEAN3_16422	SPU_016422	none	none
GLEAN3_16428	SPU_016428	none	none
GLEAN3_16434	SPU_016434	none	none
GLEAN3_16436	SPU_016436	none	none
GLEAN3_16440	SPU_016440	contains 2 EGF_CA superfamily motifs	none
GLEAN3_16451	SPU_016451	none	none
GLEAN3_16453	SPU_016453	none	none
GLEAN3_16454	SPU_016454	contains 2 ANK superfamily motifs	none
GLEAN3_16465	SPU_016465	none	none
GLEAN3_16474	SPU_016474	contains SMC_prok_A domain	none
GLEAN3_16480	SPU_016480	none	none
GLEAN3_16481	SPU_016481	none	none
GLEAN3_16513	SPU_016513	none	none
GLEAN3_16516	SPU_016516	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16535	SPU_016535	contains 2 FA58C superfamily motifs	none
GLEAN3_16541	SPU_016541	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16543	SPU_016543	contains 2A0119 domain	none
GLEAN3_16557	SPU_016557	probable assembly chimera	none
GLEAN3_16558	SPU_016558	none	none
GLEAN3_16560	SPU_016560	none	none
GLEAN3_16566	SPU_016566	contains 2A0113 domain	none
GLEAN3_16571	SPU_016571	none	none
GLEAN3_16576	SPU_016576	none	none
GLEAN3_16577	SPU_016577	none	none
GLEAN3_16583	SPU_016583	none	none
GLEAN3_16584	SPU_016584	none	none
GLEAN3_16597	SPU_016597	none	none
GLEAN3_16622	SPU_016622	none	none
GLEAN3_16626	SPU_016626	none	none
GLEAN3_16638	SPU_016638	none	none
GLEAN3_16640	SPU_016640	none	none
GLEAN3_16666	SPU_016666	none	none
GLEAN3_16677	SPU_016677	none	none
GLEAN3_16690	SPU_016690	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16692	SPU_016692	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16696	SPU_016696	none	none
GLEAN3_16704	SPU_016704	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16710	SPU_016710	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16712	SPU_016712	none	none
GLEAN3_16717	SPU_016717	none	none
GLEAN3_16719	SPU_016719	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16721	SPU_016721	contains 2 MFS superfamily motifs	none
GLEAN3_16722	SPU_016722	none	none
GLEAN3_16724	SPU_016724	contains 2A0113 domain	none
GLEAN3_16727	SPU_016727	none	none
GLEAN3_16737	SPU_016737	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16750	SPU_016750	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_16751	SPU_016751	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16753	SPU_016753	none	none
GLEAN3_16756	SPU_016756	contains PTZ00322 domain	none
GLEAN3_16760	SPU_016760	none	none
GLEAN3_16765	SPU_016765	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16768	SPU_016768	none	none
GLEAN3_16774	SPU_016774	none	none
GLEAN3_16776	SPU_016776	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16778	SPU_016778	contains 3 ANK superfamily motifs	none
GLEAN3_16780	SPU_016780	none	none
GLEAN3_16781	SPU_016781	contains S_TKc domain	none
GLEAN3_16782	SPU_016782	contains PcnB domain	none
GLEAN3_16788	SPU_016788	none	none
GLEAN3_16797	SPU_016797	contains TIGR00376 domain	none
GLEAN3_16800	SPU_016800	none	none
GLEAN3_16804	SPU_016804	none	none
GLEAN3_16811	SPU_016811	none	none
GLEAN3_16813	SPU_016813	none	none
GLEAN3_16814	SPU_016814	none	none
GLEAN3_16822	SPU_016822	none	none
GLEAN3_16828	SPU_016828	none	none
GLEAN3_16832	SPU_016832	none	none
GLEAN3_16833	SPU_016833	none	none
GLEAN3_16834	SPU_016834	none	none
GLEAN3_16847	SPU_016847	none	none
GLEAN3_16848	SPU_016848	none	none
GLEAN3_16855	SPU_016855	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16856	SPU_016856	none	none
GLEAN3_16857	SPU_016857	none	none
GLEAN3_16858	SPU_016858	none	none
GLEAN3_16868	SPU_016868	none	none
GLEAN3_16869	SPU_016869	none	none
GLEAN3_16870	SPU_016870	none	none
GLEAN3_16871	SPU_016871	contains SMC_prok_A domain	none
GLEAN3_16877	SPU_016877	contains 2 WW superfamily motifs	none
GLEAN3_16889	SPU_016889	contains 3 IG superfamily motifs	none
GLEAN3_16891	SPU_016891	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16893	SPU_016893	contains DUF607 domain	none
GLEAN3_16894	SPU_016894	none	none
GLEAN3_16909	SPU_016909	none	none
GLEAN3_16911	SPU_016911	contains 2A0113 domain	none
GLEAN3_16915	SPU_016915	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16920	SPU_016920	none	none
GLEAN3_16923	SPU_016923	Strongylocentrotus purpuratus-specific protein. matches only to itself. probable assembly chimera.	none
GLEAN3_16931	SPU_016931	contains COG5540 domain	none
GLEAN3_16933	SPU_016933	none	none
GLEAN3_16939	SPU_016939	none	none
GLEAN3_16940	SPU_016940	none	none
GLEAN3_16941	SPU_016941	none	none
GLEAN3_16942	SPU_016942	none	none
GLEAN3_16950	SPU_016950	none	none
GLEAN3_16955	SPU_016955	none	none
GLEAN3_16959	SPU_016959	none	none
GLEAN3_16960	SPU_016960	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16962	SPU_016962	none	none
GLEAN3_16966	SPU_016966	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16967	SPU_016967	none	none
GLEAN3_16969	SPU_016969	none	none
GLEAN3_16978	SPU_016978	none	none
GLEAN3_16979	SPU_016979	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16992	SPU_016992	none	none
GLEAN3_16994	SPU_016994	none	none
GLEAN3_17001	SPU_017001	contains PRK00409 domain	none
GLEAN3_17005	SPU_017005	none	none
GLEAN3_17008	SPU_017008	none	none
GLEAN3_17009	SPU_017009	none	none
GLEAN3_17012	SPU_017012	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17023	SPU_017023	contains 2 IG superfamily motifs	none
GLEAN3_17025	SPU_017025	none	none
GLEAN3_17027	SPU_017027	contains Gag_spuma domain	none
GLEAN3_17035	SPU_017035	none	none
GLEAN3_17045	SPU_017045	none	none
GLEAN3_17057	SPU_017057	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17060	SPU_017060	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17065	SPU_017065	none	none
GLEAN3_17068	SPU_017068	none	none
GLEAN3_17075	SPU_017075	none	none
GLEAN3_17076	SPU_017076	none	none
GLEAN3_17077	SPU_017077	none	none
GLEAN3_17079	SPU_017079	none	none
GLEAN3_17080	SPU_017080	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17083	SPU_017083	none	none
GLEAN3_17084	SPU_017084	none	none
GLEAN3_17100	SPU_017100	contains PEP_TPR_lipo domain	none
GLEAN3_17101	SPU_017101	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_17102	SPU_017102	none	none
GLEAN3_17103	SPU_017103	none	none
GLEAN3_17105	SPU_017105	contains 2 Na_Pi_cotrans superfamily motifs and 2a58 domain	none
GLEAN3_17111	SPU_017111	none	none
GLEAN3_17128	SPU_017128	none	none
GLEAN3_17151	SPU_017151	contains 2A1904 domain	none
GLEAN3_17157	SPU_017157	none	none
GLEAN3_17165	SPU_017165	none	none
GLEAN3_17176	SPU_017176	none	none
GLEAN3_17177	SPU_017177	none	none
GLEAN3_17182	SPU_017182	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_17184	SPU_017184	none	none
GLEAN3_17188	SPU_017188	none	none
GLEAN3_17190	SPU_017190	contains SMC_prok_B domain	none
GLEAN3_17191	SPU_017191	probable assembly chimera	none
GLEAN3_17195	SPU_017195	none	none
GLEAN3_17199	SPU_017199	homologous to numerous putative zebrafish proteins	none
GLEAN3_17203	SPU_017203	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17206	SPU_017206	none	none
GLEAN3_17229	SPU_017229	none	none
GLEAN3_17230	SPU_017230	none	none
GLEAN3_17231	SPU_017231	none	none
GLEAN3_17240	SPU_017240	none	none
GLEAN3_17241	SPU_017241	none	none
GLEAN3_17248	SPU_017248	contains PRK05771 domain	none
GLEAN3_17251	SPU_017251	none	none
GLEAN3_17258	SPU_017258	none	none
GLEAN3_17266	SPU_017266	none	none
GLEAN3_17268	SPU_017268	none	none
GLEAN3_17269	SPU_017269	none	none
GLEAN3_17270	SPU_017270	contains SMC_prok_B domain	none
GLEAN3_17277	SPU_017277	none	none
GLEAN3_17278	SPU_017278	none	none
GLEAN3_17288	SPU_017288	none	none
GLEAN3_17290	SPU_017290	contains Sulfotransfer_1 domain	none
GLEAN3_17291	SPU_017291	none	none
GLEAN3_17292	SPU_017292	none	none
GLEAN3_17298	SPU_017298	none	none
GLEAN3_17300	SPU_017300	none	none
GLEAN3_17301	SPU_017301	contains COG4886 domain	none
GLEAN3_17302	SPU_017302	none	none
GLEAN3_17317	SPU_017317	contains 2 MFS superfamily motifs	none
GLEAN3_17307	SPU_017307	none	none
GLEAN3_17322	SPU_017322	none	none
GLEAN3_17324	SPU_017324	contains 2 CCP superfamily motifs	none
GLEAN3_17325	SPU_017325	none	none
GLEAN3_17327	SPU_017327	none	none
GLEAN3_17330	SPU_017330	none	none
GLEAN3_17337	SPU_017337	contains 2A0113 domain	none
GLEAN3_17342	SPU_017342	none	none
GLEAN3_17349	SPU_017349	none	none
GLEAN3_17355	SPU_017355	none	none
GLEAN3_17362	SPU_017362	contains 2 CUB superfamily motifs	none
GLEAN3_17367	SPU_017367	none	none
GLEAN3_17371	SPU_017371	none	none
GLEAN3_17383	SPU_017383	none	none
GLEAN3_17386	SPU_017386	none	none
GLEAN3_17389	SPU_017389	none	none
GLEAN3_17397	SPU_017397	contains 2 RF-1 superfamily motifs	none
GLEAN3_17405	SPU_017405	none	none
GLEAN3_17409	SPU_017409	contains 2 MFS superfamily motifs	none
GLEAN3_17410	SPU_017410	none	none
GLEAN3_17433	SPU_017433	none	none
GLEAN3_17443	SPU_017443	none	none
GLEAN3_17444	SPU_017444	none	none
GLEAN3_17446	SPU_017446	none	none
GLEAN3_17448	SPU_017448	none	none
GLEAN3_17454	SPU_017454	contains PRK10263 domain	none
GLEAN3_17458	SPU_017458	contains NmrA domain	none
GLEAN3_17459	SPU_017459	contains ERG8 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17461	SPU_017461	none	none
GLEAN3_17464	SPU_017464	contains NmrA domain	none
GLEAN3_17468	SPU_017468	none	none
GLEAN3_17471	SPU_017471	none	none
GLEAN3_17473	SPU_017473	none	none
GLEAN3_17486	SPU_017486	probable assembly chimera	none
GLEAN3_17489	SPU_017489	probable assembly chimera	none
GLEAN3_17494	SPU_017494	contains SMC_prok_B domain	none
GLEAN3_17495	SPU_017495	none	none
GLEAN3_17500	SPU_017500	none	none
GLEAN3_17507	SPU_017507	none	none
GLEAN3_17513	SPU_017513	none	none
GLEAN3_17516	SPU_017516	contains ABC_tran domain	none
GLEAN3_17517	SPU_017517	none	none
GLEAN3_17528	SPU_017528	none	none
GLEAN3_17544	SPU_017544	none	none
GLEAN3_17545	SPU_017545	none	none
GLEAN3_17546	SPU_017546	none	none
GLEAN3_17549	SPU_017549	none	none
GLEAN3_17552	SPU_017552	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17554	SPU_017554	none	none
GLEAN3_17568	SPU_017568	none	none
GLEAN3_17570	SPU_017570	contains PRK05035 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17583	SPU_017583	none	none
GLEAN3_17587	SPU_017587	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17589	SPU_017589	none	none
GLEAN3_17590	SPU_017590	none	none
GLEAN3_17598	SPU_017598	contains UDPGT domain	none
GLEAN3_17603	SPU_017603	none	none
GLEAN3_17607	SPU_017607	none	none
GLEAN3_17621	SPU_017621	none	none
GLEAN3_17624	SPU_017624	none	none
GLEAN3_17625	SPU_017625	contains 2A1904 domain	none
GLEAN3_17631	SPU_017631	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17633	SPU_017633	none	none
GLEAN3_17637	SPU_017637	none	none
GLEAN3_17640	SPU_017640	none	none
GLEAN3_17650	SPU_017650	none	none
GLEAN3_17651	SPU_017651	none	none
GLEAN3_17655	SPU_017655	none	none
GLEAN3_17665	SPU_017665	none	none
GLEAN3_17666	SPU_017666	none	none
GLEAN3_17673	SPU_017673	none	none
GLEAN3_17679	SPU_017679	none	none
GLEAN3_17687	SPU_017687	none	none
GLEAN3_17688	SPU_017688	none	none
GLEAN3_17689	SPU_017689	none	none
GLEAN3_17691	SPU_017691	contains Fucokinase domain	none
GLEAN3_17692	SPU_017692	none	none
GLEAN3_17693	SPU_017693	none	none
GLEAN3_17697	SPU_017697	probable assembly chimera	none
GLEAN3_17698	SPU_017698	none	none
GLEAN3_17699	SPU_017699	contains SMC_N domain	none
GLEAN3_17741	SPU_017741	none	none
GLEAN3_17743	SPU_017743	probable assembly chimera	none
GLEAN3_17744	SPU_017744	contains 2A0113 domain	none
GLEAN3_17755	SPU_017755	none	none
GLEAN3_17763	SPU_017763	none	none
GLEAN3_17764	SPU_017764	none	none
GLEAN3_17770	SPU_017770	contains AcAcCoA_reduct domain. poor sequence data: ~25% of amino acids are X.	none
GLEAN3_17774	SPU_017774	none	none
GLEAN3_17782	SPU_017782	none	none
GLEAN3_10677	SPU_010677	none	none
GLEAN3_17784	SPU_017784	none	none
GLEAN3_17789	SPU_017789	none	none
GLEAN3_17793	SPU_017793	none	none
GLEAN3_17812	SPU_017812	none	none
GLEAN3_17813	SPU_017813	none	none
GLEAN3_17819	SPU_017819	none	none
GLEAN3_17824	SPU_017824	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_17835	SPU_017835	poor sequence data: ~45% of amino acids are X.	none
GLEAN3_17840	SPU_017840	none	none
GLEAN3_17851	SPU_017851	none	none
GLEAN3_17856	SPU_017856	none	none
GLEAN3_17862	SPU_017862	none	none
GLEAN3_17863	SPU_017863	none	none
GLEAN3_17864	SPU_017864	none	none
GLEAN3_17865	SPU_017865	none	none
GLEAN3_17866	SPU_017866	none	none
GLEAN3_17867	SPU_017867	none	none
GLEAN3_17869	SPU_017869	none	none
GLEAN3_17885	SPU_017885	none	none
GLEAN3_17891	SPU_017891	none	none
GLEAN3_17898	SPU_017898	none	none
GLEAN3_17905	SPU_017905	none	none
GLEAN3_17907	SPU_017907	none	none
GLEAN3_17908	SPU_017908	none	none
GLEAN3_17909	SPU_017909	contains NagC domain	none
GLEAN3_17912	SPU_017912	none	none
GLEAN3_17919	SPU_017919	none	none
GLEAN3_17925	SPU_017925	none	none
GLEAN3_17941	SPU_017941	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17951	SPU_017951	none	none
GLEAN3_17957	SPU_017957	none	none
GLEAN3_17958	SPU_017958	none	none
GLEAN3_17967	SPU_017967	contains MFS_1 domain	none
GLEAN3_17968	SPU_017968	none	none
GLEAN3_17994	SPU_017994	none	none
GLEAN3_17999	SPU_017999	contains PRK02287 domain	none
GLEAN3_18001	SPU_018001	none	none
GLEAN3_18002	SPU_018002	none	none
GLEAN3_18006	SPU_018006	none	none
GLEAN3_18008	SPU_018008	contains COG5152 domain	none
GLEAN3_18009	SPU_018009	none	none
GLEAN3_18018	SPU_018018	none	none
GLEAN3_18031	SPU_018031	none	none
GLEAN3_18032	SPU_018032	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18033	SPU_018033	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18037	SPU_018037	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18038	SPU_018038	none	none
GLEAN3_18044	SPU_018044	none	none
GLEAN3_18054	SPU_018054	contains 2 Calx-beta superfamily motifs	none
GLEAN3_18063	SPU_018063	none	none
GLEAN3_18064	SPU_018064	none	none
GLEAN3_18069	SPU_018069	none	none
GLEAN3_18072	SPU_018072	none	none
GLEAN3_18076	SPU_018076	none	none
GLEAN3_18086	SPU_018086	contains IF-2B domain	none
GLEAN3_18090	SPU_018090	none	none
GLEAN3_18093	SPU_018093	none	none
GLEAN3_18094	SPU_018094	none	none
GLEAN3_18096	SPU_018096	none	none
GLEAN3_18104	SPU_018104	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_18105	SPU_018105	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_18119	SPU_018119	none	none
GLEAN3_18108	SPU_018108	probable assembly chimera	none
GLEAN3_18117	SPU_018117	none	none
GLEAN3_18120	SPU_018120	none	none
GLEAN3_18121	SPU_018121	none	none
GLEAN3_18122	SPU_018122	probable assembly chimera	none
GLEAN3_18124	SPU_018124	probable assembly chimera	none
GLEAN3_18128	SPU_018128	none	none
GLEAN3_18143	SPU_018143	none	none
GLEAN3_18144	SPU_018144	none	none
GLEAN3_18145	SPU_018145	contains PRK03992 domain	none
GLEAN3_18147	SPU_018147	none	none
GLEAN3_18151	SPU_018151	none	none
GLEAN3_18158	SPU_018158	none	none
GLEAN3_18160	SPU_018160	contains 6 CCP superfamily motifs	none
GLEAN3_18165	SPU_018165	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18169	SPU_018169	contains PRK05431 domain. probable assembly chimera.	none
GLEAN3_18174	SPU_018174	none	none
GLEAN3_18176	SPU_018176	contains SMC_prok_B domain	none
GLEAN3_18178	SPU_018178	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18186	SPU_018186	contains COG3217 domain	none
GLEAN3_18190	SPU_018190	none	none
GLEAN3_18200	SPU_018200	none	none
GLEAN3_18203	SPU_018203	none	none
GLEAN3_18206	SPU_018206	none	none
GLEAN3_18217	SPU_018217	none	none
GLEAN3_18222	SPU_018222	none	none
GLEAN3_18224	SPU_018224	contains DadA domain	none
GLEAN3_18227	SPU_018227	none	none
GLEAN3_18232	SPU_018232	none	none
GLEAN3_18233	SPU_018233	contains 2 TPR superfamily motifs	none
GLEAN3_18235	SPU_018235	contains 2 CUB superfamily motifs	none
GLEAN3_18241	SPU_018241	contains 2 MFS superfamily motifs	none
GLEAN3_18253	SPU_018253	none	none
GLEAN3_18260	SPU_018260	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18263	SPU_018263	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18266	SPU_018266	none	none
GLEAN3_18268	SPU_018268	none	none
GLEAN3_18284	SPU_018284	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18298	SPU_018298	none	none
GLEAN3_18308	SPU_018308	contains mTERF domain	none
GLEAN3_18325	SPU_018325	none	none
GLEAN3_18327	SPU_018327	contains RhaT domain	none
GLEAN3_18328	SPU_018328	none	none
GLEAN3_18332	SPU_018332	none	none
GLEAN3_18337	SPU_018337	contains SMC_prok_B domain	none
GLEAN3_18339	SPU_018339	none	none
GLEAN3_18340	SPU_018340	none	none
GLEAN3_18350	SPU_018350	contains COG5028 domain	none
GLEAN3_18357	SPU_018357	none	none
GLEAN3_18359	SPU_018359	none	none
GLEAN3_18361	SPU_018361	none	none
GLEAN3_18363	SPU_018363	none	none
GLEAN3_18365	SPU_018365	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18383	SPU_018383	contains LdhA domain	none
GLEAN3_18386	SPU_018386	contains mutS2 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18389	SPU_018389	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18399	SPU_018399	none	none
GLEAN3_18402	SPU_018402	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18405	SPU_018405	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18416	SPU_018416	none	none
GLEAN3_18418	SPU_018418	none	none
GLEAN3_18422	SPU_018422	probable assembly chimera	none
GLEAN3_18449	SPU_018449	none	none
GLEAN3_18451	SPU_018451	none	none
GLEAN3_18459	SPU_018459	contains rne domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18460	SPU_018460	none	none
GLEAN3_18463	SPU_018463	contains COG4886 domain	none
GLEAN3_18469	SPU_018469	none	none
GLEAN3_18486	SPU_018486	contains Sulfotransfer_1 domain	none
GLEAN3_18477	SPU_018477	contains 4 EGF_CA superfamily motifs	none
GLEAN3_18487	SPU_018487	contains Sulfotransfer_1 domain	none
GLEAN3_18494	SPU_018494	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18516	SPU_018516	none	none
GLEAN3_18525	SPU_018525	none	none
GLEAN3_18541	SPU_018541	none	none
GLEAN3_18544	SPU_018544	none	none
GLEAN3_18547	SPU_018547	none	none
GLEAN3_18551	SPU_018551	none	none
GLEAN3_18557	SPU_018557	none	none
GLEAN3_18561	SPU_018561	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18562	SPU_018562	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18574	SPU_018574	none	none
GLEAN3_18581	SPU_018581	none	none
GLEAN3_18584	SPU_018584	none	none
GLEAN3_18585	SPU_018585	none	none
GLEAN3_18590	SPU_018590	none	none
GLEAN3_18593	SPU_018593	contains sbcB domain	none
GLEAN3_18595	SPU_018595	none	none
GLEAN3_18599	SPU_018599	none	none
GLEAN3_18600	SPU_018600	none	none
GLEAN3_18603	SPU_018603	none	none
GLEAN3_18604	SPU_018604	none	none
GLEAN3_18622	SPU_018622	probable assembly chimera	none
GLEAN3_18625	SPU_018625	none	none
GLEAN3_18628	SPU_018628	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18635	SPU_018635	none	none
GLEAN3_18636	SPU_018636	none	none
GLEAN3_18641	SPU_018641	none	none
GLEAN3_18642	SPU_018642	none	none
GLEAN3_18646	SPU_018646	contains PABP-1234 domain	none
GLEAN3_18656	SPU_018656	none	none
GLEAN3_18660	SPU_018660	none	none
GLEAN3_18661	SPU_018661	none	none
GLEAN3_18677	SPU_018677	none	none
GLEAN3_18694	SPU_018694	none	none
GLEAN3_18698	SPU_018698	contains 3 EGF_CA superfamily motifs	none
GLEAN3_18706	SPU_018706	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18707	SPU_018707	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_18714	SPU_018714	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18734	SPU_018734	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_18738	SPU_018738	none	none
GLEAN3_18739	SPU_018739	none	none
GLEAN3_18740	SPU_018740	none	none
GLEAN3_18741	SPU_018741	none	none
GLEAN3_18750	SPU_018750	none	none
GLEAN3_18756	SPU_018756	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18762	SPU_018762	contains 2 MFS superfamily motifs and 2A0115 domain	none
GLEAN3_18785	SPU_018785	contains RhaT domain	none
GLEAN3_18763	SPU_018763	none	none
GLEAN3_18771	SPU_018771	none	none
GLEAN3_18788	SPU_018788	contains COG1565 domain	none
GLEAN3_18798	SPU_018798	contains 2 EGF_CA superfamily motifs	none
GLEAN3_18799	SPU_018799	none	none
GLEAN3_18800	SPU_018800	none	none
GLEAN3_18808	SPU_018808	contains SpoIIIAA domain	none
GLEAN3_18815	SPU_018815	none	none
GLEAN3_18818	SPU_018818	none	none
GLEAN3_18820	SPU_018820	none	none
GLEAN3_18827	SPU_018827	contains 2 LamG superfamily motifs	none
GLEAN3_18831	SPU_018831	none	none
GLEAN3_18832	SPU_018832	none	none
GLEAN3_18834	SPU_018834	none	none
GLEAN3_18836	SPU_018836	none	none
GLEAN3_18840	SPU_018840	none	none
GLEAN3_18847	SPU_018847	none	none
GLEAN3_18849	SPU_018849	none	none
GLEAN3_06534	SPU_006534	contains 7 HYR superfamily motifs. no good homologs are found in other species. member of highly S. purpuratus-specific protein group.	none
GLEAN3_18863	SPU_018863	contains 4 HYR superfamily motifs	none
GLEAN3_18864	SPU_018864	none	none
GLEAN3_18868	SPU_018868	none	none
GLEAN3_18871	SPU_018871	none	none
GLEAN3_18874	SPU_018874	contains Sulfotransfer_1 domain	none
GLEAN3_18876	SPU_018876	contains Sulfotransfer_1 domain	none
GLEAN3_18878	SPU_018878	none	none
GLEAN3_18879	SPU_018879	none	none
GLEAN3_18901	SPU_018901	none	none
GLEAN3_18905	SPU_018905	none	none
GLEAN3_18911	SPU_018911	none	none
GLEAN3_18914	SPU_018914	none	none
GLEAN3_18919	SPU_018919	contains 2A1904 domain	none
GLEAN3_18937	SPU_018937	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18945	SPU_018945	none	none
GLEAN3_18946	SPU_018946	none	none
GLEAN3_18947	SPU_018947	none	none
GLEAN3_18953	SPU_018953	none	none
GLEAN3_18959	SPU_018959	none	none
GLEAN3_18963	SPU_018963	none	none
GLEAN3_18966	SPU_018966	none	none
GLEAN3_18967	SPU_018967	none	none
GLEAN3_18970	SPU_018970	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18971	SPU_018971	none	none
GLEAN3_18972	SPU_018972	none	none
GLEAN3_18986	SPU_018986	contains PRK11281 domain	none
GLEAN3_18988	SPU_018988	none	none
GLEAN3_18989	SPU_018989	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18996	SPU_018996	none	none
GLEAN3_19000	SPU_019000	contains DYN1 domain	none
GLEAN3_19013	SPU_019013	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19014	SPU_019014	none	none
GLEAN3_19029	SPU_019029	none	none
GLEAN3_19044	SPU_019044	none	none
GLEAN3_19047	SPU_019047	contains soxA_mon domain	none
GLEAN3_19048	SPU_019048	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19057	SPU_019057	contains NmrA domain	none
GLEAN3_19061	SPU_019061	none	none
GLEAN3_19062	SPU_019062	none	none
GLEAN3_19065	SPU_019065	none	none
GLEAN3_19069	SPU_019069	poor sequence data: ~65% of amino acids are X.	none
GLEAN3_19076	SPU_019076	none	none
GLEAN3_19080	SPU_019080	contains Ins145_P3_rec domain and MIR domain	none
GLEAN3_19084	SPU_019084	none	none
GLEAN3_19106	SPU_019106	none	none
GLEAN3_19108	SPU_019108	none	none
GLEAN3_19110	SPU_019110	probable assembly chimera	none
GLEAN3_19114	SPU_019114	none	none
GLEAN3_19115	SPU_019115	none	none
GLEAN3_19120	SPU_019120	none	none
GLEAN3_19121	SPU_019121	contains 4 EGF_CA superfamily motifs	none
GLEAN3_19131	SPU_019131	none	none
GLEAN3_19146	SPU_019146	none	none
GLEAN3_19160	SPU_019160	none	none
GLEAN3_19162	SPU_019162	none	none
GLEAN3_19167	SPU_019167	Glyco_hydro_59 domain	none
GLEAN3_19168	SPU_019168	contains CLH domain	none
GLEAN3_19169	SPU_019169	none	none
GLEAN3_19171	SPU_019171	none	none
GLEAN3_19178	SPU_019178	none	none
GLEAN3_19183	SPU_019183	contains 2A1904 domain	none
GLEAN3_19185	SPU_019185	none	none
GLEAN3_19186	SPU_019186	none	none
GLEAN3_19187	SPU_019187	none	none
GLEAN3_19188	SPU_019188	none	none
GLEAN3_19193	SPU_019193	contains 2 MAM superfamily motifs	none
GLEAN3_19197	SPU_019197	none	none
GLEAN3_19217	SPU_019217	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19220	SPU_019220	none	none
GLEAN3_19221	SPU_019221	none	none
GLEAN3_19233	SPU_019233	none	none
GLEAN3_19235	SPU_019235	none	none
GLEAN3_19236	SPU_019236	matches only to S. purpuratus proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19245	SPU_019245	none	none
GLEAN3_19248	SPU_019248	none	none
GLEAN3_19249	SPU_019249	none	none
GLEAN3_19254	SPU_019254	contains 5 S1-like superfamily motifs	none
GLEAN3_19267	SPU_019267	none	none
GLEAN3_19270	SPU_019270	contains trp domain	none
GLEAN3_19279	SPU_019279	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_19281	SPU_019281	none	none
GLEAN3_19282	SPU_019282	none	none
GLEAN3_19289	SPU_019289	contains SMC_prok_B domain	none
GLEAN3_19299	SPU_019299	none	none
GLEAN3_19301	SPU_019301	none	none
GLEAN3_19316	SPU_019316	none	none
GLEAN3_19319	SPU_019319	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_19320	SPU_019320	contains SrmB domain	none
GLEAN3_19322	SPU_019322	none	none
GLEAN3_19333	SPU_019333	none	none
GLEAN3_19335	SPU_019335	contains 2 MAM superfamily motifs	none
GLEAN3_19344	SPU_019344	none	none
GLEAN3_19352	SPU_019352	none	none
GLEAN3_19353	SPU_019353	none	none
GLEAN3_19354	SPU_019354	contains 5 LDLa superfamily motifs and 2 CUB superfamily motifs	none
GLEAN3_19358	SPU_019358	none	none
GLEAN3_19360	SPU_019360	none	none
GLEAN3_19361	SPU_019361	none	none
GLEAN3_19362	SPU_019362	none	none
GLEAN3_19366	SPU_019366	none	none
GLEAN3_19375	SPU_019375	none	none
GLEAN3_19376	SPU_019376	none	none
GLEAN3_19377	SPU_019377	contains PRK05771 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19382	SPU_019382	none	none
GLEAN3_19390	SPU_019390	none	none
GLEAN3_19395	SPU_019395	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19406	SPU_019406	none	none
GLEAN3_19410	SPU_019410	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19411	SPU_019411	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19441	SPU_019441	none	none
GLEAN3_19447	SPU_019447	none	none
GLEAN3_19448	SPU_019448	contains Pyr_redox_2 domain. poor protein sequence data: ~40% of amino acids at C-terminus are X.	none
GLEAN3_19452	SPU_019452	contains 2 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19453	SPU_019453	none	none
GLEAN3_19455	SPU_019455	probable assembly chimera	none
GLEAN3_19456	SPU_019456	none	none
GLEAN3_19457	SPU_019457	none	none
GLEAN3_19463	SPU_019463	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19466	SPU_019466	none	none
GLEAN3_19469	SPU_019469	none	none
GLEAN3_19483	SPU_019483	none	none
GLEAN3_19485	SPU_019485	none	none
GLEAN3_19503	SPU_019503	none	none
GLEAN3_19513	SPU_019513	contains 2A1904 domain	none
GLEAN3_19515	SPU_019515	none	none
GLEAN3_19526	SPU_019526	none	none
GLEAN3_19527	SPU_019527	none	none
GLEAN3_19528	SPU_019528	none	none
GLEAN3_19531	SPU_019531	contains SMC_N domain	none
GLEAN3_19537	SPU_019537	none	none
GLEAN3_19539	SPU_019539	none	none
GLEAN3_19546	SPU_019546	none	none
GLEAN3_19547	SPU_019547	none	none
GLEAN3_19554	SPU_019554	none	none
GLEAN3_19560	SPU_019560	none	none
GLEAN3_19561	SPU_019561	none	none
GLEAN3_19569	SPU_019569	none	none
GLEAN3_19573	SPU_019573	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19574	SPU_019574	none	none
GLEAN3_19588	SPU_019588	none	none
GLEAN3_19595	SPU_019595	none	none
GLEAN3_19596	SPU_019596	none	none
GLEAN3_19598	SPU_019598	none	none
GLEAN3_19600	SPU_019600	contains 2 A2M_N domain motifs	none
GLEAN3_19608	SPU_019608	contains Man-6-P_recept domain	none
GLEAN3_19622	SPU_019622	none	none
GLEAN3_19628	SPU_019628	none	none
GLEAN3_19635	SPU_019635	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19641	SPU_019641	none	none
GLEAN3_19643	SPU_019643	none	none
GLEAN3_19644	SPU_019644	none	none
GLEAN3_19650	SPU_019650	none	none
GLEAN3_19654	SPU_019654	none	none
GLEAN3_19662	SPU_019662	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_19670	SPU_019670	contains 6 EGF_CA superfamily motifs	none
GLEAN3_19674	SPU_019674		none
GLEAN3_19676	SPU_019676	none	none
GLEAN3_19679	SPU_019679	none	none
GLEAN3_19681	SPU_019681	none	none
GLEAN3_19682	SPU_019682	none	none
GLEAN3_19684	SPU_019684	none	none
GLEAN3_19687	SPU_019687	contains COG4886 domain and PCC domain	none
GLEAN3_19688	SPU_019688	none	none
GLEAN3_19691	SPU_019691	contains COG2268 domain	none
GLEAN3_19697	SPU_019697	contains 2 HYR superfamily motifs	none
GLEAN3_19698	SPU_019698	none	none
GLEAN3_19720	SPU_019720	none	none
GLEAN3_19721	SPU_019721	none	none
GLEAN3_19741	SPU_019741	contains 2 IG superfamily motifs	none
GLEAN3_19742	SPU_019742	none	none
GLEAN3_19749	SPU_019749	contains 3 Kelch_1 superfamily motifs	none
GLEAN3_19750	SPU_019750	contains c_cpa1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19753	SPU_019753	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19760	SPU_019760	contains COG4886 domain	none
GLEAN3_19761	SPU_019761	none	none
GLEAN3_19776	SPU_019776	contains CUS1 domain	none
GLEAN3_19787	SPU_019787	contains Ion_trans domain	none
GLEAN3_19791	SPU_019791	none	none
GLEAN3_19796	SPU_019796	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19797	SPU_019797	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19798	SPU_019798	none	none
GLEAN3_19800	SPU_019800	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19806	SPU_019806	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19808	SPU_019808	none	none
GLEAN3_19812	SPU_019812	contains Dynein_heavy domain	none
GLEAN3_19813	SPU_019813	none	none
GLEAN3_19816	SPU_019816	none	none
GLEAN3_19841	SPU_019841	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19858	SPU_019858	contains PAT1 domain	none
GLEAN3_19862	SPU_019862	none	none
GLEAN3_19868	SPU_019868	none	none
GLEAN3_19877	SPU_019877	probable assembly chimera	none
GLEAN3_19887	SPU_019887	none	none
GLEAN3_19897	SPU_019897	none	none
GLEAN3_19901	SPU_019901	contains SMC_prok_A domain	none
GLEAN3_19910	SPU_019910	none	none
GLEAN3_19922	SPU_019922	none	none
GLEAN3_19924	SPU_019924	contains 3 ANK domain motifs. homologous to numerous putative Trichomonas vaginalis proteins.	none
GLEAN3_19926	SPU_019926	none	none
GLEAN3_19927	SPU_019927	contains MIP-T3 domain	none
GLEAN3_19929	SPU_019929	contains 3 Thioredoxin-like superfamily motifs	none
GLEAN3_19931	SPU_019931	none	none
GLEAN3_19935	SPU_019935	none	none
GLEAN3_19940	SPU_019940	none	none
GLEAN3_19945	SPU_019945	none	none
GLEAN3_19947	SPU_019947	none	none
GLEAN3_19948	SPU_019948	none	none
GLEAN3_19949	SPU_019949	none	none
GLEAN3_19955	SPU_019955	contains 2 EFh superfamily motifs	none
GLEAN3_19956	SPU_019956	none	none
GLEAN3_19960	SPU_019960	none	none
GLEAN3_19962	SPU_019962	none	none
GLEAN3_19964	SPU_019964	contains 2A0113 domain	none
GLEAN3_19965	SPU_019965	contains 2A0113 domain	none
GLEAN3_19966	SPU_019966	contains 2A0113 domain	none
GLEAN3_19971	SPU_019971	none	none
GLEAN3_19972	SPU_019972	none	none
GLEAN3_19982	SPU_019982	none	none
GLEAN3_19992	SPU_019992	none	none
GLEAN3_19997	SPU_019997	none	none
GLEAN3_19998	SPU_019998	none	none
GLEAN3_20013	SPU_020013	none	none
GLEAN3_20023	SPU_020023	none	none
GLEAN3_20024	SPU_020024	none	none
GLEAN3_20028	SPU_020028	none	none
GLEAN3_20030	SPU_020030	none	none
GLEAN3_20041	SPU_020041	none	none
GLEAN3_20043	SPU_020043	contains 2 IG superfamily motifs	none
GLEAN3_20046	SPU_020046	none	none
GLEAN3_20049	SPU_020049	none	none
GLEAN3_20051	SPU_020051	none	none
GLEAN3_20064	SPU_020064	none	none
GLEAN3_20066	SPU_020066	none	none
GLEAN3_20071	SPU_020071	none	none
GLEAN3_20073	SPU_020073	none	none
GLEAN3_20074	SPU_020074	none	none
GLEAN3_20077	SPU_020077	none	none
GLEAN3_20091	SPU_020091	none	none
GLEAN3_20093	SPU_020093	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20094	SPU_020094	contains Sulfotransfer_1 domain	none
GLEAN3_20100	SPU_020100	none	none
GLEAN3_20101	SPU_020101	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20106	SPU_020106	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20109	SPU_020109	poor amino acid sequence (~40 % of amino acids are X)	none
GLEAN3_20113	SPU_020113	none	none
GLEAN3_20114	SPU_020114	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20116	SPU_020116	none	none
GLEAN3_20136	SPU_020136	none	none
GLEAN3_20137	SPU_020137	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20139	SPU_020139	none	none
GLEAN3_20147	SPU_020147	none	none
GLEAN3_20169	SPU_020169	none	none
GLEAN3_20171	SPU_020171	none	none
GLEAN3_20177	SPU_020177	none	none
GLEAN3_20184	SPU_020184	none	none
GLEAN3_20178	SPU_020178	contains 4 ANK superfamily motifs and Arp domains and PTZ00322 domains and trp domains	none
GLEAN3_20187	SPU_020187	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20193	SPU_020193	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20202	SPU_020202	none	none
GLEAN3_20203	SPU_020203	contains 6 EGF_CA superfamily motifs	none
GLEAN3_20208	SPU_020208	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20217	SPU_020217	none	none
GLEAN3_20222	SPU_020222	contains BET4 domain	none
GLEAN3_20226	SPU_020226	none	none
GLEAN3_20230	SPU_020230	contains TIGR00376 domain. probable assembly chimera.	none
GLEAN3_20239	SPU_020239	none	none
GLEAN3_20241	SPU_020241	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20269	SPU_020269	none	none
GLEAN3_20274	SPU_020274	none	none
GLEAN3_20286	SPU_020286	none	none
GLEAN3_20298	SPU_020298	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20301	SPU_020301	contains PRK07768 domain	none
GLEAN3_20303	SPU_020303	contains Sugar_transport domain	none
GLEAN3_20305	SPU_020305	none	none
GLEAN3_20319	SPU_020319	contains 2 MFS superfamily motifs	none
GLEAN3_20321	SPU_020321	none	none
GLEAN3_20323	SPU_020323	contains 2 MFS superfamily motifs and 2A0114 domain	none
GLEAN3_20343	SPU_020343	none	none
GLEAN3_20348	SPU_020348	none	none
GLEAN3_20353	SPU_020353	contains PRK03918 domain	none
GLEAN3_20354	SPU_020354	contains 2 TSP1 superfamily motifs	none
GLEAN3_20355	SPU_020355	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20367	SPU_020367	probable assembly chimera	none
GLEAN3_20369	SPU_020369	none	none
GLEAN3_20372	SPU_020372	none	none
GLEAN3_20378	SPU_020378	none	none
GLEAN3_20381	SPU_020381	none	none
GLEAN3_20384	SPU_020384	none	none
GLEAN3_20389	SPU_020389	contains Dynein_heavy domain	none
GLEAN3_20399	SPU_020399	none	none
GLEAN3_20408	SPU_020408	none	none
GLEAN3_20409	SPU_020409	none	none
GLEAN3_20415	SPU_020415	none	none
GLEAN3_20416	SPU_020416	none	none
GLEAN3_20417	SPU_020417	none	none
GLEAN3_20418	SPU_020418	none	none
GLEAN3_20420	SPU_020420	contains 3 CUB superfamily motifs	none
GLEAN3_20431	SPU_020431	none	none
GLEAN3_20432	SPU_020432	none	none
GLEAN3_20440	SPU_020440	none	none
GLEAN3_20444	SPU_020444	none	none
GLEAN3_20460	SPU_020460	none	none
GLEAN3_20479	SPU_020479	none	none
GLEAN3_20495	SPU_020495	none	none
GLEAN3_20500	SPU_020500	contains HSP70 domain. probable assembly chimera.	none
GLEAN3_20505	SPU_020505	none	none
GLEAN3_20515	SPU_020515	contains Sulfotransfer_1 domain	none
GLEAN3_20516	SPU_020516	contains LdhA domain	none
GLEAN3_20520	SPU_020520	contains 2A0113 domain	none
GLEAN3_20538	SPU_020538	contains 4 CCP superfamily motifs	none
GLEAN3_20548	SPU_020548	contains COG5236 domain. probable assembly chimera.	none
GLEAN3_20557	SPU_020557	none	none
GLEAN3_20559	SPU_020559	contains RVT_1 domain	none
GLEAN3_20560	SPU_020560	none	none
GLEAN3_20562	SPU_020562	contains NarG domain	none
GLEAN3_20579	SPU_020579	contains CynX domain	none
GLEAN3_20580	SPU_020580	none	none
GLEAN3_20582	SPU_020582	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20585	SPU_020585	none	none
GLEAN3_20591	SPU_020591	none	none
GLEAN3_20593	SPU_020593	none	none
GLEAN3_20594	SPU_020594	none	none
GLEAN3_20596	SPU_020596	none	none
GLEAN3_20603	SPU_020603	none	none
GLEAN3_20612	SPU_020612	none	none
GLEAN3_20613	SPU_020613	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20619	SPU_020619	none	none
GLEAN3_20628	SPU_020628	none	none
GLEAN3_20630	SPU_020630	contains 3 ANK superfamily motifs	none
GLEAN3_20638	SPU_020638	none	none
GLEAN3_20640	SPU_020640	none	none
GLEAN3_20655	SPU_020655	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20667	SPU_020667	none	none
GLEAN3_20672	SPU_020672	none	none
GLEAN3_20681	SPU_020681	contains 2 IG superfamily motifs	none
GLEAN3_20683	SPU_020683	none	none
GLEAN3_20685	SPU_020685	none	none
GLEAN3_20696	SPU_020696	none	none
GLEAN3_20705	SPU_020705	contains 2 ANK superfamily motifs and Arp domains	none
GLEAN3_20710	SPU_020710	none	none
GLEAN3_20711	SPU_020711	contains hCaCC domain. probable assembly chimera.	none
GLEAN3_20713	SPU_020713	contains SMC_prok_B domain	none
GLEAN3_20715	SPU_020715	contains SMC_prok_B domain	none
GLEAN3_20719	SPU_020719	none	none
GLEAN3_20724	SPU_020724	none	none
GLEAN3_20725	SPU_020725	contains Sulfotransfer_1 domain	none
GLEAN3_20733	SPU_020733	none	none
GLEAN3_20734	SPU_020734	none	none
GLEAN3_20742	SPU_020742	none	none
GLEAN3_20743	SPU_020743	none	none
GLEAN3_20761	SPU_020761	none	none
GLEAN3_20777	SPU_020777	none	none
GLEAN3_20791	SPU_020791	contains COG4886 domain	none
GLEAN3_20792	SPU_020792	none	none
GLEAN3_20793	SPU_020793	none	none
GLEAN3_20796	SPU_020796	probable assembly chimera	none
GLEAN3_20797	SPU_020797	none	none
GLEAN3_20811	SPU_020811	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_20812	SPU_020812	none	none
GLEAN3_20815	SPU_020815	none	none
GLEAN3_20823	SPU_020823	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_20826	SPU_020826	contains Dynein_heavy domain	none
GLEAN3_20830	SPU_020830	probable assembly chimera	none
GLEAN3_20832	SPU_020832	none	none
GLEAN3_20840	SPU_020840	none	none
GLEAN3_20844	SPU_020844	none	none
GLEAN3_20847	SPU_020847	none	none
GLEAN3_20857	SPU_020857	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_20861	SPU_020861	contains Sulfotransfer_1 domain	none
GLEAN3_20871	SPU_020871	none	none
GLEAN3_20878	SPU_020878	none	none
GLEAN3_20883	SPU_020883	none	none
GLEAN3_20888	SPU_020888	none	none
GLEAN3_20889	SPU_020889	none	none
GLEAN3_20890	SPU_020890	contains RAD18 domain	none
GLEAN3_20891	SPU_020891	contains Smc domain	none
GLEAN3_20892	SPU_020892	none	none
GLEAN3_20893	SPU_020893	none	none
GLEAN3_20897	SPU_020897	none	none
GLEAN3_20898	SPU_020898	none	none
GLEAN3_20902	SPU_020902	none	none
GLEAN3_20906	SPU_020906	none	none
GLEAN3_20910	SPU_020910	none	none
GLEAN3_20911	SPU_020911	none	none
GLEAN3_20912	SPU_020912	none	none
GLEAN3_20915	SPU_020915	none	none
GLEAN3_20930	SPU_020930	none	none
GLEAN3_20934	SPU_020934	none	none
GLEAN3_20935	SPU_020935	none	none
GLEAN3_20941	SPU_020941	none	none
GLEAN3_20956	SPU_020956	none	none
GLEAN3_20962	SPU_020962	none	none
GLEAN3_20963	SPU_020963	none	none
GLEAN3_20969	SPU_020969	none	none
GLEAN3_20977	SPU_020977	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20982	SPU_020982	none	none
GLEAN3_21002	SPU_021002	none	none
GLEAN3_21004	SPU_021004	none	none
GLEAN3_21015	SPU_021015	none	none
GLEAN3_21017	SPU_021017	probable assembly chimera	none
GLEAN3_21018	SPU_021018	none	none
GLEAN3_21029	SPU_021029	none	none
GLEAN3_21031	SPU_021031	contains 2 IG superfamily motifs	none
GLEAN3_21033	SPU_021033	contains PnbA domain	none
GLEAN3_21034	SPU_021034	none	none
GLEAN3_21043	SPU_021043	none	none
GLEAN3_21048	SPU_021048	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21049	SPU_021049	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21052	SPU_021052	none	none
GLEAN3_21064	SPU_021064	contains RhaT domain	none
GLEAN3_21065	SPU_021065	none	none
GLEAN3_21077	SPU_021077	none	none
GLEAN3_21091	SPU_021091	contains 2 HYR superfamily motifs	none
GLEAN3_21093	SPU_021093	none	none
GLEAN3_21097	SPU_021097	none	none
GLEAN3_21099	SPU_021099	none	none
GLEAN3_21100	SPU_021100	contains 2 ANK superfamily motifs and PTZ00322 domain	none
GLEAN3_21101	SPU_021101	none	none
GLEAN3_21108	SPU_021108	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21112	SPU_021112	none	none
GLEAN3_21116	SPU_021116	none	none
GLEAN3_21125	SPU_021125	none	none
GLEAN3_21130	SPU_021130	none	none
GLEAN3_21134	SPU_021134	none	none
GLEAN3_21137	SPU_021137	none	none
GLEAN3_21140	SPU_021140	none	none
GLEAN3_21150	SPU_021150	probable assembly chimera	none
GLEAN3_21151	SPU_021151	none	none
GLEAN3_21157	SPU_021157	none	none
GLEAN3_21176	SPU_021176	none	none
GLEAN3_21181	SPU_021181	none	none
GLEAN3_21187	SPU_021187	none	none
GLEAN3_21189	SPU_021189	contains WecE domain	none
GLEAN3_21215	SPU_021215	none	none
GLEAN3_21231	SPU_021231	none	none
GLEAN3_21242	SPU_021242	none	none
GLEAN3_21245	SPU_021245	none	none
GLEAN3_21248	SPU_021248	none	none
GLEAN3_21250	SPU_021250	none	none
GLEAN3_21265	SPU_021265	none	none
GLEAN3_21271	SPU_021271	none	none
GLEAN3_21298	SPU_021298	none	none
GLEAN3_21300	SPU_021300	none	none
GLEAN3_21320	SPU_021320	none	none
GLEAN3_21321	SPU_021321	none	none
GLEAN3_21323	SPU_021323	contains PRK03427 domain and PRK08853 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21325	SPU_021325	none	none
GLEAN3_21327	SPU_021327	none	none
GLEAN3_21332	SPU_021332	none	none
GLEAN3_21333	SPU_021333	none	none
GLEAN3_21334	SPU_021334	poor protein sequence data: ~70% of the amino acids are X	none
GLEAN3_21339	SPU_021339	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21346	SPU_021346	contains 8 EGF_CA superfamily motifs	none
GLEAN3_21361	SPU_021361	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21363	SPU_021363	none	none
GLEAN3_21365	SPU_021365	contains infB domain	none
GLEAN3_21369	SPU_021369	none	none
GLEAN3_21376	SPU_021376	contains RimI domain	none
GLEAN3_21385	SPU_021385	none	none
GLEAN3_21386	SPU_021386	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21390.1	SPU_021390.1	none	none
GLEAN3_21393	SPU_021393	none	none
GLEAN3_21396	SPU_021396	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21400	SPU_021400	contains RVT_1 domain	none
GLEAN3_21401	SPU_021401	contains 6 Kelch_1 superfamily motifs	none
GLEAN3_21403	SPU_021403	none	none
GLEAN3_21417	SPU_021417	none	none
GLEAN3_21432	SPU_021432	none	none
GLEAN3_21434	SPU_021434	contains COG5222 domain	none
GLEAN3_21435	SPU_021435	none	none
GLEAN3_21436	SPU_021436	none	none
GLEAN3_21437	SPU_021437	none	none
GLEAN3_21451	SPU_021451	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21461	SPU_021461	none	none
GLEAN3_21463	SPU_021463	none	none
GLEAN3_21482	SPU_021482	contains YidC domain	none
GLEAN3_21485	SPU_021485	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_21486	SPU_021486	none	none
GLEAN3_21489	SPU_021489	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_21503	SPU_021503	none	none
GLEAN3_21507	SPU_021507	none	none
GLEAN3_21511	SPU_021511	contains 3 EFh superfamily motifs and 2 PTZ00184 domain motifs	none
GLEAN3_21513	SPU_021513	contains 3 EGF_CA superfamily motifs	none
GLEAN3_21514	SPU_021514	none	none
GLEAN3_21515	SPU_021515	none	none
GLEAN3_21518	SPU_021518	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_21519	SPU_021519	none	none
GLEAN3_21521	SPU_021521	none	none
GLEAN3_21527	SPU_021527	none	none
GLEAN3_21528	SPU_021528	none	none
GLEAN3_16646	SPU_016646	none	none
GLEAN3_21535	SPU_021535	none	none
GLEAN3_21542	SPU_021542	probable assembly chimera	none
GLEAN3_21545	SPU_021545	none	none
GLEAN3_21546	SPU_021546	none	none
GLEAN3_21551	SPU_021551	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21567	SPU_021567	none	none
GLEAN3_21568	SPU_021568	none	none
GLEAN3_21570	SPU_021570	contains 2 Kelch_1 superfamily motifs and COG3055 domain	none
GLEAN3_21573	SPU_021573	none	none
GLEAN3_21574	SPU_021574	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21575	SPU_021575	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21576	SPU_021576	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21578	SPU_021578	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21579	SPU_021579	none	none
GLEAN3_21584	SPU_021584	probable assembly chimera	none
GLEAN3_21591	SPU_021591	none	none
GLEAN3_21595	SPU_021595	none	none
GLEAN3_21596	SPU_021596	none	none
GLEAN3_21607	SPU_021607	contains UbiG domain	none
GLEAN3_21611	SPU_021611	poor sequence data: ~60% of amino acids are X	none
GLEAN3_21613	SPU_021613	none	none
GLEAN3_21624	SPU_021624	none	none
GLEAN3_21637	SPU_021637	none	none
GLEAN3_21639	SPU_021639	none	none
GLEAN3_21640	SPU_021640	none	none
GLEAN3_21641	SPU_021641	none	none
GLEAN3_21645	SPU_021645	none	none
GLEAN3_21653	SPU_021653	none	none
GLEAN3_21660	SPU_021660	contains PepP domain	none
GLEAN3_21671	SPU_021671	none	none
GLEAN3_21679	SPU_021679	none	none
GLEAN3_21684	SPU_021684	contains PRK12678 domain	none
GLEAN3_21687	SPU_021687	contains 2 P_loop_NTPase superfamily motifs and cas3_core domain	none
GLEAN3_21712	SPU_021712	probable assembly chimera	none
GLEAN3_21713	SPU_021713	none	none
GLEAN3_21715	SPU_021715	none	none
GLEAN3_21720	SPU_021720	none	none
GLEAN3_21722	SPU_021722	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21743	SPU_021743	none	none
GLEAN3_21746	SPU_021746	contains 2 IG superfamily motifs	none
GLEAN3_21748	SPU_021748	contains 4 IG superfamily motifs	none
GLEAN3_21750	SPU_021750	contains 2 FNR-like superfamily motifs	none
GLEAN3_21754	SPU_021754	none	none
GLEAN3_21761	SPU_021761	none	none
GLEAN3_21763	SPU_021763	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21768	SPU_021768	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21769	SPU_021769	none	none
GLEAN3_21771	SPU_021771	none	none
GLEAN3_21776	SPU_021776	none	none
GLEAN3_21780	SPU_021780	none	none
GLEAN3_21794	SPU_021794	none	none
GLEAN3_21796	SPU_021796	none	none
GLEAN3_21830	SPU_021830	none	none
GLEAN3_21855	SPU_021855	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_21861	SPU_021861	none	none
GLEAN3_21872	SPU_021872	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21875	SPU_021875	none	none
GLEAN3_21880	SPU_021880	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21884	SPU_021884	none	none
GLEAN3_21885	SPU_021885	none	none
GLEAN3_21886	SPU_021886	none	none
GLEAN3_21889	SPU_021889	none	none
GLEAN3_21900	SPU_021900	none	none
GLEAN3_21903	SPU_021903	none	none
GLEAN3_21905	SPU_021905	contains 2 IG superfamily motifs	none
GLEAN3_21909	SPU_021909	none	none
GLEAN3_21912	SPU_021912	contains matE domain	none
GLEAN3_21915	SPU_021915	none	none
GLEAN3_21925	SPU_021925	none	none
GLEAN3_21926	SPU_021926	none	none
GLEAN3_21927	SPU_021927	none	none
GLEAN3_21929	SPU_021929	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21938	SPU_021938	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21939	SPU_021939	contains Sulfotransfer_1 domain	none
GLEAN3_21944	SPU_021944	none	none
GLEAN3_21955	SPU_021955	none	none
GLEAN3_21960	SPU_021960	contains DHC_N1 domain	none
GLEAN3_21964	SPU_021964	none	none
GLEAN3_21965	SPU_021965	none	none
GLEAN3_21977	SPU_021977	none	none
GLEAN3_21978	SPU_021978	none	none
GLEAN3_21982	SPU_021982	none	none
GLEAN3_21989	SPU_021989	none	none
GLEAN3_21996	SPU_021996	none	none
GLEAN3_21998	SPU_021998	none	none
GLEAN3_22009	SPU_022009	none	none
GLEAN3_22020	SPU_022020	none	none
GLEAN3_22025	SPU_022025	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22027	SPU_022027	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22030	SPU_022030	none	none
GLEAN3_22031	SPU_022031	none	none
GLEAN3_22037	SPU_022037	none	none
GLEAN3_22041	SPU_022041	none	none
GLEAN3_22044	SPU_022044	contains 2 FA58C superfamily motifs	none
GLEAN3_22046	SPU_022046	none	none
GLEAN3_22071	SPU_022071	none	none
GLEAN3_22073	SPU_022073	none	none
GLEAN3_22077	SPU_022077	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22080	SPU_022080	none	none
GLEAN3_22082	SPU_022082	none	none
GLEAN3_22088	SPU_022088	none	none
GLEAN3_22103	SPU_022103	none	none
GLEAN3_22126	SPU_022126	none	none
GLEAN3_22127	SPU_022127	none	none
GLEAN3_22134	SPU_022134	none	none
GLEAN3_22139	SPU_022139	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22141	SPU_022141	none	none
GLEAN3_22142	SPU_022142	none	none
GLEAN3_22144	SPU_022144	none	none
GLEAN3_22168	SPU_022168	none	none
GLEAN3_22169	SPU_022169	none	none
GLEAN3_22171	SPU_022171	none	none
GLEAN3_22173	SPU_022173	none	none
GLEAN3_22181	SPU_022181	none	none
GLEAN3_22196	SPU_022196	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22198	SPU_022198	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22203	SPU_022203	none	none
GLEAN3_22205	SPU_022205	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_22217	SPU_022217	none	none
GLEAN3_22228	SPU_022228	none	none
GLEAN3_22229	SPU_022229	poor sequence data: ~65% of amino acids are X	none
GLEAN3_22241	SPU_022241	contains 2 PLDc superfamily motifs	none
GLEAN3_22253	SPU_022253	none	none
GLEAN3_22272	SPU_022272	none	none
GLEAN3_22276	SPU_022276	none	none
GLEAN3_22292	SPU_022292	none	none
GLEAN3_22297	SPU_022297	none	none
GLEAN3_22303	SPU_022303	contains Sulfotransfer_1 domain	none
GLEAN3_22310	SPU_022310	none	none
GLEAN3_22312	SPU_022312	contains Sulfotransfer_1 domain	none
GLEAN3_22313	SPU_022313	contains Sulfotransfer_1 domain	none
GLEAN3_22321	SPU_022321	none	none
GLEAN3_22328	SPU_022328	none	none
GLEAN3_22329	SPU_022329	contains Smc domain	none
GLEAN3_22331	SPU_022331	contains rad18 domain and PRK09603 domain and SGL domain	none
GLEAN3_22337	SPU_022337	none	none
GLEAN3_22344	SPU_022344	none	none
GLEAN3_22351	SPU_022351	none	none
GLEAN3_19378	SPU_019378	none	none
GLEAN3_22358	SPU_022358	contains COG5026 domain	none
GLEAN3_17090	SPU_017090	also homologous to a wide variety of FAD-dependent oxidoreductases in bacteria and fungi	none
GLEAN3_22362	SPU_022362	none	none
GLEAN3_22363	SPU_022363	none	none
GLEAN3_22379	SPU_022379	contains 2 FA58C superfamily motifs	none
GLEAN3_22390	SPU_022390	none	none
GLEAN3_22392	SPU_022392	contains DUF1898 domain	none
GLEAN3_22400	SPU_022400	none	none
GLEAN3_22401	SPU_022401	none	none
GLEAN3_22402	SPU_022402	none	none
GLEAN3_22418	SPU_022418	contains 2 EGF_CA superfamily motifs	none
GLEAN3_22427	SPU_022427	none	none
GLEAN3_22436	SPU_022436	none	none
GLEAN3_22437	SPU_022437	none	none
GLEAN3_22440	SPU_022440	none	none
GLEAN3_22444	SPU_022444	none	none
GLEAN3_22454	SPU_022454	contains 3 CUB superfamily motifs	none
GLEAN3_22464	SPU_022464	none	none
GLEAN3_22466	SPU_022466	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22467	SPU_022467	none	none
GLEAN3_22480	SPU_022480	none	none
GLEAN3_22481	SPU_022481	none	none
GLEAN3_22486	SPU_022486	none	none
GLEAN3_22488	SPU_022488	none	none
GLEAN3_22489	SPU_022489	none	none
GLEAN3_22491	SPU_022491	none	none
GLEAN3_22492	SPU_022492	none	none
GLEAN3_22496	SPU_022496	none	none
GLEAN3_22499	SPU_022499	none	none
GLEAN3_22505	SPU_022505	none	none
GLEAN3_22508	SPU_022508	contains Sulfotransfer_1 domain	none
GLEAN3_22513	SPU_022513	none	none
GLEAN3_22515	SPU_022515	none	none
GLEAN3_22517	SPU_022517	contains 3 ANK superfamily motifs	none
GLEAN3_22520	SPU_022520	contains 2 BTB superfamily motifs	none
GLEAN3_22521	SPU_022521	contains 2 BTB superfamily motifs	none
GLEAN3_22524	SPU_022524	none	none
GLEAN3_22525	SPU_022525	none	none
GLEAN3_22531	SPU_022531	none	none
GLEAN3_22540	SPU_022540	probable assembly chimera	none
GLEAN3_22542	SPU_022542	none	none
GLEAN3_22543	SPU_022543	none	none
GLEAN3_22545	SPU_022545	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22550	SPU_022550	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22589	SPU_022589	none	none
GLEAN3_22591	SPU_022591	probable assembly chimera	none
GLEAN3_22600	SPU_022600	contains MetG superfamily motif. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22601	SPU_022601	none	none
GLEAN3_22603	SPU_022603	none	none
GLEAN3_22604	SPU_022604	none	none
GLEAN3_22610	SPU_022610	contains Pkinase domain	none
GLEAN3_22620	SPU_022620	none	none
GLEAN3_22631	SPU_022631	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22640	SPU_022640	none	none
GLEAN3_22645	SPU_022645	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22654	SPU_022654	none	none
GLEAN3_22655	SPU_022655	none	none
GLEAN3_22656	SPU_022656	contains DYN1 domain	none
GLEAN3_22658	SPU_022658	none	none
GLEAN3_22659	SPU_022659	none	none
GLEAN3_22661	SPU_022661	none	none
GLEAN3_22662	SPU_022662	none	none
GLEAN3_22672	SPU_022672	none	none
GLEAN3_22677	SPU_022677	none	none
GLEAN3_22689	SPU_022689	none	none
GLEAN3_22691	SPU_022691	none	none
GLEAN3_22694	SPU_022694	none	none
GLEAN3_22700	SPU_022700	none	none
GLEAN3_22711	SPU_022711	none	none
GLEAN3_22712	SPU_022712	none	none
GLEAN3_22713	SPU_022713	none	none
GLEAN3_22720	SPU_022720	none	none
GLEAN3_22728	SPU_022728	none	none
GLEAN3_22730	SPU_022730	none	none
GLEAN3_22733	SPU_022733	none	none
GLEAN3_22738	SPU_022738	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22739	SPU_022739	contains 3 CCP superfamily motifs	none
GLEAN3_22740	SPU_022740	none	none
GLEAN3_22746	SPU_022746	none	none
GLEAN3_22747	SPU_022747	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22749	SPU_022749	none	none
GLEAN3_22758	SPU_022758	contains SGL domain	none
GLEAN3_22772	SPU_022772	none	none
GLEAN3_22773	SPU_022773	none	none
GLEAN3_22782	SPU_022782	contains recD_rel domain	none
GLEAN3_22786	SPU_022786	none	none
GLEAN3_22789	SPU_022789	contains COG4886 domain	none
GLEAN3_22794	SPU_022794	none	none
GLEAN3_22803	SPU_022803	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22811	SPU_022811	none	none
GLEAN3_22812	SPU_022812	contains SmtA domain	none
GLEAN3_22813	SPU_022813	none	none
GLEAN3_22830	SPU_022830	none	none
GLEAN3_22836	SPU_022836	none	none
GLEAN3_22843	SPU_022843	none	none
GLEAN3_22844	SPU_022844	none	none
GLEAN3_22850	SPU_022850	none	none
GLEAN3_22856	SPU_022856	contains 2 SANT superfamily motifs	none
GLEAN3_22862	SPU_022862	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22866	SPU_022866	none	none
GLEAN3_22868	SPU_022868	none	none
GLEAN3_22881	SPU_022881	contains 2 LDLa superfamily motifs	none
GLEAN3_22886	SPU_022886	contains 2 HYR superfamily motifs	none
GLEAN3_22894	SPU_022894	none	none
GLEAN3_22895	SPU_022895	none	none
GLEAN3_22918	SPU_022918	none	none
GLEAN3_22920	SPU_022920	none	none
GLEAN3_22921	SPU_022921	none	none
GLEAN3_22925	SPU_022925	none	none
GLEAN3_22927	SPU_022927	contains SMC_N domain and SbcC domain	none
GLEAN3_22930	SPU_022930	contains 2 DUF1900 superfamily motifs	none
GLEAN3_22931	SPU_022931	contains 2 AMK superfamily motifs	none
GLEAN3_22932	SPU_022932	none	none
GLEAN3_22937	SPU_022937	none	none
GLEAN3_22945	SPU_022945	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22961	SPU_022961	none	none
GLEAN3_22962	SPU_022962	none	none
GLEAN3_22966	SPU_022966	none	none
GLEAN3_22968	SPU_022968	none	none
GLEAN3_22983	SPU_022983	none	none
GLEAN3_22985	SPU_022985	none	none
GLEAN3_22987	SPU_022987	contains tpt domain	none
GLEAN3_22990	SPU_022990	contains tpt domain	none
GLEAN3_22992	SPU_022992	none	none
GLEAN3_22994	SPU_022994	none	none
GLEAN3_22995	SPU_022995	none	none
GLEAN3_22996	SPU_022996	none	none
GLEAN3_22999	SPU_022999	contains 2 BTB superfamily motifs	none
GLEAN3_23003	SPU_023003	contains HcaE domain	none
GLEAN3_23006	SPU_023006	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23010	SPU_023010	contains pol2 domain	none
GLEAN3_23030	SPU_023030	none	none
GLEAN3_23040	SPU_023040	none	none
GLEAN3_23047	SPU_023047	none	none
GLEAN3_23050	SPU_023050	none	none
GLEAN3_23051	SPU_023051	contains SMC_prok_A domain	none
GLEAN3_23054	SPU_023054	none	none
GLEAN3_23057	SPU_023057	none	none
GLEAN3_23066	SPU_023066	none	none
GLEAN3_23073	SPU_023073	none	none
GLEAN3_23078	SPU_023078	none	none
GLEAN3_23086	SPU_023086	none	none
GLEAN3_23089	SPU_023089	contains 2 MFS superfamily motifs	none
GLEAN3_23097	SPU_023097	none	none
GLEAN3_23101	SPU_023101	none	none
GLEAN3_23104	SPU_023104	none	none
GLEAN3_23110	SPU_023110	contains COG7 domain	none
GLEAN3_23111	SPU_023111	none	none
GLEAN3_23131	SPU_023131	none	none
GLEAN3_23139	SPU_023139	none	none
GLEAN3_23145	SPU_023145	none	none
GLEAN3_23156	SPU_023156	none	none
GLEAN3_23159	SPU_023159	contains TadD domain	none
GLEAN3_23167	SPU_023167	contains SMC_prok_A domain	none
GLEAN3_23168	SPU_023168	none	none
GLEAN3_23170	SPU_023170	none	none
GLEAN3_23171	SPU_023171	contains SMC_prok_B domain	none
GLEAN3_23186	SPU_023186	none	none
GLEAN3_23199	SPU_023199	contains 2 Fascin superfamily motifs	none
GLEAN3_23209	SPU_023209	none	none
GLEAN3_23213	SPU_023213	none	none
GLEAN3_23214	SPU_023214	none	none
GLEAN3_23218	SPU_023218	none	none
GLEAN3_23233	SPU_023233	none	none
GLEAN3_23234	SPU_023234	none	none
GLEAN3_23242	SPU_023242	none	none
GLEAN3_23243	SPU_023243	contains 2 IG superfamily motifs	none
GLEAN3_23244	SPU_023244	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23251	SPU_023251	none	none
GLEAN3_23253	SPU_023253	none	none
GLEAN3_23255	SPU_023255	none	none
GLEAN3_23256	SPU_023256	contains SMC_N domain	none
GLEAN3_23268	SPU_023268	none	none
GLEAN3_23272	SPU_023272	contains 2 MFS superfamily motifs	none
GLEAN3_23273	SPU_023273	contains 2 MFS superfamily motifs	none
GLEAN3_23277	SPU_023277	none	none
GLEAN3_23284	SPU_023284	poor sequence data: ~40% of amino acids are X. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23292	SPU_023292	contains RAD18 domain	none
GLEAN3_23303	SPU_023303	none	none
GLEAN3_23306	SPU_023306	contains PRK10263 domain	none
GLEAN3_23319	SPU_023319	none	none
GLEAN3_23326	SPU_023326	probable assembly chimera	none
GLEAN3_23328	SPU_023328	none	none
GLEAN3_23335	SPU_023335	none	none
GLEAN3_23337	SPU_023337	none	none
GLEAN3_23340	SPU_023340	none	none
GLEAN3_23346	SPU_023346	contains 2 E2F_TDP superfamily motifs	none
GLEAN3_23348	SPU_023348	none	none
GLEAN3_23371	SPU_023371	none	none
GLEAN3_23378	SPU_023378	none	none
GLEAN3_23384	SPU_023384	none	none
GLEAN3_23388	SPU_023388	none	none
GLEAN3_23405	SPU_023405	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23411	SPU_023411	none	none
GLEAN3_23414	SPU_023414	none	none
GLEAN3_23424	SPU_023424	none	none
GLEAN3_23430	SPU_023430	none	none
GLEAN3_23439	SPU_023439	none	none
GLEAN3_23441	SPU_023441	none	none
GLEAN3_23444	SPU_023444	none	none
GLEAN3_23447	SPU_023447	none	none
GLEAN3_23449	SPU_023449	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_23464	SPU_023464	none	none
GLEAN3_23465	SPU_023465	probable assembly chimera	none
GLEAN3_23467	SPU_023467	contains NmrA domain	none
GLEAN3_23468	SPU_023468	none	none
GLEAN3_23470	SPU_023470	contains SMC_prok_B domain	none
GLEAN3_23471	SPU_023471	none	none
GLEAN3_23476	SPU_023476	none	none
GLEAN3_23479	SPU_023479	none	none
GLEAN3_23481	SPU_023481	none	none
GLEAN3_23485	SPU_023485	none	none
GLEAN3_23489	SPU_023489	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23490	SPU_023490	none	none
GLEAN3_23493	SPU_023493	contains Ion_trans domain	none
GLEAN3_23496	SPU_023496	none	none
GLEAN3_23501	SPU_023501	none	none
GLEAN3_23513	SPU_023513	contains 2A0119 domain	none
GLEAN3_23524	SPU_023524	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23535	SPU_023535	probable assembly chimera	none
GLEAN3_23540	SPU_023540	none	none
GLEAN3_23546	SPU_023546	none	none
GLEAN3_23552	SPU_023552	none	none
GLEAN3_23554	SPU_023554	contains 2 SPEC superfamily motifs	none
GLEAN3_23556	SPU_023556	none	none
GLEAN3_23562	SPU_023562	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23563	SPU_023563	none	none
GLEAN3_23566	SPU_023566	contains Nckap1 domain	none
GLEAN3_23588	SPU_023588	contains UDPGT domain	none
GLEAN3_23592	SPU_023592	none	none
GLEAN3_23595	SPU_023595	none	none
GLEAN3_23602	SPU_023602	none	none
GLEAN3_23603	SPU_023603	none	none
GLEAN3_23626	SPU_023626	none	none
GLEAN3_23629	SPU_023629	none	none
GLEAN3_23639	SPU_023639	none	none
GLEAN3_23650	SPU_023650	probable assembly chimera	none
GLEAN3_23654	SPU_023654	none	none
GLEAN3_23657	SPU_023657	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23675	SPU_023675	none	none
GLEAN3_23681	SPU_023681	contains FMO-like domain	none
GLEAN3_23683	SPU_023683	contains 2 Ion_trans_2 superfamily motifs	none
GLEAN3_23687	SPU_023687	contains HUL4 domain	none
GLEAN3_23708	SPU_023708	none	none
GLEAN3_23709	SPU_023709	none	none
GLEAN3_23711	SPU_023711	none	none
GLEAN3_23713	SPU_023713	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_23719	SPU_023719	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_23720	SPU_023720	none	none
GLEAN3_23721	SPU_023721	contains COG3415 domain	none
GLEAN3_23742	SPU_023742	none	none
GLEAN3_23754	SPU_023754	none	none
GLEAN3_23755	SPU_023755	none	none
GLEAN3_23771	SPU_023771	none	none
GLEAN3_23779	SPU_023779	none	none
GLEAN3_23784	SPU_023784	contains SMC_prok_B domain	none
GLEAN3_23788	SPU_023788	none	none
GLEAN3_23792	SPU_023792	none	none
GLEAN3_23793	SPU_023793	none	none
GLEAN3_23804	SPU_023804	none	none
GLEAN3_23815	SPU_023815	none	none
GLEAN3_23817	SPU_023817	contains SMC_N domain	none
GLEAN3_23821	SPU_023821	contains COG3391 domain	none
GLEAN3_23825	SPU_023825	contains 2A0113 domain	none
GLEAN3_23826	SPU_023826	none	none
GLEAN3_23827	SPU_023827	none	none
GLEAN3_23836	SPU_023836	none	none
GLEAN3_23848	SPU_023848	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23852	SPU_023852	poor sequence data: ~55% of amino acids are X	none
GLEAN3_23860	SPU_023860	contains RAD18 domain	none
GLEAN3_23862	SPU_023862	none	none
GLEAN3_23871	SPU_023871	none	none
GLEAN3_23873	SPU_023873	none	none
GLEAN3_23874	SPU_023874	none	none
GLEAN3_23879	SPU_023879	contains 4 EGF_CA superfamily motifs	none
GLEAN3_23880	SPU_023880	none	none
GLEAN3_23902	SPU_023902	contains 2 CUB superfamily motifs	none
GLEAN3_23906	SPU_023906	none	none
GLEAN3_23915	SPU_023915	none	none
GLEAN3_23917	SPU_023917	none	none
GLEAN3_23946	SPU_023946	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_23953	SPU_023953	contains 3 Ldl_recept_b superfamily motifs and COG3391 domain	none
GLEAN3_23959	SPU_023959	contains SMC_prok_B domain	none
GLEAN3_23960	SPU_023960	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23966	SPU_023966	none	none
GLEAN3_23971	SPU_023971	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_23977	SPU_023977	none	none
GLEAN3_23969	SPU_023969	none	none
GLEAN3_23980	SPU_023980	none	none
GLEAN3_23981	SPU_023981	none	none
GLEAN3_23986	SPU_023986	none	none
GLEAN3_23987	SPU_023987	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24005	SPU_024005	contains SMC_prok_B domain	none
GLEAN3_24006	SPU_024006	none	none
GLEAN3_24008	SPU_024008	none	none
GLEAN3_24011	SPU_024011	none	none
GLEAN3_24015	SPU_024015	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24021	SPU_024021	contains 2 FReD superfamily motifs	none
GLEAN3_24025	SPU_024025	none	none
GLEAN3_24027	SPU_024027	none	none
GLEAN3_24034	SPU_024034	contains FRQ1 domain	none
GLEAN3_24037	SPU_024037	contains PRK05771 domain	none
GLEAN3_24039	SPU_024039	none	none
GLEAN3_24043	SPU_024043	contains Sulfotransfer_1 domain	none
GLEAN3_24047	SPU_024047	none	none
GLEAN3_24048	SPU_024048	none	none
GLEAN3_24051	SPU_024051	contains 2 BTB superfamily	none
GLEAN3_24053	SPU_024053	none	none
GLEAN3_24057	SPU_024057	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24072	SPU_024072	none	none
GLEAN3_24076	SPU_024076	none	none
GLEAN3_24079	SPU_024079	none	none
GLEAN3_24083	SPU_024083	none	none
GLEAN3_24090	SPU_024090	none	none
GLEAN3_24085	SPU_024085	contains 2 Ion_trans_2 superfamily motifs	none
GLEAN3_24096	SPU_024096	none	none
GLEAN3_24097	SPU_024097	none	none
GLEAN3_24104	SPU_024104	none	none
GLEAN3_24105	SPU_024105	none	none
GLEAN3_24110	SPU_024110	none	none
GLEAN3_24117	SPU_024117	none	none
GLEAN3_24118	SPU_024118	contains Laminin_EGF domain	none
GLEAN3_24136	SPU_024136	none	none
GLEAN3_24145	SPU_024145	contains type_I_hly domain	none
GLEAN3_24149	SPU_024149	none	none
GLEAN3_24152	SPU_024152	none	none
GLEAN3_24155	SPU_024155	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24157	SPU_024157	none	none
GLEAN3_24164	SPU_024164	none	none
GLEAN3_24186	SPU_024186	none	none
GLEAN3_24188	SPU_024188	contains 4 Annexin superfamily motifs	none
GLEAN3_24198	SPU_024198	contains SMC_prok_B superfamily	none
GLEAN3_24202	SPU_024202	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24210	SPU_024210	none	none
GLEAN3_24190	SPU_024190	none	none
GLEAN3_24213	SPU_024213	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24219	SPU_024219	none	none
GLEAN3_24221	SPU_024221	none	none
GLEAN3_24239	SPU_024239	none	none
GLEAN3_24265	SPU_024265	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24266	SPU_024266	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24268	SPU_024268	none	none
GLEAN3_24271	SPU_024271	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24272	SPU_024272	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_24280	SPU_024280	none	none
GLEAN3_24300	SPU_024300	none	none
GLEAN3_24304	SPU_024304	none	none
GLEAN3_24314	SPU_024314	contains 2 PNP_UDP_1 superfamily motifs	none
GLEAN3_24318	SPU_024318	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_24335	SPU_024335	none	none
GLEAN3_24338	SPU_024338	none	none
GLEAN3_24341	SPU_024341	contains PTZ00322 domain	none
GLEAN3_24345	SPU_024345	none	none
GLEAN3_24351	SPU_024351	none	none
GLEAN3_24358	SPU_024358	none	none
GLEAN3_24365	SPU_024365	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_24367	SPU_024367	none	none
GLEAN3_24370	SPU_024370	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24375	SPU_024375	none	none
GLEAN3_24380	SPU_024380	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24388	SPU_024388	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24389	SPU_024389	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24391	SPU_024391	contains tpt domain	none
GLEAN3_24400	SPU_024400	none	none
GLEAN3_24410	SPU_024410	none	none
GLEAN3_24415	SPU_024415	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_24424	SPU_024424	none	none
GLEAN3_24426	SPU_024426	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24428	SPU_024428	none	none
GLEAN3_24433	SPU_024433	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24441	SPU_024441	none	none
GLEAN3_24443	SPU_024443	none	none
GLEAN3_24444	SPU_024444	contains 2 MAM superfamily motifs	none
GLEAN3_24451	SPU_024451	none	none
GLEAN3_24458	SPU_024458	none	none
GLEAN3_24460	SPU_024460	none	none
GLEAN3_24463	SPU_024463	none	none
GLEAN3_24465	SPU_024465	contains SMC_prok_B domain and COG2423 domain	none
GLEAN3_24475	SPU_024475	contains 2 Arylesterase superfamily motifs and COG3386 domain	none
GLEAN3_24488	SPU_024488	contains Lon domain	none
GLEAN3_24489	SPU_024489	none	none
GLEAN3_24503	SPU_024503	none	none
GLEAN3_24505	SPU_024505	contains Sulfotransfer_1 domain	none
GLEAN3_24510	SPU_024510	none	none
GLEAN3_24511	SPU_024511	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24512	SPU_024512	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24516	SPU_024516	none	none
GLEAN3_24518	SPU_024518	none	none
GLEAN3_24523	SPU_024523	none	none
GLEAN3_04985	SPU_004985	none	none
GLEAN3_24538	SPU_024538	none	none
GLEAN3_24539	SPU_024539	none	none
GLEAN3_24542	SPU_024542	contains 2 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24548	SPU_024548	none	none
GLEAN3_24554	SPU_024554	contains pro_imino_pep_2 domain	none
GLEAN3_24572	SPU_024572	none	none
GLEAN3_24575	SPU_024575	none	none
GLEAN3_24582	SPU_024582	none	none
GLEAN3_24583	SPU_024583	none	none
GLEAN3_24596	SPU_024596	contains Dynein_heavy domain	none
GLEAN3_24600	SPU_024600	contains 4 Filamin superfamily motifs	none
GLEAN3_24603	SPU_024603	none	none
GLEAN3_24613	SPU_024613	contains 2 IG superfamily motifs	none
GLEAN3_24614	SPU_024614	none	none
GLEAN3_24617	SPU_024617	none	none
GLEAN3_24632	SPU_024632	none	none
GLEAN3_24634	SPU_024634	none	none
GLEAN3_24651	SPU_024651	none	none
GLEAN3_24655	SPU_024655	none	none
GLEAN3_24660	SPU_024660	contains 2 MAM superfamily motifs	none
GLEAN3_24661	SPU_024661	probable assembly chimera	none
GLEAN3_24664	SPU_024664	none	none
GLEAN3_24671	SPU_024671	none	none
GLEAN3_24680	SPU_024680	none	none
GLEAN3_24685	SPU_024685	none	none
GLEAN3_24692	SPU_024692	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24700	SPU_024700	none	none
GLEAN3_24702	SPU_024702	none	none
GLEAN3_24713	SPU_024713	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24720	SPU_024720	none	none
GLEAN3_24724	SPU_024724	contains Pkinase domain	none
GLEAN3_24734	SPU_024734	none	none
GLEAN3_24738	SPU_024738	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24742	SPU_024742	contains COG5412 domain	none
GLEAN3_24743	SPU_024743	none	none
GLEAN3_24747	SPU_024747	none	none
GLEAN3_24749	SPU_024749	none	none
GLEAN3_24758	SPU_024758	contains 2 C8 superfamily motifs and 2 TIL superfamily motifs	none
GLEAN3_24761	SPU_024761	none	none
GLEAN3_24765	SPU_024765	none	none
GLEAN3_24767	SPU_024767	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24783	SPU_024783	none	none
GLEAN3_24791	SPU_024791	contains 2 P_loop_NTPase superfamily motifs	none
GLEAN3_24795	SPU_024795	none	none
GLEAN3_24804	SPU_024804	contains 4 S1-like superfamily motifs	none
GLEAN3_24806	SPU_024806	none	none
GLEAN3_24807	SPU_024807	none	none
GLEAN3_24813	SPU_024813	none	none
GLEAN3_24822	SPU_024822	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24825	SPU_024825	contains hCaCC domain	none
GLEAN3_24826	SPU_024826	none	none
GLEAN3_24835	SPU_024835	none	none
GLEAN3_24839	SPU_024839	none	none
GLEAN3_24845	SPU_024845	none	none
GLEAN3_24853	SPU_024853	none	none
GLEAN3_24855	SPU_024855	none	none
GLEAN3_24860	SPU_024860	none	none
GLEAN3_24865	SPU_024865	none	none
GLEAN3_24876	SPU_024876	contains mutS2 domain	none
GLEAN3_24887	SPU_024887	none	none
GLEAN3_24888	SPU_024888	none	none
GLEAN3_24889	SPU_024889	none	none
GLEAN3_24894	SPU_024894	none	none
GLEAN3_24899	SPU_024899	none	none
GLEAN3_24904	SPU_024904	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24905	SPU_024905	none	none
GLEAN3_24906	SPU_024906	none	none
GLEAN3_24907	SPU_024907	none	none
GLEAN3_24908	SPU_024908	none	none
GLEAN3_24910	SPU_024910	none	none
GLEAN3_24911	SPU_024911	contains RPN2 domain and AIR1 domain	none
GLEAN3_24916	SPU_024916	none	none
GLEAN3_24921	SPU_024921	none	none
GLEAN3_24935	SPU_024935	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24938	SPU_024938	none	none
GLEAN3_24943	SPU_024943	none	none
GLEAN3_24954	SPU_024954	contains SMC_prok_B domain	none
GLEAN3_24956	SPU_024956	none	none
GLEAN3_24959	SPU_024959	contains rad50 domain	none
GLEAN3_24961	SPU_024961	contains 2 ANK superfamily motifs	none
GLEAN3_24962	SPU_024962	none	none
GLEAN3_24971	SPU_024971	none	none
GLEAN3_24972	SPU_024972	none	none
GLEAN3_24979	SPU_024979	none	none
GLEAN3_24988	SPU_024988	none	none
GLEAN3_24990	SPU_024990	contains DnaJ_bact domain	none
GLEAN3_24992	SPU_024992	none	none
GLEAN3_24995	SPU_024995	contains 3 IG superfamily	none
GLEAN3_24998	SPU_024998	none	none
GLEAN3_25007	SPU_025007	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25008	SPU_025008	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25016	SPU_025016	none	none
GLEAN3_25018	SPU_025018	none	none
GLEAN3_25022	SPU_025022	contains COG4886 domain	none
GLEAN3_25023	SPU_025023	none	none
GLEAN3_25026	SPU_025026	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25029	SPU_025029	none	none
GLEAN3_25032	SPU_025032	none	none
GLEAN3_25033	SPU_025033	none	none
GLEAN3_25034	SPU_025034	none	none
GLEAN3_25035	SPU_025035	none	none
GLEAN3_25037	SPU_025037	none	none
GLEAN3_25038	SPU_025038	none	none
GLEAN3_25039	SPU_025039	none	none
GLEAN3_25046	SPU_025046	none	none
GLEAN3_25066	SPU_025066	none	none
GLEAN3_25069	SPU_025069	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25083	SPU_025083	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25084	SPU_025084	none	none
GLEAN3_25085	SPU_025085	contains 2A0113 domain	none
GLEAN3_25088	SPU_025088	none	none
GLEAN3_25094	SPU_025094	none	none
GLEAN3_25095	SPU_025095	none	none
GLEAN3_25098	SPU_025098	none	none
GLEAN3_25105	SPU_025105	none	none
GLEAN3_25108	SPU_025108	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25114	SPU_025114	none	none
GLEAN3_25116	SPU_025116	contains 3 CLECT superfamily	none
GLEAN3_25118	SPU_025118	none	none
GLEAN3_25134	SPU_025134	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25146	SPU_025146	none	none
GLEAN3_25151	SPU_025151	none	none
GLEAN3_25154	SPU_025154	none	none
GLEAN3_25171	SPU_025171	none	none
GLEAN3_25175	SPU_025175	none	none
GLEAN3_25177	SPU_025177	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25180	SPU_025180	contains 2 CLECT superfamily motifs	none
GLEAN3_25183	SPU_025183	none	none
GLEAN3_25187	SPU_025187	none	none
GLEAN3_25190	SPU_025190	none	none
GLEAN3_25195	SPU_025195	none	none
GLEAN3_25199	SPU_025199	none	none
GLEAN3_25201	SPU_025201	none	none
GLEAN3_25203	SPU_025203	none	none
GLEAN3_25205	SPU_025205	none	none
GLEAN3_25207	SPU_025207	contains PRK02224 domain	none
GLEAN3_25209	SPU_025209	none	none
GLEAN3_25215	SPU_025215	none	none
GLEAN3_25231	SPU_025231	none	none
GLEAN3_25232	SPU_025232	none	none
GLEAN3_25233	SPU_025233	none	none
GLEAN3_25243	SPU_025243	none	none
GLEAN3_25246	SPU_025246	none	none
GLEAN3_25250	SPU_025250	none	none
GLEAN3_25251	SPU_025251	none	none
GLEAN3_25259	SPU_025259	none	none
GLEAN3_25261	SPU_025261	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25265	SPU_025265	none	none
GLEAN3_25271	SPU_025271	none	none
GLEAN3_25277	SPU_025277	contains 3 HYR superfamily motifs	none
GLEAN3_25282	SPU_025282	contains 3 EGF_CA superfamily motifs	none
GLEAN3_25285	SPU_025285	none	none
GLEAN3_25287	SPU_025287	probable assembly chimera	none
GLEAN3_25288	SPU_025288	none	none
GLEAN3_25294	SPU_025294	none	none
GLEAN3_25295	SPU_025295	contains 2 HYR superfamily motifs	none
GLEAN3_25297	SPU_025297	none	none
GLEAN3_25314	SPU_025314	none	none
GLEAN3_25339	SPU_025339	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25342	SPU_025342	none	none
GLEAN3_25349	SPU_025349	contains O-FucT domain	none
GLEAN3_25359	SPU_025359	none	none
GLEAN3_25361	SPU_025361	contains 2 MAM superfamily motifs	none
GLEAN3_25371	SPU_025371	none	none
GLEAN3_25372	SPU_025372	none	none
GLEAN3_25373	SPU_025373	none	none
GLEAN3_25379	SPU_025379	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25380	SPU_025380	none	none
GLEAN3_25402	SPU_025402	none	none
GLEAN3_25404	SPU_025404	none	none
GLEAN3_25406	SPU_025406	none	none
GLEAN3_25407	SPU_025407	contains DPPIV_N domain	none
GLEAN3_25412	SPU_025412	none	none
GLEAN3_25423	SPU_025423	none	none
GLEAN3_25441	SPU_025441	none	none
GLEAN3_25445	SPU_025445	none	none
GLEAN3_25446	SPU_025446	none	none
GLEAN3_25447	SPU_025447	none	none
GLEAN3_25448	SPU_025448	contains 2 TSP_1 superfamily motifs	none
GLEAN3_25454	SPU_025454	contains 2 TSP_1 superfamily motifs	none
GLEAN3_25457	SPU_025457	none	none
GLEAN3_25473	SPU_025473	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25477	SPU_025477	none	none
GLEAN3_25478	SPU_025478	contains SSM4 domain. probable assembly chimera.	none
GLEAN3_25479	SPU_025479	contains COG4886 domain	none
GLEAN3_25491	SPU_025491	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25498	SPU_025498	none	none
GLEAN3_25500	SPU_025500	none	none
GLEAN3_25513	SPU_025513	none	none
GLEAN3_25519	SPU_025519	contains Sulfotransfer_1 domain	none
GLEAN3_25528	SPU_025528	none	none
GLEAN3_25529	SPU_025529	none	none
GLEAN3_25542	SPU_025542	contains 3 EGF_CA superfamily motifs	none
GLEAN3_25556	SPU_025556	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25561	SPU_025561	none	none
GLEAN3_25563	SPU_025563	contains 3 MAM superfamily motifs	none
GLEAN3_25570	SPU_025570	none	none
GLEAN3_25577	SPU_025577	none	none
GLEAN3_25580	SPU_025580	none	none
GLEAN3_25581	SPU_025581	none	none
GLEAN3_25582	SPU_025582	contains DUF2450 domain	none
GLEAN3_25583	SPU_025583	contains PRK12678 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25594	SPU_025594	none	none
GLEAN3_25597	SPU_025597	none	none
GLEAN3_25598	SPU_025598	none	none
GLEAN3_25605	SPU_025605	contains 2 CCP superfamily motifs	none
GLEAN3_25606	SPU_025606	none	none
GLEAN3_25607	SPU_025607	none	none
GLEAN3_25620	SPU_025620	none	none
GLEAN3_25621	SPU_025621	none	none
GLEAN3_25625	SPU_025625	none	none
GLEAN3_25631	SPU_025631	contains Smc domain	none
GLEAN3_25637	SPU_025637	none	none
GLEAN3_25641	SPU_025641	none	none
GLEAN3_25647	SPU_025647	contains SH3BP5 domain	none
GLEAN3_25648	SPU_025648	none	none
GLEAN3_25656	SPU_025656	none	none
GLEAN3_25659	SPU_025659	contains Fanconi_C domain	none
GLEAN3_25662	SPU_025662	contains 2A0119 domain	none
GLEAN3_25663	SPU_025663	contains 2 FA58C superfamily motifs	none
GLEAN3_25666	SPU_025666	none	none
GLEAN3_25675	SPU_025675	none	none
GLEAN3_25677	SPU_025677	contains 2 IG superfamily motifs	none
GLEAN3_25685	SPU_025685	none	none
GLEAN3_25704	SPU_025704	none	none
GLEAN3_25706	SPU_025706	contains APH domain	none
GLEAN3_25715	SPU_025715	none	none
GLEAN3_25721	SPU_025721	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25739	SPU_025739	none	none
GLEAN3_25742	SPU_025742	none	none
GLEAN3_25743	SPU_025743	none	none
GLEAN3_25745	SPU_025745	none	none
GLEAN3_25747	SPU_025747	none	none
GLEAN3_25748	SPU_025748	contains SpoVK domain	none
GLEAN3_25750	SPU_025750	none	none
GLEAN3_25768	SPU_025768	none	none
GLEAN3_25777	SPU_025777	none	none
GLEAN3_25780	SPU_025780	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25789	SPU_025789	none	none
GLEAN3_25795	SPU_025795	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25799	SPU_025799	contains Deme6 domain	none
GLEAN3_25802	SPU_025802	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25816	SPU_025816	none	none
GLEAN3_25818	SPU_025818	none	none
GLEAN3_25821	SPU_025821	none	none
GLEAN3_25822	SPU_025822	none	none
GLEAN3_25828	SPU_025828	none	none
GLEAN3_25834	SPU_025834	none	none
GLEAN3_25835	SPU_025835	poor sequence data: ~75% of amino acids are X	none
GLEAN3_25855	SPU_025855	none	none
GLEAN3_25867	SPU_025867	contains 2A0113 domain	none
GLEAN3_25871	SPU_025871	none	none
GLEAN3_25879	SPU_025879	none	none
GLEAN3_25884	SPU_025884	none	none
GLEAN3_25895	SPU_025895	none	none
GLEAN3_25904	SPU_025904	none	none
GLEAN3_25906	SPU_025906	contains COG4886 domain	none
GLEAN3_25908	SPU_025908	none	none
GLEAN3_25910	SPU_025910	contains GTPBP1 domain	none
GLEAN3_25912	SPU_025912	none	none
GLEAN3_25921	SPU_025921	none	none
GLEAN3_25922	SPU_025922	none	none
GLEAN3_25923	SPU_025923	none	none
GLEAN3_25928	SPU_025928	contains COG4886 domain	none
GLEAN3_25935	SPU_025935	none	none
GLEAN3_25936	SPU_025936	contains SMC_prok_A domain	none
GLEAN3_25943	SPU_025943	none	none
GLEAN3_25945	SPU_025945	none	none
GLEAN3_25957	SPU_025957	none	none
GLEAN3_25962	SPU_025962	contains 3 Annexin superfamily motifs	none
GLEAN3_25963	SPU_025963	none	none
GLEAN3_25967	SPU_025967	none	none
GLEAN3_25971	SPU_025971	contains rad18 domain	none
GLEAN3_25977	SPU_025977	probable assembly chimera	none
GLEAN3_25981	SPU_025981	contains PRK05761 domain	none
GLEAN3_25984	SPU_025984	contains 3 CCP superfamily motifs	none
GLEAN3_25988	SPU_025988	none	none
GLEAN3_25992	SPU_025992	contains 2a57 domain	none
GLEAN3_25993	SPU_025993	none	none
GLEAN3_25994	SPU_025994	none	none
GLEAN3_26004	SPU_026004	none	none
GLEAN3_26014	SPU_026014	none	none
GLEAN3_26015	SPU_026015	none	none
GLEAN3_26021	SPU_026021	none	none
GLEAN3_26022	SPU_026022	none	none
GLEAN3_26028	SPU_026028	none	none
GLEAN3_26029	SPU_026029	none	none
GLEAN3_26035	SPU_026035	none	none
GLEAN3_26037	SPU_026037	none	none
GLEAN3_26038	SPU_026038	none	none
GLEAN3_26045	SPU_026045	none	none
GLEAN3_26055	SPU_026055	none	none
GLEAN3_26061	SPU_026061	none	none
GLEAN3_26063	SPU_026063	none	none
GLEAN3_26078	SPU_026078	none	none
GLEAN3_26087	SPU_026087	none	none
GLEAN3_26089	SPU_026089	contains 2 IPT superfamily motifs	none
GLEAN3_26090	SPU_026090	none	none
GLEAN3_26091	SPU_026091	none	none
GLEAN3_26093	SPU_026093	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26102	SPU_026102	none	none
GLEAN3_26112	SPU_026112	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26115	SPU_026115	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26117	SPU_026117	contains 4 LDLa superfamily motifs	none
GLEAN3_26134	SPU_026134	contains SMC_prok_A domain and SMC_prok_B domain	none
GLEAN3_26139	SPU_026139	none	none
GLEAN3_26140	SPU_026140	none	none
GLEAN3_26148	SPU_026148	none	none
GLEAN3_26149	SPU_026149	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26169	SPU_026169	none	none
GLEAN3_26163	SPU_026163	contains 2 IG superfamily motifs	none
GLEAN3_26167	SPU_026167	none	none
GLEAN3_26168	SPU_026168	none	none
GLEAN3_26171	SPU_026171	none	none
GLEAN3_26172	SPU_026172	none	none
GLEAN3_26175	SPU_026175	none	none
GLEAN3_26177	SPU_026177	none	none
GLEAN3_26182	SPU_026182	none	none
GLEAN3_26187	SPU_026187	none	none
GLEAN3_26190	SPU_026190	contains 2 NIPSNAP superfamily motifs	none
GLEAN3_26201	SPU_026201	none	none
GLEAN3_26204	SPU_026204	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26218	SPU_026218	none	none
GLEAN3_26232	SPU_026232	contains SMC_N domain	none
GLEAN3_26233	SPU_026233	contains 2 MFS superfamily motifs and MFS_1 domain	none
GLEAN3_26242	SPU_026242	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26243	SPU_026243	none	none
GLEAN3_26245	SPU_026245	none	none
GLEAN3_26254	SPU_026254	none	none
GLEAN3_26260	SPU_026260	none	none
GLEAN3_26264	SPU_026264	none	none
GLEAN3_26265	SPU_026265	none	none
GLEAN3_26276	SPU_026276	none	none
GLEAN3_26282	SPU_026282	contains SMC_N domain	none
GLEAN3_26283	SPU_026283	contains SMC_N domain	none
GLEAN3_26284	SPU_026284	none	none
GLEAN3_26290	SPU_026290	contains 2 WSC superfamily motifs	none
GLEAN3_26302	SPU_026302	contains 4 EGF_Lam superfamily motifs and 4 Laminin_EGF domain motifs	none
GLEAN3_26303	SPU_026303	contains 4 EGF_Lam superfamily motifs and Laminin_EGF domain	none
GLEAN3_26305	SPU_026305	contains SMC_prok_A domain	none
GLEAN3_26315	SPU_026315	none	none
GLEAN3_26316	SPU_026316	none	none
GLEAN3_26324	SPU_026324	none	none
GLEAN3_26343	SPU_026343	contains Dynein_heavy domain	none
GLEAN3_26345	SPU_026345	none	none
GLEAN3_26363	SPU_026363	none	none
GLEAN3_26365	SPU_026365	contains FRQ1 domain	none
GLEAN3_26368	SPU_026368	none	none
GLEAN3_26369	SPU_026369	none	none
GLEAN3_26370	SPU_026370	none	none
GLEAN3_26377	SPU_026377	none	none
GLEAN3_26387	SPU_026387	none	none
GLEAN3_26391	SPU_026391	none	none
GLEAN3_26397	SPU_026397	none	none
GLEAN3_26398	SPU_026398	none	none
GLEAN3_26401	SPU_026401	none	none
GLEAN3_26402	SPU_026402	contains PRK08566 domain	none
GLEAN3_26405	SPU_026405	contains SMC_prok_B domain	none
GLEAN3_26406	SPU_026406	none	none
GLEAN3_26407	SPU_026407	none	none
GLEAN3_26409	SPU_026409	contains COG4886 domain	none
GLEAN3_26410	SPU_026410	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26413	SPU_026413	none	none
GLEAN3_26414	SPU_026414	none	none
GLEAN3_26415	SPU_026415	none	none
GLEAN3_26421	SPU_026421	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26423	SPU_026423	none	none
GLEAN3_26424	SPU_026424	none	none
GLEAN3_26435	SPU_026435	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26443	SPU_026443	contains amidohydrolase domain and AbgB domain	none
GLEAN3_26445	SPU_026445	none	none
GLEAN3_26451	SPU_026451	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26453	SPU_026453	none	none
GLEAN3_26455	SPU_026455	none	none
GLEAN3_26468	SPU_026468	none	none
GLEAN3_26478	SPU_026478	contains PAT1 domain	none
GLEAN3_26481	SPU_026481	none	none
GLEAN3_26482	SPU_026482	none	none
GLEAN3_26494	SPU_026494	contains carnitine_bodg domain	none
GLEAN3_26504	SPU_026504	none	none
GLEAN3_26509	SPU_026509	none	none
GLEAN3_26512	SPU_026512	contains SMC_prok_B domain	none
GLEAN3_26544	SPU_026544	contains Ftcd domain	none
GLEAN3_26525	SPU_026525	none	none
GLEAN3_26546	SPU_026546	contains 2 ANK superfamily motifs and Arp domains	none
GLEAN3_26548	SPU_026548	contains Myosin_tail_1 domain and PRK13729 domain	none
GLEAN3_26551	SPU_026551	contains 2 Ldl_recept_b superfamily motifs and COG3391 domain	none
GLEAN3_26553	SPU_026553	contains 2 LRR_RI superfamily motifs	none
GLEAN3_26565	SPU_026565	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26566	SPU_026566	none	none
GLEAN3_26568	SPU_026568	none	none
GLEAN3_26570	SPU_026570	none	none
GLEAN3_26571	SPU_026571	contains 2 TPR superfamily motifs and PRK11788 domain	none
GLEAN3_26574	SPU_026574	none	none
GLEAN3_26575	SPU_026575	contains 2 EGF_CA superfamily motifs	none
GLEAN3_26584	SPU_026584	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26585	SPU_026585	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26589	SPU_026589	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26602	SPU_026602	contains 2 PHD superfamily motifs	none
GLEAN3_26603	SPU_026603	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26606	SPU_026606	none	none
GLEAN3_26651	SPU_026651	none	none
GLEAN3_26632	SPU_026632	none	none
GLEAN3_26650	SPU_026650	contains 3 Gelsolin superfamily motifs	none
GLEAN3_26652	SPU_026652	none	none
GLEAN3_26653	SPU_026653	contains 2 IG superfamily motifs and Marek_A domain	none
GLEAN3_26654	SPU_026654	none	none
GLEAN3_26657	SPU_026657	none	none
GLEAN3_26665	SPU_026665	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26668	SPU_026668	none	none
GLEAN3_26675	SPU_026675	none	none
GLEAN3_26677	SPU_026677	none	none
GLEAN3_26679	SPU_026679	none	none
GLEAN3_26685	SPU_026685	none	none
GLEAN3_26686	SPU_026686	contains 2 FA58C superfamily motifs	none
GLEAN3_26689	SPU_026689	none	none
GLEAN3_26691	SPU_026691	none	none
GLEAN3_26697	SPU_026697	contains RAD18 domain	none
GLEAN3_26699	SPU_026699	none	none
GLEAN3_26703	SPU_026703	contains 2 IG superfamily motifs and V-set domain	none
GLEAN3_26710	SPU_026710	none	none
GLEAN3_26713	SPU_026713	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26715	SPU_026715	none	none
GLEAN3_26726	SPU_026726	none	none
GLEAN3_26727	SPU_026727	none	none
GLEAN3_26728	SPU_026728	none	none
GLEAN3_26729	SPU_026729	none	none
GLEAN3_26730	SPU_026730	none	none
GLEAN3_26742	SPU_026742	none	none
GLEAN3_26744	SPU_026744	none	none
GLEAN3_26750	SPU_026750	contains 4 IG superfamily motifs	none
GLEAN3_26753	SPU_026753	contains Sulfotransfer_1 domain	none
GLEAN3_26760	SPU_026760	none	none
GLEAN3_26765	SPU_026765	contains MviM domain	none
GLEAN3_26769	SPU_026769	none	none
GLEAN3_26775	SPU_026775	none	none
GLEAN3_26776	SPU_026776	none	none
GLEAN3_26780	SPU_026780	none	none
GLEAN3_26795	SPU_026795	none	none
GLEAN3_26808	SPU_026808	none	none
GLEAN3_26813	SPU_026813	none	none
GLEAN3_26815	SPU_026815	none	none
GLEAN3_26816	SPU_026816	none	none
GLEAN3_26818	SPU_026818	none	none
GLEAN3_26820	SPU_026820	none	none
GLEAN3_26843	SPU_026843	contains 2 FA58C superfamily motifs	none
GLEAN3_26850	SPU_026850	contains 2 MFS superfamily motifs	none
GLEAN3_26851	SPU_026851	contains Smc domain	none
GLEAN3_26853	SPU_026853	none	none
GLEAN3_26854	SPU_026854	none	none
GLEAN3_26856	SPU_026856	none	none
GLEAN3_26866	SPU_026866	none	none
GLEAN3_26867	SPU_026867	none	none
GLEAN3_26868	SPU_026868	none	none
GLEAN3_26869	SPU_026869	contains MRS6 domain	none
GLEAN3_26870	SPU_026870	contains COG5222 domain	none
GLEAN3_26871	SPU_026871	none	none
GLEAN3_26873	SPU_026873	none	none
GLEAN3_26876	SPU_026876	contains LIC domain	none
GLEAN3_26878	SPU_026878	contains 2 Kelch_1 superfamily motifs and COG3055 domain	none
GLEAN3_26882	SPU_026882	none	none
GLEAN3_26885	SPU_026885	none	none
GLEAN3_26888	SPU_026888	none	none
GLEAN3_26890	SPU_026890	none	none
GLEAN3_26891	SPU_026891	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26893	SPU_026893	none	none
GLEAN3_26900	SPU_026900	none	none
GLEAN3_26901	SPU_026901	none	none
GLEAN3_26902	SPU_026902	none	none
GLEAN3_26903	SPU_026903	none	none
GLEAN3_26908	SPU_026908	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26915	SPU_026915	contains 2 TSP_1 superfamily motifs	none
GLEAN3_26922	SPU_026922	none	none
GLEAN3_26929	SPU_026929	none	none
GLEAN3_26935	SPU_026935	none	none
GLEAN3_26941	SPU_026941	none	none
GLEAN3_26951	SPU_026951	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26959	SPU_026959	none	none
GLEAN3_26963	SPU_026963	none	none
GLEAN3_26968	SPU_026968	none	none
GLEAN3_26969	SPU_026969	contains 2 ANK superfamily motifs	none
GLEAN3_26978	SPU_026978	contains CLH domain	none
GLEAN3_26980	SPU_026980	contains 2 Kelch_1 superfamily motifs and COG3055 domain	none
GLEAN3_27003	SPU_027003	contains COG1413 domain	none
GLEAN3_27011	SPU_027011	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27021	SPU_027021	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27024	SPU_027024	contains DNA_pol_B_2 domain and rne domain	none
GLEAN3_27031	SPU_027031	none	none
GLEAN3_27038	SPU_027038	none	none
GLEAN3_27042	SPU_027042	contains 2A0601 domain	none
GLEAN3_27044	SPU_027044	none	none
GLEAN3_27045	SPU_027045	none	none
GLEAN3_27051	SPU_027051	none	none
GLEAN3_27055	SPU_027055	none	none
GLEAN3_27062	SPU_027062	none	none
GLEAN3_27064	SPU_027064	contains 2 PDZ superfamily motifs	none
GLEAN3_27067	SPU_027067	none	none
GLEAN3_27070	SPU_027070	none	none
GLEAN3_27073	SPU_027073	none	none
GLEAN3_27081	SPU_027081	none	none
GLEAN3_27088	SPU_027088	none	none
GLEAN3_27091	SPU_027091	none	none
GLEAN3_27093	SPU_027093	none	none
GLEAN3_27094	SPU_027094	none	none
GLEAN3_27097	SPU_027097	none	none
GLEAN3_27099	SPU_027099	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27105	SPU_027105	none	none
GLEAN3_27106	SPU_027106	none	none
GLEAN3_27112	SPU_027112	none	none
GLEAN3_27124	SPU_027124	none	none
GLEAN3_27129	SPU_027129	none	none
GLEAN3_27134	SPU_027134	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27135	SPU_027135	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27139	SPU_027139	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27142	SPU_027142	contains 3 HYR superfamily motifs	none
GLEAN3_27154	SPU_027154	contains 2A0113 domain	none
GLEAN3_27161	SPU_027161	contains 2 Hyd_WA superfamily motifs	none
GLEAN3_27165	SPU_027165	none	none
GLEAN3_27169	SPU_027169	contains 2 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27170	SPU_027170	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27174	SPU_027174	contains SMC_prok_B domain	none
GLEAN3_27177	SPU_027177	none	none
GLEAN3_27175	SPU_027175	none	none
GLEAN3_27187	SPU_027187	contains infB domain	none
GLEAN3_27189	SPU_027189	none	none
GLEAN3_27196	SPU_027196	none	none
GLEAN3_27198	SPU_027198	none	none
GLEAN3_27200	SPU_027200	contains CRM1 domain	none
GLEAN3_27211	SPU_027211	contains EH domain	none
GLEAN3_27212	SPU_027212	none	none
GLEAN3_27217	SPU_027217	none	none
GLEAN3_27228	SPU_027228	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27231	SPU_027231	none	none
GLEAN3_27241	SPU_027241	none	none
GLEAN3_27243	SPU_027243	none	none
GLEAN3_27248	SPU_027248	none	none
GLEAN3_27251	SPU_027251	none	none
GLEAN3_27258	SPU_027258	none	none
GLEAN3_27262	SPU_027262	none	none
GLEAN3_27271	SPU_027271	probable assembly chimera	none
GLEAN3_27273	SPU_027273	none	none
GLEAN3_27283	SPU_027283	contains Qor domain	none
GLEAN3_27296	SPU_027296	contains PRK02106 domain	none
GLEAN3_27298	SPU_027298	contains PRK02106 domain	none
GLEAN3_27301	SPU_027301	none	none
GLEAN3_27303	SPU_027303	contains COG3603 domain	none
GLEAN3_27305	SPU_027305	none	none
GLEAN3_27311	SPU_027311	none	none
GLEAN3_27322	SPU_027322	none	none
GLEAN3_27324	SPU_027324	none	none
GLEAN3_27325	SPU_027325	none	none
GLEAN3_27339	SPU_027339	none	none
GLEAN3_27346	SPU_027346	none	none
GLEAN3_27347	SPU_027347	probable assembly chimera	none
GLEAN3_27348	SPU_027348	none	none
GLEAN3_27365	SPU_027365	none	none
GLEAN3_27366	SPU_027366	contains 2 MFS superfamily motifs	none
GLEAN3_27367	SPU_027367	none	none
GLEAN3_27373	SPU_027373	contains COG5222 domain and SMC_prok_A domain	none
GLEAN3_27385	SPU_027385	contains 3 IPT superfamily motifs	none
GLEAN3_27392	SPU_027392	contains DUF803 domain	none
GLEAN3_27396	SPU_027396	contains 2 FA58C superfamily motifs	none
GLEAN3_27397	SPU_027397	contains 2A0113 domain	none
GLEAN3_27409	SPU_027409	none	none
GLEAN3_27416	SPU_027416	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27417	SPU_027417	contains 2 IG superfamily motifs	none
GLEAN3_27418	SPU_027418	none	none
GLEAN3_27421	SPU_027421	none	none
GLEAN3_27422	SPU_027422	none	none
GLEAN3_27426	SPU_027426	none	none
GLEAN3_27427	SPU_027427	none	none
GLEAN3_27429	SPU_027429	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27434	SPU_027434	contains LIC domain	none
GLEAN3_27435	SPU_027435	none	none
GLEAN3_27439	SPU_027439	none	none
GLEAN3_27441	SPU_027441	none	none
GLEAN3_27442	SPU_027442	none	none
GLEAN3_27444	SPU_027444	contains HAD-SF-IA-v3 domain	none
GLEAN3_27453	SPU_027453	contains COG4955 domain	none
GLEAN3_27455	SPU_027455	none	none
GLEAN3_27459	SPU_027459	contains TelA domain	none
GLEAN3_27460	SPU_027460	probable assembly chimera	none
GLEAN3_27466	SPU_027466	none	none
GLEAN3_27470	SPU_027470	contains MPH1 domain	none
GLEAN3_27471	SPU_027471	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27477	SPU_027477	none	none
GLEAN3_27478	SPU_027478	none	none
GLEAN3_27485	SPU_027485	none	none
GLEAN3_27500	SPU_027500	contains 2A0119 domain	none
GLEAN3_27504	SPU_027504	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27506	SPU_027506	none	none
GLEAN3_27521	SPU_027521	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27536	SPU_027536	none	none
GLEAN3_27549	SPU_027549	none	none
GLEAN3_27554	SPU_027554	contains PABP-1234 domain	none
GLEAN3_27567	SPU_027567	contains COG4581 domain	none
GLEAN3_27574	SPU_027574	none	none
GLEAN3_27577	SPU_027577	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27585	SPU_027585	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27587	SPU_027587	none	none
GLEAN3_27588	SPU_027588	contains 2 EGF_CA superfamily motifs	none
GLEAN3_27589	SPU_027589	none	none
GLEAN3_27590	SPU_027590	none	none
GLEAN3_27595	SPU_027595	probable assembly chimera	none
GLEAN3_27611	SPU_027611	none	none
GLEAN3_27612	SPU_027612	none	none
GLEAN3_27621	SPU_027621	none	none
GLEAN3_27647	SPU_027647	none	none
GLEAN3_27656	SPU_027656	contains Nup88 domain	none
GLEAN3_27662	SPU_027662	none	none
GLEAN3_27675	SPU_027675	none	none
GLEAN3_27680	SPU_027680	none	none
GLEAN3_27681	SPU_027681	none	none
GLEAN3_27683	SPU_027683	none	none
GLEAN3_27688	SPU_027688	none	none
GLEAN3_27692	SPU_027692	contains COG2129 domain	none
GLEAN3_27704	SPU_027704	none	none
GLEAN3_27716	SPU_027716	none	none
GLEAN3_27722	SPU_027722	none	none
GLEAN3_27723	SPU_027723	contains CSE1 domain	none
GLEAN3_27726	SPU_027726	none	none
GLEAN3_27732	SPU_027732	none	none
GLEAN3_27733	SPU_027733	none	none
GLEAN3_27738	SPU_027738	none	none
GLEAN3_27744	SPU_027744	none	none
GLEAN3_27745	SPU_027745	none	none
GLEAN3_27759	SPU_027759	none	none
GLEAN3_27767	SPU_027767	none	none
GLEAN3_27771	SPU_027771	none	none
GLEAN3_27788	SPU_027788	none	none
GLEAN3_27809	SPU_027809	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27818	SPU_027818	none	none
GLEAN3_27823	SPU_027823	none	none
GLEAN3_27830	SPU_027830	none	none
GLEAN3_27831	SPU_027831	none	none
GLEAN3_27835	SPU_027835	Strongylocentrotus purpuratus-specific protein. probable assembly chimera.	none
GLEAN3_27860	SPU_027860	none	none
GLEAN3_27863	SPU_027863	contains 2 MAM superfamily motifs	none
GLEAN3_27838	SPU_027838	none	none
GLEAN3_27866	SPU_027866	none	none
GLEAN3_27867	SPU_027867	none	none
GLEAN3_27875	SPU_027875	none	none
GLEAN3_27881	SPU_027881	probable assembly chimera	none
GLEAN3_27884	SPU_027884	none	none
GLEAN3_27900	SPU_027900	none	none
GLEAN3_27908	SPU_027908	none	none
GLEAN3_27917	SPU_027917	contains 4 IG superfamily motifs	none
GLEAN3_27924	SPU_027924	none	none
GLEAN3_27925	SPU_027925	none	none
GLEAN3_27928	SPU_027928	none	none
GLEAN3_27929	SPU_027929	none	none
GLEAN3_27931	SPU_027931	none	none
GLEAN3_27933	SPU_027933	none	none
GLEAN3_27934	SPU_027934	none	none
GLEAN3_27943	SPU_027943	none	none
GLEAN3_27944	SPU_027944	none	none
GLEAN3_27945	SPU_027945	contains CaiA domain	none
GLEAN3_27948	SPU_027948	contains 3 IPT superfamily motifs	none
GLEAN3_27958	SPU_027958	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27963	SPU_027963	none	none
GLEAN3_27968	SPU_027968	none	none
GLEAN3_27974	SPU_027974	none	none
GLEAN3_27988	SPU_027988	none	none
GLEAN3_27989	SPU_027989	none	none
GLEAN3_27992	SPU_027992	none	none
GLEAN3_27996	SPU_027996	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27999	SPU_027999	none	none
GLEAN3_28008	SPU_028008	contains SMC_prok_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28012	SPU_028012	contains 3 ANK superfamily motifs	none
GLEAN3_28015	SPU_028015	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28022	SPU_028022	none	none
GLEAN3_28023	SPU_028023	none	none
GLEAN3_28024	SPU_028024	none	none
GLEAN3_28027	SPU_028027	none	none
GLEAN3_28037	SPU_028037	none	none
GLEAN3_28040	SPU_028040	none	none
GLEAN3_28044	SPU_028044	contains FIMAC domain and PRK08853 domain	none
GLEAN3_28052	SPU_028052	contains 3 SPEC superfamily motifs	none
GLEAN3_28056	SPU_028056	none	none
GLEAN3_28063	SPU_028063	none	none
GLEAN3_28072	SPU_028072	none	none
GLEAN3_28074	SPU_028074	probable assembly chimera	none
GLEAN3_28084	SPU_028084	none	none
GLEAN3_28085	SPU_028085	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28095	SPU_028095	none	none
GLEAN3_28106	SPU_028106	none	none
GLEAN3_28114	SPU_028114	none	none
GLEAN3_28115	SPU_028115	none	none
GLEAN3_28116	SPU_028116	contains RVT_1 domain	none
GLEAN3_15083	SPU_015083	none	none
GLEAN3_17559	SPU_017559	none	none
GLEAN3_21156	SPU_021156	none	none
GLEAN3_28119	SPU_028119	none	none
GLEAN3_28125	SPU_028125	none	none
GLEAN3_28126	SPU_028126	contains 3 ANK superfamily motifs	none
GLEAN3_28127	SPU_028127	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28136	SPU_028136	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28138	SPU_028138	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28143	SPU_028143	contains 4 HYR superfamily motifs	none
GLEAN3_28154	SPU_028154	none	none
GLEAN3_28156	SPU_028156	none	none
GLEAN3_28165	SPU_028165	none	none
GLEAN3_28166	SPU_028166	none	none
GLEAN3_28172	SPU_028172	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28173	SPU_028173	none	none
GLEAN3_28177	SPU_028177	none	none
GLEAN3_28186	SPU_028186	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28192	SPU_028192	none	none
GLEAN3_28193	SPU_028193	probable assembly chimera	none
GLEAN3_28194	SPU_028194	none	none
GLEAN3_28195	SPU_028195	none	none
GLEAN3_28206	SPU_028206	none	none
GLEAN3_28212	SPU_028212	contains 2 MAM superfamily motifs	none
GLEAN3_28215	SPU_028215	none	none
GLEAN3_28219	SPU_028219	contains DUF1253 domain	none
GLEAN3_28220	SPU_028220	none	none
GLEAN3_28231	SPU_028231	none	none
GLEAN3_28232	SPU_028232	none	none
GLEAN3_28235	SPU_028235	none	none
GLEAN3_28242	SPU_028242	none	none
GLEAN3_28243	SPU_028243	contains SGL domain	none
GLEAN3_28244	SPU_028244	none	none
GLEAN3_28248	SPU_028248	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28252	SPU_028252	none	none
GLEAN3_28262	SPU_028262	none	none
GLEAN3_28264	SPU_028264	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28269	SPU_028269	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28281	SPU_028281	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28283	SPU_028283	none	none
GLEAN3_28284	SPU_028284	contains 2 P_loop_NTPase superfamily motifs	none
GLEAN3_28285	SPU_028285	none	none
GLEAN3_28288	SPU_028288	none	none
GLEAN3_28289	SPU_028289	contains 2 MFS superfamily motifs and 2A0119 domain	none
GLEAN3_28314	SPU_028314	contains UvrA domain	none
GLEAN3_28315	SPU_028315	none	none
GLEAN3_28317	SPU_028317	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28321	SPU_028321	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28322	SPU_028322	contains 2 EFh superfamily motifs and FRQ1 domain	none
GLEAN3_28324	SPU_028324	none	none
GLEAN3_28335	SPU_028335	none	none
GLEAN3_28355	SPU_028355	none	none
GLEAN3_28357	SPU_028357	none	none
GLEAN3_28360	SPU_028360	contains CAL1 domain	none
GLEAN3_28364	SPU_028364	contains Dynein_heavy domain	none
GLEAN3_28365	SPU_028365	contains Dynein_heavy domain	none
GLEAN3_28369	SPU_028369	contains 2A0113 domain and CynX domain	none
GLEAN3_28371	SPU_028371	none	none
GLEAN3_28378	SPU_028378	none	none
GLEAN3_28386	SPU_028386	none	none
GLEAN3_28392	SPU_028392	none	none
GLEAN3_28405	SPU_028405	none	none
GLEAN3_28409	SPU_028409	contains COG2940 domain	none
GLEAN3_28414	SPU_028414	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28417	SPU_028417	none	none
GLEAN3_28418	SPU_028418	contains 2 MAM superfamily motifs	none
GLEAN3_28419	SPU_028419	contains Sulfotransfer_1 domain	none
GLEAN3_28426	SPU_028426	contains SMC_prok_B domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28427	SPU_028427	contains 2 ANK superfamily motifs and UbiH domain	none
GLEAN3_28435	SPU_028435	none	none
GLEAN3_28436	SPU_028436	none	none
GLEAN3_28441	SPU_028441	contains soxA_mon domain	none
GLEAN3_28442	SPU_028442	contains DadA domain	none
GLEAN3_28443	SPU_028443	contains DadA domain	none
GLEAN3_28444	SPU_028444	none	none
GLEAN3_28450	SPU_028450	contains 2 IG superfamily motifs	none
GLEAN3_28464	SPU_028464	none	none
GLEAN3_28466	SPU_028466	none	none
GLEAN3_28472	SPU_028472	contains Sulfotransfer_1 domain	none
GLEAN3_28476	SPU_028476	none	none
GLEAN3_28484	SPU_028484	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28488	SPU_028488	none	none
GLEAN3_28489	SPU_028489	none	none
GLEAN3_28490	SPU_028490	none	none
GLEAN3_28492	SPU_028492	none	none
GLEAN3_28500	SPU_028500	contains Filament domain	none
GLEAN3_28502	SPU_028502	contains 2 Filamin superfamily motifs	none
GLEAN3_28503	SPU_028503	none	none
GLEAN3_28507	SPU_028507	none	none
GLEAN3_28517	SPU_028517	none	none
GLEAN3_28519	SPU_028519	contains 5 HYR superfamily motifs	none
GLEAN3_28528	SPU_028528	none	none
GLEAN3_28531	SPU_028531	none	none
GLEAN3_28553	SPU_028553	none	none
GLEAN3_28574	SPU_028574	none	none
GLEAN3_28578	SPU_028578	none	none
GLEAN3_28582	SPU_028582	none	none
GLEAN3_28588	SPU_028588	none	none
GLEAN3_28591	SPU_028591	none	none
GLEAN3_28598	SPU_028598	none	none
GLEAN3_28614	SPU_028614	none	none
GLEAN3_28616	SPU_028616	none	none
GLEAN3_28623	SPU_028623	none	none
GLEAN3_28617	SPU_028617	none	none
GLEAN3_28624	SPU_028624	none	none
GLEAN3_28642	SPU_028642	none	none
GLEAN3_28643	SPU_028643	none	none
GLEAN3_28644	SPU_028644	none	none
GLEAN3_28645	SPU_028645	none	none
GLEAN3_28659	SPU_028659	contains RPN6 domain	none
GLEAN3_28663	SPU_028663	contains COG5273 domain	none
GLEAN3_28666	SPU_028666	none	none
GLEAN3_28672	SPU_028672	contains degP_htrA domain	none
GLEAN3_28674	SPU_028674	none	none
GLEAN3_28677	SPU_028677	none	none
GLEAN3_28702	SPU_028702	none	none
GLEAN3_28705	SPU_028705	contains 2 HYR superfamily motifs	none
GLEAN3_28715	SPU_028715	contains Macoilin domain	none
GLEAN3_28718	SPU_028718	none	none
GLEAN3_28719	SPU_028719	none	none
GLEAN3_28722	SPU_028722	none	none
GLEAN3_28731	SPU_028731	none	none
GLEAN3_28733	SPU_028733	none	none
GLEAN3_28737	SPU_028737	none	none
GLEAN3_28743	SPU_028743	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_28744	SPU_028744	none	none
GLEAN3_28747	SPU_028747	contains COG3696 domain	none
GLEAN3_28757	SPU_028757	none	none
GLEAN3_28759	SPU_028759	none	none
GLEAN3_28760	SPU_028760	none	none
GLEAN3_28761	SPU_028761	contains Aes domain	none
GLEAN3_28762	SPU_028762	none	none
GLEAN3_28764	SPU_028764	none	none
GLEAN3_28767	SPU_028767	none	none
GLEAN3_28782	SPU_028782	none	none
GLEAN3_28792	SPU_028792	none	none
GLEAN3_28793	SPU_028793	none	none
GLEAN3_28798	SPU_028798	none	none
GLEAN3_28801	SPU_028801	none	none
GLEAN3_28808	SPU_028808	none	none
GLEAN3_28818	SPU_028818	none	none
GLEAN3_28819	SPU_028819	none	none
GLEAN3_28822	SPU_028822	none	none
GLEAN3_28823	SPU_028823	none	none
GLEAN3_28824	SPU_028824	none	none
GLEAN3_28825	SPU_028825	none	none
GLEAN3_28832	SPU_028832	none	none
GLEAN3_28838	SPU_028838	none	none
GLEAN3_28839	SPU_028839	none	none
GLEAN3_28851	SPU_028851	contains PlsC domain	none
GLEAN3_28855	SPU_028855	none	none
GLEAN3_28863	SPU_028863	probable assembly chimera	none
GLEAN3_28865	SPU_028865	none	none
GLEAN3_28869	SPU_028869	none	none
GLEAN3_28901	SPU_028901	contains TRF4 domain	none
GLEAN3_28911	SPU_028911	contains MRS6 domain	none
GLEAN3_28915	SPU_028915	none	none
GLEAN3_28923	SPU_028923	none	none
GLEAN3_28924	SPU_028924	none	none
GLEAN3_28928	SPU_028928	contains hCaCC domain	none
GLEAN3_28935	SPU_028935	none	none
GLEAN3_28941	SPU_028941	none	none
GLEAN3_28944	SPU_028944	none	none
GLEAN3_00094	SPU_000094	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00124	SPU_000124	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00143	SPU_000143	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00245	SPU_000245	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00333	SPU_000333	contains 2 zf-BED superfamily motifs near N-terminus and hATC superfamily motif at C-terminus. Ac-like transposable element homolog.	none
GLEAN3_00345	SPU_000345	contains SAM superfamily motif at C-terminus. contains PRK12323 domain.	none
GLEAN3_00347	SPU_000347	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00353	SPU_000353	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00396	SPU_000396	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00405	SPU_000405	none	none
GLEAN3_00423	SPU_000423	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00438	SPU_000438	none	none
GLEAN3_00448	SPU_000448	none	none
GLEAN3_00477	SPU_000477	contains Smc domain	none
GLEAN3_00518	SPU_000518	none	none
GLEAN3_00529	SPU_000529	contains 2 CUB superfamily motifs	none
GLEAN3_00551	SPU_000551	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00559	SPU_000559	contains 2 COG5028 domains	none
GLEAN3_00565	SPU_000565	none	none
GLEAN3_00570	SPU_000570	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00573	SPU_000573	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00599	SPU_000599	contains KA1 superfamily motif at C-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_00610	SPU_000610	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00621	SPU_000621	contains 5 LDLa superfamily motifs and 2 CUB superfamily motifs	none
GLEAN3_00659	SPU_000659	none	none
GLEAN3_00687	SPU_000687	none	none
GLEAN3_00689	SPU_000689	none	none
GLEAN3_00696	SPU_000696	none	none
GLEAN3_00716	SPU_000716	none	none
GLEAN3_00730	SPU_000730	none	none
GLEAN3_00752	SPU_000752	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00765	SPU_000765	none	none
GLEAN3_00775	SPU_000775	none	none
GLEAN3_00792	SPU_000792	none	none
GLEAN3_00802	SPU_000802	none	none
GLEAN3_00805	SPU_000805	none	none
GLEAN3_00815	SPU_000815	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_00867	SPU_000867	none	none
GLEAN3_00868	SPU_000868	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00898	SPU_000898	none	none
GLEAN3_00919	SPU_000919	none	none
GLEAN3_00920	SPU_000920	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00934	SPU_000934	none	none
GLEAN3_00993	SPU_000993	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01001	SPU_001001	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01019	SPU_001019	contains 3 ANK superfamily motifs and Arp domain motifs	none
GLEAN3_01025	SPU_001025	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01029	SPU_001029	none	none
GLEAN3_01053	SPU_001053	none	none
GLEAN3_01057	SPU_001057	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01071	SPU_001071	none	none
GLEAN3_01100	SPU_001100	none	none
GLEAN3_01121	SPU_001121	contains SMC_N domain	none
GLEAN3_01136	SPU_001136	none	none
GLEAN3_01140	SPU_001140	none	none
GLEAN3_01157	SPU_001157	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01189	SPU_001189	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01206	SPU_001206	contains 2 Death superfamily motifs	none
GLEAN3_01257	SPU_001257	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01276	SPU_001276	none	none
GLEAN3_01298	SPU_001298	none	none
GLEAN3_01330	SPU_001330	none	none
GLEAN3_01332	SPU_001332	contains Adaptin_N domain	none
GLEAN3_01335	SPU_001335	contains Smc domain	none
GLEAN3_01340	SPU_001340	none	none
GLEAN3_01346	SPU_001346	contains 2 FA58C superfamily motifs and 11 EGF_CA superfamily motifs	none
GLEAN3_01347	SPU_001347	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01348	SPU_001348	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01349	SPU_001349	none	none
GLEAN3_01358	SPU_001358	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01409	SPU_001409	contains 4 SRCR superfamily motifs	none
GLEAN3_01427	SPU_001427	contains 4 SPEC superfamily motifs	none
GLEAN3_01434	SPU_001434	contains 2 COG4886 domains	none
GLEAN3_01445	SPU_001445	GLEAN3_01497	none
GLEAN3_01507	SPU_001507	none	none
GLEAN3_01516	SPU_001516	contains 5 MAM superfamily motifs	none
GLEAN3_01528	SPU_001528	contains COG2319 domain	none
GLEAN3_01540	SPU_001540	none	none
GLEAN3_01562	SPU_001562	contains 2 MFS superfamily motifs	none
GLEAN3_01564	SPU_001564	none	none
GLEAN3_01575	SPU_001575	none	none
GLEAN3_01583	SPU_001583	none	none
GLEAN3_01591	SPU_001591	contains COG5635 domain.Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_01606	SPU_001606	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01646	SPU_001646	none	none
GLEAN3_01669	SPU_001669	none	none
GLEAN3_01706	SPU_001706	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_01733	SPU_001733	none	none
GLEAN3_01762	SPU_001762	contains COG4886 domain	none
GLEAN3_01784	SPU_001784	none	none
GLEAN3_01791	SPU_001791	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_01816	SPU_001816	contains COG5141 domain	none
GLEAN3_01857	SPU_001857	contains 6 ANK superfamily motifs and Arp domains	none
GLEAN3_01896	SPU_001896	none	none
GLEAN3_01906	SPU_001906	contains 2 MFS superfamily motifs	none
GLEAN3_01922	SPU_001922	contains 5 ANK superfamily motifs and Arp domains	none
GLEAN3_01946	SPU_001946	contains 6 ANK superfamily motifs and Arp domains	none
GLEAN3_01947	SPU_001947	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_01949	SPU_001949	none	none
GLEAN3_01952	SPU_001952	contains 2 7tm_1 superfamily motifs	none
GLEAN3_01974	SPU_001974	none	none
GLEAN3_02010	SPU_002010	contains 2 HYR superfamily motifs	none
GLEAN3_02011	SPU_002011	contains 2 DCX superfamily motifs	none
GLEAN3_02066	SPU_002066	none	none
GLEAN3_02071	SPU_002071	contains MCM2 domain and MCM domain	none
GLEAN3_02073	SPU_002073	none	none
GLEAN3_02080	SPU_002080	contains 4 LDLa_re superfamily motifs and 5 EGF_CA superfamily motifs	none
GLEAN3_02086	SPU_002086	contains 2 LRR_RI superfamily motifs	none
GLEAN3_02096	SPU_002096	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02111	SPU_002111	contains 6 ANK superfamily motifs and Arp domains	none
GLEAN3_02121	SPU_002121	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02125	SPU_002125	contains PRK12678 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02145	SPU_002145	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02159	SPU_002159	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02180	SPU_002180	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02187	SPU_002187	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02198	SPU_002198	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02212	SPU_002212	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02217	SPU_002217	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02222	SPU_002222	none	none
GLEAN3_02291	SPU_002291	none	none
GLEAN3_02312	SPU_002312	none	none
GLEAN3_02315	SPU_002315	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02326	SPU_002326	contains SAM superfamily motif at C-terminus and PRK12323 domain near N-terminus	none
GLEAN3_02330	SPU_002330	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02336	SPU_002336	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02340	SPU_002340	contains AST1 domain	none
GLEAN3_02341	SPU_002341	none	none
GLEAN3_02373	SPU_002373	none	none
GLEAN3_02390	SPU_002390	contains 2 BBOX superfamily motifs	none
GLEAN3_02396	SPU_002396	none	none
GLEAN3_02397	SPU_002397	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02409	SPU_002409	none	none
GLEAN3_02414	SPU_002414	none	none
GLEAN3_02426	SPU_002426	contains CYK3 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02444	SPU_002444	contains 2 C2 superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02447	SPU_002447	none	none
GLEAN3_02475	SPU_002475	contains 2 MFS superfamily motifs	none
GLEAN3_02483	SPU_002483	none	none
GLEAN3_02494	SPU_002494	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02497	SPU_002497	contains FixC domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02499	SPU_002499	none	none
GLEAN3_02508	SPU_002508	none	none
GLEAN3_02512	SPU_002512	none	none
GLEAN3_02513	SPU_002513	contains 2 Exo_endo_phos superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02530	SPU_002530	none	none
GLEAN3_02539	SPU_002539	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02545	SPU_002545	contains 2 SH3 superfamily motifs	none
GLEAN3_02561	SPU_002561	contains 2 Sulfatase superfamily motifs	none
GLEAN3_02562	SPU_002562	none	none
GLEAN3_02574	SPU_002574	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02579	SPU_002579	contains 2 APG9 superfamily motifs	none
GLEAN3_02594	SPU_002594	none	none
GLEAN3_02635	SPU_002635	COG3055 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02651	SPU_002651	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02654	SPU_002654	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02665	SPU_002665	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02666	SPU_002666	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02671	SPU_002671	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_02679	SPU_002679	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02700	SPU_002700	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02701	SPU_002701	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02714	SPU_002714	none	none
GLEAN3_02728	SPU_002728	contains RecD domain	none
GLEAN3_02745	SPU_002745	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02748	SPU_002748	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02754	SPU_002754	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02755	SPU_002755	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02760	SPU_002760	none	none
GLEAN3_02842	SPU_002842	contains TRF4 domain	none
GLEAN3_02852	SPU_002852	low quality sequence info	none
GLEAN3_02865	SPU_002865	contains 2 WD40 superfamily motifs	none
GLEAN3_02878	SPU_002878	contains Smc domain	none
GLEAN3_02904	SPU_002904	none	none
GLEAN3_02917	SPU_002917	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02941	SPU_002941	contains IMD superfamily motif at N-terminus	none
GLEAN3_02946	SPU_002946	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02957	SPU_002957	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02965	SPU_002965	none	none
GLEAN3_02992	SPU_002992	none	none
GLEAN3_02966	SPU_002966	contains 3 Mito_carr superfamily motifs in tandem at C-terminus	none
GLEAN3_03023	SPU_003023	contains 8 ANK superfamily motifs and Arp domains	none
GLEAN3_03040	SPU_003040	contains 2 CUB superfamily motifs and 6 EGF_CA superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03072	SPU_003072	none	none
GLEAN3_03075	SPU_003075	none	none
GLEAN3_03089	SPU_003089	contains 2 Neur_chan_LBD superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03091	SPU_003091	none	none
GLEAN3_03100	SPU_003100	contains MRS6 domain	none
GLEAN3_03120	SPU_003120	contains CDC55 domain	none
GLEAN3_03144	SPU_003144	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03146	SPU_003146	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03159	SPU_003159	none	none
GLEAN3_03163	SPU_003163	leucine-rich repeat kinase 2-like	none
GLEAN3_03185	SPU_003185	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03218	SPU_003218	endonuclease reverse transcriptase-like	none
GLEAN3_03233	SPU_003233	contains SH3 superfamily motif at N-terminus and COG5531 domain	none
GLEAN3_03243	SPU_003243	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03245	SPU_003245	contains PRK07768 domain and PRK12476 domain and AMP-binding domain	none
GLEAN3_03278	SPU_003278	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03300	SPU_003300	contains 2 FA58C superfamily motifs	none
GLEAN3_03328	SPU_003328	contains 2 NADB_Rossmann superfamily motifs	none
GLEAN3_03335	SPU_003335	none	none
GLEAN3_03344	SPU_003344	none	none
GLEAN3_03350	SPU_003350	contains 2 CUB superfamily motifs and 2 LDLa superfamily motifs	none
GLEAN3_03364	SPU_003364	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03365	SPU_003365	contains 2 7tm_1 superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03434	SPU_003434	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03440	SPU_003440	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03448	SPU_003448	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03452	SPU_003452	contains flgC domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03461	SPU_003461	KIAA1753 protein-like	none
GLEAN3_03468	SPU_003468	none	none
GLEAN3_03475	SPU_003475	none	none
GLEAN3_03494	SPU_003494	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03497	SPU_003497	contains zf-MYND superfamily motif near C-terminus	none
GLEAN3_03499	SPU_003499	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03506	SPU_003506	none	none
GLEAN3_03516	SPU_003516	none	none
GLEAN3_03522	SPU_003522	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03533	SPU_003533	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03538	SPU_003538	contains 8 ANK superfamily motifs and Arp domains	none
GLEAN3_03545	SPU_003545	none	none
GLEAN3_03568	SPU_003568	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03592	SPU_003592	none	none
GLEAN3_03602	SPU_003602	contains COG1480 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03605	SPU_003605	contains 3 CCP superfamily motifs	none
GLEAN3_03630	SPU_003630	contains PRK13342 domain	none
GLEAN3_03634	SPU_003634	none	none
GLEAN3_03637	SPU_003637	contains 3 CUB superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_03644	SPU_003644	contains SMC_N domain and TPR_MLP1_2 domain	none
GLEAN3_03657	SPU_003657	contains Smc domain	none
GLEAN3_03674	SPU_003674	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03675	SPU_003675	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03683	SPU_003683	none	none
GLEAN3_03755	SPU_003755	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03690	SPU_003690	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03692	SPU_003692	none	none
GLEAN3_03697	SPU_003697	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03707	SPU_003707	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03735	SPU_003735	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03752	SPU_003752	contains 2 LDL_recept_b superfamily motifs and 2 EGF_CA superfamily motifs	none
GLEAN3_03767	SPU_003767	contains Smc domain	none
GLEAN3_03769	SPU_003769	contains 4 LDLa superfamily motifs	none
GLEAN3_03776	SPU_003776	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03788	SPU_003788	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03807	SPU_003807	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03816	SPU_003816	contains 4 MAM superfamily motifs	none
GLEAN3_03836	SPU_003836	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03872	SPU_003872	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03873	SPU_003873	none	none
GLEAN3_03890	SPU_003890	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03897	SPU_003897	contains MAP65_ASE1 domain	none
GLEAN3_03910	SPU_003910	none	none
GLEAN3_03946	SPU_003946	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03949	SPU_003949	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03979	SPU_003979	none	none
GLEAN3_03989	SPU_003989	none	none
GLEAN3_04027	SPU_004027	contains 2 EGF_CA superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04056	SPU_004056	none	none
GLEAN3_04073	SPU_004073	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04130	SPU_004130	contains Smc domain and SMC_N domain	none
GLEAN3_04164	SPU_004164	contains HSP70 domain	none
GLEAN3_04166	SPU_004166	none	none
GLEAN3_04167	SPU_004167	none	none
GLEAN3_04174	SPU_004174	none	none
GLEAN3_04181	SPU_004181	none	none
GLEAN3_04185	SPU_004185	none	none
GLEAN3_04189	SPU_004189	none	none
GLEAN3_04197	SPU_004197	none	none
GLEAN3_04215	SPU_004215	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04219	SPU_004219	contains rne domain	none
GLEAN3_04229	SPU_004229	contains DPPIV_N domain	none
GLEAN3_04240	SPU_004240	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04250	SPU_004250	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04277	SPU_004277	contains 2 RNA_Pol_B_RPB2 superfamily motifs	none
GLEAN3_04294	SPU_004294	none	none
GLEAN3_04299	SPU_004299	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04303	SPU_004303	none	none
GLEAN3_04306	SPU_004306	contains 2 EGF_CA superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04325	SPU_004325	none	none
GLEAN3_04329	SPU_004329	none	none
GLEAN3_04348	SPU_004348	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04367	SPU_004367	none	none
GLEAN3_04372	SPU_004372	contains 2 EGF_CA superfamily motifs and 2 GCC2_GCC3 superfamily motifs	none
GLEAN3_04378	SPU_004378	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04385	SPU_004385	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04386	SPU_004386	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04394	SPU_004394	none	none
GLEAN3_04415	SPU_004415	none	none
GLEAN3_04423	SPU_004423	none	none
GLEAN3_04424	SPU_004424	none	none
GLEAN3_04433	SPU_004433	none	none
GLEAN3_04436	SPU_004436	none	none
GLEAN3_04437	SPU_004437	none	none
GLEAN3_04440	SPU_004440	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04447	SPU_004447	contains 3 LDL_recept_b superfamily motifs and 2 EGF_CA superfamily motifs and COG3391 domain	none
GLEAN3_04482	SPU_004482	contains UDPGT domain	none
GLEAN3_04489	SPU_004489	none	none
GLEAN3_04507	SPU_004507	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04521	SPU_004521	contains SMC_N domain	none
GLEAN3_04535	SPU_004535	none	none
GLEAN3_04589	SPU_004589	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04614	SPU_004614	none	none
GLEAN3_04620	SPU_004620	none	none
GLEAN3_04624	SPU_004624	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04645	SPU_004645	contains 2 Es2 superfamily motifs	none
GLEAN3_04646	SPU_004646	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04647	SPU_004647	contains DHC_N1 domain	none
GLEAN3_04672	SPU_004672	contains 2 WD40 superfamily motifs and COG2319 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04674	SPU_004674	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04678	SPU_004678	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04716	SPU_004716	contains 4 S1-like (Cold Shock Protein motif) superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04744	SPU_004744	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04748	SPU_004748	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04750	SPU_004750	contains AMP-binding domain	none
GLEAN3_04760	SPU_004760	none	none
GLEAN3_04777	SPU_004777	none	none
GLEAN3_04799	SPU_004799	contains PKinase domain	none
GLEAN3_04808	SPU_004808	none	none
GLEAN3_04822	SPU_004822	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04835	SPU_004835	none	none
GLEAN3_04839	SPU_004839	none	none
GLEAN3_04853	SPU_004853	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04864	SPU_004864	none	none
GLEAN3_04874	SPU_004874	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04881	SPU_004881	contains Na_H_Exchanger domain	none
GLEAN3_04959	SPU_004959	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04971	SPU_004971	none	none
GLEAN3_04982	SPU_004982	none	none
GLEAN3_04987	SPU_004987	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_04994	SPU_004994	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04995	SPU_004995	none	none
GLEAN3_05005	SPU_005005	none	none
GLEAN3_05009	SPU_005009	none	none
GLEAN3_05041	SPU_005041	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05048	SPU_005048	none	none
GLEAN3_05081	SPU_005081	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05086	SPU_005086	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05114	SPU_005114	contains THAP superfamily motif at N-terminus	none
GLEAN3_05120	SPU_005120	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05133	SPU_005133	none	none
GLEAN3_05134	SPU_005134	contains Prominin domain	none
GLEAN3_05178	SPU_005178	none	none
GLEAN3_05208	SPU_005208	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05217	SPU_005217	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05219	SPU_005219	none	none
GLEAN3_05247	SPU_005247	contains 2 IG superfamily motifs	none
GLEAN3_05251	SPU_005251	contains COG1233 domain	none
GLEAN3_05287	SPU_005287	contains Caldesmon domain	none
GLEAN3_05295	SPU_005295	none	none
GLEAN3_05303	SPU_005303	none	none
GLEAN3_05308	SPU_005308	contains SMC_N domain	none
GLEAN3_05326	SPU_005326	none	none
GLEAN3_05328	SPU_005328	none	none
GLEAN3_05348	SPU_005348	contains SPO22 domain	none
GLEAN3_05360	SPU_005360	none	none
GLEAN3_05362	SPU_005362	none	none
GLEAN3_05363	SPU_005363	contains 3 ANK superfamily motifs	none
GLEAN3_05365	SPU_005365	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05372	SPU_005372	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05374	SPU_005374	contains 6 ANK superfamily motifs and Arp domains	none
GLEAN3_05405	SPU_005405	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05416	SPU_005416		none
GLEAN3_05427	SPU_005427	contains 2 CUB superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05431	SPU_005431	none	none
GLEAN3_05432	SPU_005432	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05445	SPU_005445	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05454	SPU_005454	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05470	SPU_005470	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05488	SPU_005488	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05502	SPU_005502	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05530	SPU_005530	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05562	SPU_005562	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05563	SPU_005563	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05564	SPU_005564	contains MukB domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05568	SPU_005568	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05601	SPU_005601	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05603	SPU_005603	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05604	SPU_005604	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05612	SPU_005612	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05629	SPU_005629	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05649	SPU_005649	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05673	SPU_005673	none	none
GLEAN3_05680	SPU_005680	contains 2 TIR superfamily motifs	none
GLEAN3_05694	SPU_005694	none	none
GLEAN3_05696	SPU_005696	none	none
GLEAN3_05716	SPU_005716	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05719	SPU_005719	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05755	SPU_005755	contains SSL2 domain	none
GLEAN3_05769	SPU_005769	contains 2 PRK05648 domain motifs	none
GLEAN3_05775	SPU_005775	contains 2 PBPb superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05783	SPU_005783	contains 2 Sulfotransfer_1 domain motifs	none
GLEAN3_05785	SPU_005785	none	none
GLEAN3_05796	SPU_005796	none	none
GLEAN3_05817	SPU_005817	none	none
GLEAN3_05840	SPU_005840	contains CDC6 domain	none
GLEAN3_05847	SPU_005847	contains 5 LDLa superfamily motifs	none
GLEAN3_05869	SPU_005869	contains COG4372 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_05872	SPU_005872	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05874	SPU_005874	none	none
GLEAN3_05911	SPU_005911	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05917	SPU_005917	contains 4 MAM superfamily motifs	none
GLEAN3_05941	SPU_005941	contains 4 CCP superfamily motifs	none
GLEAN3_05951	SPU_005951	none	none
GLEAN3_05964	SPU_005964	none	none
GLEAN3_05972	SPU_005972	contains 2 ANK superfamily motifs and Arp domains	none
GLEAN3_05988	SPU_005988	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05994	SPU_005994	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06029	SPU_006029	none	none
GLEAN3_06057	SPU_006057	none	none
GLEAN3_06071	SPU_006071	contains 3 NHL superfamily motifs and PRK07764 domain and DOG3391 domain	none
GLEAN3_06088	SPU_006088	none	none
GLEAN3_06139	SPU_006139	none	none
GLEAN3_06144	SPU_006144	none	none
GLEAN3_06145	SPU_006145	none	none
GLEAN3_06156	SPU_006156	none	none
GLEAN3_06166	SPU_006166	contains HAP1_N domain	none
GLEAN3_06195	SPU_006195	contains 3 SPEC superfamily motifs	none
GLEAN3_06243	SPU_006243	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06259	SPU_006259	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06280	SPU_006280	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06298	SPU_006298	contains CysJ domain	none
GLEAN3_06300	SPU_006300	none	none
GLEAN3_06331	SPU_006331	contains 4 HYR superfamily motifs	none
GLEAN3_06335	SPU_006335	contains 3 MAM superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06357	SPU_006357	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06369	SPU_006369	none	none
GLEAN3_06373	SPU_006373	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06387	SPU_006387	none	none
GLEAN3_06397	SPU_006397	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06398	SPU_006398	none	none
GLEAN3_06418	SPU_006418	contains PRK03918 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06419	SPU_006419	contains DUF2450 domain	none
GLEAN3_06425	SPU_006425	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06440	SPU_006440	none	none
GLEAN3_06460	SPU_006460	contains 2 Vps52 superfamily motifs	none
GLEAN3_06464	SPU_006464	contains 5 SRCR superfamily motifs	none
GLEAN3_06465	SPU_006465	contains Prominin domain	none
GLEAN3_06491	SPU_006491	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06498	SPU_006498	contains 3 TSP_1 superfamily motifs	none
GLEAN3_06505	SPU_006505	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06519	SPU_006519	none	none
GLEAN3_06522	SPU_006522	none	none
GLEAN3_06525	SPU_006525	none	none
GLEAN3_06543	SPU_006543	none	none
GLEAN3_06566	SPU_006566	none	none
GLEAN3_06586	SPU_006586	contains RAD18 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06590	SPU_006590	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06592	SPU_006592	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06657	SPU_006657	contains 3 PDZ superfamily motifs	none
GLEAN3_06673	SPU_006673	contains PRK03918 domain	none
GLEAN3_06674	SPU_006674	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06685	SPU_006685	none	none
GLEAN3_06686	SPU_006686	none	none
GLEAN3_06689	SPU_006689	none	none
GLEAN3_06696	SPU_006696	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06702	SPU_006702	none	none
GLEAN3_06703	SPU_006703	none	none
GLEAN3_06716	SPU_006716	contains COG2319 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06724	SPU_006724	none	none
GLEAN3_06726	SPU_006726	contains Vinculin domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_06728	SPU_006728	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06736	SPU_006736	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06749	SPU_006749	none	none
GLEAN3_06750	SPU_006750	none	none
GLEAN3_06771	SPU_006771	none	none
GLEAN3_06782	SPU_006782	none	none
GLEAN3_06805	SPU_006805	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06806	SPU_006806	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06812	SPU_006812	contains 3 IG superfamily motifs	none
GLEAN3_06841	SPU_006841	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06862	SPU_006862	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06863	SPU_006863	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06872	SPU_006872	none	none
GLEAN3_06909	SPU_006909	none	none
GLEAN3_06919	SPU_006919	contains 2 7tm_1 superfamily motifs	none
GLEAN3_06923	SPU_006923	none	none
GLEAN3_06925	SPU_006925	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06942	SPU_006942	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06949	SPU_006949	contains C2 superfamily motif at N-terminus	none
GLEAN3_06963	SPU_006963	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06979	SPU_006979	none	none
GLEAN3_06994	SPU_006994	none	none
GLEAN3_07003	SPU_007003	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07018	SPU_007018	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07022	SPU_007022	none	none
GLEAN3_07028	SPU_007028	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07085	SPU_007085	none	none
GLEAN3_07130	SPU_007130	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07145	SPU_007145	none	none
GLEAN3_07158	SPU_007158	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07167	SPU_007167	none	none
GLEAN3_07178	SPU_007178	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07182	SPU_007182	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_07187	SPU_007187	none	none
GLEAN3_07238	SPU_007238	contains Myosin_tail_1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07286	SPU_007286	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07289	SPU_007289	contains 2 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07299	SPU_007299	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_07354	SPU_007354	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07377	SPU_007377	none	none
GLEAN3_07385	SPU_007385	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07387	SPU_007387	none	none
GLEAN3_07394	SPU_007394	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07400	SPU_007400	none	none
GLEAN3_07408	SPU_007408	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07412	SPU_007412	none	none
GLEAN3_07428	SPU_007428	contains MFS_1 domain	none
GLEAN3_07433	SPU_007433	none	none
GLEAN3_07434	SPU_007434	contains 2 SecD_SecF superfamily motifs	none
GLEAN3_07458	SPU_007458	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07461	SPU_007461	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07472	SPU_007472	none	none
GLEAN3_07477	SPU_007477	none	none
GLEAN3_07478	SPU_007478	none	none
GLEAN3_07483	SPU_007483	none	none
GLEAN3_07488	SPU_007488	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_07513	SPU_007513	none	none
GLEAN3_07521	SPU_007521	contains 5 ARM superfamily motifs	none
GLEAN3_07531	SPU_007531	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07541	SPU_007541	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07562	SPU_007562	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07589	SPU_007589	none	none
GLEAN3_07597	SPU_007597	contains LisH superfamily motif at N-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07686	SPU_007686	contains COG1033 domain	none
GLEAN3_07688	SPU_007688	contains 2 LRR_RI superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07703	SPU_007703	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07708	SPU_007708	contains PRK10263 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07717	SPU_007717	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07728	SPU_007728	none	none
GLEAN3_07730	SPU_007730	none	none
GLEAN3_07800	SPU_007800	contains 5 TSP1 superfamily motifs	none
GLEAN3_07813	SPU_007813	contains SMC_N domain	none
GLEAN3_07828	SPU_007828	none	none
GLEAN3_07838	SPU_007838	contains CbiK domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07851	SPU_007851	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07875	SPU_007875	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07888	SPU_007888	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07889	SPU_007889	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07890	SPU_007890	none	none
GLEAN3_07901	SPU_007901	contains 2 RING superfamily motifs and PEX10 domain and PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_07923	SPU_007923	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07958	SPU_007958	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07965	SPU_007965	none	none
GLEAN3_07969	SPU_007969	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08020	SPU_008020	none	none
GLEAN3_08039	SPU_008039	none	none
GLEAN3_08041	SPU_008041	none	none
GLEAN3_08063	SPU_008063	none	none
GLEAN3_08076	SPU_008076	none	none
GLEAN3_08098	SPU_008098	contains COG4886 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08118	SPU_008118	contains 2 SMC_N domain motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08131	SPU_008131	none	none
GLEAN3_08136	SPU_008136	none	none
GLEAN3_08141	SPU_008141	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08143	SPU_008143	contains Adaptin_N domain at N-terminus	none
GLEAN3_08153	SPU_008153	contains RAD18 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08161	SPU_008161	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08171	SPU_008171	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_08200	SPU_008200	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08225	SPU_008225	contains Pkinase domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08236	SPU_008236	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08242	SPU_008242	contains 2 Smc domain motifs	none
GLEAN3_08256	SPU_008256	none	none
GLEAN3_08260	SPU_008260	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08324	SPU_008324	contains 2 DNA_Pol_B2 domain motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08325	SPU_008325	none	none
GLEAN3_08334	SPU_008334	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08340	SPU_008340	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08350	SPU_008350	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08354	SPU_008354	contains 2 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08381	SPU_008381	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08402	SPU_008402	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08405	SPU_008405	contains 2 Kelch_1 superfamily motifs	none
GLEAN3_08422	SPU_008422	contains 2 RCC1 superfamily motifs and ATS1 domain	none
GLEAN3_08424	SPU_008424	none	none
GLEAN3_08433	SPU_008433	none	none
GLEAN3_08462	SPU_008462	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08475	SPU_008475	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08480	SPU_008480	none	none
GLEAN3_08495	SPU_008495	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08510	SPU_008510	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08529	SPU_008529	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08531	SPU_008531	contains COG1204 domain	none
GLEAN3_08537	SPU_008537	contains DPPIV_N domain and COG1073 domain	none
GLEAN3_08571	SPU_008571	none	none
GLEAN3_08592	SPU_008592	contains 4 S1-lie superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08594	SPU_008594	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08596	SPU_008596	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08627	SPU_008627	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_08628	SPU_008628	contains 8 ANK superfamily motifs and Arp domains	none
GLEAN3_08657	SPU_008657	contains 2 WD40 superfamily motifs	none
GLEAN3_08675	SPU_008675	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08679	SPU_008679	none	none
GLEAN3_08702	SPU_008702	contains DUF2152 domain	none
GLEAN3_08710	SPU_008710	none	none
GLEAN3_08720	SPU_008720	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08732	SPU_008732	none	none
GLEAN3_08760	SPU_008760	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08772	SPU_008772	contains 5 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_08781	SPU_008781	none	none
GLEAN3_08811	SPU_008811	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08812	SPU_008812	none	none
GLEAN3_08821	SPU_008821	none	none
GLEAN3_08835	SPU_008835	none	none
GLEAN3_08881	SPU_008881	none	none
GLEAN3_08882	SPU_008882	none	none
GLEAN3_08918	SPU_008918	none	none
GLEAN3_08924	SPU_008924	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08935	SPU_008935	contains 7 ANK superfamily motifs and Arp domains	none
GLEAN3_08991	SPU_008991	contains 5 ANK superfamily motifs and Arp domains	none
GLEAN3_08995	SPU_008995	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09013	SPU_009013	none	none
GLEAN3_09024	SPU_009024	contains 5 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09035	SPU_009035	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09060	SPU_009060	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_09061	SPU_009061	none	none
GLEAN3_09069	SPU_009069	none	none
GLEAN3_09070	SPU_009070	contains RecD domain	none
GLEAN3_09071	SPU_009071	none	none
GLEAN3_09072	SPU_009072	contains 2 WD40 superfamily motifs	none
GLEAN3_09078	SPU_009078	none	none
GLEAN3_09080	SPU_009080	none	none
GLEAN3_09092	SPU_009092	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09121	SPU_009121	contains AIR1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09122	SPU_009122	contains 3 CCP superfamily motifs and 2 EGF_CA superfamily motifs	none
GLEAN3_09133	SPU_009133	contains WcaA domain	none
GLEAN3_09138	SPU_009138	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09168	SPU_009168	contains ChaA domain	none
GLEAN3_09243	SPU_009243	contains 4 EGF_CA superfamily motifs	none
GLEAN3_09256	SPU_009256	contains COG4943 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09264	SPU_009264	none	none
GLEAN3_09265	SPU_009265	contains 3 Neuralized superfamily motifs	none
GLEAN3_09271	SPU_009271	contains 2 COG1444 domain motifs	none
GLEAN3_09280	SPU_009280	contains 2 RSI superfamily motifs	none
GLEAN3_09284	SPU_009284	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09288	SPU_009288	none	none
GLEAN3_09314	SPU_009314	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09321	SPU_009321	contains 4 FA58C superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09323	SPU_009323	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09324	SPU_009324	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09337	SPU_009337	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09404	SPU_009404	none	none
GLEAN3_09410	SPU_009410	contains COG5089 domain	none
GLEAN3_09411	SPU_009411	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09439	SPU_009439	contains 3 DUF1126 superfamily motifs	none
GLEAN3_09461	SPU_009461	contains 2 Ribophorin_II domain motifs	none
GLEAN3_09489	SPU_009489	none	none
GLEAN3_09519	SPU_009519	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09525	SPU_009525	contains LIM superfamily motif at C-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09530	SPU_009530	contains 2 Galactosyl_T superfamily motifs	none
GLEAN3_09531	SPU_009531	contains Vset domain and UDPGT domain	none
GLEAN3_09543	SPU_009543	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09550	SPU_009550	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09554	SPU_009554	contains ATS1 domain	none
GLEAN3_09577	SPU_009577	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09582	SPU_009582	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09584	SPU_009584	none	none
GLEAN3_09599	SPU_009599	contains 2 ANK superfamily motifs and Ion_trans domain	none
GLEAN3_09613	SPU_009613	none	none
GLEAN3_09627	SPU_009627	none	none
GLEAN3_09630	SPU_009630	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09649	SPU_009649	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09699	SPU_009699	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09705	SPU_009705	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09711	SPU_009711	none	none
GLEAN3_09721	SPU_009721	contains IMD superfamily motif at N-terminus	none
GLEAN3_09737	SPU_009737	contains COG5099 domain	none
GLEAN3_09774	SPU_009774	contains Smc domain	none
GLEAN3_09786	SPU_009786	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09791	SPU_009791	contains 5 ANK superfamily motifs and Arp domains	none
GLEAN3_09802	SPU_009802	none	none
GLEAN3_09803	SPU_009803	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09813	SPU_009813	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09814	SPU_009814	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09873	SPU_009873	contains 2 WSC superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09881	SPU_009881	none	none
GLEAN3_09891	SPU_009891	contains 2 NOP5NT superfamily motifs and SIK1 domain	none
GLEAN3_09896	SPU_009896	contains 3 DUF1126 superfamily motifs and FRQ1 domain	none
GLEAN3_09897	SPU_009897	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09934	SPU_009934	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09942	SPU_009942	contains 2 DEXDc superfamily motifs and MPH1 domain	none
GLEAN3_09947	SPU_009947	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09948	SPU_009948	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09959	SPU_009959	contains 2 Esterase_lipase superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09960	SPU_009960	Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09967	SPU_009967	contains RING superfamily motif at C-terminus. contains 2 Smc domain motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_09978	SPU_009978	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09986	SPU_009986	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09996	SPU_009996	none	none
GLEAN3_09999	SPU_009999	contains SNC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10002	SPU_010002	contains 5 IG superfamily motifs	none
GLEAN3_10010	SPU_010010	contains 2 Ion_trans domain motifs	none
GLEAN3_10012	SPU_010012	none	none
GLEAN3_10016	SPU_010016	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_10018	SPU_010018	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10031	SPU_010031	contains 2 Ion_trans domain motifs	none
GLEAN3_10044	SPU_010044	contains PAT1 domain at C-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10049	SPU_010049	contains 3 RT-like superfamily motifs	none
GLEAN3_10058	SPU_010058	contains COG5038 domain	none
GLEAN3_10059	SPU_010059	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10077	SPU_010077	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10079	SPU_010079	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10086	SPU_010086	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10087	SPU_010087	none	none
GLEAN3_10111	SPU_010111	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10112	SPU_010112	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10127	SPU_010127	none	none
GLEAN3_10148	SPU_010148	none	none
GLEAN3_10162	SPU_010162	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10170	SPU_010170	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10171	SPU_010171	1	none
GLEAN3_10176	SPU_010176	contains 5 MAM superfamily motifs	none
GLEAN3_10177	SPU_010177	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10186	SPU_010186	contains 4 TPR superfamily motifs	none
GLEAN3_10196	SPU_010196	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10206	SPU_010206	none	none
GLEAN3_10219	SPU_010219	none	none
GLEAN3_10233	SPU_010233	contains MIP-T3 domain	none
GLEAN3_10236	SPU_010236	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10269	SPU_010269	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10294	SPU_010294	contains 2 P_loop_NTPase superfamily motifs	none
GLEAN3_10315	SPU_010315	contains COG5219 domain	none
GLEAN3_10322	SPU_010322	contains 8 ANK superfamily motifs and Arp domains	none
GLEAN3_10326	SPU_010326	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10333	SPU_010333	none	none
GLEAN3_10363	SPU_010363	none	none
GLEAN3_10410	SPU_010410	none	none
GLEAN3_10420	SPU_010420	none	none
GLEAN3_10422	SPU_010422	none	none
GLEAN3_10439	SPU_010439	contains DUF2211 domain	none
GLEAN3_10440	SPU_010440	none	none
GLEAN3_10442	SPU_010442	contains 2 HYR superfamily motifs and 2 GCC2_GCC3 superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10478	SPU_010478	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10481	SPU_010481	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_10485	SPU_010485	contains SMC_N domain and Borrelia_P83 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10486	SPU_010486	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10514	SPU_010514	none	none
GLEAN3_10515	SPU_010515	contains PRK10920 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10516	SPU_010516	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10518	SPU_010518	contains 2 Glyco_transf_10 superfamily motifs	none
GLEAN3_10522	SPU_010522	none	none
GLEAN3_10585	SPU_010585	none	none
GLEAN3_10614	SPU_010614	contains 3 ANK superfamily motifs and Arp domains	none
GLEAN3_10638	SPU_010638	none	none
GLEAN3_10661	SPU_010661	contains 2 ANK superfamily motifs	none
GLEAN3_10681	SPU_010681	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10718	SPU_010718	contains 6 EGF_CA superfamily motifs	none
GLEAN3_10724	SPU_010724	none	none
GLEAN3_10727	SPU_010727	none	none
GLEAN3_10753	SPU_010753	contains 5 ANK superfamily motifs and Arp domains	none
GLEAN3_10766	SPU_010766	contains COG5222 domain	none
GLEAN3_10771	SPU_010771	none	none
GLEAN3_10792	SPU_010792	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10807	SPU_010807	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10812	SPU_010812	contains PRK07003 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10822	SPU_010822	contains 3 ANK superfamily motifs and Ion_trans domain	none
GLEAN3_10824	SPU_010824	none	none
GLEAN3_10826	SPU_010826	none	none
GLEAN3_10828	SPU_010828	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_10841	SPU_010841	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10859	SPU_010859	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10877	SPU_010877	contains SMC_N domain	none
GLEAN3_10885	SPU_010885	contains DYN1 domain	none
GLEAN3_10887	SPU_010887	none	none
GLEAN3_10905	SPU_010905	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10928	SPU_010928	none	none
GLEAN3_10938	SPU_010938	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10947	SPU_010947	contains 5 MAM superfamily motifs	none
GLEAN3_10975	SPU_010975	contains SMC_N domain	none
GLEAN3_10981	SPU_010981	none	none
GLEAN3_10986	SPU_010986	contains 2 Kelch_1 superfamily motifs	none
GLEAN3_11008	SPU_011008	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_11013	SPU_011013	contains 4 TSP_1 superfamily motifs and 3 FA58C superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11025	SPU_011025	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11026	SPU_011026	contains 2 LDLa superfamily motifs and 3 MAM superfamily motifs	none
GLEAN3_11043	SPU_011043	none	none
GLEAN3_11047	SPU_011047	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11068	SPU_011068	contains Smc domain	none
GLEAN3_11131	SPU_011131	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11150	SPU_011150	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11153	SPU_011153	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11164	SPU_011164	contains 6 BTB superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11179	SPU_011179	none	none
GLEAN3_11181	SPU_011181	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11203	SPU_011203	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11208	SPU_011208	contains 7 ANK superfamily motifs and Arp domains	none
GLEAN3_11219	SPU_011219	contains COG3055 domain	none
GLEAN3_11237	SPU_011237	contains PnbA domain	none
GLEAN3_11253	SPU_011253	none	none
GLEAN3_11267	SPU_011267	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11302	SPU_011302	none	none
GLEAN3_11310	SPU_011310	contains COG5098 domain	none
GLEAN3_11386	SPU_011386	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11398	SPU_011398	none	none
GLEAN3_11422	SPU_011422	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11423	SPU_011423	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11426	SPU_011426	none	none
GLEAN3_11433	SPU_011433	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11448	SPU_011448	none	none
GLEAN3_11463	SPU_011463	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11468	SPU_011468	contains 3 Sell superfamily motifs and COG0790 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11492	SPU_011492	contains 7 CUB superfamily motifs	none
GLEAN3_11549	SPU_011549	contains 2 CUB superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11555	SPU_011555	none	none
GLEAN3_11559	SPU_011559	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11565	SPU_011565	contains 5 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11575	SPU_011575	none	none
GLEAN3_11585	SPU_011585	contains COG0520 domain	none
GLEAN3_11606	SPU_011606	contains 2 AdoMet_MTase superfamily motifs	none
GLEAN3_11607	SPU_011607	none	none
GLEAN3_11608	SPU_011608	none	none
GLEAN3_11610	SPU_011610	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11647	SPU_011647	contains 2 DEXDc superfamily motifs and MPH1 domain	none
GLEAN3_11661	SPU_011661	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11678	SPU_011678	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11689	SPU_011689	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11701	SPU_011701	none	none
GLEAN3_11710	SPU_011710	contains TRF4 domain	none
GLEAN3_11717	SPU_011717	no match except self-matching. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11726	SPU_011726	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11738	SPU_011738	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11742	SPU_011742	contains Pkinase domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11767	SPU_011767	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11770	SPU_011770	contains 3 Lipocalin superfamily motifs	none
GLEAN3_11769	SPU_011769	contains 2 UBQ superfamily motifs and 2 ARM superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11777	SPU_011777	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11780	SPU_011780	contains 3 S1-like superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_11801	SPU_011801	none	none
GLEAN3_11813	SPU_011813	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11848	SPU_011848	contains ZU01 domain	none
GLEAN3_11851	SPU_011851	none	none
GLEAN3_11853	SPU_011853	none	none
GLEAN3_11865	SPU_011865	none	none
GLEAN3_11903	SPU_011903	contains Smc domain	none
GLEAN3_11905	SPU_011905	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11938	SPU_011938	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11969	SPU_011969	contains 2 Peptidase_C19 superfamily motifs	none
GLEAN3_11979	SPU_011979	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11988	SPU_011988	none	none
GLEAN3_12061	SPU_012061	none	none
GLEAN3_12063	SPU_012063	none	none
GLEAN3_12074	SPU_012074	contains COG1483 domain	none
GLEAN3_12075	SPU_012075	contains DAP2 domain	none
GLEAN3_12101	SPU_012101	contains Marek_A domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12114	SPU_012114	contains 2 PP2Cc superfamily motifs	none
GLEAN3_12134	SPU_012134	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12147	SPU_012147	contains 2 ANK superfamily motifs	none
GLEAN3_12164	SPU_012164	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12167	SPU_012167	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12181	SPU_012181	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12188	SPU_012188	contains SMC_N domain and Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12197	SPU_012197	contains XerD domain	none
GLEAN3_12214	SPU_012214	matches only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12224	SPU_012224	contains 5 ANK superfamily motifs and Arp domains	none
GLEAN3_12225	SPU_012225	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12254	SPU_012254	contains 3 SH3 superfamily motifs	none
GLEAN3_12262	SPU_012262	none	none
GLEAN3_12268	SPU_012268	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12269	SPU_012269	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12285	SPU_012285	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12326	SPU_012326	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12334	SPU_012334	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12351	SPU_012351	contains 4 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12401	SPU_012401	contains 3 IG superfamily motifs and Transposase_22 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12402	SPU_012402	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12413	SPU_012413	contains A2M_N domain and COG2373 domain	none
GLEAN3_12420	SPU_012420	none	none
GLEAN3_12458	SPU_012458	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12474	SPU_012474	none	none
GLEAN3_12483	SPU_012483	none	none
GLEAN3_12499	SPU_012499	contains 2 7tm_1 superfamily motifs	none
GLEAN3_12569	SPU_012569	contains 6 ANK superfamily motifs and Arp domains	none
GLEAN3_12592	SPU_012592	none	none
GLEAN3_12593	SPU_012593	none	none
GLEAN3_12602	SPU_012602	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12635	SPU_012635	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12650	SPU_012650	none	none
GLEAN3_12652	SPU_012652	none	none
GLEAN3_12654	SPU_012654	contains 7 EGF_CA superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12656	SPU_012656	contains 2 CUB superfamily motifs and 5 EGF_CA superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12662	SPU_012662	none	none
GLEAN3_12670	SPU_012670	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12682	SPU_012682	none	none
GLEAN3_12728	SPU_012728	contains 2 WD40 superfamily motifs	none
GLEAN3_12730	SPU_012730	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12736	SPU_012736	contains 2 CCGC superfamily motifs	none
GLEAN3_12753	SPU_012753	none	none
GLEAN3_12783	SPU_012783	contains 6 CUB superfamily motifs	none
GLEAN3_12788	SPU_012788	contains Smc domain	none
GLEAN3_12808	SPU_012808	contains FabG domain	none
GLEAN3_12815	SPU_012815	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12822	SPU_012822	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12824	SPU_012824	none	none
GLEAN3_12833	SPU_012833	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12870	SPU_012870	contains SMC_N domain	none
GLEAN3_12903	SPU_012903	contains 6 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12938	SPU_012938	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12954	SPU_012954	contains 2 WSC superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12965	SPU_012965	contains 2 Peptidase_C48 superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_12973	SPU_012973	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12984	SPU_012984	none	none
GLEAN3_13006	SPU_013006	none	none
GLEAN3_13009	SPU_013009	contains 2 BTB superfamily motifs and 3 Kelch_1 superfamily motifs and COG3055 domain	none
GLEAN3_13040	SPU_013040	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13044	SPU_013044	none	none
GLEAN3_13062	SPU_013062	none	none
GLEAN3_13065	SPU_013065	none	none
GLEAN3_13072	SPU_013072	none	none
GLEAN3_13074	SPU_013074	none	none
GLEAN3_13078	SPU_013078	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13085	SPU_013085	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13115	SPU_013115	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13124	SPU_013124	contains COG0790 domain	none
GLEAN3_13139	SPU_013139	none	none
GLEAN3_13146	SPU_013146	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13152	SPU_013152	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13190	SPU_013190	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13193	SPU_013193	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13208	SPU_013208	contains Herpes_BLLF1 domain	none
GLEAN3_13219	SPU_013219	none	none
GLEAN3_13226	SPU_013226	none	none
GLEAN3_13233	SPU_013233	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13235	SPU_013235	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13238	SPU_013238	none	none
GLEAN3_13274	SPU_013274	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13280	SPU_013280	none	none
GLEAN3_13304	SPU_013304	contains 5 ANK superfamily motifs and Arp domains	none
GLEAN3_13308	SPU_013308	none	none
GLEAN3_13341	SPU_013341	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13343	SPU_013343	none	none
GLEAN3_13344	SPU_013344	none	none
GLEAN3_13345	SPU_013345	none	none
GLEAN3_13370	SPU_013370	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13379	SPU_013379	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13382	SPU_013382	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13402	SPU_013402	contains 2 DNA_pol_B_2 domain motifs	none
GLEAN3_13412	SPU_013412	none	none
GLEAN3_13419	SPU_013419	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_13420	SPU_013420	contains 4 ANK superfamily motifs and Arp domains	none
GLEAN3_13425	SPU_013425	none	none
GLEAN3_13433	SPU_013433	none	none
GLEAN3_13453	SPU_013453	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13461	SPU_013461	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13472	SPU_013472	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13485	SPU_013485	none	none
GLEAN3_13487	SPU_013487	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13537	SPU_013537	contains COG4799 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13541	SPU_013541	contains 4 MAM superfamily motifs	none
GLEAN3_13553	SPU_013553	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13583	SPU_013583	none	none
GLEAN3_13602	SPU_013602	contains DUF812 domain	none
GLEAN3_13605	SPU_013605	contains PRK1280 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13606	SPU_013606	Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13616	SPU_013616	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13631	SPU_013631	contains 3 EGF_CA superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13634	SPU_013634	none	none
GLEAN3_13640	SPU_013640	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13655	SPU_013655	none	none
GLEAN3_13666	SPU_013666	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13672	SPU_013672	contains 2 ANK superfamily motifs	none
GLEAN3_13680	SPU_013680	none	none
GLEAN3_13681	SPU_013681	none	none
GLEAN3_13700	SPU_013700	contains OCD_Mu_Crystal domain	none
GLEAN3_13732	SPU_013732	none	none
GLEAN3_13744	SPU_013744	none	none
GLEAN3_13747	SPU_013747	none	none
GLEAN3_13762	SPU_013762	none	none
GLEAN3_13771	SPU_013771	none	none
GLEAN3_13784	SPU_013784	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13805	SPU_013805	none	none
GLEAN3_13817	SPU_013817	contains Herpes_BLLF1 domain	none
GLEAN3_13826	SPU_013826	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13827	SPU_013827	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13830	SPU_013830	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13851	SPU_013851	contains NPY1 domain	none
GLEAN3_13875	SPU_013875	none	none
GLEAN3_13911	SPU_013911	contains 4 CUB superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_13959	SPU_013959	none	none
GLEAN3_13963	SPU_013963	none	none
GLEAN3_13987	SPU_013987	none	none
GLEAN3_13996	SPU_013996	none	none
GLEAN3_14010	SPU_014010	contains Sulfotransfer_1 domain	none
GLEAN3_14049	SPU_014049	contains 2 WSC superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14052	SPU_014052	contains 2 VWC superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14054	SPU_014054	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14063	SPU_014063	none	none
GLEAN3_14074	SPU_014074	contains 7 LDLa superfamily motifs	none
GLEAN3_14101	SPU_014101	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14116	SPU_014116	contains PRK02224 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14117	SPU_014117	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14123	SPU_014123	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14127	SPU_014127	none	none
GLEAN3_14171	SPU_014171	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14194	SPU_014194	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14199	SPU_014199	contains UvrD domain	none
GLEAN3_14200	SPU_014200	contains B-block_TFIIIC domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14224	SPU_014224	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14235	SPU_014235	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14246	SPU_014246	none	none
GLEAN3_14261	SPU_014261	none	none
GLEAN3_14271	SPU_014271	none	none
GLEAN3_14273	SPU_014273	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14284	SPU_014284	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14291	SPU_014291	none	none
GLEAN3_14292	SPU_014292	contains Arp domain	none
GLEAN3_14300	SPU_014300	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14321	SPU_014321	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14330	SPU_014330	none	none
GLEAN3_14353	SPU_014353	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14354	SPU_014354	contains Smc domain	none
GLEAN3_14374	SPU_014374	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14377	SPU_014377	none	none
GLEAN3_14389	SPU_014389	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14427	SPU_014427	contains 2 FNR-like superfamily motifs	none
GLEAN3_14428	SPU_014428	contains PRK05912 domain	none
GLEAN3_14440	SPU_014440	contains Methyltransf_10 domain	none
GLEAN3_14495	SPU_014495	contains COG5635 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14508	SPU_014508	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14511	SPU_014511	none	none
GLEAN3_14540	SPU_014540	contains 4 NHL superfamily motifs and HRD1 domani and COG3386 domain	none
GLEAN3_14545	SPU_014545	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14547	SPU_014547	contains MRS 6 domain	none
GLEAN3_14571	SPU_014571	none	none
GLEAN3_14589	SPU_014589	contains 3 IPT superfamily motifs	none
GLEAN3_14600	SPU_014600	contains 5 ANK superfamily motifs and Arp domain	none
GLEAN3_14610	SPU_014610	none	none
GLEAN3_14612	SPU_014612	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14631	SPU_014631	none	none
GLEAN3_14657	SPU_014657	none	none
GLEAN3_14666	SPU_014666	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14681	SPU_014681	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14688	SPU_014688	contains 4 LRR_RI superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14703	SPU_014703	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14728	SPU_014728	contains Vinculin domain	none
GLEAN3_14739	SPU_014739	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14776	SPU_014776	none	none
GLEAN3_14781	SPU_014781	contains 5 EGF_CA superfamily motifs	none
GLEAN3_14807	SPU_014807	none	none
GLEAN3_14811	SPU_014811	contains Coatomer_WDAD domain	none
GLEAN3_14823	SPU_014823	contains 2 CCP superfamily motifs	none
GLEAN3_14827	SPU_014827	contains 2 CUB superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14837	SPU_014837	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14855	SPU_014855	none	none
GLEAN3_14868	SPU_014868	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14925	SPU_014925	contains 2 Kelch_1 superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_14941	SPU_014941	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14948	SPU_014948	none	none
GLEAN3_14960	SPU_014960	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14972	SPU_014972	contains 5 PLAT superfamily motifs	none
GLEAN3_14987	SPU_014987	contains SGL domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15002	SPU_015002	none	none
GLEAN3_15014	SPU_015014	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15081	SPU_015081	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15104	SPU_015104	contains RAB domain	none
GLEAN3_15150	SPU_015150	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15158	SPU_015158	none	none
GLEAN3_15160	SPU_015160	none	none
GLEAN3_15168	SPU_015168	none	none
GLEAN3_15170	SPU_015170	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15175	SPU_015175	none	none
GLEAN3_15195	SPU_015195	contains SMC_N domain	none
GLEAN3_15196	SPU_015196	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15202	SPU_015202	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15208	SPU_015208	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15244	SPU_015244	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15272	SPU_015272	contains 4 EFh superfamily motifs and FRQ1 domain	none
GLEAN3_15273	SPU_015273	contains 4 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15280	SPU_015280	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15295	SPU_015295	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15298	SPU_015298	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15313	SPU_015313	none	none
GLEAN3_15324	SPU_015324	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15326	SPU_015326	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15331	SPU_015331	contains COG5191 domain	none
GLEAN3_15356	SPU_015356	none	none
GLEAN3_15370	SPU_015370	none	none
GLEAN3_15388	SPU_015388	none	none
GLEAN3_15392	SPU_015392	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15438	SPU_015438	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15439	SPU_015439	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15444	SPU_015444	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15455	SPU_015455	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15480	SPU_015480	none	none
GLEAN3_15505	SPU_015505	contains 2 Smc domain motifs	none
GLEAN3_15515	SPU_015515	none	none
GLEAN3_15540	SPU_015540	none	none
GLEAN3_15562	SPU_015562	none	none
GLEAN3_15569	SPU_015569	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15570	SPU_015570	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15574	SPU_015574	contains PRK12704 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15575	SPU_015575	none	none
GLEAN3_15583	SPU_015583	contains 2 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15584	SPU_015584	contains 5 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15589	SPU_015589	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15628	SPU_015628	contains COG3386 domain. homologous to numerous Branchiostoma floridae proteins.	none
GLEAN3_15634	SPU_015634	none	none
GLEAN3_15635	SPU_015635	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15650	SPU_015650	none	none
GLEAN3_15664	SPU_015664	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15669	SPU_015669	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15707	SPU_015707	contains PRK02106 domain	none
GLEAN3_15713	SPU_015713	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15722	SPU_015722	none	none
GLEAN3_15724	SPU_015724	contains Adaptin_N domain	none
GLEAN3_15755	SPU_015755	contains 2 EGF_CA superfamily motifs	none
GLEAN3_15759	SPU_015759	contains 3 VWD superfamily motifs	none
GLEAN3_15788	SPU_015788	contains COG1511 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15794	SPU_015794	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15810	SPU_015810	none	none
GLEAN3_15812	SPU_015812	contains 5 HYR superfamily motifs and 2 EGF_CA superfamily motifs	none
GLEAN3_15868	SPU_015868	none	none
GLEAN3_15885	SPU_015885	contains HRD1 domain	none
GLEAN3_15904	SPU_015904	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15905	SPU_015905	none	none
GLEAN3_15934	SPU_015934	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15954	SPU_015954	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15958	SPU_015958	none	none
GLEAN3_15965	SPU_015965	none	none
GLEAN3_15967	SPU_015967	contains 2 MFS superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15971	SPU_015971	homologous to numerous Hydra proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_15976	SPU_015976	none	none
GLEAN3_15992	SPU_015992	contains 2 PKc-like superfamily motifs and 2 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16010	SPU_016010	none	none
GLEAN3_16017	SPU_016017	none	none
GLEAN3_16030	SPU_016030	none	none
GLEAN3_16037	SPU_016037	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16041	SPU_016041	contains 7 EGF_CA superfamily motifs and 2 CCP superfamily motifs	none
GLEAN3_16044	SPU_016044	contains PRK03992 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16049	SPU_016049	none	none
GLEAN3_16054	SPU_016054	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16098	SPU_016098	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16099	SPU_016099	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16117	SPU_016117	none	none
GLEAN3_16126	SPU_016126	contains 2 MFS superfamily motifs	none
GLEAN3_16153	SPU_016153	none	none
GLEAN3_16159	SPU_016159	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16165	SPU_016165	contains ATS1 domain	none
GLEAN3_16170	SPU_016170	contains 5 MAM superfamily motifs	none
GLEAN3_16186	SPU_016186	contains COG1231 domain	none
GLEAN3_16199	SPU_016199	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16210	SPU_016210	contains 5 MAM superfamily motifs	none
GLEAN3_16221	SPU_016221	contains Smc domain	none
GLEAN3_16225	SPU_016225	none	none
GLEAN3_16252	SPU_016252	none	none
GLEAN3_16253	SPU_016253	none	none
GLEAN3_16262	SPU_016262	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16293	SPU_016293	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16323	SPU_016323	none	none
GLEAN3_16332	SPU_016332	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16340	SPU_016340	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16357	SPU_016357	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16358	SPU_016358	contains Smc domain	none
GLEAN3_16394	SPU_016394	none	none
GLEAN3_16412	SPU_016412	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16425	SPU_016425	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16435	SPU_016435	none	none
GLEAN3_16460	SPU_016460	contains 7 ANK superfamily motifs and Arp domain	none
GLEAN3_16461	SPU_016461	contains 6 ANK superfamily motifs and Arp domain	none
GLEAN3_16462	SPU_016462	none	none
GLEAN3_16466	SPU_016466	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16470	SPU_016470	none	none
GLEAN3_16482	SPU_016482	none	none
GLEAN3_16483	SPU_016483	none	none
GLEAN3_16484	SPU_016484	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16491	SPU_016491	contains PRK02362 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16497	SPU_016497	contains Herpes_BLLF1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16509	SPU_016509	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16534	SPU_016534	none	none
GLEAN3_16539	SPU_016539	none	none
GLEAN3_16544	SPU_016544	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16567	SPU_016567	contains Myosin_tail_1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16568	SPU_016568	contains Myosin_tail_1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16608	SPU_016608	contains PRK10416 domain	none
GLEAN3_16628	SPU_016628	contains ATS1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16645	SPU_016645	none	none
GLEAN3_16654	SPU_016654	none	none
GLEAN3_16661	SPU_016661	none	none
GLEAN3_16662	SPU_016662	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16668	SPU_016668	contains ND5 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16676	SPU_016676	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16678	SPU_016678	none	none
GLEAN3_16680	SPU_016680	none	none
GLEAN3_16683	SPU_016683	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16714	SPU_016714	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16731	SPU_016731	none	none
GLEAN3_16757	SPU_016757	contains 7 Kelch_1 superfamily motifs	none
GLEAN3_16759	SPU_016759	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16766	SPU_016766	none	none
GLEAN3_16775	SPU_016775	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16777	SPU_016777	none	none
GLEAN3_16783	SPU_016783	contains COG5635 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16801	SPU_016801	contains SMC_N domain and Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17820	SPU_017820	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16842	SPU_016842	contains 2 PDZ superfamily motifs	none
GLEAN3_16859	SPU_016859	Strongylocentrotus purpuratus-specific protein. homologous to numerous putative Branchiostoma floridae proteins.	none
GLEAN3_16872	SPU_016872	none	none
GLEAN3_16874	SPU_016874	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16875	SPU_016875	none	none
GLEAN3_16888	SPU_016888	contains 3 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_16899	SPU_016899	none	none
GLEAN3_16904	SPU_016904	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16908	SPU_016908	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16913	SPU_016913	none	none
GLEAN3_16919	SPU_016919	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_16943	SPU_016943	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16953	SPU_016953	none	none
GLEAN3_16963	SPU_016963	contains PRK00055 domain	none
GLEAN3_16995	SPU_016995	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16998	SPU_016998	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17003	SPU_017003	Strongylocentrotus purpuratus-specific protein. homologous only to the protein itself.	none
GLEAN3_17011	SPU_017011	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17018	SPU_017018	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17042	SPU_017042	none	none
GLEAN3_17044	SPU_017044	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17066	SPU_017066	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17074	SPU_017074	contains 4 EGF_CA superfamily motifs and 3 GCC2_GCC3 superfamily motifs	none
GLEAN3_17088	SPU_017088	none	none
GLEAN3_17108	SPU_017108	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17118	SPU_017118	none	none
GLEAN3_17121	SPU_017121	none	none
GLEAN3_17131	SPU_017131	none	none
GLEAN3_17140	SPU_017140	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17145	SPU_017145	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17152	SPU_017152	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17183	SPU_017183	contains SNF2_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17192	SPU_017192	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17210	SPU_017210	none	none
GLEAN3_17214	SPU_017214	contains 2 TPR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17223	SPU_017223	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17260	SPU_017260	Strongylocentrotus purpuratus-specific protein. no match except to itself.	none
GLEAN3_17262	SPU_017262	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17272	SPU_017272	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17282	SPU_017282	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17294	SPU_017294	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17308	SPU_017308	none	none
GLEAN3_17309	SPU_017309	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17331	SPU_017331	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17332	SPU_017332	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17339	SPU_017339	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17353	SPU_017353	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17357	SPU_017357	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17359	SPU_017359	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17368	SPU_017368	contains ATS1 domain	none
GLEAN3_17382	SPU_017382	none	none
GLEAN3_17391	SPU_017391	contains AIR1 domain	none
GLEAN3_17392	SPU_017392	contains AIR1 domain	none
GLEAN3_17393	SPU_017393	contains AIR1 domain	none
GLEAN3_17394	SPU_017394	contains AIR1 domain	none
GLEAN3_17400	SPU_017400	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_17406	SPU_017406	contains 2 Cu-oxidase superfamily motifs. also homologous to laccase proteins in many insects.	none
GLEAN3_17411	SPU_017411	none	none
GLEAN3_17424	SPU_017424	none	none
GLEAN3_17430	SPU_017430	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17479	SPU_017479	none	none
GLEAN3_17480	SPU_017480	contains 3 IG superfamily motifs and Vset domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17499	SPU_017499	contains 3 SNF superfamily motifs	none
GLEAN3_17514	SPU_017514	contains 3 PDZ superfamily motifs and PAT1 domain	none
GLEAN3_17521	SPU_017521	none	none
GLEAN3_17539	SPU_017539	contains 4 HTH_psq superfamily motifs	none
GLEAN3_17550	SPU_017550	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17551	SPU_017551	none	none
GLEAN3_17566	SPU_017566	contains 2 ANK superfamily motifs and Arp domain	none
GLEAN3_17586	SPU_017586	contains 6 KAZAL_FS superfamily motifs	none
GLEAN3_17591	SPU_017591	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17630	SPU_017630	none	none
GLEAN3_17636	SPU_017636	none	none
GLEAN3_17641	SPU_017641	none	none
GLEAN3_17653	SPU_017653	contains 2 Gal-3-0_sulfotr superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17654	SPU_017654	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17670	SPU_017670	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17671	SPU_017671	contains COG2103 domain	none
GLEAN3_17677	SPU_017677	none	none
GLEAN3_17706	SPU_017706	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17709	SPU_017709	contains Transposase_22 domain	none
GLEAN3_17716	SPU_017716	none	none
GLEAN3_17728	SPU_017728	contains 4 WD40 superfamily motifs	none
GLEAN3_17746	SPU_017746	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17754	SPU_017754	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17785	SPU_017785	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17786	SPU_017786	contains 2 MBT superfamily motifs	none
GLEAN3_17792	SPU_017792	contains Adaptin_N domain	none
GLEAN3_17855	SPU_017855	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17858	SPU_017858	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17873	SPU_017873	contains 2 VWC superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17883	SPU_017883	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17894	SPU_017894	contains Spc7 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17913	SPU_017913	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17924	SPU_017924	contains 2 BBOX superfamily motifs and SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_17942	SPU_017942	none	none
GLEAN3_17970	SPU_017970	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17975	SPU_017975	none	none
GLEAN3_17978	SPU_017978	none	none
GLEAN3_17984	SPU_017984	none	none
GLEAN3_17932	SPU_017932	none	none
GLEAN3_18004	SPU_018004	none	none
GLEAN3_18020	SPU_018020	none	none
GLEAN3_18024	SPU_018024	contains PRK09297 domain	none
GLEAN3_18025	SPU_018025	none	none
GLEAN3_18036	SPU_018036	none	none
GLEAN3_18042	SPU_018042	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18049	SPU_018049	contains 3 ANK superfamily motifs	none
GLEAN3_18050	SPU_018050	none	none
GLEAN3_18077	SPU_018077	contains Smc domain	none
GLEAN3_18135	SPU_018135	none	none
GLEAN3_18150	SPU_018150	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18157	SPU_018157	none	none
GLEAN3_18185	SPU_018185	none	none
GLEAN3_18189	SPU_018189	none	none
GLEAN3_18194	SPU_018194	none	none
GLEAN3_18208	SPU_018208	contains 6 NF-X1_zinc_finger superfamily motifs	none
GLEAN3_18223	SPU_018223	contains 5 CUB superfamily motifs	none
GLEAN3_18255	SPU_018255	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18257	SPU_018257	none	none
GLEAN3_18278	SPU_018278	none	none
GLEAN3_18280	SPU_018280	none	none
GLEAN3_18287	SPU_018287	contains PRK06241 domain. homologous to bacterial proteins.	none
GLEAN3_18293	SPU_018293	none	none
GLEAN3_18296	SPU_018296	contains 4 SRCR superfamily motifs	none
GLEAN3_18300	SPU_018300	contains Smc domain	none
GLEAN3_18303	SPU_018303	none	none
GLEAN3_18304	SPU_018304	none	none
GLEAN3_18309	SPU_018309	none	none
GLEAN3_18320	SPU_018320	none	none
GLEAN3_18334	SPU_018334	none	none
GLEAN3_18360	SPU_018360	none	none
GLEAN3_18387	SPU_018387	none	none
GLEAN3_18395	SPU_018395	possible chimera due to erroneous sequence assembly ?	none
GLEAN3_18414	SPU_018414	none	none
GLEAN3_18442	SPU_018442	none	none
GLEAN3_18454	SPU_018454	none	none
GLEAN3_18464	SPU_018464	none	none
GLEAN3_18478	SPU_018478	none	none
GLEAN3_18509	SPU_018509	none	none
GLEAN3_18517	SPU_018517	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18532	SPU_018532	none	none
GLEAN3_18539	SPU_018539	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18566	SPU_018566	none	none
GLEAN3_18594	SPU_018594	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18612	SPU_018612	none	none
GLEAN3_18616	SPU_018616	contains 2 Thioredoxin-like superfamily motifs	none
GLEAN3_18669	SPU_018669	none	none
GLEAN3_18696	SPU_018696	contains 2 MFS superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18712	SPU_018712	contains SXM1 domain	none
GLEAN3_18772	SPU_018772	none	none
GLEAN3_18789	SPU_018789	contains COG4372 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18790	SPU_018790	contains 4 COG5638 domain motifs	none
GLEAN3_18803	SPU_018803	none	none
GLEAN3_18839	SPU_018839	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_18942	SPU_018942	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_18956	SPU_018956	none	none
GLEAN3_19031	SPU_019031	none	none
GLEAN3_19034	SPU_019034	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19035	SPU_019035	none	none
GLEAN3_19055	SPU_019055	none	none
GLEAN3_19056	SPU_019056	none	none
GLEAN3_19063	SPU_019063	contains AST1 domain	none
GLEAN3_19068	SPU_019068	none	none
GLEAN3_19098	SPU_019098	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19125	SPU_019125	contains 2 TPR superfamily motifs	none
GLEAN3_19148	SPU_019148	contains FRQ1 domain	none
GLEAN3_19164	SPU_019164	none	none
GLEAN3_19173	SPU_019173	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19194	SPU_019194	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19200	SPU_019200	none	none
GLEAN3_19201	SPU_019201	none	none
GLEAN3_19211	SPU_019211	poor amino acid sequence (~50% of amino acids are X)	none
GLEAN3_19225	SPU_019225	contains COG1331 domain	none
GLEAN3_19237	SPU_019237	contains 4 SRCR superfamily motifs	none
GLEAN3_19240	SPU_019240	none	none
GLEAN3_19244	SPU_019244	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19250	SPU_019250	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19284	SPU_019284	contains RAD18 domain	none
GLEAN3_19293	SPU_019293	none	none
GLEAN3_19295	SPU_019295	none	none
GLEAN3_19311	SPU_019311	contains 3 CCP superfamily motifs and 2 GCC2_GCC3 superfamily motifs	none
GLEAN3_19296	SPU_019296	contains Golgin_A5 domain and SMC_N domain	none
GLEAN3_19313	SPU_019313	probable assembly chimera	none
GLEAN3_19315	SPU_019315	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19324	SPU_019324	none	none
GLEAN3_19343	SPU_019343	contains infB domain	none
GLEAN3_19348	SPU_019348	none	none
GLEAN3_19439	SPU_019439	contains Macoilin domain	none
GLEAN3_19495	SPU_019495	none	none
GLEAN3_19499	SPU_019499	none	none
GLEAN3_19511	SPU_019511	contains 4 C2 superfamily motifs	none
GLEAN3_19532	SPU_019532	none	none
GLEAN3_19538	SPU_019538	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19565	SPU_019565	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19566	SPU_019566	contains COG3386 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19578	SPU_019578	poor amino acid sequence (~40 % of amino acids are X)	none
GLEAN3_19611	SPU_019611	probable assembly chimera	none
GLEAN3_19621	SPU_019621	contains 7 ANK superfamily motifs and Arp domain	none
GLEAN3_19642	SPU_019642	none	none
GLEAN3_19665	SPU_019665	contains 4 IG superfamily motifs	none
GLEAN3_19703	SPU_019703	none	none
GLEAN3_19707	SPU_019707	none	none
GLEAN3_19716	SPU_019716	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19731	SPU_019731	contains 5 Kelch_1 superfamily motifs	none
GLEAN3_19737	SPU_019737	none	none
GLEAN3_19739	SPU_019739	none	none
GLEAN3_19752	SPU_019752	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19763	SPU_019763	none	none
GLEAN3_19789	SPU_019789	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19801	SPU_019801	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19807	SPU_019807	contains Smc domain and SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19831	SPU_019831	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19851	SPU_019851	contains DNA_Pol_B_2 domain	none
GLEAN3_19853	SPU_019853	none	none
GLEAN3_19860	SPU_019860	contains 3 FA58C superfamily motifs	none
GLEAN3_19891	SPU_019891	contains 4 FA58C superfamily motifs	none
GLEAN3_19892	SPU_019892	contains 6 ANK superfamily motifs and Arp domain	none
GLEAN3_19913	SPU_019913	none	none
GLEAN3_19937	SPU_019937	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19938	SPU_019938	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19941	SPU_019941	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19963	SPU_019963	contains 4 SH3 superfamily motifs	none
GLEAN3_19978	SPU_019978	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19981	SPU_019981	contains 2 Ldl_recept_b superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_19993	SPU_019993	contains PRK03918 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20011	SPU_020011	none	none
GLEAN3_20047	SPU_020047	contains 2 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20048	SPU_020048	none	none
GLEAN3_20055	SPU_020055	none	none
GLEAN3_20067	SPU_020067	none	none
GLEAN3_20068	SPU_020068	contains COG1340 domain	none
GLEAN3_20086	SPU_020086	none	none
GLEAN3_20098	SPU_020098	none	none
GLEAN3_20111	SPU_020111	contains MgtA domain	none
GLEAN3_20122	SPU_020122	contains 8 HYR superfamily motifs	none
GLEAN3_20133	SPU_020133	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20134	SPU_020134	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20145	SPU_020145	contains 3 Kelch_1 superfamily motifs	none
GLEAN3_20152	SPU_020152	homologous to numerous Branchiostoma floridae proteins	none
GLEAN3_20180	SPU_020180	none	none
GLEAN3_20186	SPU_020186	none	none
GLEAN3_20190	SPU_020190	none	none
GLEAN3_20198	SPU_020198	contains 2 RGS superfamily motifs	none
GLEAN3_20204	SPU_020204	none	none
GLEAN3_20215	SPU_020215	none	none
GLEAN3_20223	SPU_020223	none	none
GLEAN3_20245	SPU_020245	contains 2 HYR superfamily motifs	none
GLEAN3_20248	SPU_020248	none	none
GLEAN3_20252	SPU_020252	none	none
GLEAN3_20263	SPU_020263	none	none
GLEAN3_20272	SPU_020272	none	none
GLEAN3_20327	SPU_020327	none	none
GLEAN3_20328	SPU_020328	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20356	SPU_020356	none	none
GLEAN3_20357	SPU_020357	none	none
GLEAN3_20364	SPU_020364	none	none
GLEAN3_20388	SPU_020388	none	none
GLEAN3_20400	SPU_020400	none	none
GLEAN3_20421	SPU_020421	contains 2 LRR_RI superfamily motifs	none
GLEAN3_20422	SPU_020422	contains 3 Neuralized superfamily motifs	none
GLEAN3_20456	SPU_020456	none	none
GLEAN3_20466	SPU_020466	none	none
GLEAN3_20468	SPU_020468	none	none
GLEAN3_20470	SPU_020470	contains 3 IG superfamily motifs	none
GLEAN3_20472	SPU_020472	none	none
GLEAN3_20483	SPU_020483	none	none
GLEAN3_20496	SPU_020496	none	none
GLEAN3_20507	SPU_020507	none	none
GLEAN3_20539	SPU_020539	contains V-set domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20543	SPU_020543	none	none
GLEAN3_20546	SPU_020546	contains COG3391 domain	none
GLEAN3_20552	SPU_020552	none	none
GLEAN3_20571	SPU_020571	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20572	SPU_020572	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20575	SPU_020575	none	none
GLEAN3_20589	SPU_020589	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20602	SPU_020602	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20608	SPU_020608	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20618	SPU_020618	contains V-set domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20660	SPU_020660	contains 10 ANK superfamily motifs and Arp domain	none
GLEAN3_20661	SPU_020661	contains COG5635 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20684	SPU_020684	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20691.1	SPU_020691.1	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20702	SPU_020702	contains 2 SAM superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20716	SPU_020716	contains 2 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20723	SPU_020723	contains 5 ANK superfamily motifs and Arp domain	none
GLEAN3_20727	SPU_020727	homologous to numerous Hydra magnipapillata proteins	none
GLEAN3_20732	SPU_020732	none	none
GLEAN3_20746	SPU_020746	none	none
GLEAN3_20748	SPU_020748	none	none
GLEAN3_20763	SPU_020763	none	none
GLEAN3_20765	SPU_020765	contains COG1112 domain	none
GLEAN3_20768	SPU_020768	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20771	SPU_020771	none	none
GLEAN3_20774	SPU_020774	none	none
GLEAN3_20779	SPU_020779	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20783	SPU_020783	none	none
GLEAN3_20786	SPU_020786	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20787	SPU_020787	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20794	SPU_020794	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20805	SPU_020805	contains SGL domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20810	SPU_020810	contains 4 ANK superfamily motifs and Arp domain	none
GLEAN3_20817	SPU_020817	none	none
GLEAN3_20818	SPU_020818	contains 5 HYR superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20827	SPU_020827	none	none
GLEAN3_20829	SPU_020829	none	none
GLEAN3_20843	SPU_020843	contains 2 ANK superfamily motifs	none
GLEAN3_20848	SPU_020848	none	none
GLEAN3_20866	SPU_020866	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20877	SPU_020877	none	none
GLEAN3_20894	SPU_020894	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20896	SPU_020896	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_20905	SPU_020905	none	none
GLEAN3_20921	SPU_020921	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20948	SPU_020948	none	none
GLEAN3_20958	SPU_020958	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20964	SPU_020964	contains MviM domain	none
GLEAN3_20967	SPU_020967	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20989	SPU_020989	none	none
GLEAN3_20990	SPU_020990	none	none
GLEAN3_20998	SPU_020998	none	none
GLEAN3_21026	SPU_021026	none	none
GLEAN3_21038	SPU_021038	none	none
GLEAN3_21039	SPU_021039	none	none
GLEAN3_21050	SPU_021050	none	none
GLEAN3_21053	SPU_021053	contains 2 Galactosyl_T superfamily motifs	none
GLEAN3_21080	SPU_021080	none	none
GLEAN3_21081	SPU_021081	none	none
GLEAN3_21092	SPU_021092	none	none
GLEAN3_21107	SPU_021107	contains Smc domain and SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21115	SPU_021115	none	none
GLEAN3_21122	SPU_021122	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21129	SPU_021129	none	none
GLEAN3_21138	SPU_021138	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21139	SPU_021139	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21145	SPU_021145	none	none
GLEAN3_21163	SPU_021163	contains 2 SPEC superfamily motifs	none
GLEAN3_21180	SPU_021180	none	none
GLEAN3_21212	SPU_021212	none	none
GLEAN3_21214	SPU_021214	none	none
GLEAN3_21227	SPU_021227	contains DHC_N1 domain	none
GLEAN3_21232	SPU_021232	none	none
GLEAN3_21247	SPU_021247	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21258	SPU_021258	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21278	SPU_021278	none	none
GLEAN3_21294	SPU_021294	none	none
GLEAN3_21310	SPU_021310	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21314	SPU_021314	contains 2 MFS superfamily motifs	none
GLEAN3_21338	SPU_021338	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21373	SPU_021373	none	none
GLEAN3_21389	SPU_021389	contains 7 ANK superfamily motifs and Arp domain	none
GLEAN3_21391	SPU_021391	none	none
GLEAN3_21413	SPU_021413	homologous only to 1 putative Strongylocentrotus purpuratus protein. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21429	SPU_021429	contains 3 DUF1126 superfamily motifs	none
GLEAN3_21444	SPU_021444	none	none
GLEAN3_21446	SPU_021446	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21460	SPU_021460	none	none
GLEAN3_21464	SPU_021464	none	none
GLEAN3_21481	SPU_021481	none	none
GLEAN3_21488	SPU_021488	none	none
GLEAN3_21501	SPU_021501	none	none
GLEAN3_21539	SPU_021539	none	none
GLEAN3_21541	SPU_021541	contains 2 CCP superfamily motifs and COG3889 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21585	SPU_021585	none	none
GLEAN3_21612	SPU_021612	none	none
GLEAN3_21618	SPU_021618	none	none
GLEAN3_21626	SPU_021626	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21644	SPU_021644	contains 5 ANK superfamily motifs and Arp domain	none
GLEAN3_21659	SPU_021659	contains V-set domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21664	SPU_021664	none	none
GLEAN3_21710	SPU_021710	contains PAT1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_21777	SPU_021777	none	none
GLEAN3_21791	SPU_021791	contains 2 7tm_1 superfamily motifs	none
GLEAN3_21823	SPU_021823	none	none
GLEAN3_21831	SPU_021831	contains 2 MFS superfamily motifs	none
GLEAN3_21842	SPU_021842	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21845	SPU_021845	contains CH superfamily motif at C-terminus	none
GLEAN3_21846	SPU_021846	none	none
GLEAN3_21850	SPU_021850	contains CH superfamily motif at C-terminus and Smc domain	none
GLEAN3_21852	SPU_021852	none	none
GLEAN3_21856	SPU_021856	contains 2 Serinc superfamily motifs	none
GLEAN3_21857	SPU_021857	none	none
GLEAN3_21891	SPU_021891	none	none
GLEAN3_21910	SPU_021910	none	none
GLEAN3_21913	SPU_021913	none	none
GLEAN3_21935	SPU_021935	contains PRK07003 domain and AcuC domain	none
GLEAN3_21943	SPU_021943	none	none
GLEAN3_21953	SPU_021953	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21956	SPU_021956	none	none
GLEAN3_21961	SPU_021961	none	none
GLEAN3_21971	SPU_021971	contains SMC_N domain and PRK11281 domain	none
GLEAN3_21980	SPU_021980	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21999	SPU_021999	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22008	SPU_022008	contains DHC_N1 domain	none
GLEAN3_22018	SPU_022018	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22048	SPU_022048	none	none
GLEAN3_22069	SPU_022069	contains COG5141 domain	none
GLEAN3_22096	SPU_022096	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22106	SPU_022106	contains COG2319 domain	none
GLEAN3_22107	SPU_022107	contains Smc domain	none
GLEAN3_22119	SPU_022119	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22137	SPU_022137	contains 2 CUB superfamily motifs	none
GLEAN3_22138	SPU_022138	contains 4 CUB superfamily motifs	none
GLEAN3_22143	SPU_022143	contains 3 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22165	SPU_022165	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22176	SPU_022176	contains 7 ANK superfamily motifs and Arp domain	none
GLEAN3_22182	SPU_022182	contains 2 EFh superfamily motifs	none
GLEAN3_22183	SPU_022183	none	none
GLEAN3_22191	SPU_022191	none	none
GLEAN3_22218	SPU_022218	none	none
GLEAN3_22219	SPU_022219	contains 2 7tm_1 superfamily motifs	none
GLEAN3_22249	SPU_022249	contains 5 Kelch_1 superfamily motifs	none
GLEAN3_22261	SPU_022261	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22293	SPU_022293	contains 9 ANK superfamily motifs and Arp domain	none
GLEAN3_22323	SPU_022323	none	none
GLEAN3_22324	SPU_022324	none	none
GLEAN3_22359	SPU_022359	contains 2 RNA_bind superfamily motifs	none
GLEAN3_22364	SPU_022364	none	none
GLEAN3_22366	SPU_022366	contains 5 CUB superfamily motifs	none
GLEAN3_22391	SPU_022391	none	none
GLEAN3_22408	SPU_022408	contains PnbA domain	none
GLEAN3_22416	SPU_022416	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22417	SPU_022417	homologous to numerous putative Hydra magnipapillata proteins	none
GLEAN3_22420	SPU_022420	contains COG1112 domain	none
GLEAN3_22442	SPU_022442	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22446	SPU_022446	contains COG1293 domain	none
GLEAN3_22471	SPU_022471	homologous to numerous putative Branchiostoma floridae proteins	none
GLEAN3_22494	SPU_022494	contains 2 IG superfamily motifs	none
GLEAN3_22516	SPU_022516	none	none
GLEAN3_22539	SPU_022539	contains 2 Noc2 superfamily motifs	none
GLEAN3_22551	SPU_022551	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22577	SPU_022577	none	none
GLEAN3_22579	SPU_022579	none	none
GLEAN3_22599	SPU_022599	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22606	SPU_022606	none	none
GLEAN3_22668	SPU_022668	none	none
GLEAN3_22671	SPU_022671	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22679	SPU_022679	none	none
GLEAN3_22702	SPU_022702	none	none
GLEAN3_22741	SPU_022741	contains 4 CCP superfamily motifs	none
GLEAN3_22745	SPU_022745	contains 2 SMC_N domain motifs	none
GLEAN3_22792	SPU_022792	none	none
GLEAN3_22804	SPU_022804	contains PRK00409 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22821	SPU_022821	none	none
GLEAN3_22823	SPU_022823	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22827	SPU_022827	none	none
GLEAN3_22828	SPU_022828	contains PRK06851 domain	none
GLEAN3_22835	SPU_022835	contains COG5236 domain	none
GLEAN3_22847	SPU_022847	none	none
GLEAN3_22852	SPU_022852	contains 2 EFh superfamily motifs	none
GLEAN3_22863	SPU_022863	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22885	SPU_022885	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22913	SPU_022913	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22933	SPU_022933	contains V-set domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_22948	SPU_022948	contains Deme6 domain	none
GLEAN3_22964	SPU_022964	contains PRK07003 domain	none
GLEAN3_22981	SPU_022981	contains 3 CCP superfamily motifs and 3 EGF_CA superfamily motifs	none
GLEAN3_22993	SPU_022993	none	none
GLEAN3_23007	SPU_023007	none	none
GLEAN3_23022	SPU_023022	none	none
GLEAN3_23024	SPU_023024	contains 2 PHD superfamily motifs	none
GLEAN3_23043	SPU_023043	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23045	SPU_023045	homologous only to 2 putative Strongylocentrotus purpuratus proteins, probably including itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23063	SPU_023063	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23074	SPU_023074	none	none
GLEAN3_23077	SPU_023077	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23081	SPU_023081	none	none
GLEAN3_23103	SPU_023103	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23105	SPU_023105	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23116	SPU_023116	low quality protein sequence: more than 50% of amino acids are uncertain.	none
GLEAN3_23135	SPU_023135	contains SGL domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23143	SPU_023143	contains 2 WD40 superfamily motifs	none
GLEAN3_23157	SPU_023157	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23184	SPU_023184	contains 3 FA58C superfamily motifs	none
GLEAN3_23190	SPU_023190	contains Pyr_redox_2 domain at C-terminus	none
GLEAN3_23194	SPU_023194	none	none
GLEAN3_23203	SPU_023203	none	none
GLEAN3_23205	SPU_023205	none	none
GLEAN3_23208	SPU_023208	contains 3 Lipocalin superfamily motifs	none
GLEAN3_23210	SPU_023210	none	none
GLEAN3_23212	SPU_023212	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23230	SPU_023230	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23235	SPU_023235	none	none
GLEAN3_23240	SPU_023240	none	none
GLEAN3_23264	SPU_023264	none	none
GLEAN3_23299	SPU_023299	none	none
GLEAN3_23301	SPU_023301	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23356	SPU_023356	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23365	SPU_023365	contains 3 TPR superfamily motifs and NrfG domain	none
GLEAN3_23377	SPU_023377	none	none
GLEAN3_23379	SPU_023379	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23382	SPU_023382	homologous to only itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23392	SPU_023392	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23410	SPU_023410	contains 3 CCP superfamily motifs	none
GLEAN3_23474	SPU_023474	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23478	SPU_023478	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23482	SPU_023482	none	none
GLEAN3_23536	SPU_023536	contains 2 IPT superfamily motifs	none
GLEAN3_23542	SPU_023542	none	none
GLEAN3_23543	SPU_023543	contains COG4946 domain	none
GLEAN3_23553	SPU_023553	contains 4 CUB superfamily motifs	none
GLEAN3_23558	SPU_023558	low quality protein sequence: 70% of amino acids are X. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23572	SPU_023572	contains 2 Hist_deacetyl superfamily motifs	none
GLEAN3_23586	SPU_023586	contains 7 ANK superfamily motifs and Arp domain	none
GLEAN3_23594	SPU_023594	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23597	SPU_023597	none	none
GLEAN3_23600	SPU_023600	none	none
GLEAN3_23606	SPU_023606	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23624	SPU_023624	none	none
GLEAN3_23679	SPU_023679	contains Dynein_heavy domain	none
GLEAN3_23682	SPU_023682	none	none
GLEAN3_23690	SPU_023690	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23695	SPU_023695	none	none
GLEAN3_23704	SPU_023704	none	none
GLEAN3_23710	SPU_023710	none	none
GLEAN3_23728	SPU_023728	none	none
GLEAN3_23729	SPU_023729	contains Smc domain	none
GLEAN3_23770	SPU_023770	none	none
GLEAN3_23786	SPU_023786	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23800	SPU_023800	none	none
GLEAN3_23820	SPU_023820	none	none
GLEAN3_23847	SPU_023847	none	none
GLEAN3_23861	SPU_023861	none	none
GLEAN3_23872	SPU_023872	none	none
GLEAN3_23896	SPU_023896	none	none
GLEAN3_23910	SPU_023910	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_23918	SPU_023918	contains 2 Sulfotransfer_1 domain motifs	none
GLEAN3_23939	SPU_023939	contains ROM1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_23950	SPU_023950	none	none
GLEAN3_24022	SPU_024022	contains 2 IG superfamily motifs and Ion_trans domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24024	SPU_024024	contains 2 CUB superfamily motifs	none
GLEAN3_24031	SPU_024031	none	none
GLEAN3_24050	SPU_024050	contains COG4886 domain	none
GLEAN3_24067	SPU_024067	none	none
GLEAN3_24000	SPU_024000	contains OmpH domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24054	SPU_024054	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24056	SPU_024056	contains SMC_N domain	none
GLEAN3_24077	SPU_024077	low quality protein sequence: >70% of amino aicds are X	none
GLEAN3_24086	SPU_024086	contains Smc domain	none
GLEAN3_24089	SPU_024089	contains COG5222 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24108	SPU_024108	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24109	SPU_024109	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24116	SPU_024116	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24122	SPU_024122	none	none
GLEAN3_24124	SPU_024124	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24147	SPU_024147	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24151	SPU_024151	contains 4 LDLa superfamily motifs	none
GLEAN3_24156	SPU_024156	contains COG4886 domain	none
GLEAN3_24171	SPU_024171	contains 5 ANK superfamily motifs	none
GLEAN3_24211	SPU_024211	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24212	SPU_024212	none	none
GLEAN3_24275	SPU_024275	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24276	SPU_024276	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24287	SPU_024287	contains CH superfamily motif at C-terminus and Smc domain	none
GLEAN3_24292	SPU_024292	contains Ion_trans domain	none
GLEAN3_24293	SPU_024293	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24303	SPU_024303	contains TEL1 domain	none
GLEAN3_24321	SPU_024321	none	none
GLEAN3_24322	SPU_024322	none	none
GLEAN3_24327	SPU_024327	none	none
GLEAN3_24359	SPU_024359	contains AST1 domain	none
GLEAN3_24363	SPU_024363	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24368	SPU_024368	none	none
GLEAN3_24369	SPU_024369	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24374	SPU_024374	none	none
GLEAN3_24402	SPU_024402	contains HSP70 domain	none
GLEAN3_24407	SPU_024407	none	none
GLEAN3_24418	SPU_024418	none	none
GLEAN3_24436	SPU_024436	contains 3 ANK superfamily motifs and Arp domain and Ion_trans domain	none
GLEAN3_24437	SPU_024437	contains 2 ANK superfamily motifs and Arp domain and Ion_trans domain	none
GLEAN3_24462	SPU_024462	none	none
GLEAN3_24519	SPU_024519	none	none
GLEAN3_24533	SPU_024533	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24544	SPU_024544	contains 3 ANK superfamily motifs	none
GLEAN3_24564	SPU_024564	contains A2M_N domain	none
GLEAN3_24574	SPU_024574	none	none
GLEAN3_24591	SPU_024591	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24597	SPU_024597	none	none
GLEAN3_24599	SPU_024599	contains 3 CH superfamily motifs and 4 Filamin superfamily motifs	none
GLEAN3_24624	SPU_024624	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24627	SPU_024627	none	none
GLEAN3_24630	SPU_024630	contains 2 BBOX superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24633	SPU_024633	contains PAT1 domain	none
GLEAN3_24682	SPU_024682	contains Rad18 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24683	SPU_024683	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24684	SPU_024684	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24689	SPU_024689	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24722	SPU_024722	contains COG5114 domain	none
GLEAN3_24723	SPU_024723	contains 2 EGF_CA superfamily motifs	none
GLEAN3_24759	SPU_024759	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24760	SPU_024760	contains AIR1 domain	none
GLEAN3_24776	SPU_024776	contains PDZ superfamily motif at C-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24779	SPU_024779	none	none
GLEAN3_24833	SPU_024833	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24852	SPU_024852	none	none
GLEAN3_24870	SPU_024870	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_24874	SPU_024874	contains SMC_N domain	none
GLEAN3_24879	SPU_024879	none	none
GLEAN3_24886	SPU_024886	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24897	SPU_024897	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24917	SPU_024917	none	none
GLEAN3_24928	SPU_024928	none	none
GLEAN3_24932	SPU_024932	none	none
GLEAN3_24934	SPU_024934	none	none
GLEAN3_24941	SPU_024941	none	none
GLEAN3_24996	SPU_024996	none	none
GLEAN3_25009	SPU_025009	none	none
GLEAN3_25011	SPU_025011	none	none
GLEAN3_25020	SPU_025020	homologous to numerous putative proteins in Branchiostoma floridae and Nematostella vectensis	none
GLEAN3_25059	SPU_025059	none	none
GLEAN3_25096	SPU_025096	contains 3 EGF_CA superfamily motifs	none
GLEAN3_25111	SPU_025111	none	none
GLEAN3_25124	SPU_025124	contains PRK12323 domain and COG2931 domain	none
GLEAN3_25128	SPU_025128	none	none
GLEAN3_25130	SPU_025130	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25131	SPU_025131	none	none
GLEAN3_25133	SPU_025133	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25142	SPU_025142	none	none
GLEAN3_25156	SPU_025156	none	none
GLEAN3_25157	SPU_025157	none	none
GLEAN3_25191	SPU_025191	contains PldB domain	none
GLEAN3_25198	SPU_025198	none	none
GLEAN3_25208	SPU_025208	contains Smc domain	none
GLEAN3_25216	SPU_025216	contains 3 CUB superfamily motifs	none
GLEAN3_25235	SPU_025235	contains 2 vWFA superfamily motifs	none
GLEAN3_25247	SPU_025247	contains DUF2353 domain	none
GLEAN3_25268	SPU_025268	none	none
GLEAN3_25269	SPU_025269	none	none
GLEAN3_25310	SPU_025310	contains 4 FA58C superfamily motifs	none
GLEAN3_25319	SPU_025319	none	none
GLEAN3_25320	SPU_025320	none	none
GLEAN3_25330	SPU_025330	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25331	SPU_025331	contains 2 KR superfamily motifs and 3 FA58C superfamily motifs	none
GLEAN3_25343	SPU_025343	contains B41 domain	none
GLEAN3_25351	SPU_025351	none	none
GLEAN3_25356	SPU_025356	contains SMC_N domain	none
GLEAN3_25364	SPU_025364	none	none
GLEAN3_25381	SPU_025381	none	none
GLEAN3_25384	SPU_025384	none	none
GLEAN3_25392	SPU_025392	contains PRK08279 domain	none
GLEAN3_25403	SPU_025403	contains COG3386 domain	none
GLEAN3_25415	SPU_025415	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25416	SPU_025416	contains 2 DEXDc superfamily motifs and SrmB domain	none
GLEAN3_25418	SPU_025418	contains Smc domain and SMC_N domain	none
GLEAN3_25420	SPU_025420	none	none
GLEAN3_25426	SPU_025426	contains MDN1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25430	SPU_025430	contains 4 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25431	SPU_025431	contains 4 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25466	SPU_025466	none	none
GLEAN3_25488	SPU_025488	none	none
GLEAN3_25499	SPU_025499	none	none
GLEAN3_25512	SPU_025512	none	none
GLEAN3_25532	SPU_025532	contains 3 ANK superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25534	SPU_025534	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25538	SPU_025538	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25539	SPU_025539	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25541	SPU_025541	contains SbcC domain	none
GLEAN3_25687	SPU_025687	none	none
GLEAN3_25587	SPU_025587	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25716	SPU_025716	none	none
GLEAN3_25720	SPU_025720	none	none
GLEAN3_25588	SPU_025588	none	none
GLEAN3_25627	SPU_025627	none	none
GLEAN3_25628	SPU_025628	contains 6 NF-X1-zinc-finger superfamily motifs and COG5219 domain	none
GLEAN3_25740	SPU_025740	none	none
GLEAN3_25741	SPU_025741	none	none
GLEAN3_25757	SPU_025757	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25778	SPU_025778	none	none
GLEAN3_25801	SPU_025801	none	none
GLEAN3_25805	SPU_025805	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25811	SPU_025811	none	none
GLEAN3_25836	SPU_025836	contains 2 CIMR superfamily motifs	none
GLEAN3_25839	SPU_025839	contains COG1340 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25842	SPU_025842	contains 3 HYR superfamily motifs	none
GLEAN3_25883	SPU_025883	contains 2 RPA2_OBF_family superfamily motifs	none
GLEAN3_25901	SPU_025901	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25905	SPU_025905	none	none
GLEAN3_25925	SPU_025925	none	none
GLEAN3_25932	SPU_025932	none	none
GLEAN3_25933	SPU_025933	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25938	SPU_025938	none	none
GLEAN3_25939	SPU_025939	none	none
GLEAN3_25941	SPU_025941	contains Ribosomal_L7Ae superfamily motif at C-terminus	none
GLEAN3_25950	SPU_025950	contains 4 ARM superfamily motifs	none
GLEAN3_25960	SPU_025960	none	none
GLEAN3_25965	SPU_025965	contains COG4783 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_25970	SPU_025970	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25978	SPU_025978	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26000	SPU_026000	contains V-set domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26024	SPU_026024	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26030	SPU_026030	none	none
GLEAN3_26033	SPU_026033	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26044	SPU_026044	contains 6 ANK superfamily motifs and Arp domain	none
GLEAN3_26085	SPU_026085	homologous to numerous putative Hydra magnipapillata proteins. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26086	SPU_026086	none	none
GLEAN3_26094	SPU_026094	none	none
GLEAN3_26120	SPU_026120	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26122	SPU_026122	none	none
GLEAN3_26133	SPU_026133	none	none
GLEAN3_26150	SPU_026150	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26192	SPU_026192	contains csdA domain	none
GLEAN3_26194	SPU_026194	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26213	SPU_026213	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26250	SPU_026250	none	none
GLEAN3_26262	SPU_026262	none	none
GLEAN3_26323	SPU_026323	contains 2 Mem_trans superfamily motifs	none
GLEAN3_26331	SPU_026331	none	none
GLEAN3_26334	SPU_026334	none	none
GLEAN3_26341	SPU_026341	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26344	SPU_026344	contains 2 Filamin superfamily motifs	none
GLEAN3_26381	SPU_026381	none	none
GLEAN3_26383	SPU_026383	contains COG1293 domain	none
GLEAN3_26354	SPU_026354	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26384	SPU_026384	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26392	SPU_026392	none	none
GLEAN3_26404	SPU_026404	none	none
GLEAN3_26426	SPU_026426	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26429	SPU_026429	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26449	SPU_026449	contains 2 RING superfamily and PRK03918 domain. poor sequence data: ~30% of amino acids are X. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26464	SPU_026464	contains 2 LRR_RI superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26484	SPU_026484	none	none
GLEAN3_26486	SPU_026486	none	none
GLEAN3_26489	SPU_026489	contains Ion_trans domain	none
GLEAN3_26505	SPU_026505	none	none
GLEAN3_26506	SPU_026506	none	none
GLEAN3_26522	SPU_026522	contains Smc domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26554	SPU_026554	none	none
GLEAN3_26557	SPU_026557	contains PRK13578 domain	none
GLEAN3_26562	SPU_026562	contains 2 Drf_FH3 superfamily motifs	none
GLEAN3_26564	SPU_026564	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26588	SPU_026588	contains 5 HYR superfamily motifs	none
GLEAN3_26604	SPU_026604	none	none
GLEAN3_26609	SPU_026609	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26655	SPU_026655	contains 8 ANK superfamily motifs and Arp domain	none
GLEAN3_26662	SPU_026662	contains 3 CUB superfamily motifs and 3 EGF_CA superfamily motifs	none
GLEAN3_26663	SPU_026663	none	none
GLEAN3_26666	SPU_026666	none	none
GLEAN3_26672	SPU_026672	none	none
GLEAN3_26674	SPU_026674	contains 9 ANK superfamily motifs and Arp domain	none
GLEAN3_26732	SPU_026732	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26738	SPU_026738	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26739	SPU_026739	contains 2 FA58C superfamily motifs	none
GLEAN3_26741	SPU_026741	contains 4 MAM superfamily motifs	none
GLEAN3_26752	SPU_026752	contains 3 ANK superfamily motifs and Arp domain	none
GLEAN3_26794	SPU_026794	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26809	SPU_026809	none	none
GLEAN3_26822	SPU_026822	none	none
GLEAN3_26826	SPU_026826	contains Smc domain	none
GLEAN3_26840	SPU_026840	none	none
GLEAN3_26844	SPU_026844	none	none
GLEAN3_26865	SPU_026865	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26875	SPU_026875	contains 3 NHL superfamily motifs and PRK07764 domain and COG3391 domain	none
GLEAN3_26887	SPU_026887	contains 2 Death superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_26906	SPU_026906	contains 2 RRM superfamily motifs	none
GLEAN3_26909	SPU_026909	none	none
GLEAN3_26912	SPU_026912	none	none
GLEAN3_26917	SPU_026917	contains 3 HYR superfamily motifs and 2 CUB superfamily motifs	none
GLEAN3_26927	SPU_026927	none	none
GLEAN3_26934	SPU_026934	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26942	SPU_026942	contains 2 HYR superfamily motifs and 2 GCC2_GCC3 superfamily motifs	none
GLEAN3_26945	SPU_026945	none	none
GLEAN3_26950	SPU_026950	contains 4 Kelch_1 superfamily motifs	none
GLEAN3_26965	SPU_026965	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26970	SPU_026970	contains Nop14 domain	none
GLEAN3_26981	SPU_026981	none	none
GLEAN3_26992	SPU_026992	none	none
GLEAN3_26998	SPU_026998	contains 4 CCP superfamily motifs	none
GLEAN3_27002	SPU_027002	contains HSP70 domain. poor sequence data: more than 60% of amino acids including C-terminal half are X.	none
GLEAN3_27005	SPU_027005	contains Smc domain	none
GLEAN3_27008	SPU_027008	contains SMC_N domain	none
GLEAN3_27016	SPU_027016	none	none
GLEAN3_27027	SPU_027027	none	none
GLEAN3_27029	SPU_027029	contains Torsin domain	none
GLEAN3_27046	SPU_027046	contains 3 IG superfamily motifs	none
GLEAN3_27058	SPU_027058	contains 2 ZU5 superfamily motifs	none
GLEAN3_27072	SPU_027072	none	none
GLEAN3_27074	SPU_027074	none	none
GLEAN3_27089	SPU_027089	none	none
GLEAN3_27102	SPU_027102	contains DNA_pol_B_2 domain	none
GLEAN3_27122	SPU_027122	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27182	SPU_027182	none	none
GLEAN3_27186	SPU_027186	none	none
GLEAN3_27256	SPU_027256	none	none
GLEAN3_27261	SPU_027261	none	none
GLEAN3_27270	SPU_027270	none	none
GLEAN3_27279	SPU_027279	contains 3 DM14 superfamily motifs	none
GLEAN3_27295	SPU_027295	contains PRK02106 domain	none
GLEAN3_27297	SPU_027297	contains PRK02106 domain	none
GLEAN3_27307	SPU_027307	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27317	SPU_027317	none	none
GLEAN3_27352	SPU_027352	contains SMC_N domain	none
GLEAN3_27354	SPU_027354	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27357	SPU_027357	none	none
GLEAN3_27363	SPU_027363	none	none
GLEAN3_27391	SPU_027391	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27395	SPU_027395	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27424	SPU_027424	contains 2 Galactosyl_T superfamily motifs	none
GLEAN3_27437	SPU_027437	contains 2 Galactosyl_T superfamily motifs	none
GLEAN3_27463	SPU_027463	contains MFS_1 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27464	SPU_027464	none	none
GLEAN3_27473	SPU_027473	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27515	SPU_027515	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27517	SPU_027517	none	none
GLEAN3_27558	SPU_027558	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27575	SPU_027575	contains 3 COG1357 domain motifs	none
GLEAN3_27627	SPU_027627	contains 3 Thioredoxin-like superfamily motifs	none
GLEAN3_27638	SPU_027638	contains 2 HYR superfamily motifs	none
GLEAN3_27641	SPU_027641	none	none
GLEAN3_27646	SPU_027646	contains MAM superfamily motif at N-terminus. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27651	SPU_027651	contains COG1219 domain	none
GLEAN3_27661	SPU_027661	contains Macoilin domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27667	SPU_027667	contains 2 IG superfamily motifs	none
GLEAN3_27690	SPU_027690	none	none
GLEAN3_27720	SPU_027720	none	none
GLEAN3_27736	SPU_027736	none	none
GLEAN3_27742	SPU_027742	contains 2 WSC superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27746	SPU_027746	contains 5 ANK superfamily motifs and Arp domain	none
GLEAN3_27757	SPU_027757	none	none
GLEAN3_27761	SPU_027761	none	none
GLEAN3_27770	SPU_027770	none	none
GLEAN3_27773	SPU_027773	contains 2 EGF_CA superfamily motifs	none
GLEAN3_27787	SPU_027787	contains SMC_N domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27807	SPU_027807	none	none
GLEAN3_27825	SPU_027825	none	none
GLEAN3_27846	SPU_027846	none	none
GLEAN3_27865	SPU_027865	none	none
GLEAN3_27869	SPU_027869	homologous only to itself. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27871	SPU_027871	contains 3 IPT superfamily motifs	none
GLEAN3_27877	SPU_027877	contains 2 SSF superfamily motifs	none
GLEAN3_27907	SPU_027907	contains PRK12704 domain. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27910	SPU_027910	none	none
GLEAN3_27955	SPU_027955	contains 3 IG superfamily motifs. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27961	SPU_027961	poor sequence data: ~50% of amino acids are X. Strongylocentrotus purpuratus-specific protein.	none
GLEAN3_27990	SPU_027990	none	none
GLEAN3_27991	SPU_027991	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27995	SPU_027995	none	none
GLEAN3_28017	SPU_028017	none	none
GLEAN3_28048	SPU_028048	contains 3 CUB superfamily motifs	none
GLEAN3_28057	SPU_028057	contains Dynein_heavy domain. probable assembly chimera (only N-terminal half matches).	none
GLEAN3_28080	SPU_028080	contains 2 CCP superfamily motifs	none
GLEAN3_28082	SPU_028082	contains Smc domain	none
GLEAN3_28091	SPU_028091	contains 5 FA58C superfamily motifs	none
GLEAN3_28099	SPU_028099	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28107	SPU_028107	contains 9 ANK superfamily motifs and Arp domain	none
GLEAN3_28144	SPU_028144	none	none
GLEAN3_28162	SPU_028162	none	none
GLEAN3_28176	SPU_028176	contains 2 Nramp superfamily motifs	none
GLEAN3_28223	SPU_028223	none	none
GLEAN3_28227	SPU_028227	contains COG1322 domain	none
GLEAN3_28259	SPU_028259	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28287	SPU_028287	contains Gelsolin superfamily motif at C-terminus	none
GLEAN3_28295	SPU_028295	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28306	SPU_028306	contains Chelatase_Class_II superfamily at N-terminus	none
GLEAN3_28313	SPU_028313	none	none
GLEAN3_28338	SPU_028338	none	none
GLEAN3_28341	SPU_028341	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28347	SPU_028347	none	none
GLEAN3_28390	SPU_028390	none	none
GLEAN3_28394	SPU_028394	none	none
GLEAN3_28400	SPU_028400	contains 4 HYR superfamily motifs	none
GLEAN3_28402	SPU_028402	contains 4 CCP superfamily motifs	none
GLEAN3_28438	SPU_028438	contains 3 zf-CXXC superfamily motifs	none
GLEAN3_28453	SPU_028453	contains AIR1 domain	none
GLEAN3_28474	SPU_028474	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28482	SPU_028482	contains FAT domain	none
GLEAN3_28495	SPU_028495	none	none
GLEAN3_28546	SPU_028546	none	none
GLEAN3_28556	SPU_028556	none	none
GLEAN3_28577	SPU_028577	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28581	SPU_028581	contains 2 LDLa superfamily motifs	none
GLEAN3_28604	SPU_028604	none	none
GLEAN3_28621	SPU_028621	none	none
GLEAN3_28638	SPU_028638	none	none
GLEAN3_28670	SPU_028670	none	none
GLEAN3_28689	SPU_028689	contains 2 IG superfamily motifs	none
GLEAN3_28691	SPU_028691	none	none
GLEAN3_28706	SPU_028706	contains eIF2A domain	none
GLEAN3_28727	SPU_028727	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28734	SPU_028734	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28738	SPU_028738	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28790	SPU_028790	contains 2 MFS superfamily motifs	none
GLEAN3_28803	SPU_028803	none	none
GLEAN3_28826	SPU_028826	none	none
GLEAN3_28864	SPU_028864	none	none
GLEAN3_28867	SPU_028867	none	none
GLEAN3_28895	SPU_028895	contains 4 PLAT superfamily motifs	none
GLEAN3_28913	SPU_028913	none	none
GLEAN3_28914	SPU_028914	none	none
GLEAN3_28918	SPU_028918	none	none
GLEAN3_00998	SPU_000998	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02441	SPU_002441	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03396	SPU_003396	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04698	SPU_004698	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05574	SPU_005574	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05935	SPU_005935	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06183	SPU_006183	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06474	SPU_006474	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07024	SPU_007024	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07654	SPU_007654	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07691	SPU_007691	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09475	SPU_009475	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12056	SPU_012056	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12297	SPU_012297	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13339	SPU_013339	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14410	SPU_014410	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16026	SPU_016026	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16036	SPU_016036	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16997	SPU_016997	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17658	SPU_017658	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18406	SPU_018406	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18407	SPU_018407	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19271	SPU_019271	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20393	SPU_020393	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20873	SPU_020873	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22032	SPU_022032	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22613	SPU_022613	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22617	SPU_022617	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_22979	SPU_022979	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24106	SPU_024106	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25395	SPU_025395	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26173	SPU_026173	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27358	SPU_027358	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28179	SPU_028179	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00643	SPU_000643	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06344	SPU_006344	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07042	SPU_007042	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11696	SPU_011696	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12191	SPU_012191	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12929	SPU_012929	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14090	SPU_014090	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_14561	SPU_014561	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15616	SPU_015616	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15925	SPU_015925	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17284	SPU_017284	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20150	SPU_020150	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28878	SPU_028878	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28879	SPU_028879	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_28047	SPU_028047	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25293	SPU_025293	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_24291	SPU_024291	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21717	SPU_021717	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21405	SPU_021405	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_21556	SPU_021556	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20173	SPU_020173	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_20586	SPU_020586	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18957	SPU_018957	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_19725	SPU_019725	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18276	SPU_018276	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18652	SPU_018652	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18736	SPU_018736	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18781	SPU_018781	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17535	SPU_017535	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18053	SPU_018053	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_18116	SPU_018116	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16795	SPU_016795	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17204	SPU_017204	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17205	SPU_017205	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17233	SPU_017233	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17246	SPU_017246	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_17360	SPU_017360	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16472	SPU_016472	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_16503	SPU_016503	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15345	SPU_015345	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15408	SPU_015408	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15696	SPU_015696	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15721	SPU_015721	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_15884	SPU_015884	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13246	SPU_013246	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13347	SPU_013347	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12528	SPU_012528	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12538	SPU_012538	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_12582	SPU_012582	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_13083	SPU_013083	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10110	SPU_010110	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_10473	SPU_010473	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_11323	SPU_011323	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09983	SPU_009983	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_08559	SPU_008559	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_09006	SPU_009006	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07462	SPU_007462	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_07543	SPU_007543	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06977	SPU_006977	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05644	SPU_005644	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05844	SPU_005844	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05919	SPU_005919	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05968	SPU_005968	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06207	SPU_006207	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_06274	SPU_006274	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05166	SPU_005166	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05211	SPU_005211	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_05444	SPU_005444	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04691	SPU_004691	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04701	SPU_004701	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04953	SPU_004953	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03399	SPU_003399	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03449	SPU_003449	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03554	SPU_003554	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03677	SPU_003677	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03680	SPU_003680	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03720	SPU_003720	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03732	SPU_003732	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03758	SPU_003758	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03761	SPU_003761	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03893	SPU_003893	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03907	SPU_003907	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_04211	SPU_004211	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_03250	SPU_003250	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02236	SPU_002236	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02327	SPU_002327	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02413	SPU_002413	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02623	SPU_002623	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02685	SPU_002685	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02735	SPU_002735	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01824	SPU_001824	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01853	SPU_001853	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_02052	SPU_002052	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01193	SPU_001193	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01254	SPU_001254	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01376	SPU_001376	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01395	SPU_001395	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01593	SPU_001593	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01594	SPU_001594	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_01680	SPU_001680	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00803	SPU_000803	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00807	SPU_000807	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_00909	SPU_000909	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_25920	SPU_025920	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_26802	SPU_026802	Strongylocentrotus purpuratus-specific protein	none
GLEAN3_27529	SPU_027529	Strongylocentrotus purpuratus-specific protein	none
