A typical scenario in information retrieval and web search is to index a given type of items (e.g... more A typical scenario in information retrieval and web search is to index a given type of items (e.g., web pages, images) and provide search functionality for them. In such a scenario, the basic units of indexing and retrieval are the same. Extensive study has been done for efficient top-k computation in such settings. This paper studies top-k processing for many emerging scenarios: efficiently retrieving top-k items of one type based on the inverted index of another type of items. It would be very inefficient by directly utilizing traditional top-k approaches. Here we follow TA (the Threshold Algorithm) in this scenario. We present an aggregation-aware top-k computation framework with three pruning principles upon the conventional inverted index and a novel inverted index type HybridRank, which employs the item information of both types. Experimental results show that our proposed new index structure and the aggregation-aware top-k strategy provide an efficient solution for this aggre...
Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06, 2006
This paper examines the problem of utilizing pseudo-anchor text to help ranking Web objects in ve... more This paper examines the problem of utilizing pseudo-anchor text to help ranking Web objects in vertical search. We adopt a machine learning based approach to extract pseudo-anchor text for a vertical object from its candidate anchor blocks. Experiments in academic search domain indicate that our approach is able to dramatically improve search performance.
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08, 2008
Modern web search engines, while indexing billions of web pages, are expected to process queries ... more Modern web search engines, while indexing billions of web pages, are expected to process queries and return results in a very short time. Many approaches have been proposed for efficiently computing top-k query results, but most of them ignore one key factor in the ranking functions of commercial search engines - term-proximity, which is the metric of the distance between query terms in a document. When term-proximity is included in ranking functions, most of the existing top-k algorithms will become inefficient. To address this problem, in this paper we propose to build a compact phrase index to speed up the search process when incorporating the term-proximity factor. The compact phrase index can help more accurately estimate the score upper bounds of unknown documents. The size of the phrase index is controlled by including a small portion of phrases which are possibly helpful for improving search performance. Phrase index has been used to process phrase queries in existing work. It is, however, to the best of our knowledge, the first time that phrase index is used to improve the performance of generic queries. Experimental results show that, compared with the state-of-the-art top-k computation approaches, our approach can reduce average query processing time to 1/5 for typical setttings.
Calpainsarecrucialforthedegradation of myofibrillar proteins in muscle. Calpastatin is a spe- cif... more Calpainsarecrucialforthedegradation of myofibrillar proteins in muscle. Calpastatin is a spe- cific inhibitor of calpains. The objective of this study was to elucidate the effect of nutrient restriction on the activity of calpains and calpastatin in the skeletal muscle of both cows and fetuses. Beginning 30 d after conception, 20 cows were fed either a control diet con- sisting of native
To study the in vivo and in vitro effects of adding oxygen carbon nanotubes (CNTs) to chemotherap... more To study the in vivo and in vitro effects of adding oxygen carbon nanotubes (CNTs) to chemotherapy for breast cancer. MCF-7 and SK-BR-3 breast cancer cells were co-cultured with paclitaxel and then exposed to oxygen-CNTs under hypoxic conditions. Cell proliferation, viability, and apoptosis rate were analyzed. Hypoxia-inducible factor-1 alpha (HIF-1α) expression was measured using reverse transcription-polymerase chain reaction (RT-PCR) and western blot. Nude mice were used as a human breast cancer model to explore the impact of oxygen-CNTs on the in vivo chemotherapeutic effect of paclitaxel. Oxygen-CNTs had no significant effects on the growth of breast cancer cells under normoxia and hypoxia. However, in the hypoxic environment, oxygen-CNTs significantly enhanced the inhibitory effect of paclitaxel on cell proliferation, as well as the apoptosis rate. Under hypoxia, downregulation of HIF-1α and upregulation of caspase-3, caspase-8, caspase-9, LC3 and Beclin-1 were observed when paclitaxel was combined with oxygen-CNT. Furthermore, addition of oxygen-CNTs to chemotherapy was found to significantly reduce tumor weight in the tumor-bearing mice model. Oxygen-CNTs can significantly increase the chemotherapeutic effect of paclitaxel on breast cancer cells. Oxygen-CNTs may be a potential chemosensitizer in breast cancer therapy.
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - ACL-IJCNLP '09, 2009
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM '07, 2007
... smaller than that of term weighting scores in real ranking functions, splitting inverted list... more ... smaller than that of term weighting scores in real ranking functions, splitting inverted lists by impact scores might be more effective than purely ... collections in the experiment, as illustrated in Table 1. The GOV collection consists of about 1.25 million web pages crawled from web ...
Poly-L-lysine (PLL) can be replaced successfully by quaternized chitosan to prepare alginate-Ca m... more Poly-L-lysine (PLL) can be replaced successfully by quaternized chitosan to prepare alginate-Ca microcapsules. By changing the compontent of quaternized chitosan mixture, including N-trimethylated chitosan, octadecyl quaternized carboxymethyl chitosan (OQC) and Poly(ethylene glycol) (PEG) modified OQC (PEG-OQC), different alginate-Ca microcapsules as carriers of cell transplantation can be obtained with diverse properties. Compared with alginate-PLL (AP), the mechanical strength of the alginate-quaternized
Cyclic peptide (arginine-glycine-aspartic-glutamic-valine acid, cRGD)-modified monomethoxy (polye... more Cyclic peptide (arginine-glycine-aspartic-glutamic-valine acid, cRGD)-modified monomethoxy (polyethylene glycol)-poly (D,L-lactide-co-glycolide)-poly (L-lysine) nanoparticles (mPEG-PLGA-PLL-cRGD NPs) with antitumor drug Mitoxantrone (DHAQ) or fluorescence agent Rhodamine B (Rb) encapsulated in their interior were prepared. The remarkable features of the mPEG-PLGA-PLL-cRGD NPs are the effective improvement for the cytotoxicity and uptake of the cell in vitro, and the significant enhancement of delivery ability for DHAQ or Rb in vivo. As a consequence, an excellent therapeutic efficiency for cancer is obtained, demonstrating the mPEG-PLGA-PLL-cRGD NPs play a key role in enhancing cancer therapeutic efficiency.
Modern web search engines are expected to return the top-k results efficiently. Although many dyn... more Modern web search engines are expected to return the top-k results efficiently. Although many dynamic index pruning strategies have been proposed for efficient top-k computation, most of them are prone to ignoring some especially important factors in ranking functions, such as term-proximity (the distance relationship between query terms in a document). In our recent work [Zhu, M., Shi, S., Li,
A typical scenario in information retrieval and web search is to index a given type of items (e.g... more A typical scenario in information retrieval and web search is to index a given type of items (e.g., web pages, images) and provide search functionality for them. In such a scenario, the basic units of indexing and retrieval are the same. Extensive study has been done for efficient top-k computation in such settings. This paper studies top-k processing for many emerging scenarios: efficiently retrieving top-k items of one type based on the inverted index of another type of items. It would be very inefficient by directly utilizing traditional top-k approaches. Here we follow TA (the Threshold Algorithm) in this scenario. We present an aggregation-aware top-k computation framework with three pruning principles upon the conventional inverted index and a novel inverted index type HybridRank, which employs the item information of both types. Experimental results show that our proposed new index structure and the aggregation-aware top-k strategy provide an efficient solution for this aggre...
Proceedings of the 15th ACM international conference on Information and knowledge management - CIKM '06, 2006
This paper examines the problem of utilizing pseudo-anchor text to help ranking Web objects in ve... more This paper examines the problem of utilizing pseudo-anchor text to help ranking Web objects in vertical search. We adopt a machine learning based approach to extract pseudo-anchor text for a vertical object from its candidate anchor blocks. Experiments in academic search domain indicate that our approach is able to dramatically improve search performance.
Proceeding of the 17th ACM conference on Information and knowledge mining - CIKM '08, 2008
Modern web search engines, while indexing billions of web pages, are expected to process queries ... more Modern web search engines, while indexing billions of web pages, are expected to process queries and return results in a very short time. Many approaches have been proposed for efficiently computing top-k query results, but most of them ignore one key factor in the ranking functions of commercial search engines - term-proximity, which is the metric of the distance between query terms in a document. When term-proximity is included in ranking functions, most of the existing top-k algorithms will become inefficient. To address this problem, in this paper we propose to build a compact phrase index to speed up the search process when incorporating the term-proximity factor. The compact phrase index can help more accurately estimate the score upper bounds of unknown documents. The size of the phrase index is controlled by including a small portion of phrases which are possibly helpful for improving search performance. Phrase index has been used to process phrase queries in existing work. It is, however, to the best of our knowledge, the first time that phrase index is used to improve the performance of generic queries. Experimental results show that, compared with the state-of-the-art top-k computation approaches, our approach can reduce average query processing time to 1/5 for typical setttings.
Calpainsarecrucialforthedegradation of myofibrillar proteins in muscle. Calpastatin is a spe- cif... more Calpainsarecrucialforthedegradation of myofibrillar proteins in muscle. Calpastatin is a spe- cific inhibitor of calpains. The objective of this study was to elucidate the effect of nutrient restriction on the activity of calpains and calpastatin in the skeletal muscle of both cows and fetuses. Beginning 30 d after conception, 20 cows were fed either a control diet con- sisting of native
To study the in vivo and in vitro effects of adding oxygen carbon nanotubes (CNTs) to chemotherap... more To study the in vivo and in vitro effects of adding oxygen carbon nanotubes (CNTs) to chemotherapy for breast cancer. MCF-7 and SK-BR-3 breast cancer cells were co-cultured with paclitaxel and then exposed to oxygen-CNTs under hypoxic conditions. Cell proliferation, viability, and apoptosis rate were analyzed. Hypoxia-inducible factor-1 alpha (HIF-1α) expression was measured using reverse transcription-polymerase chain reaction (RT-PCR) and western blot. Nude mice were used as a human breast cancer model to explore the impact of oxygen-CNTs on the in vivo chemotherapeutic effect of paclitaxel. Oxygen-CNTs had no significant effects on the growth of breast cancer cells under normoxia and hypoxia. However, in the hypoxic environment, oxygen-CNTs significantly enhanced the inhibitory effect of paclitaxel on cell proliferation, as well as the apoptosis rate. Under hypoxia, downregulation of HIF-1α and upregulation of caspase-3, caspase-8, caspase-9, LC3 and Beclin-1 were observed when paclitaxel was combined with oxygen-CNT. Furthermore, addition of oxygen-CNTs to chemotherapy was found to significantly reduce tumor weight in the tumor-bearing mice model. Oxygen-CNTs can significantly increase the chemotherapeutic effect of paclitaxel on breast cancer cells. Oxygen-CNTs may be a potential chemosensitizer in breast cancer therapy.
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - ACL-IJCNLP '09, 2009
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management - CIKM '07, 2007
... smaller than that of term weighting scores in real ranking functions, splitting inverted list... more ... smaller than that of term weighting scores in real ranking functions, splitting inverted lists by impact scores might be more effective than purely ... collections in the experiment, as illustrated in Table 1. The GOV collection consists of about 1.25 million web pages crawled from web ...
Poly-L-lysine (PLL) can be replaced successfully by quaternized chitosan to prepare alginate-Ca m... more Poly-L-lysine (PLL) can be replaced successfully by quaternized chitosan to prepare alginate-Ca microcapsules. By changing the compontent of quaternized chitosan mixture, including N-trimethylated chitosan, octadecyl quaternized carboxymethyl chitosan (OQC) and Poly(ethylene glycol) (PEG) modified OQC (PEG-OQC), different alginate-Ca microcapsules as carriers of cell transplantation can be obtained with diverse properties. Compared with alginate-PLL (AP), the mechanical strength of the alginate-quaternized
Cyclic peptide (arginine-glycine-aspartic-glutamic-valine acid, cRGD)-modified monomethoxy (polye... more Cyclic peptide (arginine-glycine-aspartic-glutamic-valine acid, cRGD)-modified monomethoxy (polyethylene glycol)-poly (D,L-lactide-co-glycolide)-poly (L-lysine) nanoparticles (mPEG-PLGA-PLL-cRGD NPs) with antitumor drug Mitoxantrone (DHAQ) or fluorescence agent Rhodamine B (Rb) encapsulated in their interior were prepared. The remarkable features of the mPEG-PLGA-PLL-cRGD NPs are the effective improvement for the cytotoxicity and uptake of the cell in vitro, and the significant enhancement of delivery ability for DHAQ or Rb in vivo. As a consequence, an excellent therapeutic efficiency for cancer is obtained, demonstrating the mPEG-PLGA-PLL-cRGD NPs play a key role in enhancing cancer therapeutic efficiency.
Modern web search engines are expected to return the top-k results efficiently. Although many dyn... more Modern web search engines are expected to return the top-k results efficiently. Although many dynamic index pruning strategies have been proposed for efficient top-k computation, most of them are prone to ignoring some especially important factors in ranking functions, such as term-proximity (the distance relationship between query terms in a document). In our recent work [Zhu, M., Shi, S., Li,
Uploads
Papers by Mingjie Zhu